BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040836
(758 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
Length = 802
Score = 1056 bits (2730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/759 (67%), Positives = 610/759 (80%), Gaps = 17/759 (2%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
R++++ + ++ F +CD+ L Y RAKDLV +MTL EKVQQ+GDLAYGVPRLG+P YEWWS
Sbjct: 55 RYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKVQQLGDLAYGVPRLGIPKYEWWS 114
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS +G PGT FD VPGATSFPT ILTTASFNESLWK IGQ S +ARAM
Sbjct: 115 EALHGVSDVG------PGTFFDDLVPGATSFPTTILTTASFNESLWKNIGQA-SAKARAM 167
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG AGLT+WSPN+NVVRDPRWGR +ETPGEDPYVVGRYA+NYVRGLQDVEG E + D
Sbjct: 168 YNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTDL 227
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
++RPLK+S+CCKHYAAYD++ W+G +R FD+RVTEQDM ETF+ PFEMCV EGDVSSVM
Sbjct: 228 NTRPLKVSSCCKHYAAYDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVM 287
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CS+NRVNGIPTCADPKLLNQTIRGDW+ HGYIVSDCDSI+ +V++HKFL DT EDAVA+V
Sbjct: 288 CSFNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQV 347
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
LKAGLDLDCG YYTNFT +V+QGK E ID SL++LY+VLMRLG+FDG+PQY+ LGK
Sbjct: 348 LKAGLDLDCGGYYTNFTETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQYQKLGKK 407
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC +++ELA +AAR+GIVLLKN N LPL+ +K LA+VGPHANAT+ MIGNY G P
Sbjct: 408 DICTKENVELAKQAAREGIVLLKN-NDTLPLSMDKVKNLAVVGPHANATRVMIGNYAGVP 466
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
CRY SP+DGF YS V Y GC D+ C+N S++ A+ AAKNADAT+IVAGLDL++EAE
Sbjct: 467 CRYVSPIDGFSIYSNV-TYEIGC-DVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAE 524
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
G DR DLLLPG+QT+LIN+VA AA GPV LVIM+AG VDI+FA++N KIK+ILWVGYPG+
Sbjct: 525 GLDRNDLLLPGYQTQLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQ 584
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
EGG AIADV+FGKYNPGGRLPITWYEA++V ++P T M LRP +PG+TYKF+DG
Sbjct: 585 EGGHAIADVVFGKYNPGGRLPITWYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGST 644
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYGLSYT F Y + S+ +S I L+K Q CRD+ Y T KP C AVL D + C D
Sbjct: 645 VYPFGYGLSYTTFSYNITSAKRSKHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCND 704
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F ++EVEN G DGSEVVMVYSK P GI G++IKQVIG++RVF+ AG KV F N
Sbjct: 705 -DFELEVEVENTGSRDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFN 763
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
CKS +I+D A S+L SG HTI+VG+ + VS PL +N
Sbjct: 764 VCKSFRIIDYNAYSILPSGGHTIMVGDDI--VSIPLYIN 800
>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 812
Score = 1031 bits (2666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/761 (64%), Positives = 597/761 (78%), Gaps = 14/761 (1%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
R++ + + S F +CD+ L +PERAKDL++RMTL EK Q+G +A GV RLGLP Y WWS
Sbjct: 62 RYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLPPYNWWS 121
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS +G PGT FD VPGATSFP VI T +SFNE LWK IGQ VSTEARAM
Sbjct: 122 EALHGVSNVG------PGTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAM 175
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG AGLT+WSP INV+RDPRWGR +ETPGEDP+VVG+YA NYVRGLQDVEG E D
Sbjct: 176 YNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTDL 235
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+SRPLK+S+CCKHYAAYD+DNW G +R+ FD+RVTEQDM ETF PFEMCV EGDVSSVM
Sbjct: 236 NSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSVM 295
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CSYNRVNGIPTCADP LL TIRG+W HGYIVSDCDS++ +VE +L DT EDAVA+
Sbjct: 296 CSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQT 355
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
LKAGLDLDCG Y N+T V+QGK+ +ID +L LY+VLMRLGYFDG+ +++LGK
Sbjct: 356 LKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTGFESLGKP 415
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC+ +HIELA EAARQG VLLKNDN LP + N KTLA+VGPHANAT AM+GNY G P
Sbjct: 416 DICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNYAGVP 475
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
CR SPMDG Y+KV Y GC + C+N++ I A++AA+ +DATVI G+DLS+EAE
Sbjct: 476 CRMNSPMDGLSEYAKV-KYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSIEAE 534
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRVDLLLPG+QT+L+ +VA +KGPV LVI+SAG +D++FAKNN IK+I+W GYPGE
Sbjct: 535 SLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGYPGE 594
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
EGGRAIADVIFGK+NPGGRLP+TWYE +YV ++P TSMPLRPV + +PGRTYKF+DGPV
Sbjct: 595 EGGRAIADVIFGKFNPGGRLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFYDGPV 654
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFG+GLSYT F + + S+ +S+ I L QCRDI YT GT KP C AVL+DD+ C +
Sbjct: 655 VYPFGHGLSYTFFLHNLTSAKRSIAIDLSNRTQCRDIAYTNGTFKPECPAVLVDDLTCTE 714
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
+ FQ+EVEN G+ DGS+V++VYS PP GI+ THIKQV+G++RVF+ AG S V F +N
Sbjct: 715 -EIEFQMEVENTGERDGSQVLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSETVTFKLN 773
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
ACKSL +VD +LL +G HTI+VG+ G VSFP++L+ N
Sbjct: 774 ACKSLGLVDFTGYNLLPAGGHTIVVGD--GEVSFPVELSFN 812
>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 809
Score = 1013 bits (2618), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/760 (63%), Positives = 591/760 (77%), Gaps = 15/760 (1%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ + + DF YCD+ PY RAKDLV+RMTL EKV Q GD A GV R+GLP Y WWS
Sbjct: 57 RFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWS 116
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS GR FD VPGATSFPTVIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 117 EALHGVSNFGR------CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YN GNAGLTFWSPNINVVRDPRWGR+LETPGEDP++VG YA+NYVRGLQDV G E D
Sbjct: 171 YNSGNAGLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDL 230
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+SRPLK+S+CCKHYAAYDLDNW+G DR HFD+RV+ QDM ETF+LPFEMCV EGDVSSVM
Sbjct: 231 NSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVM 290
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CSYN++NGIP+CAD +LL QTIRG+W+ HGYIVSDCDS++ + K+L+ + D+ A+
Sbjct: 291 CSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQA 350
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
L AG++LDCG + AV QGK +AD+D SLR+LY++LMR+G+FDG P + +LGK+
Sbjct: 351 LNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKD 410
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC+ +HIELA EAARQGIVLLKNDN LPL + +K +ALVGPHANAT AMIGNY G P
Sbjct: 411 DICSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIP 468
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C Y SP+D F + +V Y GCAD+ C N + I A++AAK ADAT+I AG DLS+EAE
Sbjct: 469 CYYVSPLDAFSSMGEV-RYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAE 527
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRVDLLLPG+QT+LIN+VAD + GPV LVIMS G VDI+FA++NPKI +ILW GYPGE
Sbjct: 528 ALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGE 587
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
+GG AIADVI GKYNPGGRLPITWYEA+YV +P TSM LRPV++ +PGRTYKFF+G
Sbjct: 588 QGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGST 647
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYG+SYT F Y +++S + +I L K Q+CR + Y T P C AVL+DD+ CK+
Sbjct: 648 VYPFGYGMSYTNFSYSLSTSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDDLSCKE 707
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F++ V+N+G+MDGSEVV+VYS PP GIAGTHIK+V+G+ERVF+ G + KV F+MN
Sbjct: 708 -SIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMN 766
Query: 717 ACKSLKIVDNAANSLLASGAHTILV-GEGVGGVSFPLQLN 755
CKSL IVD+ +LL SG+HTI V G+ V+FP +N
Sbjct: 767 VCKSLGIVDSTGYALLPSGSHTIKVGGDNTTSVAFPFHVN 806
>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 805
Score = 980 bits (2533), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/762 (61%), Positives = 579/762 (75%), Gaps = 14/762 (1%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ + + DF YCD+ LPY R KDLV+R+TL EK + + D+A GVPR+GLP Y+WWS
Sbjct: 54 RFAALGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWS 113
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGV+ +G T FD VPGATSFP VIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 114 EALHGVANVGS------ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAM 167
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+AGLTFWSPNINV RDPRWGR+LETPGEDP VG Y +NYVRGLQD+EG E D
Sbjct: 168 YNLGHAGLTFWSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDL 227
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+SRPLKI++ CKH+AAYDLD W DR HFD++V+EQDM ETF+ PFEMCV EGD SSVM
Sbjct: 228 NSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVM 287
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CS+N +NGIP CADP+ L IR WN HGYIVSDC +I TIV+ KFL+ T E+ VA
Sbjct: 288 CSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALS 347
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
+KAGLDL+CG YY + AV++G+++E D+D SL +LY+VLMR+G+FDG P +LGK
Sbjct: 348 MKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKK 407
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ICN +HIELA EAARQGIVLLKNDN LPL +K LALVGPHANAT AMIGNY G P
Sbjct: 408 DICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGNYAGIP 465
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C Y SP+D F V Y GCAD+ C N++ + A +AAKNADAT+I+ G DLS+EAE
Sbjct: 466 CHYVSPLDAFSELGDV-TYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAE 524
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
+DR DLLLPG+QTE++N+V D + GPV LV+M G +DI+FAKNNPKI +ILW G+PGE
Sbjct: 525 ERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGE 584
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
+GG AIAD++FGKYNPGGR PITWYE YV +P TSM LRP+ + +PGRTYKFF+G
Sbjct: 585 QGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGST 644
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYGLSYT F Y + + +SV I L + QQCR + Y+ + +P C+AVL+DD+ C D
Sbjct: 645 VYPFGYGLSYTNFSYSLTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVDDLSC-D 703
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F FQ+ V+N+G MDGSEVVMVYS PP GI GTHIKQVIG+ERVF+ G + KV F+MN
Sbjct: 704 ESFEFQVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMN 763
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
CKSL +VD++ LL SG+HTI+ G+ VSFP Q+N ++
Sbjct: 764 VCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVSFPFQVNYHN 805
>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
Length = 694
Score = 973 bits (2514), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/734 (64%), Positives = 576/734 (78%), Gaps = 46/734 (6%)
Query: 27 DLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVP 86
DLV +MTL EKV Q+G+ AYGVPRLGL Y+WWSEALHGVS +G PGT FD +P
Sbjct: 2 DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVG------PGTFFDDLIP 55
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
G+TSFPTVI T A+FNESLWK IGQ VSTEARAMYNLG AGLT+WSPNINVVRDPRWGR
Sbjct: 56 GSTSFPTVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRA 115
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
+ETPGEDPY+VGRYA+NYVRGLQDVEG E + D +SRPLK+S+CCKHYAAYD+DNW+G +
Sbjct: 116 IETPGEDPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVE 175
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
R+ FD+RV+EQDM ETF+ PFEMCV +GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW
Sbjct: 176 RYTFDARVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 235
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKI 326
+ HGYIVSDCDS+Q +VE+HK+L GLDLDCG YYT AV+QGK+
Sbjct: 236 DLHGYIVSDCDSLQVMVENHKWL--------------GLDLDCGAYYTENVEAAVRQGKV 281
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
EADID SL FLY+VLMRLG+FDG PQY + GKN++C+ ++IELA EAAR+G VLLKN+N
Sbjct: 282 READIDKSLNFLYVVLMRLGFFDGIPQYNSFGKNDVCSKENIELATEAAREGAVLLKNEN 341
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
+LPL+ +KTLA++GPH+NAT AMIGNY G PC+ +P++G Y+KV +Y GC+DI
Sbjct: 342 DSLPLSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKV-DYQMGCSDI 400
Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
C++ S I A+++AK ADAT+I+AG+DLS+EAE DR DLLLPG+QT+LIN+VA + G
Sbjct: 401 ACKDESFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNG 460
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV LV+MSAG VDI+FAK+N IKSILWVGYPGEEGG AIADVIFGKYNPGGRLP+TW+E
Sbjct: 461 PVVLVLMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHE 520
Query: 567 ANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
A+YV +P TSMPLRP+++ +PGRTYKFF+G VYPFG+GLSYTQF YK+ S+ +S+DI
Sbjct: 521 ADYVDMLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKLTSTIRSLDI 580
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KLDK Q C D+ Y + KP EV N G DGSEVV+VY+K
Sbjct: 581 KLDKYQYCHDLGYKNDSFKP-------------------SFEVLNAGAKDGSEVVIVYAK 621
Query: 684 PP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
PP GI T+IKQVIG++RVF+ AG S KV F NA KSL++VD A S+L SG HTI++G
Sbjct: 622 PPEGIDATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLG 681
Query: 743 EGVGGVSFPLQLNL 756
+ + +SF +Q+
Sbjct: 682 DDI--ISFSVQIRF 693
>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 805
Score = 971 bits (2509), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/762 (60%), Positives = 578/762 (75%), Gaps = 14/762 (1%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
R+ + + + F +CD L Y ERAKDLV RMTL EKV Q A GV RLGLP Y WWS
Sbjct: 53 RYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWS 112
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHG+S +G PG FD +PGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 113 EALHGISNLG------PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+AGLTFWSPNINVVRD RWGR ET GEDP++VG +A+NYVRGLQDVEG E D
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDL 226
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+SRPLK+S+CCKHYAAYD+D+W DR FD+RV+EQDM+ETF+ PFE CV EGDVSSVM
Sbjct: 227 NSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 286
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CS+N++NGIP C+DP+LL IR +W+ HGYIVSDC ++ IV++ +LND+K DAVA+
Sbjct: 287 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 346
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
L+AGLDL+CG YYT+ +V GK+++ ++D +L+ +Y++LMR+GYFDG P Y++LG
Sbjct: 347 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 406
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC HIELA EAARQGIVLLKND LPL G K +ALVGPHANAT+ MIGNY G P
Sbjct: 407 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 464
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C+Y SP++ F A V YA GC D C N++ A +AAK+A+ T+I G DLS+EAE
Sbjct: 465 CKYVSPLEAFSAIGNV-TYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAE 523
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRVD LLPG QTELI +VA+ + GPV LV++S +DI FAKNNP+I +ILWVG+PGE
Sbjct: 524 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 583
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
+GG AIADV+FGKYNPGGRLP+TWYEA+YV +P +SM LRPV+ +PGRTYKFFDG
Sbjct: 584 QGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGST 643
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYG+SYT+F Y +A+S S+DI L+K Q+CR + YT P C AVL+DD+ C D
Sbjct: 644 VYPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVLLDDMSCDD 703
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F++ V N+G +DGSEV+MVYS PP GI GTHIKQVIG+++VF+AAG + +V F+MN
Sbjct: 704 -TIEFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMN 762
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
ACKSL+IVD+ SLL SG+HTI VG+ S+ LQ+N ++
Sbjct: 763 ACKSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 804
>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/762 (59%), Positives = 557/762 (73%), Gaps = 45/762 (5%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ + + DF YCD+ LPY R KDLV+R+TL EK + + D+A GVPR+GLP Y+WWS
Sbjct: 54 RFAALGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWS 113
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGV+ +G T FD VPGATSFP VIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 114 EALHGVANVGS------ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAM 167
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+AGLTFWSPNINV RDPRWGR+LETPGEDP VG Y +NYVRGLQD+EG E D
Sbjct: 168 YNLGHAGLTFWSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDL 227
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+SRPLKI++ CKH+AAYDLD W DR HFD++V+EQDM ETF+ PFEMCV EGD SSVM
Sbjct: 228 NSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVM 287
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CS+N +NGIP CADP+ L IR WN HGYIVSDC +I TIV+ KFL+ T E+ VA
Sbjct: 288 CSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALS 347
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
+KAGLDL+CG YY + AV++G+++E D+D SL +LY+VLMR+G+FDG P +LGK
Sbjct: 348 MKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKK 407
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ICN +HIELA EAARQGIVLLKNDN LPL +K LALVGPHANAT AMIGNY G P
Sbjct: 408 DICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGNYAGIP 465
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C Y SP+D F V Y GCAD+ C N++ + A +AAKNADAT+I+ G DLS+EAE
Sbjct: 466 CHYVSPLDAFSELGDV-TYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAE 524
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
+DR DLLLPG+QTE++N+V D + GPV LV+M G +DI+FAKNNPKI +ILW G+PGE
Sbjct: 525 ERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGE 584
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPV 597
+GG AIAD++FGKYNPGGR PITWYE YV +P TSM LRP+ + +PGRTYKFF+G
Sbjct: 585 QGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGST 644
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYGLSYT F Y + + +SV I L
Sbjct: 645 VYPFGYGLSYTNFSYSLTAPTRSVHISLT------------------------------- 673
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F FQ+ V+N+G MDGSEVVMVYS PP GI GTHIKQVIG+ERVF+ G + KV F+MN
Sbjct: 674 -SFEFQVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMN 732
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
CKSL +VD++ LL SG+HTI+ G+ VSFP Q+N ++
Sbjct: 733 VCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVSFPFQVNYHN 774
>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 915 bits (2365), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/760 (59%), Positives = 549/760 (72%), Gaps = 78/760 (10%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ + + DF YCD+ PY RAKDLV+RMTL EKV Q GD A GV R+GLP Y WWS
Sbjct: 57 RFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWS 116
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS GR FD VPGATSFPTVIL+ ASFN+SLWK +GQ VSTEARAM
Sbjct: 117 EALHGVSNFGR------CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YN GNAGLTFWSPNINVVRDPRWGR+LETPGEDP++VG YA+NY
Sbjct: 171 YNSGNAGLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY---------------- 214
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
HYAAYDLDNW+G DR HFD+RV+ QDM ETF+LPFEMCV EGDVSSVM
Sbjct: 215 ------------HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVM 262
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CSYN++NGIP+CAD +LL QTIRG+W+ HGYIVSDCDS++ + K+L+ + D+ A+
Sbjct: 263 CSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQA 322
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
L AG++LDCG + AV QGK +AD+D SLR+LY++LMR+G+FDG P + +LGK+
Sbjct: 323 LNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKD 382
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC+ +HIELA EAARQGIVLLKNDN LPL + +K +ALVGPHANAT AMIGNY G P
Sbjct: 383 DICSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIP 440
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C Y SP+D F + +V Y GCAD+ C N + I A++AAK ADAT+I AG DLS+EAE
Sbjct: 441 CYYVSPLDAFSSMGEV-RYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAE 499
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRVDLLLPG+QT+LIN+VAD + GPV LVIMS G VDI+FA++NPKI +ILW GYPGE
Sbjct: 500 ALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGE 559
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
+GG AIADVI GKYNPGGRLPITWYEA+YV +P TSM LRPV++ +PGRTYKFF+G
Sbjct: 560 QGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGST 619
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYG+SYT F Y +++S Q C++
Sbjct: 620 VYPFGYGMSYTNFSYSLSTS-----------QSCKE------------------------ 644
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F++ V+N+G+MDGSEVV+VYS PP GIAGTHIK+V+G+ERVF+ G + KV F+MN
Sbjct: 645 -SIEFEVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMN 703
Query: 717 ACKSLKIVDNAANSLLASGAHTILV-GEGVGGVSFPLQLN 755
CKSL IVD+ +LL SG+HTI V G+ V+FP +N
Sbjct: 704 VCKSLGIVDSTGYALLPSGSHTIKVGGDNTTSVAFPFHVN 743
>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
Length = 745
Score = 888 bits (2295), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/762 (56%), Positives = 544/762 (71%), Gaps = 74/762 (9%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
R+ + + + F +CD L Y ERAKDLV RMTL EKV Q A GV RLGLP Y WWS
Sbjct: 53 RYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWS 112
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHG+S +G PG FD +PGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 113 EALHGISNLG------PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+AGLTFWSPNINVVRD RWGR ET GEDP++VG +A+NYVRGLQDVEG E
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTE----- 221
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+S+CCKHYAAYD+D+W DR FD+RV+EQDM+ETF+ PFE CV EGDVSSVM
Sbjct: 222 -----NVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 276
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CS+N++NGIP C+DP+LL IR +W+ HGYIVSDC ++ IV++ +LND+K DAVA+
Sbjct: 277 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 336
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
L+AGLDL+CG YYT+ +V GK+++ ++D +L+ +Y++LMR+GYFDG P Y++LG
Sbjct: 337 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 396
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC HIELA EAARQGIVLLKND LPL G K +ALVGPHANAT+ MIGNY G P
Sbjct: 397 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 454
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C+Y SP++ F A V YA G T+I G DLS+EAE
Sbjct: 455 CKYVSPLEAFSAIGNV-TYATGF-----------------------TIIFVGTDLSIEAE 490
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRVD LLPG QTELI +VA+ + GPV LV++S +DI FAKNNP+I +ILWVG+PGE
Sbjct: 491 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 550
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPV 597
+GG AIADV+FGKYNPGGRLP+TWYEA+YV +P +SM LRPV+ +PGRTYKFFDG
Sbjct: 551 QGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGST 610
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VYPFGYG+SYT+F Y +A+S S+DI L+K Q+CR
Sbjct: 611 VYPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCR------------------------- 645
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
TF++ V N+G +DGSEV+MVYS PP GI GTHIKQVIG+++VF+AAG + +V F+MN
Sbjct: 646 ---TFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMN 702
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
ACKSL+IVD+ SLL SG+HTI VG+ S+ LQ+N ++
Sbjct: 703 ACKSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 744
>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
vinifera]
Length = 781
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/763 (58%), Positives = 551/763 (72%), Gaps = 18/763 (2%)
Query: 1 RFESIKVKLSDFPYCDAKLP-YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWW 59
RF ++ + DF YC++ LP Y R KDLV+RMTL EK + A GV R+GLP Y+WW
Sbjct: 21 RFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPPYQWW 80
Query: 60 SEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA 119
SEALHGVS + N P T FD VPGATSFP VIL+ ASFN+SLWK I Q VS EARA
Sbjct: 81 SEALHGVSSVS--INGP--TFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEARA 136
Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
YNLG+AGLTFW PN+NV RDPRWGR ET GEDP+ V YA++YVRGLQDVEG E D
Sbjct: 137 TYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTTD 196
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+SRPLK+S+ KH+AAYDLDNW DR HF++RV+EQDM ETF+ PFE CV EGDVS V
Sbjct: 197 LNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSGV 256
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCS+N +NGIP CADP+L TIR +WN HGYIVSDC SI+TIVE KFL+ T E+AVA
Sbjct: 257 MCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVAL 316
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
LKAGLDL+CG YY + AV G++ + D+D SL LY+VLMRLG+FDG P +LGK
Sbjct: 317 NLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPALASLGK 376
Query: 360 NNIC-NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
++IC + +HIELA EAARQGIVLLKNDN LPL + +K LALVGP+A+A AM+GNY G
Sbjct: 377 DDICLSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNLALVGPNADAYGAMMGNYAG 434
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL-DLSV 477
PCR SP D F A V Y GC D++C N++ + A++AAK+AD T+IV G+ D+S+
Sbjct: 435 PPCRSVSPRDAFSAIGNV-TYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITDVSI 493
Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS--AGAVDINFAKNNPKIKSILWV 535
E KDRVDLLLPG+QT L+N++A A P+ LV+ G +DI+FA++NP I+ ILW
Sbjct: 494 GTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPILWA 553
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKF 592
G+PGEEGG AIADV++GKYNPGGRLP+TWYE YV +P TSM LR V + +PGR YKF
Sbjct: 554 GFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTSMALRSVESLGYPGRKYKF 613
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
F G VYPFG GLSYT F Y + + +S+ L K Q CR + Y++ + P C AVL+DD
Sbjct: 614 FSGSTVYPFGCGLSYTNFSYSLTAPTRSIHTHLKKLQPCRSMAYSICSVIPQCPAVLVDD 673
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
+ C + F F++ V+ +G MDGSEVV+VYS PP GI GTHIKQVIG+ERVF+ G KV
Sbjct: 674 LSCNE-TFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQVIGFERVFVKVGXVEKV 732
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILV-GEGVGGVSFPLQ 753
F+MN CKSL IV ++ ++LL SG+ I G+ VSFP Q
Sbjct: 733 KFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGGDNTISVSFPFQ 775
>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 810
Score = 831 bits (2146), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/773 (52%), Positives = 548/773 (70%), Gaps = 31/773 (4%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ ++++ F YCDA LPY +R +DLV R+TL EKV+ +GD A G R+GLP Y WW
Sbjct: 50 RFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPYLWWG 109
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS G P GT F VPGATSFP VI + A+FNE+LW IG VSTE RAM
Sbjct: 110 EALHGVSDTG-----PGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAM 164
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+A LT+WSPNINVVRDPRWGR ETPGEDP+VVGRYA+++VR +QD++G +
Sbjct: 165 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGA 224
Query: 181 D--SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
D +RP+K+S+CCKHYAAYD+D W DR FD++V E+DM ETF PFEMCV +GD S
Sbjct: 225 DPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASC 284
Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
VMCSYNR+NG+P CA+ +LL++T+RG+W HGYIVSDCDS++ +V K+L +A A
Sbjct: 285 VMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATA 344
Query: 299 RVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
+KAGLDLDCG D++T F + AV+QGK+ E+++D +LR LY+ LMRLG+FDG
Sbjct: 345 AAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDGI 404
Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHANAT 409
P+ ++LG N++C +H ELAA+AARQG+VL+KND+G LPL+T + +L+LVG H NAT
Sbjct: 405 PELESLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQHINAT 464
Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
M+G+Y G PCR +P D A KV++ + VC + + AA K DAT++
Sbjct: 465 DVMLGDYRGKPCRVVTPYD---AIRKVVS---ATSMQVCDHGACSTAA--NGKTVDATIV 516
Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
+AGL++SVE EG DR DLLLP QT IN VA+A+ P+ LVI+SAG VD++FA+NNPKI
Sbjct: 517 IAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNNPKI 576
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FP 586
+I+W GYPGEEGG AIADV+FGKYNPGGRLP+TWY++ Y+ KIP TSM LRPV + +P
Sbjct: 577 GAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVADKGYP 636
Query: 587 GRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP-P 644
GRTYKF+ GP V+YPFG+GLSY+ F Y ++ SV +++ + C+ + GT P
Sbjct: 637 GRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTTAPLA 696
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFI 703
C AV + CK+ + +F + V N G DG+ VVMVY+ PP + +KQ++ + RVF+
Sbjct: 697 CPAVNVAGHGCKE-EVSFSLTVANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFRRVFV 755
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
AG + +V FT+N CK+ IV+ A +++ SG T+LVG+ SF +++ L
Sbjct: 756 PAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSFSFSVKIEL 808
>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
Length = 825
Score = 820 bits (2117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/778 (52%), Positives = 536/778 (68%), Gaps = 31/778 (3%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ + +S F YCDA LPY ER +DLV R++L EKV+ +GD A G PR+GLP Y+WW
Sbjct: 55 RFAALGLDMSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWG 114
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS +G P GT F VPGATSFP VI + A+FNESLW+ IG VSTE RAM
Sbjct: 115 EALHGVSDVG-----PGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 169
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV---EGVEYH 177
YNLG+A LT+WSPNINVVRDPRWGR ETPGEDP+VVGRYA+N+VRG+QDV G
Sbjct: 170 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAAT 229
Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
D SRP+K+S+CCKH+AAYD+D W DR FD++V E+DM ETF PFEMC+ +GD S
Sbjct: 230 ADPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDAS 289
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
VMCSYNR+NGIP CAD +LL++T+R W HGYIVSDCDS++ +V K+LN T +A
Sbjct: 290 CVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEAT 349
Query: 298 ARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
A +KAGLDLDCG D++T + + AV+QGKI EAD+D +L +Y LMRLG+FDG
Sbjct: 350 AAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDG 409
Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHANA 408
P++++LG +++C H ELAA+AARQG+VLLKND LPL+ I +++LVG H NA
Sbjct: 410 MPEFESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINA 469
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADA 466
T M+G+Y G PCR +P D A +V+N Y C C + A AK ADA
Sbjct: 470 TDVMLGDYRGKPCRIVTPYD---AIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADA 526
Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
T+++AGL++SVE E DR DLLLP Q+ IN VA+A+ P+ LVIMSAG VD++FA+NN
Sbjct: 527 TIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNN 586
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VN 583
KI +I+W GYPGEEGG AIADV+FGKYNPGGRLP+TW++ YV +IP TSM LRP +
Sbjct: 587 TKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAH 646
Query: 584 NFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG--- 639
+PGRTYKF+ GP V+YPFG+GLSYT F Y ++ +V I + + C+ + Y G
Sbjct: 647 GYPGRTYKFYGGPAVLYPFGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAP 706
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIGY 698
+ P C A+ + +C D +F + V N G + G VV VY+ PP G KQ++ +
Sbjct: 707 SPSPACPALNVASHRC-DEVVSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPRKQLVEF 765
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
RVF+ AG + V F +N CK+ IV+ A +++ SG T++VG+ +SF + +NL
Sbjct: 766 RRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSFAVTINL 823
>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
gi|219887469|gb|ACL54109.1| unknown [Zea mays]
gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 835
Score = 818 bits (2113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/776 (52%), Positives = 534/776 (68%), Gaps = 29/776 (3%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF ++ + +S F YCDA LPY +R +DLV R+ L EKV+ +GD A G PR+GLP Y+WW
Sbjct: 67 RFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPYKWWG 126
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS +G P GT F VPGATSFP VI + A+FNESLW+ IG VSTE RAM
Sbjct: 127 EALHGVSDVG-----PGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 181
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+A LT+WSPNINVVRDPRWGR ETPGEDP+VVGRYA+N+VRG+QDV+ Y +
Sbjct: 182 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAA 241
Query: 181 D--SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
D SRP+K+S+CCKH+AAYD+D W DR FD++V E+DM ETF PFEMC+ +GD S
Sbjct: 242 DPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASC 301
Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
VMCSYNR+NGIP CAD +LL++T+R W HGYIVSDCDS++ +V K+LN T +A A
Sbjct: 302 VMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATA 361
Query: 299 RVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
+KAGLDLDCG D++T + + AV+QGKI E D+D +L +Y LMRLG+FDG
Sbjct: 362 AAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGM 421
Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHANAT 409
P++++LG +N+C H ELAA+AARQG+VLLKND LPL+ I +++LVG H NAT
Sbjct: 422 PEFESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHINAT 481
Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
M+G+Y G PCR +P + A ++N Y C C + A AK ADAT
Sbjct: 482 DVMLGDYRGKPCRIVTP---YNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADAT 538
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
+++AGL++SVE E DR DLLLP Q+ IN VA A+ P+ LVIMSAG VD++FA NN
Sbjct: 539 IVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNT 598
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNN 584
KI +I+W GYPGEEGG AIADV+FGKYNPGGRLP+TW++ YV +IP TSM LRP
Sbjct: 599 KIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAALG 658
Query: 585 FPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG--TN 641
+PGRTYKF+ GP V+YPFG+GLSYT F Y ++ +V I + + C+ + Y +G +
Sbjct: 659 YPGRTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYKMGAPSP 718
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIGYER 700
P C A+ + C + +F + V N G + G VV VY+ PP G +KQ++ + R
Sbjct: 719 SPACPALNVASHMCSEV-VSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPLKQLVAFRR 777
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
VF+ AG + V F +N CK+ IV+ A +++ SG T++VG+ +SFP+ +NL
Sbjct: 778 VFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVGDDALVLSFPVTINL 833
>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
ARA-I [Oryza sativa Japonica Group]
Length = 818
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/778 (52%), Positives = 538/778 (69%), Gaps = 34/778 (4%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + ++ FPYCDA LPY +R +DLV RMTL EKV +GD A G PR+GLP Y WW
Sbjct: 51 RFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRYLWWG 110
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGVS +G P GT F VPGATSFP VI + ASFNE+LW+ IG VSTE RAM
Sbjct: 111 EALHGVSDVG-----PGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAM 165
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+A LT+WSPNINVVRDPRWGR ETPGEDP+VVGRYA+N+VRG+QD++G +
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 225
Query: 181 D------SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
SRP+K+S+CCKHYAAYD+D W G DR FD+RV E+DM ETF PFEMC+ +G
Sbjct: 226 AAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDG 285
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
D S VMCSYNR+NG+P CAD +LL +T+R DW HGYIVSDCDS++ +V K+L T
Sbjct: 286 DASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 345
Query: 295 DAVARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
+A A +KAGLDLDCG D++T + + AV+QGK+ E+ +D +L LY+ LMRLG+
Sbjct: 346 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 405
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PH 405
FDG P+ ++LG ++C +H ELAA+AARQG+VLLKND LPL+ + ++AL G H
Sbjct: 406 FDGIPELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQH 465
Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
NAT M+G+Y G PCR +P DG KV++ A C S A AAK D
Sbjct: 466 INATDVMLGDYRGKPCRVVTPYDGV---RKVVSSTSVHA---CDKGS-CDTAAAAAKTVD 518
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
AT++VAGL++SVE E DR DLLLP Q IN VA+A+ P+ LVIMSAG VD++FA++
Sbjct: 519 ATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQD 578
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--V 582
NPKI +++W GYPGEEGG AIADV+FGKYNPGGRLP+TWY+ YV KIP TSM LRP
Sbjct: 579 NPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAE 638
Query: 583 NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
+ +PGRTYKF+ G V+YPFG+GLSYT F Y A++ V +K+ + C+ + Y G +
Sbjct: 639 HGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVS 698
Query: 642 KPP-CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYE 699
PP C AV + C++ + +F + V N G DG+ VV +Y+ PP + G KQ++ +
Sbjct: 699 SPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFR 757
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
RV +AAG + +V F +N CK+ IV+ A +++ SG +LVG+ +SFP+Q++L
Sbjct: 758 RVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 815
>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
Length = 821
Score = 807 bits (2085), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/781 (51%), Positives = 541/781 (69%), Gaps = 36/781 (4%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVP-RLGLPLYEWW 59
RF S+ + ++ F YCDA LPY ER +DLV R+TL EKV +GD A G R+GLP Y WW
Sbjct: 50 RFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPRYMWW 109
Query: 60 SEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA 119
EALHGVS +P GT F VPGATSFP V+ + A+FNE+LW+ IG STE RA
Sbjct: 110 GEALHGVS-----DTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRA 164
Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
MYNLG+A LT+WSPNINVVRDPRWGR ETPGEDP++VGR+A+++VR +QD++
Sbjct: 165 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGA 224
Query: 180 SDSRP----LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+ P LK+S+CCKHYAAYD+D W G DR FD+ V E+DM ETF PFEMCV +GD
Sbjct: 225 GAADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGD 284
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
S VMCSYNR+NG+P CA+ +LL T+R DW HGYIVSDCDS++ +V K+L
Sbjct: 285 ASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQ 344
Query: 296 AVARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
A A +KAGLDLDCG D++T + + AV+QGK+ EA++D +L LY+ LMRLG+F
Sbjct: 345 ATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFF 404
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG--PHA 406
DGSP++++LG +++C +H E+AAEAARQG+VLLKND+ LPL+ + +LALVG H
Sbjct: 405 DGSPEFQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQHI 464
Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNA 464
NAT M+G+Y G PCR +P + A KV++ C C ++ A AAK
Sbjct: 465 NATDVMLGDYRGKPCRVVTP---YEAIRKVVSGTSMQACDKGACGTTAL--GAAIAAKTV 519
Query: 465 DATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAK 524
DAT+++ GL++SVE EG DR DLLLP QT+ IN VA+A++ P+TLVI+SAG VDI+FA+
Sbjct: 520 DATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQ 579
Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVN 583
NNPKI +ILW GYPGEEGG IADV+FGKYNPGGRLP+TWY+ Y+ K+P TSM LRPV
Sbjct: 580 NNPKIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNEYIGKLPMTSMALRPVA 639
Query: 584 N--FPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTV 638
+ +PGRTYKF+ GP V+YPFG+GLSYT F Y ++ SV +K+ + C+++ Y
Sbjct: 640 DKGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVKIGTAWEDSCKNLTYKP 699
Query: 639 GT--NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQV 695
GT + PC A+ + C++ + +F ++V N G + GS VV VY+ PP + +KQ+
Sbjct: 700 GTTASTAPCPAINVAGHGCQE-EVSFTLKVSNTGGIGGSHVVPVYTAPPAEVDDAPLKQL 758
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
+ + R+F+ AG + +V FT++ CK+ IV+ A +++ +G +LVG+ SFP++++
Sbjct: 759 VAFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVLVGDESLSFSFPVKID 818
Query: 756 L 756
L
Sbjct: 819 L 819
>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
Length = 876
Score = 798 bits (2062), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/747 (53%), Positives = 508/747 (68%), Gaps = 37/747 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ + +C+ L Y RAKDLV R++L EKVQQ+ + A GVPRLG+P YEWWSEALHGVS +
Sbjct: 37 AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG HF+ VPGATSFP ILT ASFN SLW K+G+ VSTEARAM+N+G AGLT
Sbjct: 97 G------PGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLT 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+NV RDPRWGR ETPGEDP VV +YA+NYV+GLQDV H SR LK+S+
Sbjct: 151 YWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV-----HDAGKSRRLKVSS 205
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYDLDNW+G DRFHFD++VT+QD+++T+ PF+ CV EGDVSSVMCSYNRVNGI
Sbjct: 206 CCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGI 265
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCADP LL IRG W GYIVSDCDSIQ + T+EDAVA LKAGL+++C
Sbjct: 266 PTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNC 324
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
GD+ +T AV+ K+ +D+D +L + YIVLMRLG+FDG P+ + NLG +++C+
Sbjct: 325 GDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKD 384
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA+QGIVLL+N G LPL +K LA++GP+ANATK MI NY G PC+YTSP
Sbjct: 385 HQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSP 443
Query: 427 MDGFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G Y + I Y PGC D+ C + ++I AA+ A AD TV+V GLD +VEAEG DRV
Sbjct: 444 IQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRV 503
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG+Q +L+ VA+AAK V LVIMSAG +DI+FAKN I+++LWVGYPGE GG A
Sbjct: 504 NLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDA 563
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IA VIFG YNP GRLP TWY + K+ T M +RP + FPGR+Y+F+ G +Y FG
Sbjct: 564 IAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFG 623
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGLSY+ F V S+P + IK N + NK +V I V C D K
Sbjct: 624 YGLSYSSFSTFVLSAPSIIHIK---------TNPIMNLNK--TTSVDISTVNCHDLKIRI 672
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIA------GTHIKQVIGYERVFIAAGQSAKVGFTMN 716
I V+N G GS VV+V+ KPP + G + Q++G+ERV + + K +
Sbjct: 673 VIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFD 732
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
CK+L +VD L +G H +++G
Sbjct: 733 VCKALSLVDTHGKRKLVTGHHKLVIGS 759
>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
Precursor
gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
Length = 781
Score = 798 bits (2060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/747 (53%), Positives = 508/747 (68%), Gaps = 37/747 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ + +C+ L Y RAKDLV R++L EKVQQ+ + A GVPRLG+P YEWWSEALHGVS +
Sbjct: 37 AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG HF+ VPGATSFP ILT ASFN SLW K+G+ VSTEARAM+N+G AGLT
Sbjct: 97 G------PGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLT 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+NV RDPRWGR ETPGEDP VV +YA+NYV+GLQDV H SR LK+S+
Sbjct: 151 YWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV-----HDAGKSRRLKVSS 205
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYDLDNW+G DRFHFD++VT+QD+++T+ PF+ CV EGDVSSVMCSYNRVNGI
Sbjct: 206 CCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGI 265
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCADP LL IRG W GYIVSDCDSIQ + T+EDAVA LKAGL+++C
Sbjct: 266 PTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNC 324
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
GD+ +T AV+ K+ +D+D +L + YIVLMRLG+FDG P+ + NLG +++C+
Sbjct: 325 GDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKD 384
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA+QGIVLL+N G LPL +K LA++GP+ANATK MI NY G PC+YTSP
Sbjct: 385 HQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSP 443
Query: 427 MDGFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G Y + I Y PGC D+ C + ++I AA+ A AD TV+V GLD +VEAEG DRV
Sbjct: 444 IQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRV 503
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG+Q +L+ VA+AAK V LVIMSAG +DI+FAKN I+++LWVGYPGE GG A
Sbjct: 504 NLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDA 563
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IA VIFG YNP GRLP TWY + K+ T M +RP + FPGR+Y+F+ G +Y FG
Sbjct: 564 IAQVIFGDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFG 623
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGLSY+ F V S+P + IK N + NK +V I V C D K
Sbjct: 624 YGLSYSSFSTFVLSAPSIIHIK---------TNPIMNLNK--TTSVDISTVNCHDLKIRI 672
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIA------GTHIKQVIGYERVFIAAGQSAKVGFTMN 716
I V+N G GS VV+V+ KPP + G + Q++G+ERV + + K +
Sbjct: 673 VIGVKNHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFD 732
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
CK+L +VD L +G H +++G
Sbjct: 733 VCKALSLVDTHGKRKLVTGHHKLVIGS 759
>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 807
Score = 795 bits (2054), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/789 (50%), Positives = 531/789 (67%), Gaps = 63/789 (7%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + +S + YCDAKLPY +R +DL+ MT+ EKV +GD A G PR+GLP Y+WWS
Sbjct: 49 RFAAAGLDMSRYRYCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWS 108
Query: 61 EALHGVSFIGRRTNSPPGTHFD-----------SEVPGATSFPTVILTTASFNESLWKKI 109
EALHG+S G P T FD + V T F VI + ASFNESLW+ I
Sbjct: 109 EALHGLSSTG------PTTKFDDLKKPRLHSGRAAVFNGTVFANVINSAASFNESLWRSI 162
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ +STEARAMYNLG GLT+WSPNINVVRDPRWGR LETPGEDP+VVGRYA+N+VRG+Q
Sbjct: 163 GQAISTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQ 222
Query: 170 DVE--GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPF 227
DV+ ++ D SRPLK SACCKHYAAYD+D+W G+ RF FD+RVTE+DM ETF PF
Sbjct: 223 DVDDAAAGFNGDPLSRPLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPF 282
Query: 228 EMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK 287
EMCV +GD S+VMCSYNRVNGIP CAD +LL T+R DW HGYIVSDCD+++ + ++
Sbjct: 283 EMCVRDGDASAVMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNAT 342
Query: 288 FLNDTKEDAVARVLKAGLDLDCG------------DYYTNFTMGAVQQGKIAEADIDTSL 335
+L T +A A LKAGLDLDCG D+ + + M AV+QGK+ E+DID +L
Sbjct: 343 WLGYTPAEASAASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNAL 402
Query: 336 RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
LY LMRLGYFDG P+Y++L + +IC+ H LA + ARQ +VLLKN +G LPL+
Sbjct: 403 VNLYTTLMRLGYFDGMPRYESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASK 462
Query: 396 IKTLALVGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI 454
+ ++A+ GPHA A K M G+Y G PCRY +P +G SK +N
Sbjct: 463 LASVAVRGPHAEAPEKVMDGDYTGPPCRYITPREGI---SKDVNI--------------- 504
Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
+ + D T+ + G+++ +E EG DR DLLLP QTE I +VA A+ P+ LVI+S
Sbjct: 505 -----SQQGGDVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILS 559
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
G +D++FA+++PKI +ILW GYPG EGG AIADVIFG+YNPGGRLP+TW++ Y+ ++P
Sbjct: 560 GGGIDVSFAQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIHQLP 619
Query: 574 YTSMPLRPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ 630
TSM LRP + +PGRTYKF+DGP V+YPFGYGLSYT+F+Y++ + +V + + +
Sbjct: 620 MTSMALRPRPEHGYPGRTYKFYDGPDVLYPFGYGLSYTKFRYELLNKETAVTLAPGR-RH 678
Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAG 689
CR ++Y G+ P C AV + C + +F + V N GK DG+ V+VY+ PP +AG
Sbjct: 679 CRQLSYKTGSVGPDCPAVDVASHACAE-TVSFNVSVVNAGKADGANAVLVYTAPPAELAG 737
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG-VGGV 748
IKQV + RV + AG + V FT+N CK+ IV+ A +++ SG T++V G V
Sbjct: 738 APIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIVENGDSSAV 797
Query: 749 SFPLQLNLN 757
SFP+Q++ +
Sbjct: 798 SFPVQISFS 806
>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 814
Score = 788 bits (2034), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/785 (50%), Positives = 525/785 (66%), Gaps = 56/785 (7%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + +S FPYCDA LPY +R +DL+ MT+ EKV +GD+++G PR+GLP Y+WWS
Sbjct: 57 RFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWS 116
Query: 61 EALHGVSFIGRRT-----NSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
EALHGVS G +S PG H + V AT F VI + ASFNE+LW IGQ VS
Sbjct: 117 EALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVS 176
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
TEARAMYNLG GLT+WSPNINVVRDPRWGR LETPGEDPYV GRYA+N+VRG+QD+ G
Sbjct: 177 TEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPG- 235
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D +RP+K SACCKH+AAYD+DNW RF +D+RV+E+DM ETF+ PFEMCV EG
Sbjct: 236 HYSGDPSARPIKTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREG 295
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
DVSSVMCSYNRVNG+P CAD +LL+ T+RG+W+ +GYIVSDCD+++ + ++ +LN T
Sbjct: 296 DVSSVMCSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAA 355
Query: 295 DAVARVLKAGLDLDCG------------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
++ A L+AG+DLDC DY + + M AV QGK+ E+DID +L LY+ L
Sbjct: 356 ESSAVSLRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTL 415
Query: 343 MRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALV 402
MRLGYFD P+Y +L + ++C +H LA + ARQGIVLLKND+G LPL+ +A+
Sbjct: 416 MRLGYFDNIPRYASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVH 475
Query: 403 GPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAA 461
GPHA A K M G+Y G PCRY +P G I + +
Sbjct: 476 GPHARAPEKIMDGDYTGPPCRYVTPRQG------------------------ISRDVKIS 511
Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
A T+ + G++L +E EG DR DLLLP QTE I A A+ P+ LVI+S G +DI+
Sbjct: 512 HKAKMTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDIS 571
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR 580
FA+ +PKI +ILW GYPG EGG AIADVIFG+YNPGGRLP+TW++ Y+ +IP TSM R
Sbjct: 572 FAQKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTSMEFR 631
Query: 581 PV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY- 636
PV +PGRTYKF+DGP V+YPFGYGLSYT+F+Y+ ++ SV + C+ ++Y
Sbjct: 632 PVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFQYETSTDGVSVSLPA-PGGHCKGLSYK 690
Query: 637 -TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQ 694
+V T P C AV + D C + +F + V N G G+ VV+VY+ PP +A IKQ
Sbjct: 691 PSVAT-VPACQAVNVADHACTE-TVSFNVSVTNAGGRGGAHVVLVYTAPPPEVAEAPIKQ 748
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV--GEGVGGVSFPL 752
V + RVF+AA +A V F +N CK+ IV+ A +++ SG +LV G+ VSFP+
Sbjct: 749 VAAFRRVFVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVSFPV 808
Query: 753 QLNLN 757
+++L+
Sbjct: 809 KIDLS 813
>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
[Glycine max]
Length = 901
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/749 (53%), Positives = 519/749 (69%), Gaps = 23/749 (3%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
K S+FP+CD L Y +RAKDLV R+TL EK QQ+ + + G+ RLG+P YEWWSEALHGVS
Sbjct: 30 KTSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVS 89
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
+G PGT FD +VPGATSFP VIL+ ASFN SLW+K+GQ VSTEARAMYN+ AG
Sbjct: 90 NLG------PGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLAG 143
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LTFWSPN+NV RDPRWGR ETPGEDP VV RYA+ Y+RGLQ+VE + + LK+
Sbjct: 144 LTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVED---EASAKADRLKV 200
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
S+CCKHY AYDLDNW+G DRFHFD++VT+QD+++++ PF+ CV EG VSSVMCSYNRVN
Sbjct: 201 SSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCSYNRVN 260
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
GIPTCADP LL IRG W GYIVSDCDS++ + + T EDAVA LKAGL++
Sbjct: 261 GIPTCADPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHY-TATPEDAVALALKAGLNM 319
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNP 365
+CGD+ +T AV K+ A +D +L + YIVLMRLG+FD S + NLG +++C
Sbjct: 320 NCGDFLKKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFDDPKSLPFANLGPSDVCTK 379
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ +LA +AA+QGIVLL+N+NGALPL+ NIK LA++GP+ANAT MI NY G PCRYTS
Sbjct: 380 DNQQLALDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIPCRYTS 439
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G Y +NYAPGC+++ C N S+I AA+ AA +ADA V+V GLD S+EAEG DR
Sbjct: 440 PLQGLQKYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAEGLDRE 499
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPGFQ + + VA A KG V LVIM+AG +DI+ K+ I ILWVGYPG+ GG A
Sbjct: 500 NLTLPGFQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQAGGDA 559
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IA VIFG YNPGGR P TWY +YV ++P T M +R NFPGRTY+F++G +Y FG
Sbjct: 560 IAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFPGRTYRFYNGNSLYEFG 619
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI--NYTVGTNKPPCA---AVLIDDVKCKD 657
+GLSY+ F VAS+P S+ I+ + ++ + GT + A+ I + C+D
Sbjct: 620 HGLSYSTFSMYVASAPSSIMIENTSISEPHNMLSSNNSGTQVESLSDGQAIDISTINCQD 679
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPG---IAGTHIKQVIGYERVFIAAGQSAKVGFT 714
F I V+N G ++GS VV+V+ +P + G IKQ+IG+ERV + G + V
Sbjct: 680 LTFLLVIGVKNNGPLNGSHVVLVFWEPATSEFVIGAPIKQLIGFERVQVVVGVTEFVTVK 739
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
++ C+ + VD+ L G HTILVG
Sbjct: 740 IDICQLISNVDSDGKRKLVIGQHTILVGS 768
>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/738 (51%), Positives = 501/738 (67%), Gaps = 33/738 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+C + ER KDL+ R+TL EKV+ + + A GVPRLG+ YEWWSEALHGVS +G
Sbjct: 41 FPFCRKSIGIGERVKDLIGRLTLEEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVG- 99
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PGT F + PGATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLTFW
Sbjct: 100 -----PGTKFSGDFPGATSFPQVITTAASFNSSLWEAIGQVVSDEARAMYNGGAAGLTFW 154
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPN+N+ RDPRWGR ETPGEDP + G+YA YVRGLQ G LK++ACC
Sbjct: 155 SPNVNIFRDPRWGRGQETPGEDPVLAGKYAARYVRGLQGNAGDR---------LKVAACC 205
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+ AYDLDNW G DRFHFD+RV++Q+M++TF +PF CV EG V+SVMCSYN+VNG+PT
Sbjct: 206 KHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFRSCVVEGKVASVMCSYNQVNGVPT 265
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CADP LL T+R W+ +GY+VSDCDS+ ++ + N T E+A A +KAGLDLDCG
Sbjct: 266 CADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHYTN-TPEEAAADAIKAGLDLDCGP 324
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
+ T A+++G ++EAD+D++L V MRLG FDG P + +LG ++C+P H
Sbjct: 325 FLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFDGEPSAQPFGDLGPKDVCSPAHQ 384
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
ELA EAARQGIVLLKN +LPL+T + +++A++GP+++A MIGNY G PC YT+P+
Sbjct: 385 ELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNSDANVTMIGNYAGIPCEYTTPLQ 444
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G YS+ I + GCAD+ C + + AIDAA ADATV+V GLD S+EAE KDR DLL
Sbjct: 445 GIGRYSRTI-HQKGCADVACSEDQLFAGAIDAASQADATVLVMGLDQSIEAEAKDRADLL 503
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LPG Q EL++KVA A++GP LV+MS G VD++FAK +P+I +I+W GYPG+ GG AIAD
Sbjct: 504 LPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDPRIAAIVWAGYPGQAGGAAIAD 563
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
++FG NPGG+LP+TWY Y+ K+P T+M +R P +PGRTY+F+ GPVVY FG+GL
Sbjct: 564 ILFGVANPGGKLPMTWYPQEYLSKVPMTTMAMRAIPSKAYPGRTYRFYKGPVVYRFGHGL 623
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
SYT F + +A +P +V I L N TV A+ + KC ++
Sbjct: 624 SYTNFVHTIAQAPTAVAIPLHGHH-----NTTVSGK-----AIRVTHAKCNRLSIALHLD 673
Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
V+N+G DGS ++V+SKPP KQ++ +E+V +AA +V ++ CK L +VD
Sbjct: 674 VKNVGNKDGSHTLLVFSKPPAGHWAPHKQLVAFEKVHVAARTQQRVQINIHVCKYLSVVD 733
Query: 726 NAANSLLASGAHTILVGE 743
+ + G H + +G+
Sbjct: 734 RSGIRRIPMGQHGLHIGD 751
>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
Length = 768
Score = 781 bits (2018), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/738 (51%), Positives = 500/738 (67%), Gaps = 32/738 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C KLP +R KDL+ R+TL EKV + + A V RLG+ YEWWSEALHGVS +G
Sbjct: 39 NLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEWWSEALHGVSNVG 98
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F PGATSFP VI T ASFN +LW+ IG+ VS EARAMYN G AGLT+
Sbjct: 99 ------PGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGGAAGLTY 152
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N++RDPRWGR ETPGEDP +VG+YA +YV+GLQ +D LK++AC
Sbjct: 153 WSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQG---------NDGERLKVAAC 203
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW G DRFHF+++V++QDM++TF +PF MCV EG V+SVMCSYN+VNGIP
Sbjct: 204 CKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 263
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL +T+R W +GYIVSDCDS+ + + T E+A A +KAGLDLDCG
Sbjct: 264 TCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHY-TSTPEEAAADAIKAGLDLDCG 322
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T AV++G I+EAD++ +L V MRLG FDG P Y NLG ++C P H
Sbjct: 323 PFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCTPAH 382
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA EA RQGIVLLKN +LPL+ +T+A++GP++N T MIGNY G C+YT+P+
Sbjct: 383 QELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYTTPL 442
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G +Y+K I + GCAD+ C + + AIDAA+ ADATV+V GLD S+EAE +DR L
Sbjct: 443 QGIGSYAKTI-HQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDRTGL 501
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q EL++KVA A+KGP LV+MS G +D++FAK +PKI +ILW GYPG+ GG AIA
Sbjct: 502 LLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGAAIA 561
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
DV+FG NPGG+LP+TWY Y+ +P T M +R + +PGRTY+F+ G VVYPFG+G
Sbjct: 562 DVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQGKVVYPFGHG 621
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
+SYT F + +AS+P V + LD + G A+ + KC Q+
Sbjct: 622 MSYTHFVHNIASAPTMVSVPLDGHR---------GNTSISGKAIRVTHTKCNKLSLGIQV 672
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N+G DG+ ++VYS PP + KQ++ +ERV ++AG +VG +++ CK L +V
Sbjct: 673 DVKNVGSKDGTHTLLVYSAPPAGRWSPHKQLVAFERVHVSAGTQERVGISIHVCKLLSVV 732
Query: 725 DNAANSLLASGAHTILVG 742
D + + G H+I +G
Sbjct: 733 DRSGIRRIPIGEHSIHIG 750
>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
Length = 765
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/742 (50%), Positives = 508/742 (68%), Gaps = 34/742 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
++FP+C A LP P R DL+ R+TL EKV + + A VPR+G+ YEWWSEALHGVS +
Sbjct: 33 NNFPFCKASLPIPTRVNDLIGRLTLQEKVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNV 92
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PGT F + P ATSFP VI T ASFN SLW+ IG+ S EARAMYN G AGLT
Sbjct: 93 G------PGTKFAGQFPAATSFPQVITTVASFNASLWEAIGRVASDEARAMYNGGTAGLT 146
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ +DS LK++A
Sbjct: 147 YWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQG---------TDSSRLKVAA 197
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CKH+ AYDLDNW G DRFHF+++V++QDM++TF +PF MCV EG+V+SVMCSYN+VNG+
Sbjct: 198 SCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNGV 257
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCADP LL +TIRG W+ GYIVSDCDS+ + +++ T E+A A +KAGLDLDC
Sbjct: 258 PTCADPNLLKRTIRGQWHLDGYIVSDCDSVG-VFYTNQHYTSTPEEAAADAIKAGLDLDC 316
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G + T AV++G + E D++ +L V MRLG FDG P Y NLG ++C P
Sbjct: 317 GPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPT 376
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H ELA +AARQGIVLLKN +LPL+T N +T+A++GP++NAT MIGNY G C YTSP
Sbjct: 377 HQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSP 436
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y++ I + PGCA++ C ++ +A++AA+ ADATV+V GLD S+EAE DR
Sbjct: 437 LQGIGKYARTI-HEPGCANVACNDDKQFGSALNAARQADATVLVMGLDQSIEAEMVDRTG 495
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q +L++KVA A++GP LV+MS G +DI FAKN+P+I ILW GYPG+ GG AI
Sbjct: 496 LLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAI 555
Query: 547 ADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
AD++FG NPG +LP+TWY Y+K + T+M +RP ++ +PGRTY+F++GPVVYPFGY
Sbjct: 556 ADILFGTTNPGAKLPMTWYPQGYLKNLAMTNMAMRPSSSTGYPGRTYRFYNGPVVYPFGY 615
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F + +AS+PK V + +D ++ N AA+ + +C
Sbjct: 616 GLSYTNFVHTLASAPKVVSVPVDGHRRGNSSNK---------AAIRVTHARCGKLSIRLD 666
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSL 721
I+V+N+G DG+ ++V+S PP G KQ++ +E+V++ A +V ++ CK L
Sbjct: 667 IDVKNVGSKDGTNTLLVFSVPPTGNGHWAPQKQLVAFEKVYVPAKAQQRVRINIHVCKLL 726
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD + + GAH+I +G+
Sbjct: 727 SVVDKSGTRRIPMGAHSIHIGD 748
>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
Length = 809
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/786 (50%), Positives = 523/786 (66%), Gaps = 56/786 (7%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + +S FPYCDA LPY +R +DL+ MT+ EKV +GD+++G PR+GLP Y+WWS
Sbjct: 50 RFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWS 109
Query: 61 EALHGVSFIGRRT-----NSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
EALHGVS G +S PG H + V AT F VI + ASFNE+LWK IGQ VS
Sbjct: 110 EALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVS 169
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
TEARAMYNLG GLT+WSPNINVVRDPRWGR LETPGEDP+V GRYA+N+VRG+QD+ G
Sbjct: 170 TEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGH 229
Query: 175 EYHRDSDS-RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
+ D S RP+K SACCKHYAAYD+D+W + RF FD+RV+E+DM ETF+ PFEMCV +
Sbjct: 230 DGGGDDPSTRPIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRD 289
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
GD S VMCSYNRVNGIP CAD +LL+ TIRGDW HGYIVSDCD+++ + ++ +L+ T
Sbjct: 290 GDASGVMCSYNRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTG 349
Query: 294 EDAVARVLKAGLDLDCG------------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIV 341
++ A ++AGLDLDC D+ + + AV QGK+ E+DID++LR Y+
Sbjct: 350 AESSAASIRAGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMT 409
Query: 342 LMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLAL 401
LMRLGYFD P+Y +L + +IC +H LA + ARQG+VLLKND+G LPL+ I +A+
Sbjct: 410 LMRLGYFDNIPRYASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAV 469
Query: 402 VGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDA 460
GPHA A K M G+Y G PCRY +P G I +
Sbjct: 470 HGPHARAPEKIMDGDYTGPPCRYVTPRQG------------------------ISKDVKI 505
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
+ A+ T+ + G++L +E EG DR DLLLP QTE I A A+ P+ LVI+S G +DI
Sbjct: 506 SHRANTTIYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDI 565
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPL 579
+FA +PKI +ILW GYPG EGG AIADVIFG+YNPGGRLP+TW++ Y+ +IP TSM
Sbjct: 566 SFAHKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEF 625
Query: 580 RPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
RPV +PGRTYKF+DGP V+YPFGYGLSYT+F Y+ +++ +V + C+ ++Y
Sbjct: 626 RPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFLYETSTNGTAVTLPA-TGGHCKGLSY 684
Query: 637 --TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIK 693
+V T P C AV + C + +F I V N G G+ VV+VY+ PP +A IK
Sbjct: 685 KPSVATT-PACQAVDVAGHACTE-TVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQAPIK 742
Query: 694 QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV--GEGVGGVSFP 751
QV + RVF+ A +A V FT+N CK+ IV+ A +++ SG +LV G+ VSFP
Sbjct: 743 QVAAFRRVFVPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSVSFP 802
Query: 752 LQLNLN 757
++++ +
Sbjct: 803 VKIDFS 808
>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
Length = 781
Score = 777 bits (2006), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/750 (53%), Positives = 513/750 (68%), Gaps = 37/750 (4%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
K S+FP+C+ L Y RAKDLV R+TL EK QQ+ + + G+ RLG+P YEWWSEALHGVS
Sbjct: 32 KTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWWSEALHGVS 91
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
+G PGT FDS VPGATSFP VIL+ ASFNE+LW +GQ VS EARAMYN+ AG
Sbjct: 92 NVG------PGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDLAG 145
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LTFWSPN+NV RDPRWGR ETPGEDP VV RYA+NYVRGLQ+V G E D LK+
Sbjct: 146 LTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GDEASAKGDR--LKV 202
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
S+CCKHY AYD+DNW+G DRFHFD++VT+QD+++T+ PF+ CV EG VSSVMCSYNRVN
Sbjct: 203 SSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHVSSVMCSYNRVN 262
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
GIPTCADP LL IRG W GYIVSDCDS++ S + T EDAVA LKAGL++
Sbjct: 263 GIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHY-TKTPEDAVALALKAGLNM 321
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNP 365
+CGD+ +T AV K+ + +D +L + YIVLMRLG+F+ S + NLG +++C
Sbjct: 322 NCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFENPKSLPFANLGPSDVCTK 381
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ +LA EAA+QGIVLL+N+ GALPL+ IK LA++GP+ANAT MI NY G PCRY+S
Sbjct: 382 ENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMISNYAGIPCRYSS 441
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G Y + YA GC+D+ C N ++ AA+ AA +ADA V+V GLD S+EAEG DRV
Sbjct: 442 PLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLDQSIEAEGLDRV 501
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPGFQ +L+ VA A KG + LVIM+AG +DI+F K+ I ILWVGYPG++GG A
Sbjct: 502 NLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILWVGYPGQDGGNA 561
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IA VIFG YNPGGR P TWY +YV ++P T M +R NFPGRTY+F++G +Y FG
Sbjct: 562 IAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRTYRFYNGKSLYEFG 621
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD-------VKC 655
YGLSY+ F +AS+P + I L K+ P + +DD + C
Sbjct: 622 YGLSYSTFSTHIASAPST--IMLQKNTSISK----------PLNNIFLDDQVIDISTISC 669
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP---GIAGTHIKQVIGYERVFIAAGQSAKVG 712
+ F+ I V+N G DGS VV+V+ +PP ++G +KQ+IG+ER + G++ V
Sbjct: 670 FNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGKTEFVT 729
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ CK L VD+ L G H ILVG
Sbjct: 730 VKIDICKMLSNVDSDGKRKLVIGQHNILVG 759
>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
Length = 775
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/740 (51%), Positives = 507/740 (68%), Gaps = 30/740 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+S + +CD L +R DLV+R+TL EK+ +G+ A V RLG+P YEWWSEALHGVS
Sbjct: 50 VSSYGFCDKSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSN 109
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
IG PGTHF S VPGATSFP ILT ASFN SL++ IG VS EARAMYN+G AGL
Sbjct: 110 IG------PGTHFSSLVPGATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGL 163
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ + D DS LK++
Sbjct: 164 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVA 217
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G R+ FD+ V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 218 ACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNG 277
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG W +GYIVSDCDS++ + + + T E+A A+ + +GLDLD
Sbjct: 278 KPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQHY-TKTPEEAAAKTILSGLDLD 336
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y +T GAV+QG + EA I+ ++ + LMRLG+FDG P Y NLG ++C P
Sbjct: 337 CGSYLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTP 396
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ ELA EAARQGIVLLKN G+LPL++ IK+LA++GP+ANAT+ MIGNYEG PC+YTS
Sbjct: 397 ENQELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTS 456
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A+ +YAPGC D+ C N + I A A +ADAT+IV G +L++EAE DRV
Sbjct: 457 PLQGLTAFVPT-SYAPGCPDVQCAN-AQIDDAAKIAASADATIIVVGANLAIEAESLDRV 514
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++LLPG Q +L+N+VA+ +KGPV LVIMS G +D++FAK N KI SILWVGYPGE GG A
Sbjct: 515 NILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 574
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY +YV KIP T+M +R P +PGRTY+F+ G V+ FG
Sbjct: 575 IADVIFGSYNPSGRLPMTWYPQSYVEKIPMTNMNMRSDPATGYPGRTYRFYKGETVFSFG 634
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
G+S+ ++K+ +P+ V + L +D +CR + C ++ + D C++ F
Sbjct: 635 DGMSFGTVEHKIVKAPQLVSVPLAEDHECRSLE---------CKSLDVADEHCQNLAFDI 685
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+NMGKM S V+++ PP + K ++G+E+V +A V F ++ C L
Sbjct: 686 HLSVKNMGKMSSSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLS 745
Query: 723 IVDNAANSLLASGAHTILVG 742
+VD N + G H + VG
Sbjct: 746 VVDELGNRKVPLGDHMLHVG 765
>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/740 (51%), Positives = 506/740 (68%), Gaps = 30/740 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +CD L +R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 51 LAGYGFCDKSLSLEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSN 110
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGTHF S VPGATSFP ILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 111 VG------PGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGL 164
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ + D DS LK++
Sbjct: 165 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVA 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G R+ F++ VT+QDM +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 219 ACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG+W +GYIVSDCDS++ + + + T E+A A + AGLDL+
Sbjct: 279 KPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQHY-TKTPEEAAAETILAGLDLN 337
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG+Y +T GAV+QG + EA I+ ++ + LMRLG+FDG P Y NLG N++C
Sbjct: 338 CGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFFDGDPSKQTYGNLGPNDVCTS 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ ELA EAARQGIVLLKN G+LPLN IK+LA++GP+ANAT+ MIGNYEG PC Y S
Sbjct: 398 ENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCNYIS 457
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ A +YA GC ++ C N+ + A A +ADATVIV G L++EAE DR+
Sbjct: 458 PLQALTALVPT-SYAAGCPNVQCA-NAELDDATQIAASADATVIVVGASLAIEAESLDRI 515
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++LLPG Q L+++VA+A+KGPV LVIMS G +D++FAK+N KI SILWVGYPGE GG A
Sbjct: 516 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAA 575
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY +YV K+P T+M +R P +PGRTY+F+ G V+ FG
Sbjct: 576 IADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFG 635
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
G+S++ ++K+ +P+ V + L +D +CR C ++ + D C++ F
Sbjct: 636 DGISFSNIEHKIVKAPQLVSVPLAEDHECR---------SSECMSLDVADEHCQNLAFDI 686
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+NMGKM S VV+++ PP + K ++G+E+V + A+V F ++ CK L
Sbjct: 687 HLGVKNMGKMSSSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDICKDLS 746
Query: 723 IVDNAANSLLASGAHTILVG 742
+VD N + G H + VG
Sbjct: 747 VVDELGNRKVPLGQHLLHVG 766
>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 775
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/748 (50%), Positives = 508/748 (67%), Gaps = 32/748 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C A L PER KDLV R+TL EKV+ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 42 NMPFCKASLAIPERVKDLVGRLTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVG 101
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PG F+++ PGATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLT+
Sbjct: 102 ------PGVKFNAQFPGATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTY 155
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP + G YA +YVRGLQ +D LK++AC
Sbjct: 156 WSPNVNIFRDPRWGRGQETPGEDPVLAGTYAASYVRGLQG---------TDGNRLKVAAC 206
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW G DRFHF+++V++QD++ETF +PF MCV+EG V+SVMCSYN+VNG+P
Sbjct: 207 CKHFTAYDLDNWNGMDRFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNGVP 266
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL +T+RG W GYIVSDCDS+ ++ + T E+A A +KAGLDLDCG
Sbjct: 267 TCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAADAIKAGLDLDCG 325
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T AV++G ++EAD++ +L V MRLG FDG P Y LG ++C P H
Sbjct: 326 PFLAVHTQNAVEKGLLSEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPAH 385
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA EAARQGIVLLKN LPL+ T+A++GP++ AT MIGNY G C YT+P+
Sbjct: 386 QELALEAARQGIVLLKNTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNPL 445
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y+K I + GC ++ C+N+ + +AI+AA+ ADATV+V GLD S+EAE DR L
Sbjct: 446 QGIGRYAKTI-HQLGCENVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGL 504
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q +L++KVA A+KGP LVIMS G+VDI FAKNNP+I ILW GYPG+ GG AIA
Sbjct: 505 LLPGRQQDLVSKVAAASKGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIA 564
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
D++FG NPGG+LP+TWY Y+ K+P T+M +R + +PGRTY+F++GPVVYPFG+G
Sbjct: 565 DILFGTTNPGGKLPVTWYPQEYLTKLPMTNMAMRGSKSAGYPGRTYRFYNGPVVYPFGHG 624
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
L+YT F + +AS+P V + L+ R N T +N+ A+ + +C + ++
Sbjct: 625 LTYTHFVHTLASAPTVVSVPLNGH---RRANVTNISNR----AIRVTHARCDKLSISLEV 677
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+++N+G DG+ ++V+S PP G KQ++ +E++ + A +VG ++ CK L
Sbjct: 678 DIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLS 737
Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSF 750
+VD + + G H+ +G+ VS
Sbjct: 738 VVDKSGIRRIPLGEHSFNIGDVKHSVSL 765
>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
Length = 763
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/743 (50%), Positives = 507/743 (68%), Gaps = 34/743 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P PER +DL+ R+TL EKV +G+ A +PRLG+ YEWWSEALHGVS +G
Sbjct: 39 FCQLSVPIPERVRDLIGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVG--- 95
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G GLT+WSP
Sbjct: 96 ---PGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSP 152
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N++RDPRWGR ETPGEDP V G+YA +YVRGLQ +D LK++ACCKH
Sbjct: 153 NVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQG---------NDRSRLKVAACCKH 203
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG+V+S+MCSYN+VNG+PTCA
Sbjct: 204 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCA 263
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL +TIR W +GYIVSDCDS+ + ++ + T E+A A +KAGLDLDCG +
Sbjct: 264 DPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHY-TGTPEEAAADSIKAGLDLDCGPFL 322
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
T+ AV++ + E+D+D +L V MRLG FDG + Y +LG ++C P H L
Sbjct: 323 GAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGL 382
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAA+QGIVLLKN +LPL++ +T+A++GP+++AT MIGNY G C YTSP+ G
Sbjct: 383 ALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGI 442
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I + GC D+ C ++ + AA++AA+ ADATV+V GLD S+EAE KDR LLLP
Sbjct: 443 TGYARTI-HQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLP 501
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA AAKGPV LV+MS G +DI+FA+ + KI +I+W GYPG+EGG AIAD++
Sbjct: 502 GKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADIL 561
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY +Y+ +P T M +RPV++ PGRTY+F+DGPVVYPFG+GLSY
Sbjct: 562 FGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSY 621
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F + +A +PK + I + R N TV ++ + +C +EV
Sbjct: 622 TRFTHNIADAPKVIPIAV------RGRNGTVSGK-----SIRVTHARCDRLSLGVHVEVT 670
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N+G DG+ ++V+S PPG KQ++ +ERV +A G+ +V ++ CK L +VD A
Sbjct: 671 NVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRA 730
Query: 728 ANSLLASGAHTILVGEGVGGVSF 750
N + G H I +G+ VS
Sbjct: 731 GNRRIPIGDHGIHIGDESHTVSL 753
>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
Precursor
gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
Length = 768
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/743 (50%), Positives = 507/743 (68%), Gaps = 34/743 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P PER +DL+ R+TL EKV +G+ A +PRLG+ YEWWSEALHGVS +G
Sbjct: 44 FCQLSVPIPERVRDLIGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVG--- 100
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G GLT+WSP
Sbjct: 101 ---PGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSP 157
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N++RDPRWGR ETPGEDP V G+YA +YVRGLQ +D LK++ACCKH
Sbjct: 158 NVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQG---------NDRSRLKVAACCKH 208
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG+V+S+MCSYN+VNG+PTCA
Sbjct: 209 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCA 268
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL +TIR W +GYIVSDCDS+ + ++ + T E+A A +KAGLDLDCG +
Sbjct: 269 DPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHY-TGTPEEAAADSIKAGLDLDCGPFL 327
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
T+ AV++ + E+D+D +L V MRLG FDG + Y +LG ++C P H L
Sbjct: 328 GAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGL 387
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAA+QGIVLLKN +LPL++ +T+A++GP+++AT MIGNY G C YTSP+ G
Sbjct: 388 ALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGI 447
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I + GC D+ C ++ + AA++AA+ ADATV+V GLD S+EAE KDR LLLP
Sbjct: 448 TGYARTI-HQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLP 506
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA AAKGPV LV+MS G +DI+FA+ + KI +I+W GYPG+EGG AIAD++
Sbjct: 507 GKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADIL 566
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY +Y+ +P T M +RPV++ PGRTY+F+DGPVVYPFG+GLSY
Sbjct: 567 FGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSY 626
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F + +A +PK + I + R N TV ++ + +C +EV
Sbjct: 627 TRFTHNIADAPKVIPIAV------RGRNGTVSGK-----SIRVTHARCDRLSLGVHVEVT 675
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N+G DG+ ++V+S PPG KQ++ +ERV +A G+ +V ++ CK L +VD A
Sbjct: 676 NVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRA 735
Query: 728 ANSLLASGAHTILVGEGVGGVSF 750
N + G H I +G+ VS
Sbjct: 736 GNRRIPIGDHGIHIGDESHTVSL 758
>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
Length = 816
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/790 (50%), Positives = 518/790 (65%), Gaps = 66/790 (8%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + +++F YCDA LPY +R +DL+ RMT+ EKV +GD G R+GLP Y WWS
Sbjct: 57 RFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWS 116
Query: 61 EALHGVSFIGRRTNSPPGTHFD-----------SEVPGATSFPTVILTTASFNESLWKKI 109
EALHG+S G P T FD S V AT F VI + ASFNE+LWK I
Sbjct: 117 EALHGLSSTG------PTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSI 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ VSTEARAMYN+G GLT+WSPNINVVRDPRWGR LETPGEDPYVVGRYA+N+VRG+Q
Sbjct: 171 GQAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ 230
Query: 170 DV---EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
D+ E V D ++RPLK SACCKHYAAYDLD+W + RF FD+RV E+DM ETF P
Sbjct: 231 DIPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRP 290
Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
FEMCV +GDVSSVMCSYNRVNGIP CAD +LL+QTIR DW HGYIVSDCD+++ + ++
Sbjct: 291 FEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNA 350
Query: 287 KFLNDTKEDAVARVLKAGLDLDCG-------------DYYTNFTMGAVQQGKIAEADIDT 333
+L T +A A LKAGLDLDCG D+ T + M AV +GK+ E+DID
Sbjct: 351 TWLGYTGAEASAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDN 410
Query: 334 SLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT 393
+L Y+ LMRLGYFD QY +LG+ +IC QH LA + ARQGIVLLKNDN LPL+
Sbjct: 411 ALTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDA 470
Query: 394 GNIKTLALVGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS 452
+ + + GPH A K M G+Y G PCRY +P G Y +
Sbjct: 471 NKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRF---------------- 514
Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+ A+ T+ GL+L++E EG DR D+LLP QTE I +VA A+ P+ LVI
Sbjct: 515 --------SHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVI 566
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-K 571
+S G +D++FA+NNPKI +ILW GYPG EGG AIADVIFGK+NP GRLP+TW++ Y+ +
Sbjct: 567 LSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQ 626
Query: 572 IPYTSMPLRPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
+P TSM LRPV + +PGRTYKF+DGP V+YPFGYGLSYT+F Y++ ++ ++ + +
Sbjct: 627 LPMTSMDLRPVAKHGYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPV-AG 685
Query: 629 QQCRDINYTVG-TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG- 686
C+ ++Y G + P C A+ ++ C + +F + V N G GS V+V+SKPP
Sbjct: 686 GHCKKLSYKSGVSTAPACPAINVNGHVCTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPAE 744
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVG 746
+ +KQV+ ++ VF+ A + V F +N CK+ IV+ A +++ SG TILV
Sbjct: 745 VDDAPMKQVVAFKSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDS 804
Query: 747 GVSFPLQLNL 756
VSFP++++
Sbjct: 805 SVSFPVKIDF 814
>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
Length = 816
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/792 (50%), Positives = 518/792 (65%), Gaps = 67/792 (8%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + +++F YCDA LPY +R +DL+ RMT+ EKV +GD G R+GLP Y WWS
Sbjct: 56 RFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWS 115
Query: 61 EALHGVSFIGRRTNSPPGTHFD-----------SEVPGATSFPTVILTTASFNESLWKKI 109
EALHG+S G P T FD S V AT F VI + ASFNE+LWK I
Sbjct: 116 EALHGLSSTG------PTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSI 169
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ VSTEARAMYN+G GLT+WSPNINVVRDPRWGR LETPGEDPYVVGRYA+N+VRG+Q
Sbjct: 170 GQAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ 229
Query: 170 DV---EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
D+ E V D ++RPLK SACCKHYAAYDLD+W + RF FD+RV E+DM ETF P
Sbjct: 230 DIPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRP 289
Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
FEMCV +GDVSSVMCSYNRVNGIP CAD +LL+QTIR DW HGYIVSDCD+++ + ++
Sbjct: 290 FEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNA 349
Query: 287 KFLNDTKEDAVARVLKAGLDLDCG-------------DYYTNFTMGAVQQGKIAEADIDT 333
+L T +A A LKAGLDLDCG D+ T + M AV +GK+ E+DID
Sbjct: 350 TWLGYTGAEASAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDN 409
Query: 334 SLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT 393
+L Y+ LMRLGYFD QY +LG+ +IC QH LA + ARQGIVLLKNDN LPL+
Sbjct: 410 ALTNQYMTLMRLGYFDDITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDA 469
Query: 394 GNIKTLALVGPHANA-TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS 452
+ + + GPH A K M G+Y G PCRY +P G Y +
Sbjct: 470 NKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRF---------------- 513
Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+ A+ T+ GL+L++E EG DR D+LLP QTE I +VA A+ P+ LVI
Sbjct: 514 --------SHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVI 565
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-K 571
+S G +D++FA+NNPKI +ILW GYPG EGG AIADVIFGK+NP GRLP+TW++ Y+ +
Sbjct: 566 LSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQ 625
Query: 572 IPYTSMPLRPV--NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
+P TSM LRPV + +PGRTYKF++GP V+YPFGYGLSYT+F Y++ ++ ++ + +
Sbjct: 626 LPMTSMDLRPVAKHGYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPV-AG 684
Query: 629 QQCRDINYTVGTNK--PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
C+ ++Y G + P C A+ ++ C + +F + V N G GS V+V+SKPP
Sbjct: 685 GHCKKLSYKSGVSSAAPACPAINVNGHACTE-TVSFNVSVTNGGDTGGSHPVIVFSKPPA 743
Query: 687 -IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
+ IKQV+ + VF+ A + V F +N CK+ IV+ A +++ SG T+LV
Sbjct: 744 EVDDAPIKQVVAFRSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVD 803
Query: 746 GGVSFPLQLNLN 757
VSFP++++ +
Sbjct: 804 SSVSFPVKISFS 815
>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
Length = 865
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/747 (51%), Positives = 499/747 (66%), Gaps = 50/747 (6%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ + +C+ L Y RAKDLV R++L EKVQQ+ + A GV RLG+P YEWWSEALHGVS +
Sbjct: 37 AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVSRLGVPPYEWWSEALHGVSDV 96
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG F+ VPGATSFP ILT ASFN SLW K+G+ VSTEARAM+N+G AGLT
Sbjct: 97 G------PGVRFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLT 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP VV +YA+NYV+GLQDV+ SR LK+S+
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDVQDA-----GKSRRLKVSS 205
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYDLDNW+G DRFHFD++VT+QD+++T+ PF+ CV EGDVSSVMCSYNRVNGI
Sbjct: 206 CCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPPFKSCVEEGDVSSVMCSYNRVNGI 265
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCADP LL IRG W GYIVSDCDSIQ + + K L+++C
Sbjct: 266 PTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDIHY------------TKTRLNMNC 313
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
GD+ +T AV+ K+ +++D +L + YIVLMRLG+FDG P+ + LG +++C+
Sbjct: 314 GDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGFFDGDPKSLPFGQLGPSDVCSKD 373
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA+QGIVLL+N G LPL+ +K +A++GP+ANATK MI NY G PC+YTSP
Sbjct: 374 HQMLALEAAKQGIVLLEN-RGDLPLSKTAVKKIAVIGPNANATKVMISNYAGVPCKYTSP 432
Query: 427 MDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+ G Y KV+ Y PGC D+ C ++I AA+ A AD TV+V GLD +VEAEG DR
Sbjct: 433 LQGLQKYVPEKVV-YEPGCKDVNCGEQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDR 491
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
V+L LPG+Q +L+ VA+AAK V LVIMSAG +DI+FAKN I ++LWVGYPGE GG
Sbjct: 492 VNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTISAVLWVGYPGEAGGD 551
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
AIA VIFG YNP GRLP TWY + K+ T M +RP + FPGR+Y+F+ G +Y F
Sbjct: 552 AIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKF 611
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GYGLSY+ F V S+P + IK N + NK ++ I V C D K
Sbjct: 612 GYGLSYSAFSTFVLSAPSIIHIK---------TNPILNLNK--TTSIDISTVNCHDLKIR 660
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHI------KQVIGYERVFIAAGQSAKVGFTM 715
I V+N G+ GS VV+V+ KPP + T + Q++G+ERV + + KV
Sbjct: 661 IVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVGAGVPQTQLVGFERVEVGRSMTEKVTVEF 720
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ CK+L +VD L +G HT+++G
Sbjct: 721 DVCKALSLVDTHGKRKLVTGHHTLVIG 747
>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
Length = 763
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/743 (50%), Positives = 507/743 (68%), Gaps = 34/743 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P ER KDL+ R+TL EKV +G+ A +PRLG+ YEWWSEALHGVS +G
Sbjct: 39 FCQLSVPITERVKDLIGRLTLVEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVG--- 95
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G GLT+WSP
Sbjct: 96 ---PGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSP 152
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N++RDPRWGR ETPGEDP V G+YA +YVRGLQ +D LK++ACCKH
Sbjct: 153 NVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQG---------NDRSRLKVAACCKH 203
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG+V+S+MCSYN VNG+PTCA
Sbjct: 204 FTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNGVPTCA 263
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL +TIR +W +GYIVSDCDS+ + ++ + T E+A A +KAGLDLDCG +
Sbjct: 264 DPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQHY-TGTPEEAAADSIKAGLDLDCGPFL 322
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
T+ AV++ + E+D+D +L V MRLG FDG + Y +LG ++C P H L
Sbjct: 323 GAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGL 382
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAA+QGIVLLKN +LPL++ +T+A++GP+++AT AMIGNY G C YTSP+ G
Sbjct: 383 ALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSPVQGI 442
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ + + GC D+ C ++ + AA++AA+ ADATV+V GLD S+EAE KDR LLLP
Sbjct: 443 TGYARTV-HQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLP 501
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q ELI++VA AAKGPV LV+MS G +DI+FA+ + KI +I+W GYPG+EGG AIAD++
Sbjct: 502 GKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADIL 561
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY +Y+ +P T M +RP+++ PGRTY+F+DGPVVYPFG+GLSY
Sbjct: 562 FGSANPGGKLPMTWYPQDYLTNLPMTEMSMRPIHSKRIPGRTYRFYDGPVVYPFGHGLSY 621
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F + +A +PK + I + R N TV ++ + +C ++V
Sbjct: 622 TRFTHSIADAPKVIPIAV------RGRNGTVSGK-----SIRVTHARCNRLSLGVHVDVT 670
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N+G DG+ ++V+S PPG KQ++ +ERV +A G+ +V ++ CK L +VD A
Sbjct: 671 NVGSRDGTHTMLVFSAPPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRA 730
Query: 728 ANSLLASGAHTILVGEGVGGVSF 750
N + G H I +G+ VS
Sbjct: 731 GNRRIPIGDHGIHIGDESHTVSL 753
>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/740 (51%), Positives = 505/740 (68%), Gaps = 30/740 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+S + +CD L +R DLV+R+TL EK+ +G+ A V RLG+P YEWWSEALHGVS
Sbjct: 49 VSSYGFCDNSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSN 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
IG PGTHF S VPGAT+FP ILT ASFN SL++ IG VS EARAMYN+G AGL
Sbjct: 109 IG------PGTHFSSLVPGATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ + D DS LK++
Sbjct: 163 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G R+ FD+ V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG W +GYIVSDCDS++ + + + T E+A A+ + +GLDLD
Sbjct: 277 KPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQHY-TKTPEEAAAKTILSGLDLD 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y +T GAV+QG + EA I ++ + LMRLG+FDG P Y NLG ++C P
Sbjct: 336 CGSYLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTP 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ ELA EAARQGIVLLKN +LPL++ IK+LA++GP+ANAT+ MIGNYEG PC+YTS
Sbjct: 396 ENQELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTS 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A+ +YAPGC D+ C N + I A A +ADAT+IV G +L++EAE DRV
Sbjct: 456 PLQGLTAFVPT-SYAPGCPDVQCAN-AQIDDAAKIAASADATIIVVGANLAIEAESLDRV 513
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++LLPG Q +L+N+VA+ +KGPV LVIMS G +D++FAK N KI SILWVGYPGE GG A
Sbjct: 514 NILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY +YV K+P T+M +R P +PGRTY+F+ G V+ FG
Sbjct: 574 IADVIFGSYNPSGRLPMTWYPQSYVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFG 633
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
G+S+ ++K+ +P+ V + L +D +CR + C ++ + D C++ F
Sbjct: 634 DGMSFGTVEHKIVKAPQLVSVPLAEDHECRSLE---------CKSLDVADKHCQNLAFDI 684
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+NMGKM S V+++ PP + K ++G+E+V +A V F ++ C L
Sbjct: 685 HLSVKNMGKMSSSHSVLLFFTPPNVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLS 744
Query: 723 IVDNAANSLLASGAHTILVG 742
+VD N + G H + VG
Sbjct: 745 VVDELGNRKVPLGDHMLHVG 764
>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/754 (50%), Positives = 509/754 (67%), Gaps = 34/754 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +CD L +R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 51 LAGYGFCDKSLSVEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSN 110
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGTHF S VPGATSFP ILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 111 VG------PGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGL 164
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ + D DS LK++
Sbjct: 165 TYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVA 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G R+ F++ VT+QDM +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 219 ACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG+W +GYIVSDCDS++ + + + T E+A A+ + AGLDL+
Sbjct: 279 KPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQHY-TKTPEEAAAQTILAGLDLN 337
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG+Y +T GAV+QG + EA I+ ++ + LMRLG+FDG P Y NLG ++C
Sbjct: 338 CGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTS 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ ELA EAARQGIVLLKN G+LPLN IK+LA++GP+ANAT+ MIGNYEG PC Y S
Sbjct: 398 ENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGPNANATRVMIGNYEGIPCNYIS 457
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ A +YA GC ++ C N+ + A A +ADATVI+ G L++EAE DR+
Sbjct: 458 PLQTLTALVPT-SYAAGCPNVQCA-NAELDDATQIAASADATVIIVGASLAIEAESLDRI 515
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++LLPG Q L+++VA+A+KGPV LVIMS G +D++FAK+N KI SILWVGYPGE GG A
Sbjct: 516 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAA 575
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY YV K+P T+M +R P +PGRTY+F+ G V+ FG
Sbjct: 576 IADVIFGFYNPSGRLPMTWYPQAYVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFG 635
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
G+S++ ++K+ +P+ V + L +D +CR C ++ I D C++ F
Sbjct: 636 DGISFSSIEHKIVKAPQLVSVPLAEDHECR---------SSECMSLDIADEHCQNLAFDI 686
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+N GKM S VV+++ PP + K ++G+E+V + A+V F ++ CK L
Sbjct: 687 HLGVKNTGKMSTSHVVLLFFTPPDVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDVCKDLS 746
Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+VD N + G H + VG + PL L +
Sbjct: 747 VVDELGNRKVPLGQHLL----HVGNLKHPLSLRV 776
>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
Length = 925
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/746 (52%), Positives = 502/746 (67%), Gaps = 24/746 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S FP+C+ LPY +RA DLV R+TL EK +Q+ + A G+ RLG+P YEWWSEALHGVS
Sbjct: 37 SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVS-- 94
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
NS G HF +P T FP VIL+ ASFNESLW +GQ VSTE RAMYN+G AGLT
Sbjct: 95 ----NSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLT 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP VV RYA+NYVRGLQ+V G E + +D LK+S+
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEGNFAADR--LKVSS 207
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYD+D W+G DRFHFD++VT QD+++T+ PF+ CV EG VSSVMCSYNRVNG+
Sbjct: 208 CCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMCSYNRVNGV 267
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCA+P+LL IR W GYIVSDCDSI E + +T EDAVA LKAGL+L+C
Sbjct: 268 PTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNC 326
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y ++T AV GK+ E+ +B +L + YIVLMRLG+FDG P + +G +++C
Sbjct: 327 GSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVD 386
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA +AA+QGIVLL N NGALPL+ KTLA++GP+A+AT M+ NY G PCRYTSP
Sbjct: 387 HQLLALDAAKQGIVLLHN-NGALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSP 445
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y ++Y GCA++ C ++I A A ADATV+V GLDL +EAE DRV+
Sbjct: 446 LQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVN 505
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPGFQ +L+ + A AA G V LV+MSAG VDI+F KN KI ILWVGYPG+ GG AI
Sbjct: 506 LTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAI 565
Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
+ VIFG YNPGGR P TWY YV ++P T M +RP NFPGRTY+F+ G +Y FG+
Sbjct: 566 SQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATXNFPGRTYRFYTGKSLYQFGH 625
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDI---NY-TVGTNKPPCAAVLIDDVKCKDYK 659
GLSY+ F + S+P +V + L +I NY T+ A+ I + C++
Sbjct: 626 GLSYSTFYKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLS 685
Query: 660 -FTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
I V+N G++DG+ VV+ + KPP G+ G +++G+ERV + G++ VG ++
Sbjct: 686 NIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLD 745
Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
C + VD L G HT++VG
Sbjct: 746 VCGKISNVDEEGKRKLVMGMHTLVVG 771
>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 818
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/747 (51%), Positives = 503/747 (67%), Gaps = 24/747 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S FP+C+ LPY +RA DLV R+TL EK +Q+ + A G+ RLG+P YEWWSEALHGVS
Sbjct: 61 SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVS-- 118
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
NS G HF +P T FP VIL+ ASFNESLW +GQ VSTE RAMYN+G AGLT
Sbjct: 119 ----NSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLT 174
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP VV RYA+NYVRGLQ+V G E + +D LK+S+
Sbjct: 175 YWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEGNFAADR--LKVSS 231
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYD+D W+G DRFHFD++VT QD+++T+ PF+ CV EG VSSVMCSYNRVNG+
Sbjct: 232 CCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGV 291
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCA+P+LL IR W GYIVSDCDSI E + +T EDAVA LKAGL+L+C
Sbjct: 292 PTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNC 350
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y ++T AV GK+ E+ ++ +L + YIVLMRLG+FDG P + +G +++C
Sbjct: 351 GSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVD 410
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA +AA+QGIVLL N NGALPL+ KTLA++GP+A+AT M+ NY G PCRYTSP
Sbjct: 411 HQLLALDAAKQGIVLLHN-NGALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSP 469
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y ++Y GCA++ C ++I A A ADATV+V GLDL +EAE DRV+
Sbjct: 470 LQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVN 529
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPGFQ +L+ + A AA G V LV+MSAG VDI+F KN KI ILWVGYPG+ GG AI
Sbjct: 530 LTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAI 589
Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
+ VIFG YNPGGR P TWY YV ++P T M +RP +NFPGRTY+F+ G +Y FG+
Sbjct: 590 SQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGH 649
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDI---NY-TVGTNKPPCAAVLIDDVKCKDYK 659
GLSY+ F + S+P +V + L +I NY T+ A+ I + C++
Sbjct: 650 GLSYSTFYKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLS 709
Query: 660 -FTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
I V+N G++DG+ VV+ + KPP G+ G +++G+ERV + G++ VG ++
Sbjct: 710 NIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLD 769
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
C + VD L G HT++VG
Sbjct: 770 VCGKISNVDEEGKRKLVMGMHTLVVGS 796
>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 772
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/741 (50%), Positives = 495/741 (66%), Gaps = 32/741 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C A L R KDL+ R+TL EKV + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 39 NLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWWSEALHGVSNVG 98
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F + P ATSFP VI T ASFN SLW+ IG+ S EARAMYN G AGLT+
Sbjct: 99 ------PGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGTAGLTY 152
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ +G LK++A
Sbjct: 153 WSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGNR---------LKVAAS 203
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG V+SVMCSYN+VNG+P
Sbjct: 204 CKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNGVP 263
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL +T+RG W +GYIVSDCDS+ S + T E+A A +KAGLDLDCG
Sbjct: 264 TCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHY-TSTPEEAAADAIKAGLDLDCG 322
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T AV++G I+EAD++ +L V MRLG +DG P Y NLG ++C H
Sbjct: 323 PFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCTQSH 382
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA EAARQGIVLLKN +LPL+T +T+A++GP++N T MIGNY G C YTSP+
Sbjct: 383 QELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYTSPL 442
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y+K I Y GCA++ C ++ AI+AA+ ADATV+V GLD S+EAE DR L
Sbjct: 443 QGIGTYTKTI-YEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDRASL 501
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q +L++KVA A+KGP LVIMS G VDI FAKN+P+I+ ILW GYPG+ GG AIA
Sbjct: 502 LLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGAAIA 561
Query: 548 DVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
D++FG NPGG+LP+TWY Y+K +P T+M +R + +PGRTY+F++GPVVYPFGYG
Sbjct: 562 DILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGYG 621
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F + + S+PK V I +D + N NK A+ + +C +
Sbjct: 622 LSYTHFVHTLTSAPKLVSIPVDGHRHGNSSNI---ANK----AIKVTHARCGKLSINLHV 674
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+V+N+G DG ++V+S PP G KQ++ +E+V I A +V ++ CK L
Sbjct: 675 DVKNVGSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKAQQRVRVKIHVCKLLS 734
Query: 723 IVDNAANSLLASGAHTILVGE 743
+VD + + G H++ +G+
Sbjct: 735 VVDRSGTRRIPMGLHSLHIGD 755
>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/741 (50%), Positives = 500/741 (67%), Gaps = 32/741 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C A L R KDL+ R+TL EKV + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 38 NLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWWSEALHGVSNVG 97
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F + P ATSFP VI T ASFN SLW+ IG+ S EARAMYN G AGLT+
Sbjct: 98 ------PGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGTAGLTY 151
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ+ +G LK++A
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGNR---------LKVAAS 202
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV EG V+SVMCSYN+VNG+P
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQVNGVP 262
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL +T+RG W +GYIVSDCDS+ S + T E+A A +KAGLDLDCG
Sbjct: 263 TCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHY-TSTPEEAAADAIKAGLDLDCG 321
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T AV++G I+E D++ +L V MRLG +DG P Y LG ++C P H
Sbjct: 322 PFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCTPSH 381
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA EAARQGIVLLKN +LPL+T T+A++GP++N T MIGNY G C YTSP+
Sbjct: 382 QELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYTSPL 441
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
+G Y+K I + GCA++ C N+ AI+ A+ ADATV+V GLD S+EAE DR L
Sbjct: 442 EGIGRYTKTI-HELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDRAGL 500
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q +L++KVA A+KGP LVIMS G VDI FAKNNP+I++ILW GYPG+ GG AIA
Sbjct: 501 LLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGAAIA 560
Query: 548 DVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYG 604
D++FG NPGG+LP+TWY Y+K +P T+M +R + +PGRTY+F++GPVVYPFGYG
Sbjct: 561 DILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRTYRFYNGPVVYPFGYG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F + +AS+PK V I +D R N + NK A+ + +C + Q+
Sbjct: 621 LSYTHFVHTLASAPKLVSIPVDGH---RHGNSSSIANK----AIKVTHARCGKLSISLQV 673
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+V+N+G DG+ ++V+S PP G KQ++ ++++ I + +V ++ CK L
Sbjct: 674 DVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKAQQRVNVNIHVCKLLS 733
Query: 723 IVDNAANSLLASGAHTILVGE 743
+VD + + G H++ +G+
Sbjct: 734 VVDRSGTRRVPMGLHSLHIGD 754
>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
Length = 774
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/748 (50%), Positives = 506/748 (67%), Gaps = 29/748 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+FP+C LP +R +DL+ R+TL EKV+ +G+ A VPRLG+ YEWWSEALHGVS
Sbjct: 40 FRNFPFCQTNLPIGDRVRDLIGRLTLQEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSN 99
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F E PGATSFP VI T ASFN SLW++IG+ VS EARAMYN GL
Sbjct: 100 VG------PGTKFGGEFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGEMGGL 153
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP V YA YVRGLQ E D DS LK++
Sbjct: 154 TYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGLQGNE------DGDS--LKVA 205
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW G DRFHF+++VT+QD+++TF +PF CV +G V+S+MCSYN+VNG
Sbjct: 206 ACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFRSCVKQGKVASIMCSYNQVNG 265
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
IPTCADP+LL +TIRG W +GYIVSDCDS+ ++ + T E+A A +KAGLDLD
Sbjct: 266 IPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHY-TSTPEEAAAAAIKAGLDLD 324
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNP 365
CG + + T AV G + EA IDT+L V MRLG FDG P QY +LG ++C+P
Sbjct: 325 CGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFDGEPSAQQYGHLGPRDVCSP 384
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H ELA EAARQGIVLLKN ALPL+ +T+A++GP+++ T MIGNY G C YTS
Sbjct: 385 AHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTS 444
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G Y+K I + GC D+ C ++ + A++AA+ ADATV+V GLD S+EAE +DR
Sbjct: 445 PLQGISKYAKTI-HEKGCGDVACSDDKLFAGAVNAARQADATVLVMGLDQSIEAEFRDRT 503
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPGFQ ELI++V+ A++GPV LV+MS G VD+ FA N+P+I +I+W GYPG+ GG A
Sbjct: 504 GLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDPRIGAIVWAGYPGQGGGAA 563
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IADV+FG +NPGG+LP+TWY Y+ +P T+M +R +PGRTY+F+ GP+VYPFG
Sbjct: 564 IADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRSNLAKGYPGRTYRFYKGPLVYPFG 623
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GLSYT+F + +PK++ I +D N + +NK ++ + KC
Sbjct: 624 HGLSYTKFITTIFEAPKTLAIPIDGRHT---YNSSTISNK----SIRVTHAKCSKISVQI 676
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
++V+N+G DGS ++V+SKPP KQ++ +++V++ A +V ++ CK L
Sbjct: 677 HVDVKNVGPKDGSHTLLVFSKPPVDIWVPHKQLVAFQKVYVPARSKQRVAINIHVCKYLS 736
Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSF 750
+VD A + G H+I +G+ +S
Sbjct: 737 VVDRAGVRRIPIGEHSIHIGDAKHSLSL 764
>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
Length = 774
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/740 (51%), Positives = 498/740 (67%), Gaps = 30/740 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L F +C+ L R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS+
Sbjct: 49 LGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSY 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGTHF+S VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G DRFHF++ VT+QDM +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
P CADP LL+ +RG+W +GYIVSDCDS+ S + T E+A A+ + AGLDL+
Sbjct: 277 KPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLN 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV+ G + E+ +D ++ + LMRLG+FDG+P Y LG ++C
Sbjct: 336 CGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTS 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELA EAARQGIVLLKN G+LPL+ IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 EHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A Y PGC+++ C + I A A ADATV++ G+D S+EAEG+DRV
Sbjct: 456 PLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 513
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++ LPG Q LI +VA A+KG V LV+MS G DI+FAKN+ KI SILWVGYPGE GG A
Sbjct: 514 NIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY +YV K+P T+M +R P + +PGRTY+F+ G +Y FG
Sbjct: 574 IADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFG 633
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSYTQF + + +PKSV I +++ C + C +V C++ F
Sbjct: 634 DGLSYTQFNHHLVQAPKSVSIPIEEGHSC---------HSSKCKSVDAVQESCQNLVFDI 684
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V N G + GS V ++S PP + + K ++G+E+VF+ A A V F ++ CK L
Sbjct: 685 HLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLS 744
Query: 723 IVDNAANSLLASGAHTILVG 742
IVD +A G H + VG
Sbjct: 745 IVDELGTRKVALGLHVLHVG 764
>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/745 (50%), Positives = 505/745 (67%), Gaps = 38/745 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+++ +C+ KL R KDLV R+TL EKV + + A V RLG+P YEWWSEALHGVS
Sbjct: 49 LANYGFCNKKLSVDARVKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSN 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
IG PGTHF + +PGATSFP IL ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 109 IG------PGTHFSNVIPGATSFPMPILIAASFNASLFQTIGKVVSTEARAMHNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ + D DS LK++
Sbjct: 163 TYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAGYVKGLQQTD------DGDSNKLKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+D+W+G R+ F++ VT+QD+ +T+ PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPPFKSCVIDGNVASVMCSYNQVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG W +GYIVSDCDS+ + ++ + T E+A A+ + AGLDL+
Sbjct: 277 KPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQHY-TKTPEEAAAKSILAGLDLN 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + +T GAV+QG I EA I+ ++ + LMRLG+FDG P Y NLG ++C
Sbjct: 336 CGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTS 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA EAARQGIVLLKN G+LPLN IK+LA++GP+ANAT+AMIGNYEG PC+YTS
Sbjct: 396 ANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGPNANATRAMIGNYEGIPCKYTS 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAK----NADATVIVAGLDLSVEAEG 481
P+ G A ++A GC D+ C N AA+D AK +ADATVIV G +L++EAE
Sbjct: 456 PLQGLTALVPT-SFAAGCPDVQCTN-----AALDDAKKIAASADATVIVVGANLAIEAES 509
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR+++LLPG Q +L+ +VA+ AKGPV L IMS G +D++FAK N KI SILWVGYPGE
Sbjct: 510 HDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAKTNKKITSILWVGYPGEA 569
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG AIADVIFG +NP GRLP+TWY +YV K+P T+M +R P +PGRTY+F+ G V
Sbjct: 570 GGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETV 629
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
+ FG G+SY+ F++K+ +P+ V + L +D CR C ++ + C++
Sbjct: 630 FSFGDGISYSTFEHKLVKAPQLVSVPLAEDHVCRS---------SKCKSLDVVGEHCQNL 680
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
F + ++N GKM S+ V ++S PP + K ++ +E+V + A V F ++ C
Sbjct: 681 AFDIHLRIKNKGKMSSSQTVFLFSTPPAVHNAPQKHLLAFEKVLLTGKSEALVSFKVDVC 740
Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
K L +VD N +A G H + VG+
Sbjct: 741 KDLGLVDELGNRKVALGKHMLHVGD 765
>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 763 bits (1970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/741 (51%), Positives = 497/741 (67%), Gaps = 30/741 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L F +C+ L R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS+
Sbjct: 49 LGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSY 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGTHF+S VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTHFNSIVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASAYVRGLQQGD------DGSPDRLKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G DR HF++ VT+QDM +TF PF+ CV +G+V+SVMCS+N+VNG
Sbjct: 217 ACCKHYTAYDLDNWKGVDRLHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL+ +RG+W +GYIVSDCDS+ S + T E+A A+ + AGLDL+
Sbjct: 277 KPTCADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLN 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV+ G + E+ +D ++ + LMRLG+FDG+P Y LG ++C
Sbjct: 336 CGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTS 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H E+A EAARQGIVLLKN G+LPL+ IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 EHQEMAREAARQGIVLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKYTT 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A Y PGC+++ C + I A A ADATV++ G+D S+EAEG+DRV
Sbjct: 456 PLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 513
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+ LPG Q LI +VA A+KG V LV+MS G DI+FAKN+ KI SILWVGYPGE GG A
Sbjct: 514 SIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKIASILWVGYPGEAGGAA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY +YV K+P T+M +R P + +PGRTY+F+ G +Y FG
Sbjct: 574 IADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFG 633
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSYTQF + + +PKSV I +++ C + C +V C++ F
Sbjct: 634 DGLSYTQFNHHLVQAPKSVSIPIEEGHSC---------HSSKCKSVDAVQESCQNLAFDI 684
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V N G + GS V ++S PP + + K ++G+E+VF+ A A V F ++ CK L
Sbjct: 685 HLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAEALVRFKVDVCKDLS 744
Query: 723 IVDNAANSLLASGAHTILVGE 743
IVD +A G H + VG
Sbjct: 745 IVDELGTQKVALGLHVLHVGS 765
>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/744 (51%), Positives = 500/744 (67%), Gaps = 32/744 (4%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
L+ F +C+ L +R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 13 SLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVS 72
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
++G PGTHF S VPGATSFP VILT ASFN SL+ IG+ VSTEARAMYN+G AG
Sbjct: 73 YVG------PGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMYNVGLAG 126
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LTFWSPNIN+ RDPRWGR ETPGEDP + +Y YV+GLQ + D + LK+
Sbjct: 127 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRD------DGNPDGLKV 180
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+ACCKHY AYDLDNW+G DR+HF++ VT+QDM +TF PF+ CV +G+V+SVMCSYN+VN
Sbjct: 181 AACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVN 240
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG--L 305
GIPTCADP LL+ IRG+W +GYIV+DCDSI S + T E+A A+ + AG L
Sbjct: 241 GIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHY-TKTPEEAAAKAILAGIRL 299
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNI 362
DL+CG + T AV G + E+ ID ++ + LMRLG+FDG P Y LG ++
Sbjct: 300 DLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDV 359
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
C ++ ELA EAARQGIVLLKN G+LPL+ IK LA++GP+AN TK MIGNYEGTPC+
Sbjct: 360 CTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCK 419
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
YT+P+ G A Y PGC+++ C + + + A A ADATV+V G DLS+EAE +
Sbjct: 420 YTTPLQGLAALVAT-TYLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADLSIEAESR 477
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DRVD+LLPG Q LI VA+A+ GPV LVIMS G +D++FAK N KI SILWVGYPGE G
Sbjct: 478 DRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAG 537
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
G AIAD+IFG YNP GRLP+TWY +YV K+P T+M +R P N +PGRTY+F+ G VY
Sbjct: 538 GAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVY 597
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
FG GLSY++F +++ +P V + L+++ C C +V + C++
Sbjct: 598 SFGDGLSYSEFSHELTQAPGLVSVPLEENHVCY---------SSECKSVAAAEQTCQNLT 648
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
F + ++N G GS V ++S PP + + K ++G+E+VF+ A + VGF ++ CK
Sbjct: 649 FDVHLRIKNTGTTSGSHTVFLFSTPPSVHNSPQKHLVGFEKVFLHAQTDSHVGFKVDVCK 708
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L +VD + +A G H + +G
Sbjct: 709 DLSVVDELGSKKVALGEHVLHIGS 732
>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 765
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/741 (50%), Positives = 504/741 (68%), Gaps = 30/741 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
++ + +CD L R KDLV R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 40 VAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEWWSEALHGVSN 99
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F + +PGATSFP ILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 100 VG------PGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVGLAGL 153
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR LETPGEDP + +YA YV+GLQ +G D LK++
Sbjct: 154 TYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDG------GDPNKLKVA 207
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G R+ F++ VT+QDM++TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL +RG+W +GYIVSDCDS++ + + + T E+A A + AGLDL+
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHY-TKTPEEAAAISILAGLDLN 326
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + +T GAV+QG I EA I+ ++ + LMRLG+FDG P+ Y NLG ++C
Sbjct: 327 CGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTQ 386
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ ELA EAARQGIVLLKN +LPLN IK+LA++GP+ANAT+ MIGNYEG PC+Y S
Sbjct: 387 ENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYIS 446
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A++ +YA GC D+ C N ++ A A +ADATVIV G L++EAE DRV
Sbjct: 447 PLQGLTAFAPT-SYAAGCLDVRCPN-PVLDDAKKIAASADATVIVVGASLAIEAESLDRV 504
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++LLPG Q L+++VA+A+KGPV LVIMS G +D++FAKNN KI SILWVGYPGE GG A
Sbjct: 505 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAA 564
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG +NP GRLP+TWY +YV K+P T+M +R P +PGRTY+F+ G V+ FG
Sbjct: 565 IADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFAFG 624
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSY+ +K+ +P+ V ++L +D CR C ++ + C++ F
Sbjct: 625 DGLSYSSIVHKLVKAPQLVSVQLAEDHVCRS---------SECKSIDVVGEHCQNLVFDI 675
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ ++N GKM + V ++S PP + K ++G+E+V + A V F ++ CK L
Sbjct: 676 HLRIKNKGKMSSAHTVFLFSTPPAVHNAPQKHLLGFEKVHLIGKSEALVSFKVDVCKDLS 735
Query: 723 IVDNAANSLLASGAHTILVGE 743
IVD N +A G H + VG+
Sbjct: 736 IVDELGNRKVALGQHLLHVGD 756
>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 782
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/740 (51%), Positives = 501/740 (67%), Gaps = 30/740 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+S F +CD+ L + R +DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS+
Sbjct: 57 VSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGVSY 116
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F + VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 117 VG------PGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 170
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D D LK++
Sbjct: 171 TYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD------DGDPDRLKVA 224
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G DR+HF++ V+ QD+++TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 225 ACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNG 284
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG W +GYIVSDCDS+ + S + + E+A A+ + AGLDLD
Sbjct: 285 KPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSPEEAAAKTILAGLDLD 343
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CGD+ T AV G + EA I ++ + LMRLG+FDG+P Y LG ++C P
Sbjct: 344 CGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTP 403
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELA EAARQGIVLLKN +LPL++ IK+LA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 404 EHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTT 463
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A ++ PGCA++ C ++ + A A +ADATV+V G D S+EAE +DRV
Sbjct: 464 PLQGLSAVVST-SFQPGCANVAC-TSAQLDEAKKIAASADATVLVVGSDQSIEAESRDRV 521
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q LI +VA A+KGPV LVIM+ G +DI FAK + KI SILWVG+PGE GG A
Sbjct: 522 DLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAA 581
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IADVIFG +NP GRLP+TWY +YV K+P T M +RP N FPGRTY+F+ G +Y FG
Sbjct: 582 IADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFG 641
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSY+ FK+ + +PK V I L++ C + C ++ + C++ F
Sbjct: 642 DGLSYSDFKHHLVKAPKLVSIPLEEGHIC---------HSSKCHSLEVVQESCQNLGFDV 692
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+N+G+ GS V +YS PP + + K ++G+E+V + G V F ++ CK L
Sbjct: 693 HLRVKNVGQRSGSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLS 752
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + +A G H + VG
Sbjct: 753 VADEVGSRKVALGLHILHVG 772
>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
Length = 777
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/740 (51%), Positives = 497/740 (67%), Gaps = 29/740 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ F +C+ L +R DLV R+TL EK+ + + A V RLG+P YEWWSEALHGVS+
Sbjct: 51 LASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSRLGIPKYEWWSEALHGVSY 110
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGTHF + VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 111 VG------PGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 164
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +Y YVRGLQ + + DS LK++
Sbjct: 165 TFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQQTD------NGDSERLKVA 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G DR+HF++ VT+QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 219 ACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNVASVMCSYNQVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG+W +GYIVSDCDS+ I S + T E+A A + AGLDL+
Sbjct: 279 KPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHY-TKTPEEAAAITILAGLDLN 337
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV G + + +D ++ + LMRLG+FDG P Y LG ++C
Sbjct: 338 CGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTA 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA EAARQGIVLLKN G+LPL+ IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 398 VNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 457
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A S Y GC+++ C + + A A +ADATV+V G D S+EAE +DRV
Sbjct: 458 PLQGLTA-SVATTYLAGCSNVACA-AAQVDDAKKLAASADATVLVMGADQSIEAESRDRV 515
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
D+LLPG Q LI +VA+ +KGPV LVIMS G +D++FAK N KI SILWVGYPGE GG A
Sbjct: 516 DVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAA 575
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY YV K+P T+M +R P + +PGRTY+F+ G VY FG
Sbjct: 576 IADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYRFYTGETVYSFG 635
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSY+++K+++ +P+ V I L+ D CR + C +V + C+ F
Sbjct: 636 DGLSYSEYKHQLVQAPQLVSIPLEDDHVCR--------SSSKCISVDAGEQNCQGLAFNI 687
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
++V N+GK+ G+ V ++ PP + + K ++ +E+V + A V F ++ CK L
Sbjct: 688 DLKVRNIGKVRGTHTVFLFFTPPSVHNSPQKHLVDFEKVSLDAKTYGMVSFKVDVCKHLS 747
Query: 723 IVDNAANSLLASGAHTILVG 742
+VD + +A G H + VG
Sbjct: 748 VVDEFGSRKVALGGHVLHVG 767
>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/745 (50%), Positives = 504/745 (67%), Gaps = 32/745 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C L ER KDL+ R+TL EKV+ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 41 FCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNLG--- 97
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
P F+++ P ATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLT+WSP
Sbjct: 98 ---PAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSP 154
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + G YA YVRGLQ G +R LK++ACCKH
Sbjct: 155 NVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQ---GTHANR------LKVAACCKH 205
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+ AYDLDNW G DRFHF+++V++QD+++TF +PF+MCV+EG V+SVMCSYN+VNG+PTCA
Sbjct: 206 FTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQVNGVPTCA 265
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL +T+RG W GYIVSDCDS+ ++ + T E+A A +KAGLDLDCG +
Sbjct: 266 DPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAADAIKAGLDLDCGPFL 324
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
T AV++G ++EAD++ +L V MRLG FDG P Y +LG ++C P H EL
Sbjct: 325 AVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCKPAHQEL 384
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLLKN LPL++ +T+A++GP++ AT MIGNY G C YT+P+ G
Sbjct: 385 ALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYTNPLQGI 444
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ ++ GC ++ C+N+ + AI+AA+ ADATV+V GLD S+EAE DR LLLP
Sbjct: 445 GRYARTVHQL-GCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDRTGLLLP 503
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q +L++KVA A+KGP LV+MS G VDI FAKNNP+I ILW GYPG+ GG AIAD++
Sbjct: 504 GRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGAAIADIL 563
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY Y+ K+P T+M +R + +PGRTY+F++GPVVYPFG+GL+Y
Sbjct: 564 FGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFYNGPVVYPFGHGLTY 623
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F + +AS+P V + L+ R N T +N+ A+ + +C T Q++++
Sbjct: 624 THFVHTLASAPTVVSVPLNGH---RRANVTNISNR----AIRVTHARCDKLSITLQVDIK 676
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
N+G DG+ ++V+S PP G KQ++ +E+V + A +VG ++ CK L +VD
Sbjct: 677 NVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRVGVNIHVCKLLSVVD 736
Query: 726 NAANSLLASGAHTILVGEGVGGVSF 750
+ + G H+ +G+ VS
Sbjct: 737 RSGIRRIPLGEHSFNIGDVKHSVSL 761
>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 809
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/740 (51%), Positives = 501/740 (67%), Gaps = 30/740 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+S F +CD+ L + R +DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS+
Sbjct: 84 VSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGVSY 143
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F + VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 144 VG------PGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGL 197
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D D LK++
Sbjct: 198 TYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD------DGDPDRLKVA 251
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G DR+HF++ V+ QD+++TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 252 ACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGNVASVMCSYNQVNG 311
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL IRG W +GYIVSDCDS+ + S + + E+A A+ + AGLDLD
Sbjct: 312 KPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSPEEAAAKTILAGLDLD 370
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CGD+ T AV G + EA I ++ + LMRLG+FDG+P Y LG ++C P
Sbjct: 371 CGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQLYGKLGPKDVCTP 430
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELA EAARQGIVLLKN +LPL++ IK+LA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 431 EHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMIGNYEGTPCKYTT 490
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A ++ PGCA++ C ++ + A A +ADATV+V G D S+EAE +DRV
Sbjct: 491 PLQGLSAVVST-SFQPGCANVAC-TSAQLDEAKKIAASADATVLVVGSDQSIEAESRDRV 548
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q LI +VA A+KGPV LVIM+ G +DI FAK + KI SILWVG+PGE GG A
Sbjct: 549 DLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILWVGFPGEAGGAA 608
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IADVIFG +NP GRLP+TWY +YV K+P T M +RP N FPGRTY+F+ G +Y FG
Sbjct: 609 IADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRTYRFYTGETIYSFG 668
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSY+ FK+ + +PK V I L++ C + C ++ + C++ F
Sbjct: 669 DGLSYSDFKHHLVKAPKLVSIPLEEGHIC---------HSSKCHSLEVVQESCQNLGFDV 719
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+N+G+ GS V +YS PP + + K ++G+E+V + G V F ++ CK L
Sbjct: 720 HLRVKNVGQRSGSHTVFLYSTPPSVHNSPQKHLLGFEKVSLGRGGETVVRFKVDVCKDLS 779
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + +A G H + VG
Sbjct: 780 VADEVGSRKVALGLHILHVG 799
>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
Length = 768
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/749 (49%), Positives = 504/749 (67%), Gaps = 33/749 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C LP R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 42 FCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG--- 98
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F PGAT+FP VI T ASFNESLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 99 ---PGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGMAGLTYWSP 155
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+NV RDPRWGR ETPGEDP V G+YA +YVRGLQ G+ LK++ACCKH
Sbjct: 156 NVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLR---------LKVAACCKH 206
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV G V+SVMCSYN+VNG PTCA
Sbjct: 207 YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVMCSYNQVNGKPTCA 266
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG+W +GYIVSDCDS+ + ++ + T E+A A ++AGLDLDCG +
Sbjct: 267 DPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHY-TATPEEAAASTIRAGLDLDCGPFL 325
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
T AV+ G + E D++ +L V MRLG FDG P + NLG ++C P H +L
Sbjct: 326 AIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 385
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AARQGIVLL+N LPL+ ++T+A++GP+++ T MIGNY G C YT+P+ G
Sbjct: 386 ALQAARQGIVLLQNRGRTLPLSR-TLQTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 444
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y+K +++ PGC D+ C N AA AA++ADAT++V GLD S+EAE +DR LLLP
Sbjct: 445 RRYAKTVHH-PGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIEAEFRDRKGLLLP 503
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G+Q EL++ VA A++GP LV+MS G +D++FAKN+P+I +ILWVGYPG+ GG AIADV+
Sbjct: 504 GYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 563
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY NY+ K+P T+M +R P +PGRTY+F+ GPVV+PFG+G+SY
Sbjct: 564 FGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHGMSY 623
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F + + +P+ V + L R N T +N A+ + C+ I+V+
Sbjct: 624 TTFAHSLVQAPREVSVPLASLHVSR--NTTGASN-----AIRVSHANCEALALGVHIDVK 676
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N G MDG+ ++V+S PPG + KQ+IG+E+V + G +V ++ CK L +VD
Sbjct: 677 NTGDMDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQKRVKIDIHVCKHLSVVDRF 736
Query: 728 ANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+ G H + +G+ +S LQ NL
Sbjct: 737 GIRRIPIGEHDLYIGDLKHSIS--LQANL 763
>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 769
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/740 (50%), Positives = 493/740 (66%), Gaps = 27/740 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+D+P+C L ER KDL+ R+TL EKV+ + A GVPRLG+ Y+WWSEALHGVS +
Sbjct: 37 TDYPFCRRSLVVEERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNV 96
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PGT F E P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G GLT
Sbjct: 97 G------PGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMYNGGVGGLT 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP + G YA+NYVRGLQ EG LK++A
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGTEGNR---------LKVAA 201
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV G VSSVMCSYN+VNG+
Sbjct: 202 CCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPFRMCVKGGKVSSVMCSYNQVNGV 261
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCADP LL T+R W+ GYIVSDCDS+ S + T E+A A +KAGLDLDC
Sbjct: 262 PTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQHY-TSTPEEAAAMAIKAGLDLDC 320
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQ 366
G + T AV++G + E+ I+ +L V MRLG FDG + Y +LG ++C+
Sbjct: 321 GSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMFDGDLKTQPYAHLGAKHVCSDH 380
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
+ +LA +AARQGIVLL+N G+LPL+T + +A+VGP++NAT MIGNY G C Y +P
Sbjct: 381 NRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPNSNATLTMIGNYAGIACEYITP 440
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y++ I + GC + C++N AI+AA+ ADA V+V GLD S+EAE +DR
Sbjct: 441 LQGISKYTRTI-HQEGCRGVACRSNKFFGGAIEAARVADAVVLVMGLDQSIEAEFRDRAG 499
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q +L+ KVA AKGPV LV+MS G +D++FAK++PKI I+W GYPG+ GG AI
Sbjct: 500 LLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDHPKISGIIWGGYPGQAGGLAI 559
Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
ADV+FG+ NPGG+LP+TWY +YV K+P T+M LRP ++PGRTY+F+ GPVVYPFG+GL
Sbjct: 560 ADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTSYPGRTYRFYKGPVVYPFGHGL 619
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
SYT F +K+ S+P ++ + + + + + G AV + KC ++
Sbjct: 620 SYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGK------AVRVTHAKCDRLSLVIKVA 673
Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
V N+G DG+ ++VYS PP KQ++ +E+V I A +V ++ CK L +VD
Sbjct: 674 VRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALKEVQINIHVCKLLSVVD 733
Query: 726 NAANSLLASGAHTILVGEGV 745
+ G H I +G+ V
Sbjct: 734 KYGIRRVPMGEHGIDIGDNV 753
>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
Length = 771
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/748 (49%), Positives = 499/748 (66%), Gaps = 33/748 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C+ KL PER KDL+ R+T+ EKV + + A VPR+G+ YEWWSEALHGVS +G
Sbjct: 39 NLPFCNVKLAIPERVKDLIGRLTMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVG 98
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGLT+
Sbjct: 99 ------PGTRFGGVFPAATSFPQVITTAASFNASLWEAIGRVVSDEARAMYNGGAAGLTY 152
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP + GRYA +YV+GLQ +G LK++AC
Sbjct: 153 WSPNVNIFRDPRWGRGQETPGEDPVLAGRYAASYVKGLQGTDG---------NKLKVAAC 203
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYD+DNW G DRFHF++ V++QD+++TF +PF MCV EG V+SVMCSYN+VNG+P
Sbjct: 204 CKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVMCSYNQVNGVP 263
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL +T+RG W GYIVSDCDS+ + S + T E+A A +KAGLDLDCG
Sbjct: 264 TCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQHY-TSTPEEAAADAIKAGLDLDCG 322
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T AV++G + EAD++ +L V MRLG FDG P Y LG ++C P H
Sbjct: 323 PFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLGPKDVCKPAH 382
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA EAARQGIVLLKN LPL+ +T+A++GP+++ T MIGNY G C YTSP+
Sbjct: 383 QELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAGIACGYTSPL 442
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y+K I + GC+++ C+++ A+DAA++ADAT++V GLD S+EAE DR L
Sbjct: 443 QGIGRYAKTI-HQQGCSNVACRDDKQFGPALDAARHADATILVIGLDQSIEAETVDRTSL 501
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q +L++KVA A+KGP LV+MS G VDI FAKN+PK+ ILW GYPG+ GG AIA
Sbjct: 502 LLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKNDPKVAGILWAGYPGQAGGAAIA 561
Query: 548 DVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVN-NFPGRTYKFFDGPVVYPFGYGL 605
D++FG +PGG+LP+TWY Y+K + T+M +RP +PGRTY+F+ GPVVYPFG+GL
Sbjct: 562 DILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRPSKIGYPGRTYRFYKGPVVYPFGHGL 621
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
+YT F ++++S+P V + + + + N +NK A+ + +C ++
Sbjct: 622 TYTHFVHELSSAPTVVSVPVHGHRHGNNTNI---SNK----AIRVTHARCGKLSIALHVD 674
Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHI---KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
V+N+G DG+ ++V+S PP G H K ++ +E+V + A +V ++ CK L
Sbjct: 675 VKNVGSRDGTHTLLVFSAPPN-GGNHWVPQKSLVAFEKVHVPAKTKQRVRVNIHVCKLLS 733
Query: 723 IVDNAANSLLASGAHTILVGEGVGGVSF 750
+VD + + G H++ +G+ VS
Sbjct: 734 VVDKSGIRRIPMGEHSLHIGDVKHSVSL 761
>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 769
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/740 (50%), Positives = 493/740 (66%), Gaps = 27/740 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+D+P+C L ER KDL+ R+TL EKV+ + A GVPRLG+ Y+WWSEALHGVS +
Sbjct: 37 TDYPFCRRSLVVGERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNV 96
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PGT F E P ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G GLT
Sbjct: 97 G------PGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMYNGGVGGLT 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP + G YA+NYVRGLQ EG LK++A
Sbjct: 151 YWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGTEGNR---------LKVAA 201
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDLDNW G DRFHF+++V++QD+++TF +PF MCV G VSSVMCSYN+VNG+
Sbjct: 202 CCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPFRMCVKGGKVSSVMCSYNQVNGV 261
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCADP LL T+R W+ GYIVSDCDS+ S + T E+A A +KAGLDLDC
Sbjct: 262 PTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQHY-TSTPEEAAAMAIKAGLDLDC 320
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQ 366
G + T AV++G + E+ I+ +L V MRLG FDG + Y +LG ++C+
Sbjct: 321 GSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMFDGDLKTQPYAHLGAKHVCSDH 380
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
+ +LA +AARQGIVLL+N G+LPL+T + +A+VGP++NAT MIGNY G C Y +P
Sbjct: 381 NRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPNSNATLTMIGNYAGIACEYITP 440
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y++ I + GC + C++N AI+AA+ ADA V+V GLD S+EAE +DR
Sbjct: 441 LQGISKYTRTI-HQEGCRGVACRSNKFFGGAIEAARVADAVVLVMGLDQSIEAEFRDRAG 499
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q +L+ KVA AKGPV LV+MS G +D++FAK++PKI I+W GYPG+ GG AI
Sbjct: 500 LLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKDHPKISGIIWGGYPGQAGGLAI 559
Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
ADV+FG+ NPGG+LP+TWY +YV K+P T+M LRP ++PGRTY+F+ GPVVYPFG+GL
Sbjct: 560 ADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLRPGTSYPGRTYRFYKGPVVYPFGHGL 619
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
SYT F +K+ S+P ++ + + + + + G AV + KC ++
Sbjct: 620 SYTAFTHKILSAPTTLTVPVTGHRHPHNGSEFWGK------AVRVTHAKCDRLSLVIKVA 673
Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
V N+G DG+ ++VYS PP KQ++ +E+V I A +V ++ CK L +VD
Sbjct: 674 VRNIGARDGAHTLLVYSIPPMGVWVPQKQLVAFEKVHIDAQALKEVQINIHVCKLLSVVD 733
Query: 726 NAANSLLASGAHTILVGEGV 745
+ G H I +G+ V
Sbjct: 734 KYGIRRVPMGEHGIDIGDNV 753
>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 775
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/739 (50%), Positives = 499/739 (67%), Gaps = 30/739 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F +C+ +P R +DL+ R+TLPEK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 49 FKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 107
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PGT F PGAT FP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+W
Sbjct: 108 -----PGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQAGLTYW 162
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPN+N+ RDPRWGR ETPGEDP + +YA +YV+GLQ DS LK++ACC
Sbjct: 163 SPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DSAGNHLKVAACC 214
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHY AYDLDNW G DRFHF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PT
Sbjct: 215 KHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKPT 274
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CADP LL TIRG W +GYIVSDCDS+ ++ + T E+A A +KAGLDLDCG
Sbjct: 275 CADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHY-TKTPEEAAAEAIKAGLDLDCGP 333
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
+ T A+++G I+E D++ +L L V MRLG FDG P Y NLG ++C H
Sbjct: 334 FLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNLGPRDVCTSAHQ 393
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA EAAR+ IVLL+N +LPL+ ++T+ +VGP+A+AT MIGNY G C YT+P+
Sbjct: 394 QLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYAGVACGYTTPLQ 453
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G Y K + GC + C+ N + AA A+ ADA V+V GLD +VEAE +DRV LL
Sbjct: 454 GIARYVKTAHQV-GCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVEAETRDRVGLL 512
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LPG Q EL+ +VA AAKGPV L+IMS G VDI+FAKN+PKI +ILWVGYPG+ GG AIAD
Sbjct: 513 LPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYPGQAGGTAIAD 572
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
VIFG NPGGRLP+TWY Y+ K+P T+M +R P +PGRTY+F+ GPVV+PFG+GL
Sbjct: 573 VIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYPGRTYRFYKGPVVFPFGHGL 632
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD-YKFTFQI 664
SY++F + +A +PK V + + Q N T+ + AV + C D + F +
Sbjct: 633 SYSRFSHSLALAPKQVSVPIMSLQAL--TNSTLSSK-----AVKVSHANCDDSLEMEFHV 685
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDG+ ++++S+PP + IKQ++G+ + + AG +V ++ CK L +V
Sbjct: 686 DVKNEGSMDGTHTLLIFSQPPHGKWSQIKQLVGFHKTHVLAGSKQRVKVGVHVCKHLSVV 745
Query: 725 DNAANSLLASGAHTILVGE 743
D + +G H + +G+
Sbjct: 746 DQFGVRRIPTGEHELHIGD 764
>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
Length = 770
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/741 (49%), Positives = 488/741 (65%), Gaps = 30/741 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L + +CDA L R DLV R+TL EK+ + A GV RLG+P YEWWSEALHGV++
Sbjct: 45 LGNLTFCDASLAVENRVNDLVNRLTLGEKIGFLVSGAGGVSRLGIPKYEWWSEALHGVAY 104
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
G PG HF S VPGATSFP VILT ASFN +L++ IG+ VSTEARAMYN+G AGL
Sbjct: 105 TG------PGVHFTSLVPGATSFPQVILTAASFNVTLFQTIGKVVSTEARAMYNVGLAGL 158
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +Y + YV GLQ + D + LK++
Sbjct: 159 TYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVEGLQQTD------DGSTNKLKVA 212
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ F++ V +QD+ +TF PF CV EG V+SVMCSYN+VNG
Sbjct: 213 ACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDTFQPPFRSCVLEGAVASVMCSYNQVNG 272
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTC DP LL +RG+W +GYIV+DCDS+Q I +S + T E+A A L +G+DL+
Sbjct: 273 KPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVIFKSQNY-TKTPEEAAALGLNSGVDLN 331
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + + +T GAV Q + E+ ID ++ + LMRLG+FDG+P+ Y NLG ++C P
Sbjct: 332 CGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLMRLGFFDGNPKSRIYGNLGPKDVCTP 391
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ ELA EAARQGIVLLKN G+LPL IK+LA++GP+AN TK MIGNYEG PC+YT+
Sbjct: 392 ENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLAVIGPNANVTKTMIGNYEGIPCKYTT 451
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A I Y PGCAD+ C N + I A A ADA V+V G D S+E E DR
Sbjct: 452 PLQGLTASVATI-YKPGCADVSC-NTAQIDDAKQIATTADAVVLVMGSDQSIEKESLDRT 509
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+ LPG Q+ L+ +VA AKGPV LVIMS G +D+ FA +NPKI SILWVG+PGE GG A
Sbjct: 510 SITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDVQFAVDNPKITSILWVGFPGEAGGAA 569
Query: 546 IADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
+ADVIFG YNP GRLP+TWY +Y +P T M +R P N+PGRTY+F+ GP V+ FG
Sbjct: 570 LADVIFGYYNPSGRLPMTWYPQSYADVVPMTDMNMRPNPATNYPGRTYRFYTGPTVFTFG 629
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GLSY+QFK+ + +P+ V + L + CR C V C + F
Sbjct: 630 HGLSYSQFKHHLDKAPQFVSLPLGEKHTCR---------LSKCKTVDAVGQSCSNMGFDI 680
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+N+GK+ GS ++ +++ PP + K ++G+E+V + V F +N CK L
Sbjct: 681 HLRVKNVGKISGSHIIFLFTSPPSVHNAPKKHLLGFEKVHLTPQGEGVVKFNVNVCKHLS 740
Query: 723 IVDNAANSLLASGAHTILVGE 743
+ D N +A G H + +G+
Sbjct: 741 VHDELGNRKVALGPHVLHIGD 761
>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
Length = 784
Score = 751 bits (1940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/742 (50%), Positives = 498/742 (67%), Gaps = 27/742 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ L R DLV R+TL EK+ + A GV RLG+P YEWWSEALHGVS+
Sbjct: 54 LAAYGFCNTVLKIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSY 113
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
IG PGTHF S+VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 114 IG------PGTHFSSQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGL 167
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YV+GLQ+ +G DS LK++
Sbjct: 168 TYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQETDG------GDSNRLKVA 221
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ F++ VT+QDM +T+ PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 222 ACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNG 281
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL+ IRG+W +GYIVSDCDS+ + ++ + E A +L AGLDL+
Sbjct: 282 KPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISIL-AGLDLN 340
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV+ G + EA ID ++ ++ LMRLG+FDG+P+ Y LG ++C
Sbjct: 341 CGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTS 400
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELAA+AARQGIVLLKN G LPL+ +IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 401 ANQELAADAARQGIVLLKN-TGFLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 459
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A + Y PGC+++ C + A A AD TV++ G D S+EAE +DRV
Sbjct: 460 PLQGL-AGAVSTTYLPGCSNVACAVAD-VAGATKLAATADVTVLLIGADQSIEAESRDRV 517
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q EL+ +VA AAKGPV LVIMS G DI FAKN+PKI ILWVGYPGE GG A
Sbjct: 518 DLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIA 577
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IAD+IFG+YNP GRLP+TWY +YV K+P T M +RP +PGRTY+F+ G VY FG
Sbjct: 578 IADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIMNMRPDKSKGYPGRTYRFYTGETVYAFG 637
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
GLSYT+F + + +P V + L+++ CR ++ P C + V F
Sbjct: 638 DGLSYTKFSHSLVKAPSLVSLSLEENHVCRSSECQSLDAIGPHCE----NAVSGGGSAFE 693
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
QI+V N G +G V +++ PP I G+ K ++G+E++ + + A V F + CK L
Sbjct: 694 VQIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLLGFEKIRLGKMEEAVVRFKVEVCKDL 753
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD + G H + VG+
Sbjct: 754 SVVDEIGKRKIGLGKHLLHVGD 775
>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
Length = 784
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/742 (50%), Positives = 498/742 (67%), Gaps = 27/742 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ L R DLV R+TL EK+ + A GV RLG+P YEWWSEALHGVS+
Sbjct: 54 LAAYGFCNTVLKIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSY 113
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
IG PGTHF S+VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 114 IG------PGTHFSSQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGL 167
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YV+GLQ+ +G DS LK++
Sbjct: 168 TYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQETDG------GDSNRLKVA 221
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ F++ VT+QDM +T+ PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 222 ACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNG 281
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL+ IRG+W +GYIVSDCDS+ + ++ + E A +L AGLDL+
Sbjct: 282 KPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISIL-AGLDLN 340
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV+ G + EA ID ++ ++ LMRLG+FDG+P+ Y LG ++C
Sbjct: 341 CGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTS 400
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELAA+AARQGIVLLKN G LPL+ +IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 401 ANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 459
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A + Y PGC+++ C + A A AD +V+V G D S+EAE +DRV
Sbjct: 460 PLQGL-AGTVSTTYLPGCSNVACAVAD-VAGATKLAATADVSVLVIGADQSIEAESRDRV 517
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q EL+ +VA AAKGPV LVIMS G DI FAKN+PKI ILWVGYPGE GG A
Sbjct: 518 DLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIA 577
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IAD+IFG+YNP G+LP+TWY +YV K+P T M +RP + +PGRTY+F+ G VY FG
Sbjct: 578 IADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASGYPGRTYRFYTGETVYAFG 637
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
GLSYT+F + + +P V + L+++ CR ++ P C + V F
Sbjct: 638 DGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQSLDAIGPHCE----NAVSGGGSAFE 693
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
I+V N G +G V +++ PP I G+ K ++G+E++ + + A V F + CK L
Sbjct: 694 VHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIRLGKREEAVVRFKVEICKDL 753
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD + G H + VG+
Sbjct: 754 SVVDEIGKRKIGLGKHLLHVGD 775
>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
Length = 765
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/742 (50%), Positives = 504/742 (67%), Gaps = 32/742 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+S + +CD RA DL+ R+TL EKV + + +PRLG+P YEWWSEALHGVS+
Sbjct: 40 VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWWSEALHGVSY 99
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F + VPGATSFP ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 100 VG------PGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 153
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD G S LK++
Sbjct: 154 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGG-------GSDALKVA 206
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 207 ACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNG 266
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL+ IRGDW +GYIVSDCDS+ + + + + EDA A +K+GLDL+
Sbjct: 267 KPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PEDAAAITIKSGLDLN 325
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG++ T+ AVQ GK++E+D+D ++ +IVLMRLG+FDG P+ + +LG ++C
Sbjct: 326 CGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTS 385
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA EAARQGIVLLKN GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 386 SNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 444
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + AA AA +AD TV+V G D SVE E DR
Sbjct: 445 PLQGLGANVATV-YQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDR 503
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG Q +L++ VA+A++GPV LV+MS G DI+FAK++ KI +ILWVGYPGE GG
Sbjct: 504 TSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAGGA 563
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
A+AD++FG +NPGGRLP+TWY A++ K+ T M +RP +PGRTY+F+ G VY F
Sbjct: 564 ALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAF 623
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G GLSYT+F + + S+P+ V ++L + C + C +V C F
Sbjct: 624 GDGLSYTKFAHSLVSAPEQVAVQLAEGHACHTEH---------CFSVEAAGEHCGSLSFD 674
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+ V N G M G V ++S PP + K ++G+E+V + GQ+ V F ++ CK L
Sbjct: 675 VHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDL 734
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD N +A G+HT+ VG+
Sbjct: 735 SVVDELGNRKVALGSHTLHVGD 756
>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
Length = 896
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/741 (51%), Positives = 489/741 (65%), Gaps = 62/741 (8%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S FP+C+ LPY +RA DLV R+TL EK +Q+ + A G+ RLG+P YEWWSEALHGVS
Sbjct: 61 SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVS-- 118
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
NS G HF +P T FP VIL+ ASFNESLW +GQ VSTE RAMYN+G AGLT
Sbjct: 119 ----NSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLT 174
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
+WSPN+N+ RDPRWGR ETPGEDP VV RYA+NYVRGLQ+V G E + +D LK+S+
Sbjct: 175 YWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEGNFAADR--LKVSS 231
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYD+D W+G DRFHFD++VT QD+++T+ PF+ CV EG VSSVMCSYNRVNG+
Sbjct: 232 CCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGV 291
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCA+P+LL IR W GYIVSDCDSI E + +T EDAVA LKAGL+L+C
Sbjct: 292 PTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNC 350
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y ++T AV GK+ E+ ++ +L + YIVLMRLG+FDG P + +G +++C
Sbjct: 351 GSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVD 410
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA +AA+QGIVLL N NGALPL+ KTLA++GP+A+AT M+ NY G PCRYTSP
Sbjct: 411 HQLLALDAAKQGIVLLHN-NGALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSP 469
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y ++Y GCA++ C ++I A A ADATV+V GLDL +EAE DRV+
Sbjct: 470 LQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVN 529
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPGFQ +L+ + A AA G V LV+MSAG VDI+F KN KI ILWVGYPG+ GG AI
Sbjct: 530 LTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAI 589
Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
+ VIFG YNPGGR P TWY YV ++P T M +RP +NFPGRTY+F+ G +Y FG+
Sbjct: 590 SQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGH 649
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSY+ F YK S+ ID V
Sbjct: 650 GLSYSTF-YKNLSN--------------------------------IDIV---------- 666
Query: 664 IEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
I V+N G++DG+ VV+ + KPP G+ G +++G+ERV + G++ VG ++ C +
Sbjct: 667 IGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKI 726
Query: 722 KIVDNAANSLLASGAHTILVG 742
VD L G HT++VG
Sbjct: 727 SNVDEEGKRKLVMGMHTLVVG 747
>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
Length = 810
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/749 (50%), Positives = 506/749 (67%), Gaps = 24/749 (3%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
+ +D+ +C+ L Y +RAKDL+ R+TL EKVQQ+ + A G+PRLG+P YEWWSEALHGVS
Sbjct: 33 QTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEWWSEALHGVS 92
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
+G G F+ VPGATSFP +IL+ ASFNE+LW K+GQ VSTEAR M+++G AG
Sbjct: 93 NVGF------GVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVGLAG 146
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LT+WSPN+NV RDPRWGR ETPGEDP VV RYA+NYVRGLQ+V G E + +D LK+
Sbjct: 147 LTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GDEGNSTADK--LKV 203
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
S+CCKHY AYDLD W+G DRFHFD++VT+QD+++T+ PF CV E VSSVMCSYNRVN
Sbjct: 204 SSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYNRVN 263
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
GIPTCADP LL IRG+WN GYIVSDCDSI+ +S + T EDAVA LKAGL++
Sbjct: 264 GIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINY-TATPEDAVALALKAGLNM 322
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
+CG++ +T+ AV+ K+ E+ +D +L + +IVLMRLG+FDG P+ + NLG +++C+
Sbjct: 323 NCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDVCS 382
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
H +LA +AARQGIVLL N GALPL+ N + LA++GP+AN T MI NY G PC+YT
Sbjct: 383 DGHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCKYT 441
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ G Y + YA GC + C ++++I AA AA ADA V++ GLD S+E EG DR
Sbjct: 442 TPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGLDR 501
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+L LPGFQ +L+ V +A G V LV+MS+ +D++FA N KIK ILWVGYPG+ GG
Sbjct: 502 ENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAGGD 561
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
A+A V+FG YNP GR P TWY Y ++P T M +R NFPGRTY+F+ G +Y F
Sbjct: 562 AVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTYRFYAGNTLYKF 621
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV-GTNKPP---CAAVLIDDVKCKD 657
G+GLSY+ F + S P ++ +K + D + I T T + P A+ I + C +
Sbjct: 622 GHGLSYSTFSNFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQAMDITTLNCTN 681
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPG---IAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ + V N G + G VV+V+ KPP + G Q++G+ RV + G++ V
Sbjct: 682 SLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEVNRGKTQNVTLE 741
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
++ CK L +VD+ L +G H +G
Sbjct: 742 IDVCKRLSLVDSEGKRKLVTGQHIFTIGS 770
>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
Length = 767
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/741 (50%), Positives = 500/741 (67%), Gaps = 31/741 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ RA DLV R+TL EKV + D +PRLG+PLYEWWSEALHGVS+
Sbjct: 43 LASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYEWWSEALHGVSY 102
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F S VP ATSFP ILT ASFN +L++ IG+ VS EARAM+N+G AGL
Sbjct: 103 VG------PGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGLAGL 156
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD S S LK++
Sbjct: 157 TFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDA-------GSGSGSLKVA 209
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ F++ V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 210 ACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNG 269
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL+ IRGDW +GYI SDCDS+ + + + T EDA A +KAGLDL+
Sbjct: 270 KPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAISIKAGLDLN 328
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG++ T+ AVQ GK++E+D+D ++ +I LMRLG+FDG P+ + NLG +++C
Sbjct: 329 CGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVCTS 388
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA EAARQGIVLLKN +GALPL+ +IK+LA++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 389 SNQELAREAARQGIVLLKN-SGALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKYTT 447
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + AA AA +AD TV+V G D S+E E DR
Sbjct: 448 PLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDR 506
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG Q +L++ VA+A++GP LVIMS G DI+FAK++ KI +ILWVGYPGE GG
Sbjct: 507 TSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAGGA 566
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
AIADV+FG +NP GRLP+TWY ++ K+P M +RP +PGRTY+F+ G VY FG
Sbjct: 567 AIADVLFGHHNPSGRLPVTWYPESFTKVPMIDMRMRPDASTGYPGRTYRFYTGDTVYAFG 626
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSYT F + + S+PK V ++L + C C +V + C+ F
Sbjct: 627 DGLSYTSFAHHLVSAPKQVALQLAEGHTCL---------TEQCPSVEAEGAHCEGLAFDV 677
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V N G M G+ V ++S PP + K ++G+E+V + GQ+ V F ++ CK L
Sbjct: 678 HLRVRNAGDMSGAHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLS 737
Query: 723 IVDNAANSLLASGAHTILVGE 743
+VD N +A G HT+ VG+
Sbjct: 738 VVDELGNRKVALGNHTLHVGD 758
>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 779
Score = 748 bits (1931), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/747 (50%), Positives = 486/747 (65%), Gaps = 31/747 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ P+C LP RA+DLV R+T EKV+ + + A GVPRLG+ YEWWSEALHGVS
Sbjct: 36 TRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEGYEWWSEALHGVSDT 95
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG F PGAT+FP VI T ASFN SLW+ IG+ VS E RA+YN AGLT
Sbjct: 96 G------PGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIYNGRQAGLT 149
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FWSPN+N+ RDPRWGR ETPGEDP V GRYA YVRGLQ + LK +A
Sbjct: 150 FWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQ---------QHAGRLKTAA 200
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDLD W G DRFHF++ VT QD+++TF PF CV EG ++VMCSYN+VNG+
Sbjct: 201 CCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSYNQVNGV 260
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCAD L TIRG W GYIVSDCDS+ + T+EDAVA L+AGLDLDC
Sbjct: 261 PTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTR-TREDAVAATLRAGLDLDC 319
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQ 366
G + +T AV QGK+ EADID ++ V MRLG FDG + + +LG ++C P
Sbjct: 320 GPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQHVCTPA 379
Query: 367 HIELAAEAARQGIVLLKNDNG---ALPLNTGNIK-TLALVGPHANATKAMIGNYEGTPCR 422
H ELA EAA Q IVLLKN G LPL++ + + T+A+VGPH+ AT AMIGNY G PC
Sbjct: 380 HRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYAGKPCA 439
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEG 481
YT+P+ G Y++ + GC D+ CQ + I AA+DAA++ADATV+V GLD SVEAEG
Sbjct: 440 YTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQSVEAEG 499
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR LLLPG Q EL++ VA A+KGPV LV+MS G VDI FA+N+ + +ILW GYPG+
Sbjct: 500 LDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAGYPGQA 559
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG+AIADVIFG +NPGG+LP+TWY +Y+ K P T+M +R P +PGRTY+F+ GP +
Sbjct: 560 GGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGYPGRTYRFYAGPTI 619
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
+PFG+GLSYT+F + +A +P + ++ R T V + +C+
Sbjct: 620 HPFGHGLSYTKFAHTLAHAPAHLTVRRAAGH--RTTAAINTTTASHLNDVRVAHAQCEGL 677
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
+ ++V+N+G DG+ V VY+ PP I G ++Q++ +E+V +AAG A+V ++
Sbjct: 678 SVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHVAAGAVARVKMGVD 737
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
C SL I D + G H +++GE
Sbjct: 738 VCGSLSIADQEGVRRIPIGEHRLMIGE 764
>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
Length = 780
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/742 (50%), Positives = 501/742 (67%), Gaps = 26/742 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ + R DLV R+TL EK+ + +GV RLG+P YEWWSEALHGVS+
Sbjct: 49 LAAYGFCNTAIKIEYRVADLVARLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSY 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F +VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTRFSGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YV+GLQ+ + SD+ LK++
Sbjct: 163 TYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVKGLQETD------SSDANRLKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ F++ V +QD+ +T+ PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPFKSCVVDGNVASVMCSYNKVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL+ IRG+W +GYIVSDCDS+ + ++ + T E+A A + AGLDL+
Sbjct: 277 KPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHY-TKTPEEAAAISINAGLDLN 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + + T AV+ G + EA ID ++ ++ LMRLG+FDG P+ Y LG ++C P
Sbjct: 336 CGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTP 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELAAEAARQGIVLLKN GALPL+ IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 ANQELAAEAARQGIVLLKN-TGALPLSPKTIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 454
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A + Y PGC+++ C + + + A +DATV+V G D S+EAE +DRV
Sbjct: 455 PLQGL-AGTVHTTYLPGCSNVACAV-ADVAGSTKLAAASDATVLVIGADQSIEAESRDRV 512
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q EL+ +VA AAKGPV LVIMS G DI FAKN+ KI ILWVGYPGE GG A
Sbjct: 513 DLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKNDAKIAGILWVGYPGEAGGIA 572
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
ADVIFG+YNP GRLP+TWY +YV K+P T+M +RP N +PGRTY+F+ G VY FG
Sbjct: 573 TADVIFGRYNPSGRLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFG 632
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
GLSYT+F + + +P+ V + L+++ CR ++ P C + F
Sbjct: 633 DGLSYTKFSHSLVKAPRLVSLSLEENHVCRSSECQSLNAIGPHCDNAV---SGTGGKAFE 689
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
I+V+N G +G V +++ PP + G+ K ++G+E++ + + A V F ++ CK L
Sbjct: 690 VHIKVQNGGDREGIHTVFLFTTPPAVHGSPRKHLLGFEKIRLGKMEEAVVKFKVDVCKDL 749
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD + G H + VG+
Sbjct: 750 SVVDEVGKRKIGLGQHLLHVGD 771
>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 774
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/739 (49%), Positives = 497/739 (67%), Gaps = 30/739 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F +C+ +P R +DL+ R+TLPEK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 48 FKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 106
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PGT F PGAT FP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+W
Sbjct: 107 -----PGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQAGLTYW 161
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPN+N+ RDPRWGR ETPGEDP + +YA +YV+GLQ D LK++ACC
Sbjct: 162 SPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DGAGNRLKVAACC 213
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHY AYDLDNW G DRFHF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PT
Sbjct: 214 KHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVMCSYNQVNGKPT 273
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CADP LL TIRG W +GYIVSDCDS+ ++ + T E+A A +KAGLDLDCG
Sbjct: 274 CADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHY-TRTPEEAAAEAIKAGLDLDCGP 332
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
+ T A+++G I+E D++ +L L V MRLG FDG P + NLG ++C P H
Sbjct: 333 FLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNLGPRDVCTPAHQ 392
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA EAAR+ IVLL+N +LPL+ ++ + ++GP+ +AT MIGNY G C YT+P+
Sbjct: 393 QLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYAGVACGYTTPLQ 452
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G Y K + GC + C+ N + AA A+ DATV+V GLD ++EAE +DRV LL
Sbjct: 453 GIARYVKTAHQV-GCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIEAETRDRVGLL 511
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LPG Q EL+ +VA AAKGPV LVIMS G VD++FAKNNPKI +ILWVGYPG+ GG AIAD
Sbjct: 512 LPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYPGQAGGTAIAD 571
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
VIFG NPGGRLP+TWY Y+ K+P T+M +R P +PGRTY+F+ GPVV+PFG+GL
Sbjct: 572 VIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGYPGRTYRFYKGPVVFPFGHGL 631
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT-FQI 664
SY++F +A +PK V +++ Q N T+ + AV + C D T F +
Sbjct: 632 SYSRFSQSLALAPKQVSVQILSLQAL--TNSTLSSK-----AVKVSHANCDDSLETEFHV 684
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDG+ ++++SKPP + IKQ++ + + + AG ++ +++CK L +V
Sbjct: 685 DVKNEGSMDGTHTLLIFSKPPPGKWSQIKQLVTFHKTHVPAGSKQRLKVNVHSCKHLSVV 744
Query: 725 DNAANSLLASGAHTILVGE 743
D + +G H + +G+
Sbjct: 745 DQFGVRRIPTGEHELHIGD 763
>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/742 (50%), Positives = 501/742 (67%), Gaps = 30/742 (4%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
L+ +C+ + +R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 47 SLASLGFCNTSIGINDRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVS 106
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
++G PGTHF +V GATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AG
Sbjct: 107 YVG------PGTHFSDDVAGATSFPQVILTAASFNTSLFEAIGKVVSTEARAMYNVGLAG 160
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LTFWSPNIN+ RDPRWGR ETPGEDP + +Y YV+GLQ + D D LK+
Sbjct: 161 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVKGLQQRD------DGDPDKLKV 214
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+ACCKHY AYDLDNW+G+DR+HF++ VT+QDM +TF PF+ CV +G+V+SVMCSYN+VN
Sbjct: 215 AACCKHYTAYDLDNWKGSDRYHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVN 274
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
G PTCADP LL+ IRG+WN +GYIV+DCDS+ +S + +E A A +L AG+DL
Sbjct: 275 GKPTCADPDLLSGVIRGEWNLNGYIVTDCDSLDVFYKSQNYTKTPEEAAAAAIL-AGVDL 333
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
+CG + T AV+ G + E ID ++ + LMRLG+FDG P Y LG ++C
Sbjct: 334 NCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCT 393
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ ELA EAARQGIVLLKN G+LPL+ IK LA++GP+AN TK MIGNYEGTPC+YT
Sbjct: 394 AENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYT 453
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ G A S Y PGC+++ C + + + A A ADATV+V G DLS+EAE +DR
Sbjct: 454 TPLQGL-AASVATTYLPGCSNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEAESRDR 511
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
VD+LLPG Q LI VA+ + GPV LVIMS G +D++FA+ N KI SILWVGYPGE GG
Sbjct: 512 VDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGA 571
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
AIAD+IFG YNP GRLP+TWY +YV K+P T+M +R P N +PGRTY+F+ G VY F
Sbjct: 572 AIADIIFGYYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSF 631
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G GLSY+QF +++ +P+ V + L++ C + C +V+ + C++ F
Sbjct: 632 GDGLSYSQFTHELIQAPQLVYVPLEESHVC---------HSSECQSVVASEQTCQNSTFD 682
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+ V+N G + GS V ++S PP + + K ++G+E+VF+ A V F ++ CK L
Sbjct: 683 MLLRVKNEGTISGSHTVFLFSSPPAVHNSPQKHLVGFEKVFLNAQTGRHVRFKVDICKDL 742
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD + +A G H + VG
Sbjct: 743 SVVDELGSKKVALGEHVLHVGS 764
>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
Length = 762
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/736 (49%), Positives = 495/736 (67%), Gaps = 31/736 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C+ ++P R +DL+ R+ LPEK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 39 YKFCNTRVPIHARVQDLIGRLALPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 97
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PGT F ATSFP VI T ASFN+SLW +IG+ VS EARAMYN G AGLTFW
Sbjct: 98 -----PGTKFGGAFSAATSFPQVITTAASFNQSLWLEIGRIVSDEARAMYNGGAAGLTFW 152
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPN+N+ RDPRWGR ETPGEDP V G+YA +YV+GLQ + LK++ACC
Sbjct: 153 SPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGLQG--------NGAGNRLKVAACC 204
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHY AYDLDNW G DRFHF+++V++QD+ +T+ +PF+ CV +G V+SVMCSYN+VNG PT
Sbjct: 205 KHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKACVRDGKVASVMCSYNQVNGKPT 264
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CADP+LL TIRG+W +GYIVSDCDS+ + ++ + T E A A +KAGLDLDCG
Sbjct: 265 CADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHY-TRTPEQAAAAAIKAGLDLDCGP 323
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIEL 370
+ T GA++QG I+E D++ +L L V MRLG FDG Q Y NLG ++C P H ++
Sbjct: 324 FLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGDAQPYGNLGTRDVCLPSHNDV 383
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLL+N ALPL+ +T+ ++GP+++ T MIGNY G C YT+P+ G
Sbjct: 384 ALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDVTVTMIGNYAGIACGYTTPLQGI 443
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y K I+ A GC D+ C N + + A+ ADATV+V GLD S+EAE +DR LLLP
Sbjct: 444 ARYVKTIHQA-GCKDVGCGGNQLFGLSEQVARQADATVLVMGLDQSIEAEFRDRTGLLLP 502
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA AA+GPV LV+MS G +D+ FAKN+PKI +ILWVGYPG+ GG AIADVI
Sbjct: 503 GHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKISAILWVGYPGQSGGTAIADVI 562
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG+ NP GRLP TWY +YV K+P T+M +R P +PGRTY+F+ GPVV+PFG+GLSY
Sbjct: 563 FGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPATGYPGRTYRFYKGPVVFPFGHGLSY 622
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
++F + +A +PK V ++ +T +NK A+ + C + + F ++V+
Sbjct: 623 SRFTHSLALAPKQVSVQFTTPLTQA---FTNSSNK----AMKVSHANCDELEVGFHVDVK 675
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N G MDG+ ++VYSK P +KQ++ + + ++ AG +V ++ C L VD
Sbjct: 676 NEGSMDGAHTLLVYSKAP----NGVKQLVNFHKTYVPAGSKTRVKVGVHVCNHLSAVDEF 731
Query: 728 ANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 732 GVRRIPMGEHELQIGD 747
>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
Length = 751
Score = 743 bits (1917), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/740 (51%), Positives = 489/740 (66%), Gaps = 53/740 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L F +C+ L R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS+
Sbjct: 49 LGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSY 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGTHF+S VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGL
Sbjct: 109 VG------PGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLDNW+G DRFHF++ VT+QDM +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
P CADP LL+ +RG+W +GYIVSDCDS+ S + T E+A A+ + AGLDL+
Sbjct: 277 KPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLN 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV+ G + E+ +D ++ + LMRLG+FDG+P Y LG ++C
Sbjct: 336 CGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTS 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELA EAARQGIVLLKN G+LPL+ IKTLA++GP+AN TK MIGNYEGTPC+YT+
Sbjct: 396 EHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTT 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A Y PGC+++ C + I A A ADATV++ G+D S+EAEG+DRV
Sbjct: 456 PLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRV 513
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++ LPG Q LI +VA A+KG V LV+MS G DI+FAKN+ KI SILWVGYPGE GG A
Sbjct: 514 NIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
IADVIFG YNP GRLP+TWY +YV K+P T+M +R P + +PGRTY+F+ G +Y FG
Sbjct: 574 IADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFG 633
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSYTQF + ++ +D Q+ C++ F
Sbjct: 634 DGLSYTQFNHHLS---------VDAVQE-----------------------SCQNLVFDI 661
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V N G + GS V ++S PP + + K ++G+E+VF+ A A V F ++ CK L
Sbjct: 662 HLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLS 721
Query: 723 IVDNAANSLLASGAHTILVG 742
IVD +A G H + VG
Sbjct: 722 IVDELGTRKVALGLHVLHVG 741
>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
gi|219885199|gb|ACL52974.1| unknown [Zea mays]
gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 794
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/766 (48%), Positives = 487/766 (63%), Gaps = 33/766 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ P+C LP RA+DLV R+T EKV+ + + A GVPRLG+ YEWWSEALHGVS
Sbjct: 36 ASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 95
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG F PGAT+FP VI T AS N +LW+ +G+ VS EARAMYN G AGLT
Sbjct: 96 G------PGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGRAGLT 149
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY--HRDSDSRPLKI 187
FWSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ HR+ LK+
Sbjct: 150 FWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHRNR----LKL 205
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+ACCKH+ AYDLD W G DRFHF++ V QD+++TF +PF CV +G +SVMCSYN+VN
Sbjct: 206 AACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQVN 265
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
G+PTCAD L TIRG W GYIVSDCDS+ + T EDA A L+AGLDL
Sbjct: 266 GVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTR-TPEDAAAATLRAGLDL 324
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
DCG + + AV GK+A+AD+D +L V MRLG FDG P + LG ++C
Sbjct: 325 DCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVCT 384
Query: 365 PQHIELAAEAARQGIVLLKNDNGA------LPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+H +LA +AARQG+VLLKN GA LPL + +A+VGPHA+AT AMIGNY G
Sbjct: 385 REHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAG 444
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
PCRYT+P+ G AY+ + + GC D+ C+ N I AA++AA+ ADATV+VAGLD VE
Sbjct: 445 KPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRVE 504
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AEG DR LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N+P+I ILWVGYP
Sbjct: 505 AEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYP 564
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDG 595
G+ GG+AIADVIFG +NPG +LP+TWY +Y+ K+P T+M +R P +PGRTY+F+ G
Sbjct: 565 GQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGRTYRFYTG 624
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKL--DKDQQCRDINYTVGTNKPPCAAVLIDDV 653
P +YPFG+GLSYTQF + +A +P + ++L + T P AV +
Sbjct: 625 PTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVRAVRVAHA 684
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI------AGTHIKQVIGYERVFIAAGQ 707
+C+ ++V N+G DG+ V+VY P A +Q++ +E+V + AG
Sbjct: 685 RCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEKVHVPAGG 744
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
A+V + C L + D + G H +++GE VS ++
Sbjct: 745 VARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVE 790
>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
Length = 770
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/739 (50%), Positives = 510/739 (69%), Gaps = 29/739 (3%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C LP ERA+DLV R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 38 NLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWWSEALHGVSNVG 97
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F PGATSFP VI T ASFN SLW++IG+ VS EARAMYN G AGLT+
Sbjct: 98 ------PGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGMAGLTY 151
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V +YA YVRGLQ RD LK++AC
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA-----RDR----LKVAAC 202
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKHY AYDLD+W G DRFHF++RV++QD+++T+ +PF+ CV EG+V+SVMCSYN+VNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL TIRG+W +GYIVSDCDS+ + + T E+A A +KAGLDLDCG
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHY-TATPEEAAAVAIKAGLDLDCG 321
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T A++ GK+ EAD++ +L V MRLG FDG P Y NLG ++C P H
Sbjct: 322 PFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAH 381
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
+LA EAARQGIVL++N ALPL+T +T+A++GP+++ T+ MIGNY G C YT+P+
Sbjct: 382 QQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPL 441
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y++ I+ A GC+ + C+++ AA+ AA+ ADATV+V GLD S+EAE +DRVD+
Sbjct: 442 QGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDI 500
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q EL++KVA A++GP LV+MS G +D++FAKN+P+I +I+WVGYPG+ GG AIA
Sbjct: 501 LLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIA 560
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
DV+FG+ NPGG+LP+TWY +Y+ K P T+M +R P +PGRTY+F++GPVV+PFG+G
Sbjct: 561 DVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSY+ F + +A +P +V + L Q + N T+ ++ A+ I C F I
Sbjct: 621 LSYSTFAHSLAQAPTTVSVSLASLQTIK--NSTIVSS----GAIRISHANCNTQPLGFHI 674
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDGS ++++S PP + K+++ +E+V + AG +V F ++ CK L +V
Sbjct: 675 DVKNTGTMDGSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVV 734
Query: 725 DNAANSLLASGAHTILVGE 743
D+ + G H +G+
Sbjct: 735 DHFGIHRIPMGEHHFHIGD 753
>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 782
Score = 741 bits (1912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/753 (49%), Positives = 504/753 (66%), Gaps = 35/753 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ +C A LP R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 53 NLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG 112
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PG F PGATSFP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+
Sbjct: 113 ------PGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARAMYNGGLAGLTY 166
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+NV RDPRWGR ETPGEDP + G+YA +YVRGLQ G++ LK++AC
Sbjct: 167 WSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLK---------LKVAAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKHY AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG P
Sbjct: 218 CKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQVNGKP 277
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL TIRG W +GYIVSDCDS+ + ++ + T E+A A +KAGLDLDCG
Sbjct: 278 TCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHY-TSTPEEAAAATIKAGLDLDCG 336
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T AV++G + E D++ +L V MRLG FDG P Y NLG ++C P H
Sbjct: 337 PFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNLGPRDVCTPAH 396
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA EAARQGIVLL+N ALPL++ T+A++GP+++ T MIGNY G C+YTSP+
Sbjct: 397 QELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKYTSPL 456
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y+K + + GC D+ C +N AA AA+ ADATV+V GLD S+EAE +DRV L
Sbjct: 457 QGISRYAKTL-HQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIEAEFRDRVGL 515
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q EL+++VA A++GP LV+MS G +D++FAKN+P++ +ILW GYPG+ GG AIA
Sbjct: 516 LLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGAAIA 575
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
DV+FG NPGG+LP+TWY Y+ K+P T+M +R P +PGRTY+F+ G VV+PFG+G
Sbjct: 576 DVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGRTYRFYKGNVVFPFGHG 635
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
+SYT F + + +PK V + + +N T+ + A+ + + C+ I
Sbjct: 636 MSYTSFSHSLTQAPKEVSLPI---TNLYALNTTISSK-----AIRVSHINCQT-SLGIDI 686
Query: 665 EVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V+N G MDG+ ++V+S PP G + KQ+IG+E+V + AG +V ++ CK L
Sbjct: 687 NVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDLVAGSQIQVKIDIHVCKHLSA 746
Query: 724 VDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
VD + G H I +G+ +S LQ N+
Sbjct: 747 VDRFGIRRIPIGDHHIYIGDLKHSIS--LQANM 777
>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
Length = 774
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/743 (50%), Positives = 505/743 (67%), Gaps = 32/743 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
++ + +CD RA DLV R+TL +KV + + + RLG+P YEWWSEALHGVS+
Sbjct: 47 VAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYEWWSEALHGVSY 106
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F VPGATSFP ILT ASFN SL++ IG+ VS EARAM+N+G AGL
Sbjct: 107 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNVGLAGL 160
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + RYA+ YV GLQD D+D PLK++
Sbjct: 161 TFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGA-----DADG-PLKVA 214
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF PF+ CV +G V+SVMCSYN+VNG
Sbjct: 215 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNKVNG 274
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL+ IRGDW +GYIVSDCDS+ ++ S + T E+A A +K+GLDL+
Sbjct: 275 KPTCADKDLLSGVIRGDWKLNGYIVSDCDSVD-VLYSQQHYTKTPEEAAAITIKSGLDLN 333
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CGD+ T+ AVQ G ++E+D+D ++ +I+LMRLG+FDG P+ Y +LG ++C
Sbjct: 334 CGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVCTS 393
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA E ARQGIVLLKND GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 394 SNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 452
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G + Y PGC+++ C NS+ + AA AA +AD TV+V G D S+E E DR
Sbjct: 453 PLHGLGNNVATV-YQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREALDR 511
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG Q +LI+ VA+A+KG V LV+MS G DI+FAK + KI +ILWVGYPGE GG
Sbjct: 512 TSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAGGA 571
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPF 601
AIAD+IFGKYNP GRLP+TWY A++ K+P T M +RP N+ +PGRTY+F+ G V+ F
Sbjct: 572 AIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRTYRFYTGETVFAF 631
Query: 602 GYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
G GLSYT + VA+ P V ++L + C + CA+V C+ F
Sbjct: 632 GDGLSYTTMSHNLVAAPPSEVSMQLAEGHAC---------HTKECASVEAAGDHCEGMAF 682
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
++ V N G+M G+ V+++S PP + K ++G+E++ + GQ+ F ++ CK
Sbjct: 683 EVRLRVHNTGEMAGAHTVLLFSSPPAVHNAPAKHLLGFEKLNLEPGQAGVAAFKVDVCKD 742
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
L +VD N +A G HT+ VG+
Sbjct: 743 LSVVDELGNRKVALGGHTLHVGD 765
>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 773
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/741 (49%), Positives = 496/741 (66%), Gaps = 31/741 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ RA DLV R+TL EKV + D +PRLG+PLYEWWSEALHGVS+
Sbjct: 49 LASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWWSEALHGVSY 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F VPGATSFP ILT ASFN +L++ IG+ VS EARAM+N+G AGL
Sbjct: 109 VG------PGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNVGLAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQ S + LK++
Sbjct: 163 TFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAV-------SGAGALKVA 215
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQVNG 275
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL+ IRGDW +GYI SDCDS+ + + + T EDA A +KAGLDL+
Sbjct: 276 KPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAISIKAGLDLN 334
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T+ AVQ GK++E+D+D ++ + LMRLG+FDG P+ + NLG +++C P
Sbjct: 335 CGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVCTP 394
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA EAARQGIVLLKN G LPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 395 SNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + AA AA +AD TV+V G D S+E E DR
Sbjct: 454 PLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESLDR 512
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG Q +L++ VA+A+ GP LV+MS G DI+FAK++ KI +ILWVGYPGE GG
Sbjct: 513 TSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAGGA 572
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
AIADV+FG +NP GRLP+TWY ++ K+P T M +R P +PGRTY+F+ G VY FG
Sbjct: 573 AIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFYTGDTVYAFG 632
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GLSYT F + + S+PK + ++L + C C +V + C+ F
Sbjct: 633 DGLSYTSFAHHLVSAPKQLALQLAEGHACL---------TEQCPSVEAEGAHCEGLAFDV 683
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V N G+ G V ++S PP + K ++G+E+V + GQ+ V F ++ CK L
Sbjct: 684 HLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLS 743
Query: 723 IVDNAANSLLASGAHTILVGE 743
+VD N +A G+HT+ VG+
Sbjct: 744 VVDELGNRKVALGSHTLHVGD 764
>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
Length = 772
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/737 (49%), Positives = 490/737 (66%), Gaps = 29/737 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F +C ++P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 44 FKFCRTRVPVHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG- 102
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PGT F PGATSFP VI T ASFN+SLW++IGQ VS EARAMYN G AGLT+W
Sbjct: 103 -----PGTKFGGAFPGATSFPQVITTAASFNQSLWQEIGQVVSDEARAMYNGGQAGLTYW 157
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPN+N+ RDPRWGR ETPGEDP + +YA +YV+GLQ D LK++ACC
Sbjct: 158 SPNVNIFRDPRWGRGQETPGEDPVLSAKYAASYVKGLQG--------DGAGNRLKVAACC 209
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHY AYDLDNW G DRFHF++RV++QD+ +T+ +PF CV EG V+SVMCSYN+VNG PT
Sbjct: 210 KHYTAYDLDNWNGVDRFHFNARVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVNGKPT 269
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CADP LL TIRG+W +GYIVSDCDS+ + + T E+A A +KAGLDLDCG
Sbjct: 270 CADPDLLKNTIRGEWKLNGYIVSDCDSVGVFYDQQHY-TRTPEEAAAEAIKAGLDLDCGP 328
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHI 368
+ T GA++ G + E D+D +L V MRLG FDG P QY NLG ++C P H
Sbjct: 329 FLAIHTEGAIKAGLLPEIDVDYALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTPAHQ 388
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
ELA EA+RQGIVLL+N+ LPL+T +T+A+VGP+++ T+ MIGNY G C YT+P+
Sbjct: 389 ELALEASRQGIVLLQNNGHTLPLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTTPLQ 448
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G Y+K I + GC ++ C N + AA AA+ ADATV+V GLD S+EAE +DR DL+
Sbjct: 449 GIGRYTKTI-HQQGCTNVACTTNQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTDLV 507
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
+PG Q EL+++VA A++GP LV+MS G +D++FAKN+PKI +I+WVGYPG+ GG A+AD
Sbjct: 508 MPGHQQELVSRVARASRGPTVLVLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAMAD 567
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
V+FG NP G+LP+TWY +YV K+P T+M +R +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 568 VLFGTTNPSGKLPMTWYPQDYVSKVPMTNMAMRAGRGYPGRTYRFYKGPVVFPFGLGLSY 627
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP-CAAVLIDDVKCKDYKFTFQIEV 666
T F + +A P SV + L + + TN +AV + C + V
Sbjct: 628 TTFAHSLAQVPTSVSVPL--------TSLSATTNSTMLSSAVRVSHTNCNPLSLALHVVV 679
Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
+N G DG+ ++V+S PP KQ++G+ +V I AG +V ++ CK L +VD
Sbjct: 680 KNTGARDGTHTLLVFSSPPSGKWAANKQLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQ 739
Query: 727 AANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 740 FGIRRIPIGEHKLQIGD 756
>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
Length = 774
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/736 (49%), Positives = 488/736 (66%), Gaps = 29/736 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C ++P R +DL+ R+TL EK+ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 46 FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F + + GATSFP VI T ASFNESLW++IG+ VS EARAMYN G AGLTFWSP
Sbjct: 103 ---PGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGAAGLTFWSP 158
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + +Y YV+GLQ D LK++ACCKH
Sbjct: 159 NVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQG--------DGAGNRLKVAACCKH 210
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DRFHF++RV++QD+++T+ +PF CV +G+V+SVMCSYN+VNG PTCA
Sbjct: 211 YTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVMCSYNQVNGKPTCA 270
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP+LL TIRG W +GYIVSDCDS+ ++ + T E+A A +KAGLDLDCG +
Sbjct: 271 DPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEEAAAYAIKAGLDLDCGPFL 329
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
T AV+ G++ E DI+ +L V MRLG FDG P +Y NLG ++C P EL
Sbjct: 330 GIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLGLADVCKPSSNEL 389
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLL+N +LPL+T +T+A++GP+++ T+ MIGNY G C YT+P+ G
Sbjct: 390 ALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPLQGI 449
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I+ A GC D+ C N +I AA AA+ ADATV+V GLD S+EAE +DR DLLLP
Sbjct: 450 ARYTRTIHQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTDLLLP 508
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA A++GP LVIMS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 509 GHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIADVL 568
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NP G+LP+TWY NYV +P T M +R P +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 569 FGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGLGLSY 628
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F + +A P V + + N T+ N + + C I+++
Sbjct: 629 TRFSHSLAQGPTLVSVPFTSLVASK--NTTMLGNHD----IRVSHTNCDSLSLDVHIDIK 682
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N G MDG+ ++V++ PP KQ++G+ +V I AG +V + CK L +VD
Sbjct: 683 NSGTMDGTHTLLVFATPPTGKWAPNKQLVGFHKVHIVAGSERRVRVGVQVCKHLSVVDEL 742
Query: 728 ANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 743 GIRRIPLGQHKLEIGD 758
>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
Length = 774
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/737 (49%), Positives = 493/737 (66%), Gaps = 29/737 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C A +P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHG+S +G
Sbjct: 49 FCRANVPIHVRVQDLLGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVG--- 105
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PG F PGATSFP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 106 ---PGAKFGGAFPGATSFPQVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSP 162
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N++RDPRWGR ETPGEDP V +YA +YVRGLQ + LK++ACCKH
Sbjct: 163 NVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQGT--------AAGNRLKVAACCKH 214
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DRFHF+++VT+QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 215 YTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCA 274
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
D LL TIRG W +GYIVSDCDS+ + T E+A AR +KAGLDLDCG +
Sbjct: 275 DENLLKNTIRGQWRLNGYIVSDCDSVDVFFNQQHY-TSTPEEAAARSIKAGLDLDCGPFL 333
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAA 372
FT GAV++G + E DI+ +L V MRLG FDG+ Y NLG ++C P H LA
Sbjct: 334 AIFTEGAVKKGLLTENDINLALANTLTVQMRLGMFDGNLGPYANLGPRDVCTPAHKHLAL 393
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
EAA QGIVLLKN +LPL+ +T+A++GP+++ T+ MIGNY G C YTSP+ G
Sbjct: 394 EAAHQGIVLLKNSARSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTSPLQGISR 453
Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
Y++ ++ A GCA + C+ N AA AA+ ADATV+V GLD S+EAE +DR LLLPG+
Sbjct: 454 YARTLHQA-GCAGVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGY 512
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q +L+ +VA A++GPV LV+MS G +D+ FAKN+P++ +I+W GYPG+ GG AIA++IFG
Sbjct: 513 QQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFG 572
Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
NPGG+LP+TWY +YV K+P T M +R N+PGRTY+F+ GPVV+PFG+GLSYT F
Sbjct: 573 AANPGGKLPMTWYPQDYVAKVPMTVMAMRASGNYPGRTYRFYKGPVVFPFGFGLSYTTFT 632
Query: 612 YKVASSP-KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY-KFTFQIEVENM 669
+ +A SP + + L ++N ++ + C + K +EV N
Sbjct: 633 HSLAKSPLAQLSVSLS------NLNSANTILNSSSHSIKVSHTNCNSFPKMPLHVEVSNT 686
Query: 670 GKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
G+ DG+ V V+++PP GI G + KQ+I +E+V + AG V ++ACK L +VD
Sbjct: 687 GEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKVHVMAGAKQTVQVDVDACKHLGVVDE 746
Query: 727 AANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 747 YGKRRIPMGEHKLHIGD 763
>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
Length = 769
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/717 (49%), Positives = 484/717 (67%), Gaps = 28/717 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 42 FCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG--- 98
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F PGATSFP VI T A+FNESLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 99 ---PGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDEARAMYNGGMAGLTYWSP 155
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + G+YA +YVR LQ ++ SR LK++ACCKH
Sbjct: 156 NVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQG--------NTGSR-LKVAACCKH 206
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 207 YTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCA 266
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP +L TIRG W GYIVSDCDS+ + + + T E+A A +KAGLDLDCG +
Sbjct: 267 DPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHY-TRTPEEAAADAIKAGLDLDCGPFL 325
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
T GAV+ G + E D++ + + V MRLG FDG P + NLG ++C P H +L
Sbjct: 326 AIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQL 385
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA QGIVLLKN LPL+T T+A++GP+++ T MIGNY G C YT+P+ G
Sbjct: 386 ALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 445
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y+K I+ A GC + C N +I AA AA+ ADATV+V GLD S+EAE DR LLLP
Sbjct: 446 SRYAKTIHQA-GCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIEAEFIDRAGLLLP 504
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA A++GPV LV+M G VD++FAKN+P+I +ILWVGYPG+ GG AIADV+
Sbjct: 505 GRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVL 564
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG+ NPGG+LP+TWY +YV ++P T M +R +PGRTY+F+ GPVV+PFG+G+SYT
Sbjct: 565 FGRANPGGKLPMTWYPQDYVARLPMTDMRMRAGRGYPGRTYRFYKGPVVFPFGHGMSYTT 624
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD-YKFTFQIEVEN 668
F + ++ +P + + N T+ +N A+ + C D ++V+N
Sbjct: 625 FAHTLSKAPNQFSVPIATSLYAFK-NTTISSN-----AIRVAHTNCNDAMSLGLHVDVKN 678
Query: 669 MGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
G M G+ ++V++KPP + KQ+IG+++V + AG V ++ CK L +VD
Sbjct: 679 TGDMAGTHTLLVFAKPPAGNWSPNKQLIGFKKVHVTAGALQSVRLDIHVCKHLSVVD 735
>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 772
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/740 (49%), Positives = 487/740 (65%), Gaps = 29/740 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS +P+C LP PER KDL+ R+TL EKV+ + + A VPRLG+ YEWWSEALHGVS
Sbjct: 38 LSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSN 97
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F + PGATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGL
Sbjct: 98 VG------PGTEFGGDFPGATSFPQVITTVASFNVSLWEAIGRVVSDEARAMYNGGAAGL 151
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP V G YA Y++GLQ +D LK++
Sbjct: 152 TYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQG---------NDGDRLKVA 202
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLDNW G DRFHF+++VT QDM +TF +PF CV EG V+SVMCSYN+VNG
Sbjct: 203 ACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNG 262
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCADP LL TIR W +GYIVSDCDS+ ++ + T E+A A +KAGLDLD
Sbjct: 263 VPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHY-TSTAEEAAADAIKAGLDLD 321
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV++G + + I+ +L V MRLG FDG+P Y LG N+C+P
Sbjct: 322 CGPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSP 381
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H +LA +AARQGIVLLKN LPL+ + +T+A++GP+++ MIGNY G C Y +
Sbjct: 382 SHQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGNYAGVACGYVT 441
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P++G Y+ V+ + GC ++ C + A+ AA ADATV+V GLD SVEAE KDR
Sbjct: 442 PLEGIKRYTTVV-HRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRD 500
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q EL+ KVA A++GP +++MS G +D++FA N+P+I +ILWVGYPG+ GG A
Sbjct: 501 GLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAA 560
Query: 546 IADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
IADV+FG NPGG+LP+TWY +Y+ +P T+M +R +++PGRTY+F+ GPVVY FG+G
Sbjct: 561 IADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSSYPGRTYRFYAGPVVYEFGHG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F + + +P V I L +Q T + A+ + KC+ +
Sbjct: 621 LSYTNFIHTIVKAPTIVSISLSGHRQ------THSASTLSSKAIRVTHAKCQKLSLVIHV 674
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+VEN G DG ++V+S PP T + KQ++ +E++ +A+ + ++ ++ CK L
Sbjct: 675 DVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLS 734
Query: 723 IVDNAANSLLASGAHTILVG 742
+VD + G H I +G
Sbjct: 735 VVDKLGVRRIPLGDHYIHIG 754
>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 729 bits (1882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/739 (49%), Positives = 490/739 (66%), Gaps = 34/739 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 46 FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F PGATSFP VI T ASFNESLW++IG+ V EARAMYN G AGLT+WSP
Sbjct: 103 ---PGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARAMYNGGMAGLTYWSP 159
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + +YA YV+GLQ D LK++ACCKH
Sbjct: 160 NVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG--------DGAGNRLKVAACCKH 211
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G +RFHF++RV++QD+ +T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 212 YTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGKPTCA 271
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG W +GYIVSDCDS+ + E + T E+A A +KAGLDLDCG +
Sbjct: 272 DPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEAAADAIKAGLDLDCGPFL 330
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
T AV++G +++ +I+ +L V MRLG FDG P QY NLG ++C P H +L
Sbjct: 331 AIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAHQQL 390
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLL+N +LPL+ +T+A++GP+++ T MIGNY G C YT+P+ G
Sbjct: 391 ALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 450
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I+ A GC D+ C N + AA AA+ ADATV+V GLD S+EAE DRV LLLP
Sbjct: 451 GRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLLLP 509
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA A++GP LV+MS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 510 GHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIADVL 569
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY NYV +P T M +R P +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 570 FGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGLSY 629
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
T F + +A P SV + L + + ++ V + C A+ DV +
Sbjct: 630 TTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDV---------HV 680
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDG+ ++V++ PP KQ++G+ ++ IAAG +V ++ CK L +V
Sbjct: 681 DVKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVV 740
Query: 725 DNAANSLLASGAHTILVGE 743
D + G H + +G+
Sbjct: 741 DRFGIRRIPLGEHKLQIGD 759
>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
Length = 784
Score = 729 bits (1882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/751 (47%), Positives = 487/751 (64%), Gaps = 24/751 (3%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S L FP+CD KL R +DLV R+TL EKV +M + A G+PRLG+P Y+WW EAL
Sbjct: 41 SSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQWWQEAL 100
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ S PG F P ATSFP I T ASFN +L+ IG+ VS+EARA++NL
Sbjct: 101 HGVA-------SSPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARALHNL 153
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
G AGLTFWSPN+N+ RDPRWGR ETPGEDP + ++A YVRGLQ G Y +
Sbjct: 154 GRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQ---GGAYEGSASDG 210
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+SACCKH AYD+DNW+G DR+HF++ V+EQD+ +T+ PF+ C+ +G VSSVMCSY
Sbjct: 211 FLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCSY 270
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG+PTCAD LL +T+R W F+GYIVSDCD++Q + E + + EDAVA + A
Sbjct: 271 NRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYA-PSAEDAVADSILA 329
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKN 360
GLDL+CG + A+Q GKI EAD+D ++ L MRLG FDG P Y +LG
Sbjct: 330 GLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSSLGAT 389
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC+ H +LA +AA QG+VLLKND G+LPL+T +KT+AL+GP+ANAT M+GNYEG P
Sbjct: 390 DICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTMLGNYEGIP 447
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C+Y SP+ G YS I Y+PGC ++ C ++ +A++ A ADA V+V GLD S E E
Sbjct: 448 CKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQSQERE 507
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DR LLLPG Q++L++ +A+A P+ LVIMSAG VDI+ K+N +I S++W+GYPG+
Sbjct: 508 TFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLGYPGQ 567
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG A+A V+FG YNPGGRLP TWY + + M +R P++ +PGR+Y+F+ G +
Sbjct: 568 SGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMQMRPNPLSGYPGRSYRFYTGTPL 627
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDI---KLDKDQQCRDINYTVGTNKPPCAAVLIDDVK- 654
Y FG GLSY+ + YK +P + + C +N + K C + DD++
Sbjct: 628 YNFGDGLSYSTYFYKFLLAPTKLSFFKSNTGNSRGCPAVNRSKA--KSGCFHLPADDLET 685
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
C F +EV N+G GS V+++S PP + G +KQ+I +++V + + + ++ F
Sbjct: 686 CNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQKVHLESDTTQRLIFG 745
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEGV 745
++ CK L V L SG H +L+G V
Sbjct: 746 IDPCKHLSSVRRNGKRFLHSGRHKLLIGNAV 776
>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/736 (49%), Positives = 489/736 (66%), Gaps = 29/736 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C ++P R +DL+ R+TL EK+ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 46 FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F + + GATSFP VI T ASFNESLW++IG+ VS EARAMYN G AGLTFWSP
Sbjct: 103 ---PGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGAAGLTFWSP 158
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + +Y YV+GLQ D LK++ACCKH
Sbjct: 159 NVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG--------DGAGNRLKVAACCKH 210
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DRFHF++RV++QD+++T+ +PF+ CV +G+V+SVMCSYN+VNG PTCA
Sbjct: 211 YTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGKPTCA 270
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG W +GYIVSDCDS+ ++ + T E A A +KAGLDLDCG +
Sbjct: 271 DPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAAAAYAIKAGLDLDCGPFL 329
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
T A++ G++ E DI+ +L V MRLG FDG P +Y NLG ++C P EL
Sbjct: 330 GIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNEL 389
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLL+N +LPL+T +T+A++GP+++ T+ MIGNY G C YT+P+ G
Sbjct: 390 ALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPLQGI 449
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I+ A GC D+ C N +I AA AA+ ADATV+V GLD S+EAE +DR LLLP
Sbjct: 450 ARYTRTIHQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLLLP 508
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA A++GP LVIMS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 509 GHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYPGQAGGTAIADVL 568
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NP G+LP+TWY NYV +P T M +R P +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 569 FGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGLSY 628
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F + +A P V + L + N T+ +N V + C F I+++
Sbjct: 629 TRFSHSLAQGPTLVSVPLTSLVAAK--NTTMLSNH----GVRVSHTNCDSLSLDFHIDIK 682
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N G MDG+ ++V++ P KQ++G+ +V I AG +V ++ CK L IVD
Sbjct: 683 NTGTMDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKL 742
Query: 728 ANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 743 GIRRIPLGQHKLEIGD 758
>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/739 (49%), Positives = 490/739 (66%), Gaps = 34/739 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 46 FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F PGATSFP VI T ASFNESLW++IG+ V EARAMYN G AGLT+WSP
Sbjct: 103 ---PGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARAMYNGGMAGLTYWSP 159
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + +YA YV+GLQ D LK++ACCKH
Sbjct: 160 NVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG--------DGAGNRLKVAACCKH 211
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G +RFHF++RV++QD+ +T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 212 YTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVMCSYNQVNGKPTCA 271
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG W +GYIVSDCDS+ + E + T E+A A +KAGLDLDCG +
Sbjct: 272 DPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEAAADAIKAGLDLDCGPFL 330
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
T AV++G +++ +I+ +L V MRLG FDG P QY NLG ++C P H +L
Sbjct: 331 AIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLGPRDVCTPAHQQL 390
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLL+N +LPL+ +T+A++GP+++ T MIGNY G C YT+P+ G
Sbjct: 391 ALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGI 450
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I+ A GC D+ C N + AA AA+ ADATV+V GLD S+EAE DRV LLLP
Sbjct: 451 GRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIEAEFVDRVGLLLP 509
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA A++GP LV+MS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+
Sbjct: 510 GHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYPGQAGGTAIADVL 569
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY NYV +P T M +R P +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 570 FGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRFYRGPVVFPFGLGLSY 629
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
T F + +A P SV + L + + ++ V + C A+ DV +
Sbjct: 630 TTFAHNLAHGPTSVSVPLTSLKATANSTMLSKAVRVSHADCNALSPLDV---------HV 680
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDG+ ++V++ PP KQ++G+ ++ IAAG +V ++ CK L +V
Sbjct: 681 DVKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVV 740
Query: 725 DNAANSLLASGAHTILVGE 743
D + G H + +G+
Sbjct: 741 DRFGIRRIPLGEHKLQIGD 759
>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/743 (49%), Positives = 506/743 (68%), Gaps = 30/743 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ K RA+DLV R+TL EKV + + + RLG+P YEWWSEALHGVS+
Sbjct: 46 LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 105
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F VPGATSFP ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 106 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 159
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD G D LK++
Sbjct: 160 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA-GAGGVTDG---ALKVA 215
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 275
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL IRGDW +GYIVSDCDS+ ++ + + T E+A A +K+GLDL+
Sbjct: 276 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLN 334
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG++ T+ AVQ G+++E D+D ++ +I+LMRLG+FDG P+ + +LG ++C
Sbjct: 335 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 394
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA E ARQGIVLLKN +GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 395 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + A+ AA +AD TV+V G D S+E E DR
Sbjct: 454 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 512
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG QT+L++ VA+A+ GPV LV+MS G DI+FAK + KI +ILWVGYPGE GG
Sbjct: 513 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGA 572
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
A+AD++FG +NP GRLP+TWY A+Y + T M +RP +PGRTY+F+ G V+ F
Sbjct: 573 ALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAF 632
Query: 602 GYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
G GLSYT+ + + S+P S V ++L +D CR CA+V C D F
Sbjct: 633 GDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASVEAAGDHCDDLAF 683
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
+++V N G++ G+ V+++S PP K ++G+E+V +A G++ V F ++ C+
Sbjct: 684 DVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRD 743
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
L +VD +A G HT+ VG+
Sbjct: 744 LSVVDELGGRKVALGGHTLHVGD 766
>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 772
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/740 (48%), Positives = 486/740 (65%), Gaps = 29/740 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS +P+C LP PER KDL+ R+TL EKV+ + + A VPRLG+ YEWWSEALHGVS
Sbjct: 38 LSRYPFCRVALPIPERVKDLIGRLTLQEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSN 97
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F + PGATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGL
Sbjct: 98 VG------PGTEFGGDFPGATSFPQVITTVASFNVSLWEAIGRVVSDEARAMYNGGAAGL 151
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP V G YA Y++GLQ +D LK++
Sbjct: 152 TYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAARYIKGLQG---------NDGDRLKVA 202
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLDNW G DRFHF+++VT QDM +TF +PF CV EG V+SVMCSYN+VNG
Sbjct: 203 ACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNG 262
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCADP LL TIR W +GYIVSDCDS+ ++ + T E+A A +KAGLDLD
Sbjct: 263 VPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDNQHY-TSTAEEAAADAIKAGLDLD 321
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T AV++ + + I+ +L V MRLG FDG+P Y LG N+C+P
Sbjct: 322 CGPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSP 381
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H +LA +AARQGIVLLKN LPL+ + +T+A++GP+++ MIGNY G C Y +
Sbjct: 382 SHQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGNYAGVACGYVT 441
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P++G Y+ V+ + GC ++ C + A+ AA ADATV+V GLD SVEAE KDR
Sbjct: 442 PLEGIKRYTTVV-HRKGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRD 500
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q EL+ KVA A++GP +++MS G +D++FA N+P+I +ILWVGYPG+ GG A
Sbjct: 501 GLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAA 560
Query: 546 IADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
IADV+FG NPGG+LP+TWY +Y+ +P T+M +R +++PGRTY+F+ GPVVY FG+G
Sbjct: 561 IADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNMAMRSTSSYPGRTYRFYAGPVVYEFGHG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F + + +P V I L +Q T + A+ + KC+ +
Sbjct: 621 LSYTNFIHTIVKAPTIVSISLSGHRQ------THSASTLSSKAIRVTHAKCQKLSLVIHV 674
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+VEN G DG ++V+S PP T + KQ++ +E++ +A+ + ++ ++ CK L
Sbjct: 675 DVENKGDRDGFHTMLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLS 734
Query: 723 IVDNAANSLLASGAHTILVG 742
+VD + G H I +G
Sbjct: 735 VVDKLGVRRIPLGDHYIHIG 754
>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
Length = 767
Score = 727 bits (1877), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/736 (50%), Positives = 495/736 (67%), Gaps = 28/736 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C LP R +DL+ R+ L EKV+ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 40 FCRVNLPIRARVQDLIGRLNLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVG--- 96
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F P ATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLT+WSP
Sbjct: 97 ---PGTKFRGAFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSP 153
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + +YA +YVRGLQ + Y+++ LK++ACCKH
Sbjct: 154 NVNIFRDPRWGRGQETPGEDPTLASQYAASYVRGLQGI----YNKNR----LKVAACCKH 205
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW DRFHF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 206 YTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFKGCVQEGRVASVMCSYNQVNGKPTCA 265
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG W +GYIVSDCDS+ + + + T E+A A +KAGLDLDCG +
Sbjct: 266 DPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHY-TRTPEEAAADTIKAGLDLDCGPFL 324
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIEL 370
T AV++G + EAD++ +L + V MRLG FDG + + +LG ++C+P H +L
Sbjct: 325 AVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFDGEAAAQPFGHLGPKDVCSPAHQDL 384
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AARQGIVLL+N +LPL+T + +A++GP+A+AT MIGNY G C YTSP+ G
Sbjct: 385 ALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNADATVTMIGNYAGVACGYTSPLQGI 444
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y+K ++ A GC + C +N AA AA +ADATV+V GLD S+EAE +DR +LLP
Sbjct: 445 ARYAKTVHQA-GCIGVACTSNQQFGAATAAAAHADATVLVMGLDQSIEAEFRDRASVLLP 503
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL++KVA A++GP LV+M G VD+ FAKN+PKI +ILWVGYPG+ GG AIADV+
Sbjct: 504 GHQQELVSKVALASRGPTILVLMCGGPVDVTFAKNDPKISAILWVGYPGQAGGTAIADVL 563
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP TWY +YV K+P T + +R P N +PGRTY+F+ GPVV+PFG+GLSY
Sbjct: 564 FGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPSNGYPGRTYRFYKGPVVFPFGFGLSY 623
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F +A +P V + L Q + N T NK A+ + C + + I+V+
Sbjct: 624 TRFTQSLAHAPTKVMVPL--ANQFTNSNIT-SFNKD---ALKVLHTNCDNIPLSLHIDVK 677
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N GK+DGS ++V+S PP + KQ+IG++RV + AG +V ++ C L D
Sbjct: 678 NKGKVDGSHTILVFSTPPKGTKSSEKQLIGFKRVHVFAGSKQRVRMNIHVCNHLSRADEF 737
Query: 728 ANSLLASGAHTILVGE 743
+ G HT+ +G+
Sbjct: 738 GVRRIPIGEHTLHIGD 753
>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 774
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/737 (49%), Positives = 491/737 (66%), Gaps = 29/737 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 49 FCRVNVPIHVRVQDLIGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVG--- 105
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PG+ F PGATSFP VI T ASFN+SLW++IG+ VS EARAMYN G AGLT+WSP
Sbjct: 106 ---PGSKFGGAFPGATSFPQVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSP 162
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N++RDPRWGR ETPGEDP V +YA +YVRGLQ + LK++ACCKH
Sbjct: 163 NVNILRDPRWGRGQETPGEDPIVAAKYAASYVRGLQGT--------AAGNRLKVAACCKH 214
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DRFHF+++VT+QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 215 YTAYDLDNWNGVDRFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCA 274
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
D LL TIRG W +GYIVSDCDS+ + T E+A A +KAGLDLDCG +
Sbjct: 275 DENLLKNTIRGKWRLNGYIVSDCDSVDVFFNQQHY-TSTPEEAAAASIKAGLDLDCGPFL 333
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAA 372
FT GAV++G + E DI+ +L V MRLG FDG+ Y NLG ++C+ H LA
Sbjct: 334 AIFTEGAVKKGLLTENDINLALANTLTVQMRLGMFDGNLGPYANLGPRDVCSLAHKHLAL 393
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
EAA QGIVLLKN +LPL+ +T+A++GP+++ T+ MIGNY G C YT+P+ G
Sbjct: 394 EAAHQGIVLLKNSGRSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTTPLQGISR 453
Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
Y++ ++ A GCA + C+ N AA AA+ ADATV+V GLD S+EAE +DR LLLPG+
Sbjct: 454 YARTLHQA-GCAGVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGY 512
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q +L+ +VA A++GPV LV+MS G +D+ FAKN+P++ +I+W GYPG+ GG AIA++IFG
Sbjct: 513 QQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFG 572
Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
NPGG+LP+TWY +YV K+P T M +R N+PGRTY+F+ GPVV+PFG+GLSYT F
Sbjct: 573 AANPGGKLPMTWYPQDYVAKVPMTVMAMRASGNYPGRTYRFYKGPVVFPFGFGLSYTTFT 632
Query: 612 YKVASSP-KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY-KFTFQIEVENM 669
+A SP + + L ++N ++ + C + K +EV N
Sbjct: 633 NSLAKSPLAQLSVSLS------NLNSANAILNSTSHSIKVSHTNCNSFPKMPLHVEVSNT 686
Query: 670 GKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
G+ DG+ V V+++PP GI G + KQ+I +E+V + AG V ++ACK L +VD
Sbjct: 687 GEFDGTHTVFVFAEPPKNGIKGLGVNKQLIAFEKVHVMAGAKQTVRVDVDACKHLGVVDE 746
Query: 727 AANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 747 YGKRRIPMGKHKLHIGD 763
>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/736 (49%), Positives = 488/736 (66%), Gaps = 29/736 (3%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C ++P R +DL+ R+TL EK+ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 46 FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGYEWWSEALHGVSNVG--- 102
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F + + GATSFP VI T ASFNESLW++IG+ VS EARAMYN G AGLTFWSP
Sbjct: 103 ---PGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDEARAMYNGGAAGLTFWSP 158
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N+ RDPRWGR ETPGEDP + +Y YV+GLQ D LK++ACCKH
Sbjct: 159 NVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG--------DGAGNRLKVAACCKH 210
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DRFHF++RV++QD+++T+ +PF+ CV +G+V+SVMCSYN+VNG PTCA
Sbjct: 211 YTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVMCSYNQVNGKPTCA 270
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG W +GYIVSDCDS+ ++ + T E A A +KAGLDLDCG +
Sbjct: 271 DPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAAAAYAIKAGLDLDCGPFL 329
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
T A++ G++ E DI+ +L V MRLG FDG P +Y NLG ++C P EL
Sbjct: 330 GIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNEL 389
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLL+N +LPL+T +T+A++GP+++ T+ MIGNY G C YT+P+ G
Sbjct: 390 ALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGIACGYTTPLQGI 449
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y++ I+ A GC D+ C N +I AA AA+ ADATV+V GLD S+EAE +DR LLLP
Sbjct: 450 ARYTRTIHQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTGLLLP 508
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+++VA A++GP LVIMS G +D+ FAKN+P I +I+WVGYPG+ GG AIADV+
Sbjct: 509 GHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYPGQAGGTAIADVL 568
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NP G+LP+TWY NYV +P T M +R P +PGRTY+F+ GPVV+PFG GLSY
Sbjct: 569 FGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYRFYKGPVVFPFGMGLSY 628
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T+F + +A P V + L + N T+ +N V + C F I+++
Sbjct: 629 TRFSHSLAQGPTLVSVPLTSLVAAK--NTTMLSNH----GVRVSHTNCDSLSLDFHIDIK 682
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N G MDG+ ++V++ P KQ++G+ +V I AG +V ++ CK L IVD
Sbjct: 683 NTGTMDGTHTLLVFATQPAGKWAPNKQLVGFHKVHIVAGSERRVRVGVHVCKHLSIVDKL 742
Query: 728 ANSLLASGAHTILVGE 743
+ G H + +G+
Sbjct: 743 GIRRIPLGQHKLEIGD 758
>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/743 (49%), Positives = 506/743 (68%), Gaps = 30/743 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ K RA+DLV R+TL EKV + + + RLG+P YEWWSEALHGVS+
Sbjct: 46 LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 105
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F VPGATSFP ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 106 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 159
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD G D LK++
Sbjct: 160 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA-GAGGVTDG---ALKVA 215
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 275
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL IRGDW +GYIVSDCDS+ ++ + + T E+A A +K+GLDL+
Sbjct: 276 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLN 334
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG++ T+ AVQ G+++E D+D ++ +I+LMRLG+FDG P+ + +LG ++C
Sbjct: 335 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 394
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA E ARQGIVLLKN +GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 395 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + A+ AA +AD TV+V G D S+E E DR
Sbjct: 454 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 512
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG QT+L++ VA+A+ GPV LV+MS G DI+FAK + KI +ILWVGYPGE GG
Sbjct: 513 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGA 572
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
A+AD++FG +NP G+LP+TWY A+Y + T M +RP +PGRTY+F+ G V+ F
Sbjct: 573 ALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAF 632
Query: 602 GYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
G GLSYT+ + + S+P S V ++L +D CR CA+V C D F
Sbjct: 633 GDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASVEAAGDHCDDLAF 683
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
+++V N G++ G+ V+++S PP K ++G+E+V +A G++ V F ++ C+
Sbjct: 684 DVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRD 743
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
L +VD +A G HT+ VG+
Sbjct: 744 LSVVDELGGRKVALGGHTLHVGD 766
>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
Length = 773
Score = 726 bits (1874), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/742 (49%), Positives = 491/742 (66%), Gaps = 28/742 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ +C+A L R DLV R+TL EK+ + A GV RLG+P Y+WWSEALHGVS
Sbjct: 44 LAGLRFCNAGLSIKARVTDLVGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSN 103
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G G+ F +VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G+AGL
Sbjct: 104 VG------GGSRFTGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGL 157
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPN+N+ RDPRWGR ETPGEDP + +YA+ YV+GLQ+ +G + +R LK++
Sbjct: 158 TFWSPNVNIFRDPRWGRGQETPGEDPTLSSKYAVAYVKGLQETDGGDPNR------LKVA 211
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW +R F++ V +QD+ +TF PF+ CV +G V+SVMCSYN+VNG
Sbjct: 212 ACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPPFKSCVVDGHVASVMCSYNQVNG 271
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL+ IRG W +GYIVSDCDS+ + + T E+AVA+ L AGLDL+
Sbjct: 272 KPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQHYAK-TPEEAVAKSLLAGLDLN 330
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
C + MGAV+ G + E ID ++ + LMRLG+FDG P+ Y LG ++C
Sbjct: 331 CDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGFFDGDPKKQLYGGLGPKDVCTA 390
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA + ARQGIVLLKN G+LPL+ IKTLA++GP+ANAT+ MIGNY G PC+YT+
Sbjct: 391 DNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYHGVPCKYTT 450
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G A + Y GC ++ C + + I +A+D A +ADA V+V G D S+E EG DRV
Sbjct: 451 PLQGL-AETVSSTYQLGC-NVACVD-ADIGSAVDLAASADAVVLVVGADQSIEREGHDRV 507
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q EL+ +VA AA+GPV LVIMS G DI FAKN+ KI SI+WVGYPGE GG A
Sbjct: 508 DLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKNDKKITSIMWVGYPGEAGGLA 567
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
IADVIFG++NP G LP+TWY +YV K+P ++M +RP +PGR+Y+F+ G VY F
Sbjct: 568 IADVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKSKGYPGRSYRFYTGETVYAFA 627
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKFT 661
L+YT+F +++ +P+ V + LD++ CR ++ P C ++ F
Sbjct: 628 DALTYTKFDHQLIKAPRLVSLSLDENHPCRSSECQSLDAIGPHC-----ENAVEGGSDFE 682
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+ V+N G GS V +++ P + G+ IKQ++G+E++ + + A V F +N CK L
Sbjct: 683 VHLNVKNTGDRAGSHTVFLFTTSPQVHGSPIKQLLGFEKIRLGKSEEAVVRFNVNVCKDL 742
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+VD +A G H + VG
Sbjct: 743 SVVDETGKRKIALGHHLLHVGS 764
>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
Length = 785
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/750 (47%), Positives = 486/750 (64%), Gaps = 22/750 (2%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S L FP+CD KL R +DLV R+TL EKV +M + A G+PRLG+P Y+WW EAL
Sbjct: 42 SSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQWWQEAL 101
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ S PG F P ATSFP I ASFN +L+ IG+ VS+EARA++NL
Sbjct: 102 HGVA-------SSPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSEARALHNL 154
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
G AGLTFWSPN+N+ RDPRWGR ETPGEDP + ++A YVRGLQ G Y +
Sbjct: 155 GRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQ---GGAYGGSASDG 211
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+SACCKH AYD+DNW+G DR+HF++ V+EQD+ +T+ PF+ C+ +G VSSVMCSY
Sbjct: 212 FLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVMCSY 271
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG+PTCAD LL +T+R W F+GYIVSDCD++Q + E + + EDAVA + A
Sbjct: 272 NRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYA-PSAEDAVADSILA 330
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKN 360
GLDL+CG + A+Q GK+ EAD+D ++ L MRLG FDG + Y +LG
Sbjct: 331 GLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSSLGAT 390
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+IC+ H +LA +AA QG+VLLKND G+LPL+T +KT+AL+GP+ANAT M+GNYEG P
Sbjct: 391 DICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTMLGNYEGIP 448
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C+Y SP+ G Y+ I Y+PGC D+ C ++ +A++ A ADA V+V GLD S E E
Sbjct: 449 CKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQSQERE 508
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DR LLLPG Q++L++ +A+A P+ LVIMSAG VDI+ K+N +I S++W+GYPG+
Sbjct: 509 TFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIGYPGQ 568
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG A+A V+FG YNPGGRLP TWY + + M +R P + +PGR+Y+F+ G +
Sbjct: 569 SGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMRMRPNPPSGYPGRSYRFYTGTPL 628
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP--CAAVLIDDVK-C 655
Y FG GLSY+ + YK +P + + RD TV ++ C + DD++ C
Sbjct: 629 YNFGDGLSYSTYLYKFLLAPTRLSFFKSNTRNSRDCP-TVNRSEAEFGCFHLPADDLETC 687
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
F +EV N+G GS V+++S PP + G +KQ+I +++V + + + ++ F +
Sbjct: 688 NSILFQVSVEVSNLGPRSGSHSVLIFSAPPPVEGAPLKQLIAFQKVHLESDTTQRLIFGI 747
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGV 745
+ CK L V L SG H +L+G V
Sbjct: 748 DPCKHLSSVRRNGKRFLHSGRHKLLIGNAV 777
>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
Length = 771
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/739 (49%), Positives = 486/739 (65%), Gaps = 22/739 (2%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+ + +C LP R +DL+ R+TL EK++ + + A V RLG+ YEWWSEALHGVS
Sbjct: 34 IRNLRFCKTSLPIHVRVQDLIARLTLQEKIRLLVNNAAPVQRLGISGYEWWSEALHGVS- 92
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
N+ G F PGATSFP VI T ASFN SLW++IG+ VS E RAMYN G AGL
Sbjct: 93 -----NTGYGVKFGGAFPGATSFPQVITTAASFNASLWEEIGRVVSEEGRAMYNGGAAGL 147
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPN+N+ RDPRWGR ETPGEDP++V +Y ++YV+GLQ G R LK++
Sbjct: 148 TFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSYVKGLQGGGGRGNTR------LKVA 201
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDLD+W G DR+HF+++V+ QD+++T+ PF+ CV EG+V+SVMCSYN++NG
Sbjct: 202 ACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFKACVVEGNVASVMCSYNQING 261
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
P+CADP LL TIR W+ +GYIVSDCDS+ + E + EDA A +KAGLDLD
Sbjct: 262 KPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHYTR-YPEDAAAITIKAGLDLD 320
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQH 367
CG + T AV GK+++ +I+ +L V MRLG FDG + Y NLG ++C+P H
Sbjct: 321 CGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFDGPNGPYANLGPKDVCSPAH 380
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
+LA +AAR+GIVLLKN ALPL+T +T+A++GP+++AT AMIGNY G PC Y SP+
Sbjct: 381 QQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNSDATLAMIGNYAGVPCGYISPL 440
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y++ I + GC + C N A AA++ADATV+V GLD S+EAE KDRV L
Sbjct: 441 QGISRYARTI-HQQGCMGVACPGNQNFGLAEVAARHADATVLVMGLDQSIEAEAKDRVTL 499
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q +LI++VA A+KGPV LV+MS G +D+ FAKN+P++ SI+WVGYPG+ GG AIA
Sbjct: 500 LLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKNDPRVSSIVWVGYPGQAGGAAIA 559
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
DV+FG NPGG+LP+TWY +YV K+ +M +R P +PGRTY+F+ GP V+PFG G
Sbjct: 560 DVLFGATNPGGKLPMTWYPQDYVAKVSMANMDMRANPSKGYPGRTYRFYKGPTVFPFGAG 619
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
+SYT F + S+P +V + N T T A V C+ I
Sbjct: 620 ISYTTFSQHLVSAPITVSVPTLHSHDLVSNNTT--TLMKAKATVRTIHTNCESLDIDMHI 677
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDG+ V+++S PP T KQ++ +E+V + AG +V MNACK L +
Sbjct: 678 DVKNTGDMDGTHAVLIFSTPPD--PTETKQLVAFEKVHVVAGAKQRVKINMNACKHLSVA 735
Query: 725 DNAANSLLASGAHTILVGE 743
D + G H I VG+
Sbjct: 736 DEYGVRRIYMGEHKIHVGD 754
>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
Length = 755
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/749 (49%), Positives = 494/749 (65%), Gaps = 33/749 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C +P R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 34 FCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG--- 90
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PGT F PGATSFP VI T ASFN+SLW++IG+ VS EARAM+N G AGLT+WSP
Sbjct: 91 ---PGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGGMAGLTYWSP 147
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+NV RDPRWGR ETPGEDP V G+YA +YVRGLQ G LK++ACCKH
Sbjct: 148 NVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFR---------LKVAACCKH 198
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y AYDLDNW G DR+HF++RV++QD+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCA
Sbjct: 199 YTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQVNGKPTCA 258
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
DP LL TIRG+W +GYIVSDCDS+ + E+ + +E A A + KAGLDLDCG +
Sbjct: 259 DPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATI-KAGLDLDCGPFL 317
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
T AV+ G + E D++ +L V MRLG FDG P + LG ++C P H +L
Sbjct: 318 AIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCTPAHQQL 377
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A AA+QGIVLL+N LPL+ N+ T+A++GP A+ T MIGNY G C YT+P+ G
Sbjct: 378 ALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNYAGVACGYTTPLQGI 436
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
Y+K I+ + GC D+ C N A AA ADATV+V GLD S+EAE +DR DLLLP
Sbjct: 437 SRYAKTIHQS-GCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDRKDLLLP 495
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G+Q ELI++VA A++GP LV+MS G +D++FAKN+P+I +ILW GYPG+ GG AIADV+
Sbjct: 496 GYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGAAIADVL 555
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
FG NPGG+LP+TWY +Y+ K+P T+M +R P +PGRTY+F+ GPVV+PFG+G+SY
Sbjct: 556 FGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKGPVVFPFGHGMSY 615
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F + + +P+ V + + N T N ++ + C+ I+V+
Sbjct: 616 TTFAHSLVQAPQEVAVPFTSLYALQ--NTTAARN-----SIRVSHANCEPLVLGVHIDVK 668
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N G MDG + ++V+S PP + K++IG+E+V I AG +V + CK L +VD
Sbjct: 669 NTGDMDGIQTLLVFSSPPEGKWSANKKLIGFEKVHIVAGSKKRVKIDIPVCKHLSVVDRF 728
Query: 728 ANSLLASGAHTILVGEGVGGVSFPLQLNL 756
L G H + +G+ +S LQ NL
Sbjct: 729 GIRRLPIGKHDLHIGDLKHSIS--LQANL 755
>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 717 bits (1851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/742 (49%), Positives = 491/742 (66%), Gaps = 29/742 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ +C+ L R DLV R+TL EK+ +G A GV RLG+P Y+WWSEALHGVS
Sbjct: 49 LAGLRFCNTGLNIKSRVTDLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSN 108
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G G+ F +VPGATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G+AGL
Sbjct: 109 VGG------GSSFSGQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGL 162
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPN+N+ RDPRWGR ETPGEDP + +YA+ YVRGLQ+ +G + +R LK++
Sbjct: 163 TFWSPNVNIFRDPRWGRGQETPGEDPELSSKYAVAYVRGLQETDGGDPNR------LKVA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+ RF F++ V +QDM +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 217 ACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQVNG 276
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL+ IRG W +GYIVSDCDS+ + + T E+AVA+ + AGLDL+
Sbjct: 277 KPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQHY-TKTPEEAVAKSILAGLDLN 335
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICN 364
C + + M AV+ G + E ID ++ + LMRLG+FDG P+ Y LG N++C
Sbjct: 336 CDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDVCT 395
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
+ ELA +AARQGIVLLKN G+LPL+ IKTLA++GP+ANAT+ MIGNY G PC+YT
Sbjct: 396 ANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCKYT 455
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ G A + Y GC ++ C + +A A +ADA V+V G D S+E E DR
Sbjct: 456 TPLQGL-AETVSSTYQLGC-NVACAE-PDLGSAAALAASADAVVLVMGADQSIEQENLDR 512
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+DL LPG Q EL+ +VA AKGPV LVIMS GA DI FAKN KI I+WVGYPGE GG
Sbjct: 513 LDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGGL 572
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
AIADVIFG++NP G LP+TWY +YV K+P T+M +RP N +PGRTY+F+ G VY F
Sbjct: 573 AIADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAF 632
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKPPCAAVLIDDVKCKDYKF 660
G GLSYT F +++ +PK V + LD++ CR +V P C D+ F
Sbjct: 633 GDGLSYTNFNHQILKAPKLVSLDLDENHACRSSECQSVDAIGPHC-----DNAVGGGLNF 687
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
Q++V N+G +GS V +++ PP + G+ K ++G+E++ + + + F ++ CK
Sbjct: 688 EVQLKVRNVGDREGSHTVFLFTTPPEVHGSPRKHLLGFEKIRLGEKEETVIRFNVDVCKD 747
Query: 721 LKIVDNAANSLLASGAHTILVG 742
L +VD +A G + + VG
Sbjct: 748 LSVVDEIGKRKIALGHYLLHVG 769
>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
Length = 738
Score = 717 bits (1851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/739 (49%), Positives = 496/739 (67%), Gaps = 61/739 (8%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C LP ERA+DLV R+TL EK++ + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 38 NLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWWSEALHGVSNVG 97
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F PGATSFP VI T ASFN SLW++IG+ VS EARAMYN G AGLT+
Sbjct: 98 ------PGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGMAGLTY 151
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V +YA YVRGLQ RD LK++AC
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA-----RDR----LKVAAC 202
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKHY AYDLD+W G DRFHF++RV++QD+++T+ +PF+ CV EG+V+SVMCSYN+VNG P
Sbjct: 203 CKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQVNGKP 262
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL TIRG+W +GYIVSDCDS+ + + T E+A A +KAGLDLDCG
Sbjct: 263 TCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHY-TATPEEAAAVAIKAGLDLDCG 321
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
+ T A++ GK+ EAD++ +L V MRLG FDG P Y NLG ++C P H
Sbjct: 322 PFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCTPAH 381
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
+LA EAARQGIVL++N ALPL+T +T+A++GP+++ T+ MIGNY G C YT+P+
Sbjct: 382 QQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYTTPL 441
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y++ I+ A GC+ + C+++ AA+ AA+ ADATV+V GLD S+EAE +DRVD+
Sbjct: 442 QGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDRVDI 500
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q EL++KVA A++GP LV+MS G +D++FAKN+P+I +I+WVGYPG+ GG AIA
Sbjct: 501 LLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGTAIA 560
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
DV+FG+ NPGG+LP+TWY +Y+ K P T+M +R P +PGRTY+F++GPVV+PFG+G
Sbjct: 561 DVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNGPVVFPFGHG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSY+ F + +A +P + F I
Sbjct: 621 LSYSTFAHSLAQAPTT--------------------------------------PLGFHI 642
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
+V+N G MDGS ++++S PP + K+++ +E+V + AG +V F ++ CK L +V
Sbjct: 643 DVKNTGTMDGSHTLLLFSTPPPGTWSPNKRLLAFEKVHVGAGSQERVRFDVHVCKHLSVV 702
Query: 725 DNAANSLLASGAHTILVGE 743
D+ + G H +G+
Sbjct: 703 DHFGIHRIPMGEHHFHIGD 721
>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
[Hordeum vulgare]
Length = 777
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/743 (48%), Positives = 501/743 (67%), Gaps = 30/743 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ K RA+DLV R+TL EKV + + + RLG+P YEWWSEALHGVS+
Sbjct: 48 LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 107
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F VPGATSFP ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 108 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 161
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD G D LK++
Sbjct: 162 TFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA-GAGGVTDG---ALKVA 217
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 218 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 277
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL IRGDW +GYIVSDCDS+ ++ + + T E+A A +K+G+DL+
Sbjct: 278 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGVDLN 336
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG++ T+ AVQ G+++E D+D ++ +I+LMRLG+FDG P+ + +LG ++C
Sbjct: 337 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 396
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA E ARQGIVLLKN +GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 397 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + A+ AA +AD TV+V G D S+E E DR
Sbjct: 456 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 514
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG QT+L++ VA+A+ GPV LV+MS G DI+FAK + KI + LWVGYPGE GG
Sbjct: 515 TSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAGGA 574
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
A+ D +FG +NP GRLP+TWY A+Y + T M +RP +PGRTY+F+ G V+ F
Sbjct: 575 ALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAF 634
Query: 602 GYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
G GLSYT+ + + S+P S V ++L +D CR CA+V C D
Sbjct: 635 GDGLSYTKMSHSLVSAPPSYVSMRLAEDHLCR---------AEECASVEAAGDHCDDLAL 685
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
+++V N G++ G+ V+++S PP K ++G+E+V +A G++ V F ++ C+
Sbjct: 686 DVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHLVGFEKVSLAPGEAGTVAFRVDVCRD 745
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
L +VD +A G HT+ G+
Sbjct: 746 LSVVDELGGRKVALGGHTLHDGD 768
>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
Length = 772
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/752 (48%), Positives = 486/752 (64%), Gaps = 48/752 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ FP+C+ LP +R +D V R+TL EK+ Q+ + A G+PRLG+P Y+WW EALHGV+
Sbjct: 39 LAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVA- 97
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
S PG F VP ATSFP I T ASFN SL+ IGQ VSTEARAM+NLG +GL
Sbjct: 98 ------SSPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGL 151
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +A YVRGLQ+ + + S LK+S
Sbjct: 152 TFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQ-------AGSDKLKVS 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYD+DNW G DR+HF++ VTEQD+++T+ PF+ CV +G VSSVMCSYNR+NG
Sbjct: 205 ACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCAD +LL T+R W +GYIVSDCDS+Q ++ + ++ A + AGL+L+
Sbjct: 265 VPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADAL-LAGLNLN 323
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T+ A+QQ K+ EA I+ +L +L V MRLG +DG P+ Y +LG +++C
Sbjct: 324 CGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTS 383
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAARQG+VLLKN GALPL+T IK+LA+VGPHANAT+AMIGNY G PC+YTS
Sbjct: 384 EHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTS 442
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ F Y++V +YAPGCA++ C ++S+I A+ AA ADA V+ GLDL++EAE DR
Sbjct: 443 PLQAFQKYAQV-SYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q EL+++V AAKGPV +VI+SAGA+DI FA ++ +I ILW GYPG+ GG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
IA+VIFG +NP G+LP TWY N+ I M +RP +PGRTY+F+ GP ++ FG
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGD 621
Query: 604 GLSYTQFKYKVASSPKSVDIK----------LDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
GLSYT K +P + I L K C ++ T D+
Sbjct: 622 GLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDAT-------------DEK 668
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQ-SAK 710
C+ K I V N G M S +M++S PP G G +Q++G+ ++ IA S
Sbjct: 669 SCESLKSQVAISVRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNP 728
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F ++ C+ D LL SG H + G
Sbjct: 729 VIFDLDPCRHFVHADRDGKKLLRSGTHVLTAG 760
>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
Length = 772
Score = 712 bits (1838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/748 (47%), Positives = 475/748 (63%), Gaps = 27/748 (3%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
+ S FP+CD LP P+R DLV RM L EK+ Q+ A G+PRLG+P Y+WW EALHGV+
Sbjct: 29 RSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVA 88
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
PG F + VP ATSFP VILT ASFN SLW KI Q +S EA AMYN G +G
Sbjct: 89 -------ESPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSG 141
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS--DSRP- 184
LTFWSPNIN+ RDPRWGR ETPGEDP + +YA +VRGLQ+ + E S RP
Sbjct: 142 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPT 201
Query: 185 -LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+S+CCKH+ AYD++ EG D FHF+++VT QD+Q+TF PF C+ +G S +MCSY
Sbjct: 202 RLKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSY 261
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG+P+CAD L +T+R W F GYIVSDCD++ + E + T EDAVA VL A
Sbjct: 262 NRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLSA 320
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
G+DL+CG + T A++QGK+ EA +D +L + V MRLG FDG+ Y ++G +
Sbjct: 321 GMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPDA 380
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+C +H +L+ EAA QGIVLLKN LP ++ T+A++GP NAT+ M+GNY G PC
Sbjct: 381 VCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPC 440
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+Y +P G Y+K + + PGC DI+C + ++ AA+ AA+N+DA VIV GLD E EG
Sbjct: 441 QYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREG 500
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR LLLPG+Q +L+ +V+ AKGPV LV+MS G +D+ FAK N KI S+LWVGYPGE
Sbjct: 501 LDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEA 560
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPVV 598
GG+AIA VIFG +NP GRLP+TWY + + + +M LRP FPGRTY+F+ G V
Sbjct: 561 GGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENV 620
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
Y FG+GLSYT F Y S+P ++ + R G P ID C+
Sbjct: 621 YEFGHGLSYTNFTYTNFSAPSNITAR--NTVAIRTPLREDGARHFP-----IDYTGCEAL 673
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI---KQVIGYERVFIAAGQSAKVGFTM 715
F + N G D + ++Y+ PP + + KQ+I ++R + AG+ AKV F +
Sbjct: 674 AFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAKVEFDV 733
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGE 743
+ CK L + + A +L G + + +G+
Sbjct: 734 DTCKDLGLTNEAGTKVLVHGDYKLSLGD 761
>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
Length = 772
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/749 (46%), Positives = 475/749 (63%), Gaps = 29/749 (3%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
+ S FP+CD LP P+R DLV RM L EK+ Q+ A G+PRLG+P Y+WW EALHGV+
Sbjct: 29 RSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVA 88
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
PG F + VP ATSFP VILT ASFN SLW KI Q +S EA AMYN G +G
Sbjct: 89 -------ESPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSG 141
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE-----GVEYHRDSDS 182
LTFWSPNIN+ RDPRWGR ETPGEDP + +YA +VRGLQ+ + + + S +
Sbjct: 142 LTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPT 201
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
R LK+S+CCKH+ AYD++ EG D FHF+++VT QD+Q+TF PF C+ +G S +MCS
Sbjct: 202 R-LKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCS 260
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YNRVNG+P+CAD L +T+R W F GYIVSDCD++ + E + T EDAVA VL
Sbjct: 261 YNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLS 319
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
AG+DL+CG + T A++QGK+ EA +D +L + V MRLG FDG+ Y ++G +
Sbjct: 320 AGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPD 379
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C P+H +L+ EAA QGIVLLKN LP ++ T+A++GP NAT+ M+GNY G P
Sbjct: 380 AVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVP 439
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C+Y +P G Y+K + + PGC DI+C + ++ AA+ AA+N+DA VIV GLD E E
Sbjct: 440 CQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQERE 499
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
G DR LLLPG Q L+ +V+ AKGPV LV+MS G +D+ FAK N KI ++LWVGYPGE
Sbjct: 500 GLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGE 559
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPV 597
GG+AIA VIFG +NP GRLP+TWY + + + +M LRP FPGRTY+F+ G
Sbjct: 560 AGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGEN 619
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VY FG+GLSYT F Y +P ++ + R G + P ID C+
Sbjct: 620 VYEFGHGLSYTNFTYTNFCAPSNITAR--NTVAIRTPLREDGARQFP-----IDYTGCEA 672
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI---KQVIGYERVFIAAGQSAKVGFT 714
F + N G D + ++Y+ PP + + KQ+I ++R + AG+ AKV F
Sbjct: 673 LAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAKVEFD 732
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
++ CK L + + A +L G + + +G+
Sbjct: 733 VDTCKDLGLTNEAGTKVLVHGDYKLSLGD 761
>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 770
Score = 711 bits (1835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/755 (49%), Positives = 488/755 (64%), Gaps = 33/755 (4%)
Query: 7 VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
V + +C L ER KDL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGV
Sbjct: 39 VGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGV 98
Query: 67 SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
S +G PGT F PGATSFP VI T ASFN+SLW IG+ VS EARAMYN G A
Sbjct: 99 SNVG------PGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA 152
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GLT+WSPN+N+ RDPRWGR ETPGEDP + +YA NYV+GLQ +G + LK
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQGNDG--------KKRLK 204
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
++ACCKHY AYDLDNW G DR+HF+++V++QD+++T+ +PF+ CV EG V+SVMCSYN+V
Sbjct: 205 VAACCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQV 264
Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
NG PTCADP LL TIRG W GYIVSDCDS+ + +S F T E+A A +KAGLD
Sbjct: 265 NGKPTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHF-TPTPEEAAASTIKAGLD 323
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
LDCG + T AV +G + E D++ +L L V MRLG FDG P Y NLG ++C
Sbjct: 324 LDCGPFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVC 383
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P H LA EAARQGIVLL+N GALPL+ +T+A++GP+++AT MIGNY G C Y
Sbjct: 384 TPAHKHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEY 443
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
T+P+ G Y K I +A GCA++ C + +I A AA+ ADA V+V GLD S+EAE +D
Sbjct: 444 TTPVQGISKYVKTI-HAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRD 502
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +LLPG Q EL+ ++ A KGP +V+MS G +D++FAKN+ KI ILWVGYPG+ GG
Sbjct: 503 RNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGG 562
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYP 600
AIADV+FG NPGG+LP+TWY +Y+ K+P T+M LR P +PGRTY+F+ GPVV+P
Sbjct: 563 AAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFP 622
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FG+GLSY++F A +P I L + + TV + CA+V
Sbjct: 623 FGFGLSYSKFSQSFAEAP--TKISLPLSSLSPNSSATVKVSHTDCASV---------SDL 671
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
I+V+N G +DGS ++V+S P + K +IG+E+V + AG +V ++ C
Sbjct: 672 PIMIDVKNTGTVDGSHTILVFSTVPNQTWSPEKHLIGFEKVHLIAGSQKRVRIGIHVCDH 731
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
L VD + G H + +G+ +S L
Sbjct: 732 LSRVDEFGTRRIPMGEHKLHIGDLTHSISLQADLQ 766
>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
Length = 772
Score = 711 bits (1835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/752 (48%), Positives = 485/752 (64%), Gaps = 48/752 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ FP+C+ L +R +D V R+TL EK+ Q+ + A G+PRLG+P Y+WW EALHGV+
Sbjct: 39 LAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVA- 97
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
S PG F VP ATSFP I T ASFN SL+ IGQ VSTEARAM+NLG +GL
Sbjct: 98 ------SSPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGL 151
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +A YVRGLQ+ + + S LK+S
Sbjct: 152 TFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQ-------AGSDKLKVS 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYD+DNW G DR+HF++ VTEQD+++T+ PF+ CV +G VSSVMCSYNR+NG
Sbjct: 205 ACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCAD +LL T+R W +GYIVSDCDS+Q ++ + ++ A + AGL+L+
Sbjct: 265 VPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADAL-LAGLNLN 323
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T+ A+QQ K+ EA I+ +L +L V MRLG +DG P+ Y +LG +++C
Sbjct: 324 CGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTS 383
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAARQG+VLLKN GALPL+T IK+LA+VGPHANAT+AMIGNY G PC+YTS
Sbjct: 384 EHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTS 442
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ F Y++V +YAPGCA++ C ++S+I A+ AA ADA V+ GLDL++EAE DR
Sbjct: 443 PLQAFQKYAQV-SYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRT 501
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q EL+++V AAKGPV +VI+SAGA+DI FA ++ +I ILW GYPG+ GG A
Sbjct: 502 SLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAA 561
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGY 603
IA+VIFG +NP G+LP TWY N+ I M +RP +PGRTY+F+ GP ++ FG
Sbjct: 562 IAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGD 621
Query: 604 GLSYTQFKYKVASSPKSVDIK----------LDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
GLSYT K +P + I L K C ++ T D+
Sbjct: 622 GLSYTSLSAKFIKAPSFLSIPSTAPMQPCTGLKKSSSCFHLDAT-------------DEK 668
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQ-SAK 710
C+ K I V N G M S +M++S PP G G +Q++G+ ++ IA S
Sbjct: 669 SCESLKSQVAISVRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNP 728
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F ++ C+ D LL SG H + G
Sbjct: 729 VIFDLDPCRHFVHADPDGKKLLRSGTHVLTAG 760
>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
Length = 822
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/794 (47%), Positives = 497/794 (62%), Gaps = 64/794 (8%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ P+C LP RA+DLV R+T EKV+ + + A GVPRLG+ YEWWSEALHGVS
Sbjct: 39 ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQ------------------ 111
G PG F PGAT+FP VI T ASFN +LW+ IGQ
Sbjct: 99 G------PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSC 152
Query: 112 --------------TVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVV 157
VS E RAMYN G AGLTFWSPN+N+ RDPRWGR ETPGEDP V
Sbjct: 153 IRISVFMYVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVA 212
Query: 158 GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQ 217
RYA YVRGLQ + S LK++ACCKH+ AYDLDNW G DRFHF++ VT Q
Sbjct: 213 ARYAAAYVRGLQ-------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQ 265
Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
D+++TF +PF CV +G +SVMCSYN+VNG+PTCAD L TIR W GYIVSDCD
Sbjct: 266 DLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCD 325
Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRF 337
S+ + S + T+EDAVA L+AGLDLDCG + +T GAV QGK+ + DID ++
Sbjct: 326 SVD-VFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTN 384
Query: 338 LYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
V MRLG FDG P + +LG ++C H ELA EAARQGIVLLKND ALPL+
Sbjct: 385 TVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPA 444
Query: 395 NIK-TLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM 453
+ +A+VGPHA AT AMIGNY G PCRYT+P+ G Y+ + PGC D+ C +
Sbjct: 445 TARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQ 504
Query: 454 -IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
I AA+DAA+ ADAT++VAGLD +EAEG DR LLLPG Q ELI+ VA A+KGPV LV+
Sbjct: 505 PIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVL 564
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-K 571
MS G +DI FA+N+PKI ILW GYPG+ GG+AIADVIFG +NPGG+LP+TWY +Y+ K
Sbjct: 565 MSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQK 624
Query: 572 IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
+P T+M +R P +PGRTY+F+ GP ++PFG+GLSYT F + +A +P + ++L
Sbjct: 625 VPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHH 684
Query: 630 QCRDINYTVGTNK--PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY------ 681
+ ++ AAV + +C++ + ++V N+G+ DG+ V+VY
Sbjct: 685 AAASASASLNATARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPAS 744
Query: 682 --SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
++ G ++Q++ +E+V + AG +A+V ++ C L + D + G H +
Sbjct: 745 SAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRL 804
Query: 740 LVGEGVGGVSFPLQ 753
++GE V+ L+
Sbjct: 805 IIGELTHTVTIALE 818
>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 709
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/716 (49%), Positives = 489/716 (68%), Gaps = 33/716 (4%)
Query: 39 QQMGDLAYGVP---RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVI 95
Q++G L P RLG+P YEWWSEALHGVS++G PGT F VPGATSFP I
Sbjct: 7 QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVG------PGTRFSPLVPGATSFPQPI 60
Query: 96 LTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPY 155
LT ASFN SL++ IG+ VSTEARAM+N+G AGLTFWSPNIN+ RDPRWGR ETPGEDP
Sbjct: 61 LTAASFNASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPL 120
Query: 156 VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT 215
+ +YA+ YV GLQD G D LK++ACCKHY AYD+DNW+G +R+ FD++V+
Sbjct: 121 LASKYAVGYVTGLQDA-GAGGVTDG---ALKVAACCKHYTAYDVDNWKGVERYTFDAKVS 176
Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
+QD+ +TF PF+ CV +G+V+SVMCSYN+VNG PTCAD LL IRGDW +GYIVSD
Sbjct: 177 QQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSD 236
Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSL 335
CDS+ ++ + + T E+A A +K+GLDL+CG++ T+ AVQ G+++E D+D ++
Sbjct: 237 CDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAI 295
Query: 336 RFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLN 392
+I+LMRLG+FDG P+ + +LG ++C + ELA E ARQGIVLLKN +GALPL+
Sbjct: 296 TNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLS 354
Query: 393 TGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS 452
+IK++A++GP+ANA+ MIGNYEGTPC+YT+P+ G A + Y PGC ++ C NS
Sbjct: 355 AKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTV-YQPGCTNVGCSGNS 413
Query: 453 M-IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
+ + A+ AA +AD TV+V G D S+E E DR LLLPG QT+L++ VA+A+ GPV LV
Sbjct: 414 LQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILV 473
Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV- 570
+MS G DI+FAK + KI +ILWVGYPGE GG A+AD++FG +NP GRLP+TWY A+Y
Sbjct: 474 VMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYAD 533
Query: 571 KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS-VDIKLDK 627
+ T M +RP +PGRTY+F+ G V+ FG GLSYT+ + + S+P S V ++L +
Sbjct: 534 TVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAE 593
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
D CR CA+V C D F +++V N G++ G+ V+++S PP
Sbjct: 594 DHPCR---------AEECASVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPA 644
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
K ++G+E+V +A G++ V F ++ C+ L +VD +A G HT+ VG+
Sbjct: 645 HNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 700
>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
Length = 779
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/765 (46%), Positives = 476/765 (62%), Gaps = 69/765 (9%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L F +C+ +LP R +DL+ RMTL EK+ Q+ + A G+PRLGLP YEWW EALHGV+
Sbjct: 41 LLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAV 100
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
PG F + PGATSFP ILT ASF+ VSTEARAM+N AGL
Sbjct: 101 -------SPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGL 144
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQD + LK+S
Sbjct: 145 TYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDT-------NLGGDKLKVS 197
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYD+DNW+G RF F++ VT+QD+ +T+ PF+ CV + VSSVMCSYNRVNG
Sbjct: 198 ACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNG 257
Query: 249 IPTCADPKLLNQTIRGDWNFHG----------------YIVSDCDSIQTIVESHKFLNDT 292
+PTCAD LL+ T+R WN +G YIVSDCDS+QT ++ + T
Sbjct: 258 VPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK-T 316
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
ED VA L AGL+LDCG + T A+ GKI EA+++ +LR+LY V MRLG +DG+P
Sbjct: 317 AEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNP 376
Query: 353 Q---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
+ Y NLG ++C ++ +LA +AA++GIVLLKN+ LP + NI+T+A +GPHA AT
Sbjct: 377 RSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKAT 436
Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
+AMIGNY+G PC+YT+P DG AY++V+ Y+ GC+D+ C ++S+I +A+ A ADA V+
Sbjct: 437 RAMIGNYQGIPCKYTTPHDGLSAYARVV-YSAGCSDVACYSDSLIGSAVSTASQADAVVL 495
Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
GLDL+ EAEGKDR LLLPG Q EL+ +V AAKGP LVI S G+VD++FAK N K+
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKV 555
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPG 587
+ ILW GYPGE GG AIA V+FG +NPGGRLP+TWY ++ I M +RP +PG
Sbjct: 556 QGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRPDASRGYPG 615
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS--------VDIKLDKDQQCRDINYTVG 639
RTY+F+ G VY FGYG +Y++ +K +P S V D + C +N
Sbjct: 616 RTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNLTCFHLNAH-- 673
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIG 697
D++ C +I V N G + V++YS PP G G I+Q+ G
Sbjct: 674 -----------DEITCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAG 722
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ +V +A G V ++ CK L +L G HT+ VG
Sbjct: 723 FGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVG 767
>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
Length = 779
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/765 (46%), Positives = 476/765 (62%), Gaps = 69/765 (9%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L F +C+ +LP R +DL+ RMTL EK+ Q+ + A G+PRLGLP YEWW EALHGV+
Sbjct: 41 LLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAV 100
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
PG F + PGATSFP ILT ASF+ VSTEARAM+N AGL
Sbjct: 101 -------SPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGL 144
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQD + LK+S
Sbjct: 145 TYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDT-------NLGGDKLKVS 197
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYD+DNW+G RF F++ VT+QD+ +T+ PF+ CV + VSSVMCSYNRVNG
Sbjct: 198 ACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNG 257
Query: 249 IPTCADPKLLNQTIRGDWNFHG----------------YIVSDCDSIQTIVESHKFLNDT 292
+PTCAD LL+ T+R WN +G YIVSDCDS+QT ++ + T
Sbjct: 258 VPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNYAK-T 316
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
ED VA L AGL+LDCG + T A+ GKI EA+++ +LR+LY V MRLG +DG+P
Sbjct: 317 AEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNP 376
Query: 353 Q---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
+ Y NLG ++C ++ +LA +AA++GIVLLKN+ LP + NI+T+A +GPHA AT
Sbjct: 377 RSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKAT 436
Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
+AMIGNY+G PC+YT+P DG AY++V+ Y+ GC+D+ C +NS+I +A A ADA V+
Sbjct: 437 RAMIGNYQGIPCKYTTPHDGLSAYARVV-YSAGCSDVACYSNSLIGSAASTASQADAVVL 495
Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
GLDL+ EAEGKDR LLLPG Q EL+ +V AAKGPV LVI S G+VD++FAK + K+
Sbjct: 496 FVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKV 555
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPG 587
+ +LW GYPGE GG AIA V+FG +NPGGRLP+TWY ++ I M +RP +PG
Sbjct: 556 QGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRPDASRGYPG 615
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS--------VDIKLDKDQQCRDINYTVG 639
RTY+F+ G VY FGYG +Y++ +K +P S V D + C +N
Sbjct: 616 RTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNLTCFHLNAH-- 673
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIG 697
D++ C +I V N G + V++YS PP G G I+Q+ G
Sbjct: 674 -----------DEITCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAG 722
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ +V +A G V ++ CK L +L G HT+ VG
Sbjct: 723 FGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVG 767
>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 701 bits (1809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/756 (46%), Positives = 474/756 (62%), Gaps = 44/756 (5%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
SD+P+C+ LP RA+ LV +TL EK+QQ+ D A +PRL +P YEWWSE+LHG++
Sbjct: 38 SDYPFCNTSLPISTRAQSLVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATN 97
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG F+ V ATSFP V+LT ASFN SLW IG ++ EARAMYN+G AGLT
Sbjct: 98 G------PGVSFNGTVSAATSFPQVLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLT 151
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FW+PNIN+ RDPRWGR ETPGEDP V YA+ +VRG Q DSD L +SA
Sbjct: 152 FWAPNINIFRDPRWGRGQETPGEDPMVASAYAVEFVRGFQG--------DSDGDGLMLSA 203
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYDL+ W R+ FD+ V+ QD+++T+ PF CV +G S +MCSYNRVNG+
Sbjct: 204 CCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGV 263
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA L Q + +W F GYI SDCD++ T+ E + N + EDAVA VLKAG D++C
Sbjct: 264 PACARQDLF-QKAKTEWGFKGYITSDCDAVATVYEYQHYAN-SPEDAVADVLKAGTDINC 321
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y T A+ QGK+ E DID +L L+ V MRLG FDG P Y NLG ++C +
Sbjct: 322 GSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKE 381
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAARQGIVLLKND LPL+ I +LA++GP A+ + G Y G PC+ S
Sbjct: 382 HRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGPQAD-QPFLGGGYTGIPCKPESL 440
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
++G Y + ++A GC D+ C +++ A+ A+ AD V+VAGLDLS E E DRV
Sbjct: 441 VEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVS 500
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q LI+ VA A + P+ LV+ G +D++FA+ +P+I SILW+GYPGE G +A+
Sbjct: 501 LLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKAL 560
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A++IFG +NPGGRLP+TWY ++ ++P M +R P +PGRTY+F+ G VY FG G
Sbjct: 561 AEIIFGDFNPGGRLPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKD---------QQCRDINYTVGTNKPPCAAVLIDDV-K 654
LSYT+F Y+ S+P +++ D Q+ ++NY I+++
Sbjct: 621 LSYTKFAYQFVSAPNKLNLLRSSDTVSSKNLPRQRREEVNY-----------FHIEELDT 669
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGF 713
C +F +I V N+G MDGS VVM++S+ P I GT KQ+IG+ RV + +S +
Sbjct: 670 CDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSI 729
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVS 749
++ C+ I + ++ G HTI++G+ V VS
Sbjct: 730 MVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVS 765
>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 774
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/743 (46%), Positives = 483/743 (65%), Gaps = 27/743 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S F +C LP +R +DLV R+TL EK+ Q+ A +PRLG+P YEWWSEALHGV+ +
Sbjct: 39 SSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIPRLGIPAYEWWSEALHGVANV 98
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
GR G HF+ + ATSFP VILT ASF+ W +IGQ + EARA+YN G A G+
Sbjct: 99 GR------GIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGREARAVYNAGQATGM 152
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNIN+ RDPRWGR ETPGEDP V G+YA++YVRG+Q G + L+ S
Sbjct: 153 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ---GDSFQGGKLKGHLQAS 209
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLDNW+G +RF FD+RVT QD+ +T+ PF+ CV +G S +MC+YNRVNG
Sbjct: 210 ACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPPFQSCVQQGKASGIMCAYNRVNG 269
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
IP+CAD LL++T RG W+FHGYI SDCD++ I ++ + + EDAV VLKAG+D++
Sbjct: 270 IPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQGYAK-SPEDAVVDVLKAGMDVN 328
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y T AV+Q K+ EA ID +L L+ V MRLG F+G+P + N+G + +C+
Sbjct: 329 CGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGLFNGNPTEQPFSNIGPDQVCSQ 388
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAAR GIVLLKN LPL +LA++GP+AN+ + ++GNY G PC+ +
Sbjct: 389 EHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGPNANSVQTLLGNYAGPPCKTVT 448
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ Y K Y GC + C + S I A+D AK D V++ GLD + E E DR+
Sbjct: 449 PLQALQYYVKNTIYYSGCDTVKCSSAS-IDKAVDIAKGVDRVVMIMGLDQTQEREELDRL 507
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL+LPG Q ELI VA +AK P+ LV++S G VDI+FAK + I SILW GYPGE GG A
Sbjct: 508 DLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAKYDENIGSILWAGYPGEAGGIA 567
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+A++IFG +NPGG+LP+TWY +VK+P T M +R P + +PGRTY+F+ G V+ FGY
Sbjct: 568 LAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRPDPSSGYPGRTYRFYKGRNVFEFGY 627
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKF 660
GLSY+++ Y++ ++ + L++ R I+ N P A L+ + CK+ KF
Sbjct: 628 GLSYSKYSYELKYVSQT-KLYLNQSSTMRIID-----NSDPVRATLVAQLGAEFCKESKF 681
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+ ++ VEN G+M G V+++++ G +Q+IG++ V + AG+ A++ F ++ C+
Sbjct: 682 SVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLIGFKSVILNAGEKAEIEFELSPCE 741
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
+ ++ G H ++VG
Sbjct: 742 HFSRANEDGLRVMEEGTHFLMVG 764
>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
Length = 777
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/754 (47%), Positives = 477/754 (63%), Gaps = 31/754 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +CD +LP +RA DLV ++TL EK+ Q+GD + V RLG+P Y+WWSEALHGV+
Sbjct: 40 SSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVPAYKWWSEALHGVANA 99
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
GR G H D + ATSFP VILT ASFN LW +IGQ + TEAR +YN G A GL
Sbjct: 100 GR------GVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGVYNNGQAEGL 153
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNINV RDPRWGR ETPGEDP + G+YA +VRG+Q G +S L+ S
Sbjct: 154 TFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGMSGAINSSDLEAS 210
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDL+NW+G RF FD++VTEQD+ +T+ PF+ CV +G S +MCSYNRVNG
Sbjct: 211 ACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYNRVNG 270
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCAD LL++T RGDW+F+GYI SDCD++ I + + EDAVA VLKAG+D++
Sbjct: 271 VPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAK-APEDAVADVLKAGMDVN 329
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNNICNP 365
CG Y + A QQGKI DID +LR L+ + MRLG FDG+P+Y N+G + +C+
Sbjct: 330 CGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGADQVCSK 389
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H +LA +AAR GIVLLKND ALPL+ + +LA++GP+ N ++GNY G PC +
Sbjct: 390 EHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPPCISVT 449
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ Y K + GC VC N S I A+ AA +AD V+ GLD + E E DR+
Sbjct: 450 PLQALQGYVKDARFVQGCNAAVC-NVSNIGEAVHAAGSADYVVLFMGLDQNQEREEVDRL 508
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG Q L+N VADAAK PV LV++ G VD+ FAKNNPKI +I+W GYPG+ GG A
Sbjct: 509 ELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAGGIA 568
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA V+FG +NPGGRLP+TWY + +P T M +R P +PGRTY+F+ G VY FGY
Sbjct: 569 IAQVLFGDHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKGKTVYNFGY 628
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK------CKD 657
GLSY+++ ++ AS K K I T + A + DV+ C
Sbjct: 629 GLSYSKYSHRFAS-------KGTKPPSMSGIEGLKATARASAAGTVSYDVEEMGAEACDR 681
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
+F + V+N G MDG +V+++ + P G Q+IG++ V + A ++A V F ++
Sbjct: 682 LRFPAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVS 741
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
CK L ++ G+H + VG+ +SF
Sbjct: 742 PCKHLSRAAEDGRKVIDQGSHFVRVGDDEFELSF 775
>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
Length = 789
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/769 (45%), Positives = 478/769 (62%), Gaps = 49/769 (6%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
SD+P+C+ LP RA+ LV +TL EK+QQ+ D A +PRL +P YEWWSE+LHG++
Sbjct: 38 SDYPFCNTSLPISTRAQSLVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATN 97
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG F+ V ATSFP V+LT ASFN SLW IG ++ EARAMYN+G AGLT
Sbjct: 98 G------PGVSFNGTVSAATSFPQVLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLT 151
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--------DVEGVEYHR--- 178
FW+PNIN+ RDPRWGR ETPGEDP V YA+ +VRG Q ++ G +
Sbjct: 152 FWAPNINIFRDPRWGRGQETPGEDPMVASAYAVEFVRGFQGGNWKGGDEIRGAVGKKRVL 211
Query: 179 --DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
DSD L +SACCKH AYDL+ W R+ FD+ V+ QD+++T+ PF CV +G
Sbjct: 212 RGDSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKA 271
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
S +MCSYNRVNG+P CA L Q + +W F GYI SDCD++ T+ E + N + EDA
Sbjct: 272 SCLMCSYNRVNGVPACARQDLF-QKAKTEWGFKGYITSDCDAVATVYEYQHYAN-SPEDA 329
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--- 353
VA VLKAG D++CG Y T A+ QGK+ E DID +L L+ V MRLG FDG P
Sbjct: 330 VADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGL 389
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y NLG ++C +H LA EAARQGIVLLKND LPL+ I +LA++GP A+ +
Sbjct: 390 YGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAIIGPQAD-QPFLG 448
Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
G Y G PC+ S ++G Y + ++A GC D+ C +++ A+ A+ AD V+VAGL
Sbjct: 449 GGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGL 508
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
DLS E E DRV LLLPG Q LI+ VA A + P+ LV+ G +D++FA+ +P+I SIL
Sbjct: 509 DLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASIL 568
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYK 591
W+GYPGE G +A+A++IFG +NPGGRLP+TWY ++ ++P M +R P +PGRTY+
Sbjct: 569 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYR 628
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD---------QQCRDINYTVGTNK 642
F+ G VY FG GLSYT+F Y+ S+P +++ D Q+ ++NY
Sbjct: 629 FYIGHRVYGFGQGLSYTKFAYQFVSAPNKLNLLRSSDTVSSKNLPRQRREEVNY------ 682
Query: 643 PPCAAVLIDDV-KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYER 700
I+++ C +F +I V N+G MDGS VVM++S+ P I GT KQ+IG+ R
Sbjct: 683 -----FHIEELDTCDSLRFHVEISVTNVGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFSR 737
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVS 749
V + +S + ++ C+ I + ++ G HTI++G+ V VS
Sbjct: 738 VHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIMLGDVVHSVS 786
>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
Length = 784
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/755 (46%), Positives = 481/755 (63%), Gaps = 36/755 (4%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+CD LP R DLV R+T+ EK+ Q+GD + +PRLG+P Y+WWSEALHGV+ G
Sbjct: 49 NIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLGVPAYKWWSEALHGVANAG 108
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
R G H D + ATSFP VILT ASFN LW +IGQ + EARA+YN G A GLT
Sbjct: 109 R------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLT 162
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FW+PNINV RDPRWGR ETPGEDP + G+YA +VRG+Q G +S L+ SA
Sbjct: 163 FWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGVAGPVNSTDLEASA 219
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDL+NW+G R+ +D++VT QD+++T+ PF+ CV +G S +MCSYNRVNG+
Sbjct: 220 CCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIMCSYNRVNGV 279
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCAD LL++T R W F+GYI SDCD++ I ++ + T EDAVA VLKAG+D++C
Sbjct: 280 PTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAK-TSEDAVADVLKAGMDVNC 338
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y + A+QQGKI E DI+ +L L+ V MRLG F+G P+ Y N+G + +C +
Sbjct: 339 GGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIGPDQVCTQE 398
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H +LA EAA+ GIVLLKND GALPL+ + +LA++G +AN +++GNY G PC +P
Sbjct: 399 HQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFGPPCVTVTP 458
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ Y K ++ GC C N + IP A+ AA +AD+ V+ GLD + E E DR+D
Sbjct: 459 LQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQNQEREEVDRLD 517
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPG Q LI VA+AAK PV LV++ G VD++FAK NPKI +ILW GYPGE GG AI
Sbjct: 518 LTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGIAI 577
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG++NPGGRLP+TWY ++ K+P T M +R P +PGRTY+F+ GP V+ FGYG
Sbjct: 578 AQVLFGEHNPGGRLPVTWYPQDFTKVPMTDMRMRADPATGYPGRTYRFYRGPTVFNFGYG 637
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK------CKDY 658
LSY+++ ++ + P + + + T G V DV+ C
Sbjct: 638 LSYSKYSHRFVTKPPP---SMSNVAGLKALATTAG-------GVATYDVEAIGSETCDRL 687
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSAKVGFTM 715
KF + V+N G MDG V+V+ + P +G +Q+IG++ + + A Q+A V F +
Sbjct: 688 KFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQSLHLRATQTAHVEFEV 747
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ CK ++ G+H ++VG+ +SF
Sbjct: 748 SPCKHFSRATEDGRKVIDQGSHFVMVGDDEFEMSF 782
>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
Length = 790
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/759 (46%), Positives = 470/759 (61%), Gaps = 50/759 (6%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ P+C LP RA+DLV R+T EKV+ + + A GV RLG+ YEWWSEALHGVS
Sbjct: 43 TTLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDT 102
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG F PGAT+FP VI A+ N +LW+ IG+ VS EARAMYN G AGLT
Sbjct: 103 G------PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLT 156
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FWSPN+N+ RDPRWGR ETPGEDP + RYA YVRGLQ + D LK++A
Sbjct: 157 FWSPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQ--------QPYDHNRLKLAA 208
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDLD+W G DRFHF++ V+ QD+++TF +PF CV G +SVMCSYN+VNG+
Sbjct: 209 CCKHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGV 268
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCAD L TIR W GYIVSDCDS+ + T EDAVA L+AGLDLDC
Sbjct: 269 PTCADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTR-TAEDAVAATLRAGLDLDC 327
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G + +T AV + K+++AD+D +L V MRLG FDG P + +LG ++C
Sbjct: 328 GPFLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKA 387
Query: 367 HIELAAEAARQGIVLLKNDNG-------ALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
H +LA +AARQ +VLLKN G LPL + +A+VGPHA+AT AMIGNY G
Sbjct: 388 HQDLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGK 447
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQ-NNSMIPAAIDAAKNADATVIVAGLDLSVE 478
PCRYT+P+ G AY+ + + GCAD+ CQ N I AA+DAA+ GL S
Sbjct: 448 PCRYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS-- 505
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
LLLPG Q ELI+ VA AAKGPV LV+MS G +DI FA+N+P+I ILWVGYP
Sbjct: 506 --------LLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYP 557
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDG 595
G+ GG+AIADVIFG++NPGG+LP+TWY +Y+ K+P T+M +R P +PGRTY+F+ G
Sbjct: 558 GQAGGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTG 617
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG---TNKPPCAAVLIDD 652
P ++ FG+GLSYTQF + +A +P + ++L + + P AV +
Sbjct: 618 PTIHAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVRVAH 677
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY------SKPPGIAGTH--IKQVIGYERVFIA 704
+C+ ++V N+G DG+ V+VY S AGT +Q++ +E+V +
Sbjct: 678 ARCEGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKVHVP 737
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
AG A+V ++ C L + D + G H +++GE
Sbjct: 738 AGGVARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGE 776
>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
gi|194688848|gb|ACF78508.1| unknown [Zea mays]
gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 780
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/751 (47%), Positives = 475/751 (63%), Gaps = 27/751 (3%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+CDA LP R DLV RMT+ EK+ Q+GD + +PRLG+P Y+WWSEALHG+S G
Sbjct: 44 NIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGDQSPAIPRLGVPAYKWWSEALHGISNQG 103
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
R G H D + ATSFP VILT ASFN LW +IGQ + EARA+YN G A GLT
Sbjct: 104 R------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLT 157
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FW+PNINV RDPRWGR ETPGEDP + G+YA +VRG+Q G +S L+ SA
Sbjct: 158 FWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGLAGPVNSTGLEASA 214
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDL+NW+G R+ FD++VT QD+ +T+ PF+ CV +G S +MCSYNRVNG+
Sbjct: 215 CCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCVEDGHASGIMCSYNRVNGV 274
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
PTCAD LL+ T R DW F+GYI SDCD++ I ++ + T EDAVA VLKAG+D++C
Sbjct: 275 PTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAK-TAEDAVADVLKAGMDVNC 333
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y + A+QQGKI E DI+ +L L+ V MRLG F+G P+ Y ++G + +C +
Sbjct: 334 GSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQVCTQE 393
Query: 367 HIELAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
H +LA EAA+ GIVLLKND GA LPL+ N+ +LA++G +AN + GNY G PC
Sbjct: 394 HQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGPPCVTV 453
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ Y K ++ GC C N + IP A+ AA +AD+ V+ GLD E E DR
Sbjct: 454 TPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQDQEREEVDR 512
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+DL LPG Q LI VA+AAK PV LV++ G VD++FAK NPKI +ILW GYPGE GG
Sbjct: 513 LDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEAGGI 572
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
AIA V+FG++NPGGRLP+TWY ++ ++P T M +R P +PGRTY+F+ GP V+ FG
Sbjct: 573 AIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPTVFNFG 632
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGLSY+++ ++ A+ P + + T G I C KF
Sbjct: 633 YGLSYSKYSHRFATKPPPT----SNVAGLKAVEATAG-GMASYDVEAIGSETCDRLKFPA 687
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+ V+N G MDG V+V+ + P +G Q+IG++ + + A Q+A V F ++ CK
Sbjct: 688 VVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFEVSPCK 747
Query: 720 SLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++ G+H ++VGE +SF
Sbjct: 748 HFSRATEDGRKVIDQGSHFVMVGEDEFEMSF 778
>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
Length = 777
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/749 (44%), Positives = 482/749 (64%), Gaps = 28/749 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+C+ L + RA+ LV +TL EK+QQ+ + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 30 YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG- 88
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PG F+ + ATSFP V++T ASFN +LW IG ++ EARAM+N+G GLT W
Sbjct: 89 -----PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIW 143
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR--------DSDSR 183
+PNIN+ RDPRWGR ETPGEDP V Y+I +VRGLQ ++ H D+
Sbjct: 144 APNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMG 203
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L +SACCKH+ AYDL+ W R+ FDS VTEQD+ +T+ PF C+ +G S +MCSY
Sbjct: 204 SLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSY 263
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N VNG+P CA+P LL + R DW GYI SDCD++ T+ E K+ DT EDA+A VLKA
Sbjct: 264 NAVNGVPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKA 321
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKN 360
G+D++CG + T A+ QGK+ E ++D++L L+ V RLG+FDG+P ++ LG
Sbjct: 322 GMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQ 381
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
++C QH LA EAARQGIVLLKN+N LPL+ I +L ++G AN + ++G Y G P
Sbjct: 382 DVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVP 441
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C S ++GF Y++ I +A GC D+ C +++ AI AK AD + VAGLD S E E
Sbjct: 442 CSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETE 501
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRV LLLPG Q +L++ VA +K P+ LV++ G +DI+FAK + ++ SILW+G PGE
Sbjct: 502 DLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGE 561
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG+A+A+VIFG YNPGGRLP+TWY ++ +P M +R P +PGRTY+F+ G +
Sbjct: 562 AGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYPGRTYRFYTGDRI 621
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV--GTNKPPCAAVLIDDVK-C 655
Y FG GLSYT FKY++ S+PK V++ + R I V G N + + +++V+ C
Sbjct: 622 YGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNM---SYMEVEEVESC 678
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+F ++ V N+G+ DGS VVM++S+ P + GT +Q+IG++R+++ QSA+
Sbjct: 679 DLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIM 738
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
++ C + + D ++ G HTI +G+
Sbjct: 739 VDPCNHVSLADEYGKRVIPLGDHTISLGD 767
>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
Precursor
gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
Length = 792
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/757 (44%), Positives = 481/757 (63%), Gaps = 32/757 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
S +P+C+ L +RA LV + LPEK+ Q+ + A VPRLG+P YEWWSE+LHG++
Sbjct: 37 FSSYPFCNVSLSIKQRAISLVSLLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLA- 95
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG F+ + ATSFP VI++ ASFN +LW +IG V+ E RAMYN G AGL
Sbjct: 96 -----DNGPGVSFNGSISAATSFPQVIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGL 150
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR----- 183
TFW+PNINV RDPRWGR ETPGEDP VV Y + +VRG Q+ + + + S
Sbjct: 151 TFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDD 210
Query: 184 --------PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
L +SACCKH+ AYDL+ W R+ F++ VTEQDM++T+ PFE C+ +G
Sbjct: 211 RHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGK 270
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
S +MCSYN VNG+P CA LL Q R +W F GYI SDCD++ TI +++ + E+
Sbjct: 271 ASCLMCSYNAVNGVPACAQGDLL-QKARVEWGFEGYITSDCDAVATIF-AYQGYTKSPEE 328
Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--- 352
AVA +KAG+D++CG Y T A++QGK++E +D +L L+ V +RLG FDG P
Sbjct: 329 AVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRG 388
Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
QY LG N+IC+ H +LA EA RQGIVLLKND+ LPLN ++ +LA+VGP AN M
Sbjct: 389 QYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNM 448
Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
G Y G PC+ + Y K +YA GC+D+ C +++ A+ AK AD ++VAG
Sbjct: 449 GGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAG 508
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
LDLS E E KDRV L LPG Q +L++ VA +K PV LV+ G VD+ FAKN+P+I SI
Sbjct: 509 LDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSI 568
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTY 590
+W+GYPGE GG+A+A++IFG +NPGGRLP TWY ++ + + M +R ++ +PGRTY
Sbjct: 569 IWIGYPGETGGQALAEIIFGDFNPGGRLPTTWYPESFTDVAMSDMHMRANSSRGYPGRTY 628
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+F+ GP VY FG GLSYT+F+YK+ S+P + + QQ + + +
Sbjct: 629 RFYTGPQVYSFGTGLSYTKFEYKILSAPIRLSLSELLPQQSSHKKQL--QHGEELRYLQL 686
Query: 651 DDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAG 706
DDV C+ +F ++ V N G++DGS VVM++SK PP ++G KQ+IGY+RV + +
Sbjct: 687 DDVIVNSCESLRFNVRVHVSNTGEIDGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSN 746
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ + F ++ CK L + ++ ++ G+H + +G+
Sbjct: 747 EMMETVFVIDPCKQLSVANDVGKRVIPLGSHVLFLGD 783
>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
sativus]
Length = 767
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/749 (44%), Positives = 482/749 (64%), Gaps = 28/749 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+C+ L + RA+ LV +TL EK+QQ+ + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 20 YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG- 78
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PG F+ + ATSFP V++T ASFN +LW IG ++ EARAM+N+G GLT W
Sbjct: 79 -----PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIW 133
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR--------DSDSR 183
+PNIN+ RDPRWGR ETPGEDP V Y+I +VRGLQ ++ H D+
Sbjct: 134 APNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMG 193
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L +SACCKH+ AYDL+ W R+ FDS VTEQD+ +T+ PF C+ +G S +MCSY
Sbjct: 194 SLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSY 253
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N VNG+P CA+P LL + R DW GYI SDCD++ T+ E K+ DT EDA+A VLKA
Sbjct: 254 NAVNGVPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKA 311
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKN 360
G+D++CG + T A+ QGK+ E ++D++L L+ V RLG+FDG+P ++ LG
Sbjct: 312 GMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQ 371
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
++C QH LA EAARQGIVLLKN+N LPL+ I +L ++G AN + ++G Y G P
Sbjct: 372 DVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVP 431
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
C S ++GF Y++ I +A GC D+ C +++ AI AK AD + VAGLD S E E
Sbjct: 432 CSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETE 491
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRV LLLPG Q +L++ VA +K P+ LV++ G +DI+FAK + ++ SILW+G PGE
Sbjct: 492 DLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGE 551
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG+A+A+VIFG YNPGGRLP+TWY ++ +P M +R P +PGRTY+F+ G +
Sbjct: 552 AGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYPGRTYRFYTGDRI 611
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV--GTNKPPCAAVLIDDVK-C 655
Y FG GLSYT FKY++ S+PK V++ + R I V G N + + +++V+ C
Sbjct: 612 YGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNM---SYMEVEEVESC 668
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+F ++ V N+G+ DGS VVM++S+ P + GT +Q+IG++R+++ QSA+
Sbjct: 669 DLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIM 728
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGE 743
++ C + + D ++ G HTI +G+
Sbjct: 729 VDPCNHVSLADEYGKRVIPLGDHTISLGD 757
>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
Length = 791
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/752 (44%), Positives = 469/752 (62%), Gaps = 34/752 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C+ LP R + L+ +T+ EK+ + D +PRLGLP YEWWSE+LHG++ G
Sbjct: 41 YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNG- 99
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
P +F+ ++ G TSFP VILT A+FN +LW I ++ EARAMYNLG AGLTFW
Sbjct: 100 -----PAVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFW 154
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV---------------EGVEY 176
+PNIN++RDPRWGR ETPGEDP VV YAI YV G Q + V
Sbjct: 155 APNINILRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLK 214
Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
D+D L +SACCKH+ AYDL+ W R+ F++ VT+QDM++TF PF C+ +G
Sbjct: 215 EDDNDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKA 274
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
S +MCSYN VNG+P CAD +LL++ +R DW F GYI SDCD++ TI E+ K+ T EDA
Sbjct: 275 SCLMCSYNSVNGVPACADKELLDK-VRTDWGFDGYITSDCDAVATIYENQKY-TKTPEDA 332
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---Q 353
VA LKAG +++CG Y A QQG + E D+D +L++L+ V RLG FDG+P Q
Sbjct: 333 VAVALKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQ 392
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
+ N G ++C H+ LA +AARQGIVLLKND LPL+ ++ TLA+VGP AN +
Sbjct: 393 FANFGAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVSSPG- 451
Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
G Y G PC+ S +GF+ + YA GC D+ C + + AI K AD ++VAG
Sbjct: 452 GTYSGVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGS 511
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
DLS E E DR LLLPG QT L+ +A A+K P+ LV+ G VD++FA+ +P+I SIL
Sbjct: 512 DLSEETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASIL 571
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYK 591
WV YPGE GG+A++++IFG NPGG+LP+TWY ++ K+P T M +R P N +PGRTY+
Sbjct: 572 WVAYPGETGGKALSEIIFGYQNPGGKLPMTWYLESFTKVPMTDMNMRADPSNGYPGRTYR 631
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
F+ G V+Y FG+GLSYT F ++ S+P + + L K + R I + + + +D
Sbjct: 632 FYTGDVLYGFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSI---LAKGRSRLGYIHVD 688
Query: 652 DVK-CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSA 709
+V+ C KF I V N G MDGS V+M++S+ G KQ++G++RV + A +
Sbjct: 689 EVESCHSSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYV 748
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+ ++ C+ ++ N +LA G HT ++
Sbjct: 749 ETSLLVDPCELFSFANDQGNRILALGEHTFIL 780
>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
Length = 775
Score = 690 bits (1780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/742 (46%), Positives = 478/742 (64%), Gaps = 30/742 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+C LP R DLV R+TL EK+ Q+ + A +PRLG+P YEWWSE+LHGV G+
Sbjct: 43 FCQTGLPISVRVLDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGK-- 100
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
G F+ + GATSFP VILT A+F+E+LW +IGQ + EAR +YN G A G+TFW+
Sbjct: 101 ----GIFFNGSIAGATSFPQVILTAATFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWA 156
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKISAC 190
PNIN+ RDPRWGR ETPGEDP + G+YAI YVRG+Q G + + L+ SAC
Sbjct: 157 PNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGVQGDSFNGGQLKKGH----LQASAC 212
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLD W+ DRF F++ VT QDM +TF PF+ C+ + S +MCSYN VNGIP
Sbjct: 213 CKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIP 272
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
+CA+ LL +T R W FHGYI SDCD++Q + ++H++ N T ED+ A LKAG+D+DCG
Sbjct: 273 SCANYNLLTKTARQQWGFHGYITSDCDAVQVMHDNHRYGN-TPEDSTAFALKAGMDIDCG 331
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
DY +T AV + K+++ ID +L L+ + MRLG F+G P+ Y N+ + +C PQH
Sbjct: 332 DYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQH 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
+LA EAAR GIVLLKN LPL+ +LA++G +AN + GNY+G PC+Y +
Sbjct: 392 QQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAVIGHNANNAYILRGNYDGPPCKYIEIL 451
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
Y+K + Y GC C ++ I A++ A+NAD V++ GLD + E E DR DL
Sbjct: 452 KALVGYAKSVQYQQGCNAANC-TSANIDQAVNIARNADYVVLIMGLDQTQEREQFDRDDL 510
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+LPG Q LIN VA AAK PV LVI+S G VDI+FAK NPKI SILW GYPGE GG A+A
Sbjct: 511 VLPGQQENLINSVAKAAKKPVILVILSGGPVDISFAKYNPKIGSILWAGYPGEAGGIALA 570
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
++IFG++NPGG+LP+TWY +VKIP T M +R P +PGRTY+F+ GP VY FGYGL
Sbjct: 571 EIIFGEHNPGGKLPVTWYPQAFVKIPMTDMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGL 630
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTF 662
SYT + Y S+ + I+L++ + + N +D++ C+ KF+
Sbjct: 631 SYTTYSYGFHSATPNT-IQLNQLLSVKTVE-----NSDSIRYTFVDEIGSDNCEKAKFSA 684
Query: 663 QIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+ VEN G+MDG V+++ K G+ IKQ++G++ V + AG+++++ F ++ C+ L
Sbjct: 685 HVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQLVGFQSVSLKAGENSQLVFEISPCEHL 744
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+ ++ G+ ++VG+
Sbjct: 745 SSANEDGLMMIEEGSRYLVVGD 766
>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 796
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/759 (44%), Positives = 478/759 (62%), Gaps = 32/759 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
S +P+C+ L +RA LV +TLPEK+ Q+ A VPRLG+P YEWWSE+LHG++
Sbjct: 37 FSSYPFCNVSLSIKQRAISLVSLLTLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLA- 95
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG F+ + ATSFP VI++ ASFN +LW +IG V+ EARAMYN G AGL
Sbjct: 96 -----DNGPGVSFNGSISAATSFPQVIVSAASFNRTLWYEIGSAVAVEARAMYNGGQAGL 150
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD-----VEGVEYHRDS--- 180
TFW+PNIN+ RDPRWGR ETPGEDP VV Y + +VRG Q+ V + D+
Sbjct: 151 TFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVEFVRGFQEKKKRKVLKTRFGSDNVDD 210
Query: 181 -------DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
L +SACCKH+ AYDL+ W R+ F++ VTEQDM++T+ PFE C+ +
Sbjct: 211 DARYDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKD 270
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
G S +MCSYN VNG+P CA LL Q R +W F GYI SDCD++ TI E + +
Sbjct: 271 GKASCLMCSYNAVNGVPACAQGDLL-QKARVEWGFDGYITSDCDAVATIFEYQGY-TKSP 328
Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
E+AVA +KAG+D++CG Y T A++QGK++E +D +L L+ V +RLG FDG P+
Sbjct: 329 EEAVADAIKAGVDINCGTYMLRNTQSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPR 388
Query: 354 ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
Y LG N+IC+ H +LA EAARQGIVLLKND LPLN ++ +LA+VGP AN
Sbjct: 389 GGHYGKLGSNDICSSDHRKLALEAARQGIVLLKNDYKLLPLNKNHVSSLAIVGPMANNIS 448
Query: 411 AMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
M G Y G PC+ + Y K +YA GC+D+ C +++ A+ AK AD ++V
Sbjct: 449 NMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCVSDTGFGEAVAIAKGADFVIVV 508
Query: 471 AGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
AGLDLS E E KDR L LPG Q +L++ VA +K PV LV+ G VD+ FAK +P+I
Sbjct: 509 AGLDLSQETEDKDRFSLSLPGKQKDLVSSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIG 568
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGR 588
SI+W+GYPGE GG+A+A++IFG +NPGGRLPITWY ++ +P + M +R ++ +PGR
Sbjct: 569 SIIWIGYPGETGGQALAEIIFGDFNPGGRLPITWYPESFADVPMSDMHMRADSSRGYPGR 628
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY+F+ GP VY FG GLSYT+F YK+ S+P + + QQ + + +
Sbjct: 629 TYRFYTGPQVYSFGTGLSYTKFDYKIISAPIRLSLSELLPQQSSHKKQLLQHGEEQLQYI 688
Query: 649 LIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIA 704
+DDV C+ +F ++ V N G++DGS V+M++SK + +G KQ+IG++RV I
Sbjct: 689 QLDDVMVNSCESLRFNVRVNVRNTGEIDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIR 748
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ + + F ++ CK L + ++ ++ G H + +G+
Sbjct: 749 SNEMMETVFVIDPCKYLSVANDVGKRVIPLGIHALFLGD 787
>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/740 (46%), Positives = 483/740 (65%), Gaps = 27/740 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+C+ LP +RA+DLV R+TL EK+ Q+ + A +PRLG+P YEWWSEALHGVS
Sbjct: 40 FPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIPGYEWWSEALHGVS---- 95
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
N+ PG HF+ + GATSFP VILT ASF+ W +IGQ + EARA+YN G A G+TF
Sbjct: 96 --NAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARALYNAGQATGMTF 153
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN+ RDPRWGR ETPGEDP V G YA +YV+G+Q G + L+ SAC
Sbjct: 154 WAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQ---GDSFEGGKIKGHLQASAC 210
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW+G +RF FD+RVT QD+ +T+ PF+ CV +G S +MC+YN+VNG+P
Sbjct: 211 CKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGIMCAYNKVNGVP 270
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
+CAD LL++T R W F GYI SDCD++ +I+ + + EDAV VLKAG+D++CG
Sbjct: 271 SCADSNLLSKTARAQWGFRGYITSDCDAV-SIIHDDQGYAKSPEDAVVDVLKAGMDVNCG 329
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
Y AV+Q K++E+DID +L L+ V MRLG F+G P+ + N+G + +C+ +H
Sbjct: 330 SYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNIGPDQVCSQEH 389
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA EAAR GIVLLKN LPL+ K+LA++GP+AN+ + ++GNY G PCR+ +P+
Sbjct: 390 QILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYAGPPCRFVTPL 449
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
+Y K Y P C + C + S + A+D AK AD V++ GLD + E E DR DL
Sbjct: 450 QALQSYIKQTVYHPACDTVQCSSAS-VDRAVDVAKGADNVVLMMGLDQTQEREELDRTDL 508
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q ELI VA AAK PV LV+ S G VDI+FAKN+ I SILW GYPGE G A+A
Sbjct: 509 LLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYPGEGGAIALA 568
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
+++FG +NPGGRLP+TWY +VK+P T M +RP + +PGRTY+F+ G V+ FGYG+
Sbjct: 569 EIVFGDHNPGGRLPMTWYPQEFVKVPMTDMGMRPEASSGYPGRTYRFYRGRSVFEFGYGI 628
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKFTF 662
SY+++ Y++ + ++ + L++ IN + + LI ++ C+ K
Sbjct: 629 SYSKYSYELTAVSQNT-LYLNQSSTMHIIN-----DFDSVRSTLISELGTEFCEQNKCRA 682
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+I V+N G+M G V+++++ G KQ+IG++ V + AG+ A++ F ++ C+ L
Sbjct: 683 RIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQSVVLGAGERAEIEFEVSPCEHL 742
Query: 722 KIVDNAANSLLASGAHTILV 741
+ ++ G H ++V
Sbjct: 743 SRANEDGLMVMEEGRHFLVV 762
>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 726
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/749 (47%), Positives = 482/749 (64%), Gaps = 49/749 (6%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CD L R DLV R+TL EKV Q+ + A +PRL +P YEWW E LHGV+ +
Sbjct: 3 FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS--- 59
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
F +P ATSFP ILTTASFN+ LW +IGQ STEARA YN G AGLT+WSP
Sbjct: 60 -------FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSP 112
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
IN+ RDPRWGR+ ET GEDPY YA ++V+G+Q EG D++S+ LK+SACCKH
Sbjct: 113 VINIARDPRWGRIQETSGEDPYTTSAYATHFVQGMQ--EG-----DANSKRLKLSACCKH 165
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+ AYD+DNWEG DR+HFD++ ++ +T+ PF+ CV EG +S+MCSYN+VNG+PTCA
Sbjct: 166 FTAYDVDNWEGIDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCA 222
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
+ L T+R W +GYIVSDCDS+ + ES + T EDA A L AGLDL+CGDY
Sbjct: 223 NYDFLENTVRRAWGLNGYIVSDCDSVLVMHESTNYA-PTTEDAAADALNAGLDLNCGDYL 281
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIEL 370
++T GAV GK+ + +D ++ +++V MRLG FDG+P ++ N+G ++C P H EL
Sbjct: 282 ASYTEGAVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQEL 341
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAARQGIVLLKND LPL + NI T A++GP+ANAT M+GNYEG PC+Y +P+ G
Sbjct: 342 AVEAARQGIVLLKNDGNILPL-SKNINT-AVIGPNANATHTMLGNYEGIPCQYITPLQGL 399
Query: 431 YA-----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
Y KV ++ GC + CQ + I +A+ A ADA V+V GL E+E DR
Sbjct: 400 VKFGSGDYHKVW-FSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRT 458
Query: 486 DLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG+Q LI++VA AA G PV LV+M AG VDINFAKN+ +I+SILWVGYPG+ GG+
Sbjct: 459 SLLLPGYQQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQ 518
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
AIA+VIFG +NPGG+LP++WY +Y KI T+M +RP +N+PGRTY+F+ G +Y FG
Sbjct: 519 AIAEVIFGAHNPGGKLPMSWYPEDYTKISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFG 578
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGLSYT++K+ A +P +V Q C + G+ C F
Sbjct: 579 YGLSYTEYKHSFALAPTTVMTPSIHSQLCDPHQTSAGSK------------TCSSSNFDV 626
Query: 663 QIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
I VEN+G M G+ ++++ P G GT +KQ+ ++ V+I +G KV T+N C+
Sbjct: 627 HINVENIGAMAGNHTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQH 686
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVS 749
L V +L +G H + VG+ +S
Sbjct: 687 LGTVAEDGTRMLEAGNHILSVGDAKHSLS 715
>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 683 bits (1762), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/740 (45%), Positives = 474/740 (64%), Gaps = 51/740 (6%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+C LP +RA DLV R+TL EK+ Q+ + A +PRLG+P Y+WWSEALHGV++ G
Sbjct: 40 FPFCKTTLPISQRANDLVSRLTLEEKISQLVNSAQPIPRLGIPGYQWWSEALHGVAYAG- 98
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
PG F+ + ATSFP VIL+ ASF+ + W +I Q + EARA+YN G A G+TF
Sbjct: 99 -----PGIRFNGTIKRATSFPQVILSAASFDANQWYRISQAIGKEARALYNAGQATGMTF 153
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN+ RDPRWGR ETPGEDP + G+YA++YVRGLQ G + PL+ SAC
Sbjct: 154 WAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQ---GDSFKGGEIKGPLQASAC 210
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDL+NW G R+ FD+ VT QD+ +T+ PF+ CV EG S +MC+YNRVNGIP
Sbjct: 211 CKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEEGRASGIMCAYNRVNGIP 270
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CAD L++T R W F GYI SDCD++ I ++ + T EDAV VLKAG+D++CG
Sbjct: 271 NCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGYAK-TPEDAVVAVLKAGMDVNCG 329
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQH 367
Y T AV Q K+ ++ID +L L+ V MRLG F+G+P Q+ N+G + +C+ ++
Sbjct: 330 SYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTGQQFGNIGPDQVCSQEN 389
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA +AAR GIVLLKN G LPL+ +LA++GP+AN+ + ++GNY G PC+ +P+
Sbjct: 390 QILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQTLLGNYAGPPCKLVTPL 449
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
+Y K PGC + C + S++ A++ AK AD V++ GLD + E EG DR DL
Sbjct: 450 QALQSYIKHTIPYPGCDSVQCSSASIV-GAVNVAKGADHVVLIMGLDDTQEKEGLDRRDL 508
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+LPG Q ELI VA AAK PV LV++S G VDI+FAKN+ I SILW GYPGE G A+A
Sbjct: 509 VLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGSILWAGYPGEAGAIALA 568
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
++IFG +NPGG+LP+TWY +VK+P T M +RP + +PGRTY+F+ GP V+ FGYGL
Sbjct: 569 EIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRPETSSGYPGRTYRFYKGPTVFEFGYGL 628
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
SY+++ Y++ A+ I + +C++ KF +
Sbjct: 629 SYSKYTYEL-------------------------------RAIYIGEEQCENIKFKVTVS 657
Query: 666 VENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V+N G+M G V+++++ PG G IK+++G++ V + AG+ ++ + ++ C+ L
Sbjct: 658 VKNEGQMAGKHPVLLFARHAKPG-KGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSS 716
Query: 724 VDNAANSLLASGAHTILVGE 743
+ ++ G+ +LVG+
Sbjct: 717 ANEDGVMVMEEGSQILLVGD 736
>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 757
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/757 (47%), Positives = 476/757 (62%), Gaps = 79/757 (10%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
L+ F +C+ L +R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 50 SLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVS 109
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG----QTVSTEARAMYNL 123
++G PGTHF S VPGATSFP VILT ASFN SL+ IG Q VSTEARAMYN+
Sbjct: 110 YVG------PGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVISQVVSTEARAMYNV 163
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
G AGLTFWSPNIN+ RDPRWGR ETPGEDP + +Y YV+GLQ + D +
Sbjct: 164 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRD------DGNPD 217
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDS-RVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
LK++ACCKHY AYDLDNW+G DR+HF++ VT+QDM +TF PF+ CV +G+V+SVMCS
Sbjct: 218 GLKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCS 277
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
YN+VNGIPTCADP LL+ IRG+W +G YIV+DCDSI S + T E+A A+
Sbjct: 278 YNKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTDCDSIDVFYNSQHY-TKTPEEAAAKA 336
Query: 301 LKA--GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YK 355
+ A GLDL+CG + T AV G + E+ ID ++ + LMRLG+FDG P Y
Sbjct: 337 ILAGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYG 396
Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
LG ++C ++ ELA EAARQGIVLLKN
Sbjct: 397 KLGPKDVCTAENQELAREAARQGIVLLKN------------------------------- 425
Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
GTPC+YT+P+ G A Y PGC+++ C + + + A A ADATV+V G DL
Sbjct: 426 -TGTPCKYTTPLQGLAALVAT-TYLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADL 482
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
S+EAE +DRVD+LLPG Q LI VA+A+ GPV LVIMS G +D++FAK N KI SILWV
Sbjct: 483 SIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWV 542
Query: 536 GYPGEEGGRAIADVIFGKYN------PGGRLPITWYEANYV-KIPYTSMPLR--PVNNFP 586
GYPGE GG AIAD+IFG YN PGGRLP+TWY +YV K+P T+M +R P N +P
Sbjct: 543 GYPGEAGGAAIADIIFGSYNPSTHQPPGGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYP 602
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY+F+ G VY FG GLSY++F +++ +P V + L+++ C C
Sbjct: 603 GRTYRFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPLEENHVCY---------SSECK 653
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
+V + C++ F + ++N G GS V ++S PP + + K ++G+E+VF+ A
Sbjct: 654 SVAAAEQTCQN--FDVHLRIKNTGTTSGSHTVFLFSTPPSVHNSPQKHLVGFEKVFLHAQ 711
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ VGF ++ CK L +VD + +A G H + +G
Sbjct: 712 TDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGS 748
>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
Length = 770
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/744 (46%), Positives = 476/744 (63%), Gaps = 28/744 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+C+A LP+P RA+ LV +TL EK+ Q+ + A G PRLG+P +EWWSE+LHGV
Sbjct: 36 SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGV--- 92
Query: 70 GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG +F S V AT FP VIL+ A+FN SLW+ + ++ EARAM+N G AGL
Sbjct: 93 ---CDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNINV RDPRWGR ETPGEDP VV Y++ YV+G Q G E + +S
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLS 202
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDL+ W G R+ F+++V QDM++T+ PF+ C+ EG S +MCSYN+VNG
Sbjct: 203 ACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNG 262
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA +L Q R +W F GYI SDCD++ I E+ + + ED++A VLKAG+D++
Sbjct: 263 VPACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTY-TASDEDSIAVVLKAGMDIN 320
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T A+++GK+ E DI+ +L L+ V +RLG+FD + + + LG NN+C
Sbjct: 321 CGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTT 380
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELAAEA RQG VLLKNDNG LPL + +AL+GP AN + G+Y G PC T+
Sbjct: 381 EHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTT 440
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G AY +A GC D+ C + AI+AAK AD V++AGL+L+ E E DRV
Sbjct: 441 FVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRV 500
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q +LI+ VA K PV LV+M G VD++FAK++P+I SILW+GYPGE GG
Sbjct: 501 SLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNV 560
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+ +++FGKYNPGG+LPITWY ++ +P M +R +PGRTY+F+ G VVY FGY
Sbjct: 561 LPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGY 620
Query: 604 GLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
GLSY+++ Y + +PK + + D R Y T + V ++D+ C+ +F
Sbjct: 621 GLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAY---TRRDGVDYVQVEDIASCEALQF 677
Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
I V N G MDGS V+++ S P G+ IKQ++G+ERV AAG+S V T++ CK
Sbjct: 678 PVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCK 737
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
+ + +L G H ++VG+
Sbjct: 738 LMSFANTEGTRVLFLGTHVLMVGD 761
>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 776
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/743 (46%), Positives = 474/743 (63%), Gaps = 28/743 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+C+ KLP +R KDLV R+TL EK+ Q+ + A +PRLG+P YEWWSEALHG+ +GR
Sbjct: 42 YPFCNPKLPITQRTKDLVSRLTLDEKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGR 101
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G F+ + ATSFP VILT ASF+ LW +IGQ + EARA+YN G A G+TF
Sbjct: 102 ------GIFFNGSITSATSFPQVILTAASFDSHLWYRIGQAIGVEARAIYNGGQAMGMTF 155
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN+ RDPRWGR ET GEDP + YA++YVRGLQ G + L+ SAC
Sbjct: 156 WAPNINIFRDPRWGRGQETAGEDPMMTSNYAVSYVRGLQ---GDSFQGGKLRGHLQASAC 212
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW+G +RFHFD+RV+ QD+ +T+ PF C+ +G S +MC+YNRVNGIP
Sbjct: 213 CKHFTAYDLDNWKGVNRFHFDARVSLQDLADTYQPPFRSCIEQGRASGIMCAYNRVNGIP 272
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
+CAD LL T+R W FHGYIVSDC ++ I + + + EDAVA VL AG+DL+CG
Sbjct: 273 SCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIHDEQGYAK-SAEDAVADVLHAGMDLECG 331
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
Y T+ AVQQ K+ ID +L L+ + +RLG FDG+P + +G N++C+ H
Sbjct: 332 SYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIRLGQFDGNPAKLPFGMIGPNHVCSENH 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCRYTSP 426
+ LA EAAR GIVLLKN LPL +I +LA++GP+ANA+ ++GNY G PC+ +
Sbjct: 392 LYLALEAARNGIVLLKNTASLLPLPKTSI-SLAVIGPNANASPLTLLGNYAGPPCKSITI 450
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ GF Y K + PGC ++ I A+ AKNAD V+V GLD SVE E +DRV
Sbjct: 451 LQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKVAKNADYVVLVMGLDQSVEREERDRVH 510
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPG Q ELIN VA A+K PV LV++ G +DI+ AKNN KI I+W GYPGE GG A+
Sbjct: 511 LDLPGKQLELINSVAKASKRPVILVLLCGGPIDISSAKNNDKIGGIIWAGYPGELGGIAL 570
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A +IFG +NPGGRLPITWY +Y+K+P T M +R P +PGRTY+F+ GP VY FG+G
Sbjct: 571 AQIIFGDHNPGGRLPITWYPKDYIKVPMTDMRMRADPTTGYPGRTYRFYKGPTVYEFGHG 630
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI---DDVKCKDYKFT 661
LSYT++ Y+ V + DK + + + N L+ D+ CK +
Sbjct: 631 LSYTKYSYEF------VSVTHDKLHFNQSSTHLMTENSETIRYKLVSELDEETCKSMSVS 684
Query: 662 FQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
+ V+N G + G ++++ +P + +KQ++G+ + + AG+ + VGF ++ C+
Sbjct: 685 VTVGVKNHGNIVGRHPILLFMRPQKHRTRSPMKQLVGFHSLLLDAGEMSHVGFELSPCEH 744
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
L + A ++ G+H + VGE
Sbjct: 745 LSRANEAGLKIIEEGSHLLHVGE 767
>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
Length = 780
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/729 (46%), Positives = 462/729 (63%), Gaps = 19/729 (2%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C+ LP RA+ L+ +TL EK+QQ+ D A G+PRLG+P YEWWSE+LHG+S G
Sbjct: 40 YSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPRLGIPHYEWWSESLHGISING- 98
Query: 72 RTNSPPGTHFDSEVP--GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
PG F + P AT FP VI++ ASFN +LW IG ++ EARAMYN+G AGLT
Sbjct: 99 -----PGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIAIEARAMYNVGQAGLT 153
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FW+PNIN+ RDPRWGR ETPGEDP V YAI +V+G Q + + L +SA
Sbjct: 154 FWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWKNEDGEINDDKLMLSA 213
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYDL+ W R+ F++ VTEQDM++T+ PF C+ +G S +MCSYN VNG+
Sbjct: 214 CCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKGKASCLMCSYNEVNGV 273
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LL Q R +W F GYI SDCD++ TI E + + + EDAVA LKAG+D++C
Sbjct: 274 PACAREDLL-QKPRTEWGFKGYITSDCDAVATIFEYQNY-SKSPEDAVAIALKAGMDINC 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQ 366
G Y AV++GK+ E DID +L L+ V +RLG FDG P Q+ LG N+C +
Sbjct: 332 GTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRKGQFGKLGPKNVCTKE 391
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAARQGIVLLKND LPLN + +LA++GP AN ++ G+Y G PC S
Sbjct: 392 HKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANSLGGDYTGYPCDPQSL 451
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+G AY K +YA GC D+ C +++ AI AK AD +IVAGLDLS E E DRV
Sbjct: 452 FEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVAGLDLSQETEEHDRVS 511
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q L++ VA A+K PV LV+ G +D++FAK +P+I SILW+GYPGE G +A+
Sbjct: 512 LLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIASILWIGYPGEAGAKAL 571
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A++IFG+YNPGGRLP+TWY ++ ++ T M +R P +PGRTY+F+ G VY FG G
Sbjct: 572 AEIIFGEYNPGGRLPMTWYPESFTEVSMTDMNMRPNPSRGYPGRTYRFYTGNRVYGFGGG 631
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKFTFQ 663
LSYT F YK+ S+P + + R G + + + I+++ C +F Q
Sbjct: 632 LSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGER--LSYININEITSCDSLRFYMQ 689
Query: 664 IEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
I VEN+G MDG VVM++S+ P + G KQ++G++RV + +S ++ ++ C+ L
Sbjct: 690 ILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISHRSTEMSILVDPCEHLS 749
Query: 723 IVDNAANSL 731
+ + +
Sbjct: 750 VANEQGKKI 758
>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
Length = 783
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/745 (44%), Positives = 472/745 (63%), Gaps = 20/745 (2%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+C+ LP R L+ +TL +K+ Q+ + A + LG+P Y+WWSEALHG++
Sbjct: 38 SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATN 97
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG +F+ V AT+FP VI++ A+FN SLW IG V E RAM+N+G AGL+
Sbjct: 98 G------PGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLS 151
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY---HRDSDSRPLK 186
FW+PN+NV RDPRWGR ETPGEDP V YA+ +VRG+Q V+G++ DSD L
Sbjct: 152 FWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKKVLNDHDSDDDGLM 211
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+SACCKH+ AYDL+ W R++F++ VT+QD+++T+ PF CV +G S +MCSYN V
Sbjct: 212 VSACCKHFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQGKASCLMCSYNEV 271
Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
NG+P CA LL +R W F GYI SDCD++ T+ E K+ + EDAVA VLKAG+D
Sbjct: 272 NGVPACASKDLLG-LVRNKWGFEGYIASDCDAVATVFEYQKYAK-SAEDAVADVLKAGMD 329
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
++CG + T A++QG + E D+D +L L+ V MRLG F+G P+ + LG ++C
Sbjct: 330 INCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVC 389
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P+H +LA EAARQGIVLLKNDN LPL+ + +LA++GP A T + G Y G PC
Sbjct: 390 TPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMAT-TSELGGGYSGIPCSP 448
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
S DG Y K I+YA GC+D+ C ++ AID AK AD VIVAGLD ++E E D
Sbjct: 449 RSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLD 508
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
RV LLLPG Q +L+++VA A+K PV LV+ G +D++FA++N I SILW+GYPGE GG
Sbjct: 509 RVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYPGEAGG 568
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
+A+A++IFG++NP GRLP+TWY ++ +P M +R P +PGRTY+F+ G +Y F
Sbjct: 569 KALAEIIFGEFNPAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGF 628
Query: 602 GYGLSYTQFKYKVASSPKSVDI-KLDKDQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYK 659
G+GLSY+ F Y+V S+P + + K R + V + V +D+++ C
Sbjct: 629 GHGLSYSDFSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLS 688
Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
F+ I V N+G MDGS VVM++SK P I G+ Q++G R+ + +S + + C
Sbjct: 689 FSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPC 748
Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
+ D +L G H + VG+
Sbjct: 749 EHFSFADEQGKRILPLGNHILNVGD 773
>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
Length = 791
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/754 (44%), Positives = 461/754 (61%), Gaps = 39/754 (5%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C+ KLP +RA DLV RMT EK Q+GD+A GVPRLG+P Y+WW+EALHGV+ G+
Sbjct: 60 LPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVPRLGVPSYKWWNEALHGVAISGK 119
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G H + V ATSFP V+ T ASFN++LW +IGQ EARA YN+G A GLT
Sbjct: 120 ------GIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLTM 173
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V RY +VRGLQ G + S L+ SAC
Sbjct: 174 WSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQ---GSSSNTKSVPPVLQTSAC 230
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL++W+G R+ F + VT QD+ +TF PF CV +G S VMC+Y VNG+P
Sbjct: 231 CKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFRSCVVDGKASCVMCAYTIVNGVP 290
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
+CA+ LL +T RG W GY+ +DCD++ I+ + +F T ED VA LKAGLD+DCG
Sbjct: 291 SCANGDLLTKTFRGSWGLDGYVAADCDAV-AIMRNSQFYRPTAEDTVAATLKAGLDIDCG 349
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
Y + M A+Q+GK+ + D+D +++ L MRLG+FDG P+ Y NLG +IC +H
Sbjct: 350 PYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFDGDPKTNVYGNLGAGHICTAEH 409
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA EAA GIVLLKN G LPL G + + A++G +AN A++GNY G PC T+P+
Sbjct: 410 KNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNANDVLALLGNYWGPPCAPTTPL 469
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y K + + GC C N + P A A ++DA ++ GL E+EGKDR L
Sbjct: 470 QGIQGYVKNVKFLAGCNKAAC-NVAATPQATALASSSDAVILFMGLSQEQESEGKDRTTL 528
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q LIN VA+AAK PV LV+++ G VDI FA+ NPKI +ILW GYPG+ GG AIA
Sbjct: 529 LLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAIA 588
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
V+FG+ NP G+LP TWY + +IP T M +R ++PGRTY+F++G +Y FGYGLSY
Sbjct: 589 KVLFGEKNPSGKLPNTWYPEEFTRIPMTDMRMRAAGSYPGRTYRFYNGKTIYKFGYGLSY 648
Query: 608 TQFKYKVASSPKSVD----------IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
++F ++V + K+ + +D + + I DV C
Sbjct: 649 SKFSHRVVTGRKNPAHNTSLLAAGLAAMTEDNLSYHVEH-------------IGDVVCDQ 695
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
KF ++V+N G +DG +++ + P G +Q+IG++ I AG+ A + F ++
Sbjct: 696 LKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIGFQSQHIKAGEKANLRFEVS 755
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
C+ V ++ G+H + VG+ +SF
Sbjct: 756 PCEHFSRVRQDGRKVIDKGSHFLKVGKHELEISF 789
>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
Length = 780
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/757 (46%), Positives = 468/757 (61%), Gaps = 36/757 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +C+ +LP +RA DLV R+TL EK+ Q+GD + V RLG+P Y+WWSEALHGVS
Sbjct: 42 SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 101
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
GR G H D + ATSFP VILT ASFN LW +IGQ + TEARA+YN G A GL
Sbjct: 102 GR------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGL 155
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNINV RDPRWGR ETPGEDP V G+YA +VRG+Q G +S L+ S
Sbjct: 156 TFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEAS 212
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDL+NW+G R+ FD++VT QD+ +T+ PF CV +G S +MCSYNRVNG
Sbjct: 213 ACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNG 272
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCAD LL++T RGDW F+GYI SDCD++ I + + T EDAVA VLKAG+D++
Sbjct: 273 VPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAK-TAEDAVADVLKAGMDVN 331
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNNICNP 365
CG Y + A+QQGKI E DI+ +L L+ V MRLG F+G+P+Y N+G + +C
Sbjct: 332 CGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQ 391
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAA+ G+VLLKND ALPL+ + ++A++G +AN ++GNY G PC +
Sbjct: 392 EHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVT 451
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ Y K + GC C N S I A A + D V+ GLD E E DR+
Sbjct: 452 PLQVLQGYVKDTRFLAGCNSAAC-NVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDRL 510
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG Q LIN VA+AAK PV LV++ G VD+ FAK NPKI +ILW GYPGE GG A
Sbjct: 511 ELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIA 570
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA V+FG++NPGGRLP+TWY + +P T M +R P +PGRTY+F+ G VY FGY
Sbjct: 571 IAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGY 630
Query: 604 GLSYTQFKYKVAS------SPKSVD-IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
GLSY+++ + + S S+D +K ++Y V P C
Sbjct: 631 GLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEEIGPE---------TCD 681
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA---GTHIKQVIGYERVFIAAGQSAKVGF 713
KF + V+N G MDG V+++ + P A G Q+IG++ + + + Q+ V F
Sbjct: 682 KLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEF 741
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++ CK ++ G+H ++VG+ +SF
Sbjct: 742 EVSPCKHFSRATEDGKKVIDHGSHFMMVGDDEFEMSF 778
>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
Length = 774
Score = 676 bits (1745), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/757 (46%), Positives = 468/757 (61%), Gaps = 36/757 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +C+ +LP +RA DLV R+TL EK+ Q+GD + V RLG+P Y+WWSEALHGVS
Sbjct: 36 SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 95
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
GR G H D + ATSFP VILT ASFN LW +IGQ + TEARA+YN G A GL
Sbjct: 96 GR------GIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNINV RDPRWGR ETPGEDP V G+YA +VRG+Q G +S L+ S
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEAS 206
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDL+NW+G R+ FD++VT QD+ +T+ PF CV +G S +MCSYNRVNG
Sbjct: 207 ACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNG 266
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+PTCAD LL++T RGDW F+GYI SDCD++ I + + T EDAVA VLKAG+D++
Sbjct: 267 VPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAK-TAEDAVADVLKAGMDVN 325
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNNICNP 365
CG Y + A+QQGKI E DI+ +L L+ V MRLG F+G+P+Y N+G + +C
Sbjct: 326 CGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQ 385
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAA+ G+VLLKND ALPL+ + ++A++G +AN ++GNY G PC +
Sbjct: 386 EHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVT 445
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ Y K + GC C N S I A A + D V+ GLD E E DR+
Sbjct: 446 PLQVLQGYVKDTRFLAGCNSAAC-NVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDRL 504
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG Q LIN VA+AAK PV LV++ G VD+ FAK NPKI +ILW GYPGE GG A
Sbjct: 505 ELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIA 564
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA V+FG++NPGGRLP+TWY + +P T M +R P +PGRTY+F+ G VY FGY
Sbjct: 565 IAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGY 624
Query: 604 GLSYTQFKYKVAS------SPKSVD-IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
GLSY+++ + + S S+D +K ++Y V I C
Sbjct: 625 GLSYSKYSHHFVANGTKLPSLSSIDGLKAMATAAAGTVSYDVEE---------IGTETCD 675
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA---GTHIKQVIGYERVFIAAGQSAKVGF 713
KF + V+N G MDG V+++ + P A G Q+IG++ + + + Q+ V F
Sbjct: 676 KLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEF 735
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++ CK ++ G+H ++VG+ +SF
Sbjct: 736 EVSPCKHFSRATEDGKKVIDHGSHFMMVGDDEFEMSF 772
>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 772
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/744 (46%), Positives = 476/744 (63%), Gaps = 34/744 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+C+ KLP P+R KDL+ R+TL EK+ Q+ + A +PRLG+P Y+WWSEALHGVS +G
Sbjct: 38 YPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPRLGIPAYQWWSEALHGVSGVG- 96
Query: 72 RTNSPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
PG FD S + ATSFP VILT ASF+ LW +IG + EARA++N G A GL
Sbjct: 97 -----PGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIGIEARAIFNAGQANGL 151
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNIN+ RDPRWGR ET GEDP + RYA+++VRGLQ H L S
Sbjct: 152 TFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSFKGAH-------LLAS 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLDNW+G DRF FD+RV+ QD+ +T+ PF+ CV +G S +MC+YNRVNG
Sbjct: 205 ACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNRVNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CAD LL QT R W+F+GYI SDC ++ I + ++ + ED VA VL+AG+DL+
Sbjct: 265 VPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAK-SPEDVVADVLRAGMDLE 323
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNP 365
CG Y T AV Q K+ ++ID +L+ L+ + MRLG FDG+P + +G N++C+
Sbjct: 324 CGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVCSK 383
Query: 366 QHIELAAEAARQGIVLLKNDNGALPL-NTGNIKTLALVGPHANATK-AMIGNYEGTPCRY 423
+H LA EAAR GIVLLKN LPL T +LA++GP+AN++ ++GNY G PC+Y
Sbjct: 384 EHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPCKY 443
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
+ + GF Y K Y PGC +++ I A++ AK D V+V GLD S E E +D
Sbjct: 444 VTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMGLDQSEEREERD 503
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
RV L LPG Q ELIN VA+A+K PV LV++S G +DI AK N KI ILW GYPGE GG
Sbjct: 504 RVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGELGG 563
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPF 601
A+A +IFG +NPGGRLP TWY +Y+K+P T M +R P +PGRTY+F+ GP VY F
Sbjct: 564 IALAQIIFGDHNPGGRLPTTWYPKDYIKVPMTDMRMRADPSTGYPGRTYRFYKGPKVYEF 623
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI---DDVKCKDY 658
GYGLSY+++ Y+ V + DK + + + N + L+ D+ C+
Sbjct: 624 GYGLSYSKYSYEF------VSVTHDKLHFNQSSTHLMVENSETISYKLVSELDEQTCQSM 677
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
+ + V+N G M G V+++ +P +G+ +KQ++G+E V + AG+ A V F ++
Sbjct: 678 SLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPVKQLVGFESVMLDAGEMAHVEFEVSP 737
Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
C+ L + A ++ G+H +LV
Sbjct: 738 CEHLSRANEAGAMIIEEGSHMLLV 761
>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
Length = 732
Score = 674 bits (1738), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/732 (45%), Positives = 468/732 (63%), Gaps = 32/732 (4%)
Query: 34 LPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPT 93
LPEK+ Q+ + A VPRLG+P YEWWSE+LHG++ ++ PG F+ + ATSFP
Sbjct: 2 LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLA------DNGPGVSFNGSISAATSFPQ 55
Query: 94 VILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGED 153
VI++ ASFN +LW +IG V+ E RAMYN G AGLTFW+PNINV RDPRWGR ETPGED
Sbjct: 56 VIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGED 115
Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSR-------------PLKISACCKHYAAYDLD 200
P VV Y + +VRG Q+ + + + S L +SACCKH+ AYDL+
Sbjct: 116 PKVVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLE 175
Query: 201 NWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
W R+ F++ VTEQDM++T+ PFE C+ +G S +MCSYN VNG+P CA LL Q
Sbjct: 176 KWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLL-Q 234
Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
R +W F GYI SDCD++ TI +++ + E+AVA +KAG+D++CG Y T A
Sbjct: 235 KARVEWGFEGYITSDCDAVATIF-AYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSA 293
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIELAAEAARQ 377
++QGK++E +D +L L+ V +RLG FDG P QY LG N+IC+ H +LA EA RQ
Sbjct: 294 IEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQ 353
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
GIVLLKND+ LPLN ++ +LA+VGP AN M G Y G PC+ + Y K
Sbjct: 354 GIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKT 413
Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
+YA GC+D+ C +++ A+ AK AD ++VAGLDLS E E KDRV L LPG Q +L+
Sbjct: 414 SYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLV 473
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
+ VA +K PV LV+ G VD+ FAKN+P+I SI+W+GYPGE GG+A+A++IFG +NPG
Sbjct: 474 SHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 533
Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
GRLP TWY ++ + + M +R ++ +PGRTY+F+ GP VY FG GLSYT+F+YK+
Sbjct: 534 GRLPTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKIL 593
Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKM 672
S+P + + QQ + + +DDV C+ +F ++ V N G++
Sbjct: 594 SAPIRLSLSELLPQQSSHKKQL--QHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEI 651
Query: 673 DGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
DGS VVM++SK PP ++G KQ+IGY+RV + + + + F ++ CK L + ++ +
Sbjct: 652 DGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRV 711
Query: 732 LASGAHTILVGE 743
+ G+H + +G+
Sbjct: 712 IPLGSHVLFLGD 723
>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
Length = 767
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/755 (45%), Positives = 480/755 (63%), Gaps = 40/755 (5%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ P+C LP +R +DL+ R+TL EK++ + + A VPRLG+ YEWWSEALHGVS
Sbjct: 39 NLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKGYEWWSEALHGVS--- 95
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
N+ PG F PGATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGLT+
Sbjct: 96 ---NADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMYNGGMAGLTY 152
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ G + LK++AC
Sbjct: 153 WSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQ---------LKVAAC 203
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKHY AYDLDNW DR+ F++RV++QD+ +T+ +PF+ CV EG V C++ I
Sbjct: 204 CKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKYQ-VYCAHT----IK 258
Query: 251 TCADPKLLN--QTIRGDWNFHGYIVSDCDSIQTI--VESHKFLNDTKEDAVARVLKAGLD 306
A+P +L W++H ++ C + H L+ T EDA A +KAGLD
Sbjct: 259 LMANPLVLTLISPQHHPWSWHSWL--HCFRLYRCWGFICHSTLHSTPEDAAAATIKAGLD 316
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
L+CG + T AV+QGK+ EAD++ +L V MRLG FDG P Y NLG ++C
Sbjct: 317 LECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQPYGNLGPRDVC 376
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P H +LA EAARQGIVLL+N +LPL+T +T+A++GP+++ T M+GNY G C +
Sbjct: 377 TPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTMLGNYAGVACGF 436
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
T+P+ G Y + I+ + GC + C NN + A AA+ ADATV+V GLD S+E E KD
Sbjct: 437 TTPLQGIERYVRTIHQS-GCDSVACSNNQLFGVAETAARQADATVLVMGLDQSIETEFKD 495
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
RV LLLPG Q EL+++VA A++GPV LV+MS G +D++FAKN+P+I +ILWVGYPG+ GG
Sbjct: 496 RVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAILWVGYPGQAGG 555
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYKFFDGPVVYP 600
AIADV+FG+ NPGGRLP+TWY +Y+ K P T+M +R P + +PGRTY+F+ GPVV+P
Sbjct: 556 TAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSGYPGRTYRFYKGPVVFP 615
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FG+G+SYT F +++A +P +V + L + N T N + + C
Sbjct: 616 FGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQ--NSTTFNN-----GIRVTHTNCDTLIL 668
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
I+V+N G MDG+ V+V+S PP KQ+IG+++V + A +V ++ C
Sbjct: 669 GIHIDVKNTGDMDGTHTVLVFSTPPVGKWGANKQLIGFKKVHVVARGRQRVKIHVHVCNQ 728
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
L +VD + G H++ +G+ +S + L+
Sbjct: 729 LSVVDQFGIRRIPIGEHSLHIGDIKHSISLQVTLD 763
>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 782
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/739 (44%), Positives = 459/739 (62%), Gaps = 30/739 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CDA LP +RA DLV R+T EKV Q+GD A GVPRLG+P Y+WWSEALHG++ GR
Sbjct: 52 FCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYKWWSEALHGLATSGR-- 109
Query: 74 NSPPGTHFD---SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G HFD S ATSFP V+LT A+F++ LW +IGQ + TEARA+YN+G A GLT
Sbjct: 110 ----GLHFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEARALYNIGQAEGLT 165
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP + +YA+ +V+G+Q + S L+ SA
Sbjct: 166 MWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQG---------NSSAILQTSA 216
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYDL++W G R++F+++VT QD+++T+ PF CV + + +MC+Y +NG+
Sbjct: 217 CCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGV 276
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA+ LL +T+RGDW GYI SDCD++ + ++ ++ T EDAVA LKAGLD++C
Sbjct: 277 PACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNC 335
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
G Y A+QQGK+ E DID +L+ L+ + MRLG+FDG P+ Y LG +IC P
Sbjct: 336 GTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTP 395
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAA GIVLLKND G LPL+ + + A++GP+AN A+IGNY G PC T+
Sbjct: 396 EHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCESTT 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P++G Y K + + GC C + A A ++D + GL E+EG+DR
Sbjct: 456 PLNGILGYIKNVRFLAGCNSAACDVAATD-QAAAVASSSDYVFLFMGLSQKQESEGRDRT 514
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VADAAK PV LV+++ G VD+ FA+ NPKI +ILW GYPG+ GG A
Sbjct: 515 SLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 574
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA V+FG +NPGGRLP+TWY + K+P T M +R P +PGR+Y+F+ G VY FGY
Sbjct: 575 IARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGY 634
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSY+ + ++ S K + + R + G + D C+ KF
Sbjct: 635 GLSYSSYSRQLVSGGKPAESYTNLLASLRTTTTSEGDESYHIEEIGTDG--CEQLKFPAV 692
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+EV+N G MDG V++Y + P G Q+IG+ + G+ A + F ++ C+
Sbjct: 693 VEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFS 752
Query: 723 IVDNAANSLLASGAHTILV 741
V ++ G+H ++V
Sbjct: 753 RVRKDGKKVIDRGSHYLMV 771
>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
distachyon]
Length = 771
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/741 (45%), Positives = 478/741 (64%), Gaps = 24/741 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+CDA LP+P RA+ LV +TL EK+ Q+ + A GVPRLG+P YEWWSE+LHG++
Sbjct: 37 YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLA---- 92
Query: 72 RTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
++ PG +F S V AT FP VIL+ ASFN SLW+ + + V+ EARAM+N G AGLT+
Sbjct: 93 --DNGPGVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTY 150
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNINV RDPRWGR ETPGEDP V+ Y++ YV+G Q EY + R + +SAC
Sbjct: 151 WAPNINVFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQG----EYGDGKEGR-MMLSAC 205
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKHY AYDL+ W R+ F+++V EQD ++T+ PF+ C+ EG S +MCSYN+VNG+P
Sbjct: 206 CKHYVAYDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVP 265
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA LL Q +R +W F GY+VSDCD++ I + N + ED++A VLKAG+D++CG
Sbjct: 266 ACARKDLL-QKVRDEWGFQGYVVSDCDAVGIIYGYQNYTN-SDEDSIAIVLKAGMDINCG 323
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNICNPQH 367
+ T A+Q+GKI E DI+ +L L+ V +RLG FD G+ + LG +NIC +H
Sbjct: 324 SFLIRHTKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEH 383
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELAAEAARQG VLLKNDN LPL + +A++GP AN M G+Y G PC T+ +
Sbjct: 384 RELAAEAARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFL 443
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G A A GC DI C + AI+ AK AD V++AGL+L+ E E DRV L
Sbjct: 444 KGMQAVVPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSL 503
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q +LIN +A K P+ LVI G VD++FAK + +I S+LW+GYPGE GG+ +
Sbjct: 504 LLPGKQMDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLP 563
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
+++FG+YNPGG+LPITWY ++ +P M +R P ++PGRTY+F+ G VVY FGYGL
Sbjct: 564 EILFGEYNPGGKLPITWYPESFTAVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGL 623
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYT-VGTNKPPCAAVLIDDV-KCKDYKFTFQ 663
SY+++ Y + +P I L + I+ T + V ++D+ C+ KF+
Sbjct: 624 SYSKYSYNIIQAP--TKISLSRSSAVDFISTKRAHTRRDGLDYVQVEDIASCESIKFSVH 681
Query: 664 IEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
I V N G MDGS V+++++ + G +KQ++G+ER++ AAG++ V T++ CK +
Sbjct: 682 ISVANDGAMDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKLMS 741
Query: 723 IVDNAANSLLASGAHTILVGE 743
+ +L G+H ++VG+
Sbjct: 742 SANTEGRRVLLLGSHLLMVGD 762
>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
Length = 766
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/744 (44%), Positives = 480/744 (64%), Gaps = 28/744 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+CDA L P RA+ LV +TL EK+ Q+ + A GVPRLG+P Y+WWSE+LHG++
Sbjct: 32 SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA-- 89
Query: 70 GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG +F S V AT+FP VIL+TA+FN SLW+ + + V+TEA M+N G AGL
Sbjct: 90 ----DNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGL 145
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+W+PNIN+ RDPRWGR ET GEDP V Y++ YV+G Q +G E +++S
Sbjct: 146 TYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGR-------IRLS 198
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD++ WEG R+ F+++V QD+++T+ PF+ C+ E S +MC+YN+VNG
Sbjct: 199 ACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNG 258
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA+ LL +T R +W F GYI SDCD++ I E+ + + ED++A VLKAG+D++
Sbjct: 259 VPMCANKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSDEDSIAIVLKAGMDIN 316
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYK-NLGKNNICNP 365
CG + T AV++GK+ E DID +L L+ V +RLG FD + Q+ LG NN+C
Sbjct: 317 CGSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTK 376
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELAAEA RQG VLLKND+ LPL ++ +A++GP AN AM G+Y G C T+
Sbjct: 377 EHRELAAEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTT 436
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G AY+ +A GC D+ C + + AI AAK AD V+VAGL+L+ E E DRV
Sbjct: 437 FLKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRV 496
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI+ VA AK P+ LV++ G VD++FAK +P+I SILW+GYPGE GG+
Sbjct: 497 SLLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQV 556
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+ +++FG+YNPGG+L +TWY ++ IP T M +R P +PGRTY+F+ G VVY FGY
Sbjct: 557 LPEILFGEYNPGGKLAMTWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGY 616
Query: 604 GLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
GLSY+++ Y + S+PK + + D R +Y + V +D+ C+ F
Sbjct: 617 GLSYSKYSYSILSAPKKITMSRSSVLDIISRKPSY---IRRDGLDFVKTEDIASCEALAF 673
Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+ + V N G MDGS V+++++ + G IKQ++G+ERV AAG ++ V +++ CK
Sbjct: 674 SVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDPCK 733
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
+ + +L G H + VG+
Sbjct: 734 HMSAANPEGKRVLLLGDHVLTVGD 757
>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 829
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/752 (44%), Positives = 465/752 (61%), Gaps = 33/752 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C+ KLP +RA DLV RMT EK Q+GD+A GVPRLG+P Y+WW+EALHGV+ G+
Sbjct: 96 LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGK 155
Query: 72 RTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G H D V ATSFP V+LT ASFN++LW +IGQ EARA YN+G A GLT
Sbjct: 156 ------GIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLT 209
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP V RYA +VRGLQ G + S L SA
Sbjct: 210 MWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSSNTKSVPPVLLTSA 266
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYDL++W+G R+ F + VT QD+ +TF PF CV +G S VMC+Y VNG+
Sbjct: 267 CCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGV 326
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P+CA+ LL +T RG W GY+ +DCD++ +I+ + +F T ED VA LKAGLD+DC
Sbjct: 327 PSCANADLLTKTFRGSWGLDGYVAADCDAV-SIMRNSQFYRPTAEDTVATTLKAGLDIDC 385
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y M A+Q+GK+ + D+D +++ L+ MRLG+FDG P+ Y NLG +IC +
Sbjct: 386 GPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQE 445
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA GIVLLKN G LPL G++ + A++G +AN A++GNY G PC T+P
Sbjct: 446 HKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTTP 505
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y K + + GC C N + P A A +D+ ++ GL E+EGKDR
Sbjct: 506 LQGIQGYVKNVRFLAGCHKAAC-NVAATPQAAALASTSDSVILFMGLSQEQESEGKDRTT 564
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q LI VA+AAK PV LV+++ G VDI FA+ NPKI +ILW GYPG+ GG AI
Sbjct: 565 LLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAI 624
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
A V+FG+ NP GRLP+TWY + K+P T M +R ++PGR+Y+F+ G +Y FGYGLS
Sbjct: 625 AKVLFGEKNPSGRLPVTWYPEEFTKVPMTDMRMRSAGSYPGRSYRFYKGKTIYKFGYGLS 684
Query: 607 YTQFKYKVASS----PKSVDIKLDKDQQCR---DINYTVGTNKPPCAAVLIDDVKCKDYK 659
Y++F ++V ++ + + L +++Y V I D C+ K
Sbjct: 685 YSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH---------IGDELCRQLK 735
Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
F ++V+N G MDG +++ + P G +Q++G++ I AG+ A + F ++ C
Sbjct: 736 FLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPC 795
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ V + ++ G+H + VG+ +SF
Sbjct: 796 EDFSRVRDDGRKVIDKGSHFLKVGKHELEISF 827
>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
Length = 780
Score = 669 bits (1727), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/740 (45%), Positives = 472/740 (63%), Gaps = 18/740 (2%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+CD LP RA+ LV +TLPEK+ + + A +PRLG+P Y+WWSE+LHG++ G
Sbjct: 40 PFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLALNG-- 97
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG F VP ATSFP VIL+ ASFN SLW + ++ EARAM+N+G AGLTFW+
Sbjct: 98 ----PGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWA 153
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR-PLKISACC 191
PNIN+ RDPRWGR ETPGEDP + YA+ YVRGLQ + G++ D L +SACC
Sbjct: 154 PNINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACC 213
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+ AYDLD W R++F++ V++QD+++T+ PF C+ +G S +MCSYN VNG+P
Sbjct: 214 KHFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPA 273
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CA +LL R W F GYI SDCD++ T+ E K+ ++EDAVA VLKAG+D++CG
Sbjct: 274 CASEELLGLA-RDKWGFKGYITSDCDAVATVYEYQKYAK-SQEDAVADVLKAGMDINCGT 331
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHI 368
+ T A++QGK+ E D+D +L L+ V +RLG FDG P ++ LG ++C +H
Sbjct: 332 FMLRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHK 391
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
LA +AARQGIVLLKND LPL+ +LA++GP A TK + G Y G PC +S +
Sbjct: 392 TLALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLATTTK-LGGGYSGIPCSSSSLYE 450
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G +++ I+YA GC D+ C ++ AID AK AD VIVAGLD + E E DRV LL
Sbjct: 451 GLGEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLL 510
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LPG Q L++ VADA+K PV LV++ G +D++FA+ NP+I SI+W+GYPGE GG+A+A+
Sbjct: 511 LPGKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAE 570
Query: 549 VIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLS 606
+IFG++NP GRLP+TWY + +P M +R P +PGRTY+F+ G VY FG+GLS
Sbjct: 571 IIFGEFNPAGRLPMTWYPEAFTNVPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLS 630
Query: 607 YTQFKYKVASSPKSVDI-KLDKDQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYKFTFQI 664
++ F Y S+P + + + KD + + Y V V ++ ++ C F+ I
Sbjct: 631 FSDFSYNFLSAPSKISLSRTIKDGSRKRLLYQVENEVYGVDYVPVNQLQNCNKLSFSVHI 690
Query: 665 EVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V N+G +DGS VVM++SK P + G+ Q++G+ R+ + + + ++ C+ L
Sbjct: 691 SVMNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSF 750
Query: 724 VDNAANSLLASGAHTILVGE 743
D +L G HT+ VG+
Sbjct: 751 ADKQGKRILPLGPHTLSVGD 770
>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
Precursor
gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
Length = 767
Score = 669 bits (1727), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/756 (45%), Positives = 481/756 (63%), Gaps = 35/756 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C LP +RA+DLV R+T+ EK+ Q+ + A G+PRLG+P YEWWSEALHGV++ G
Sbjct: 36 YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYEWWSEALHGVAYAG- 94
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
PG F+ V ATSFP VILT ASF+ W +I Q + EAR +YN G A G+TF
Sbjct: 95 -----PGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQANGMTF 149
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP + G YA+ YVRGLQ +G R + S L+ S
Sbjct: 150 WAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDG----RKTLSNHLQAS 205
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLD W+G R+ F+++V+ D+ ET+ PF+ C+ EG S +MC+YNRVNG
Sbjct: 206 ACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAYNRVNG 265
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
IP+CADP LL +T RG W F GYI SDCD++ I ++ + + EDAVA VLKAG+D++
Sbjct: 266 IPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAK-SPEDAVADVLKAGMDVN 324
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y T A+QQ K++E DID +L L+ V +RLG F+G P Y N+ N +C+P
Sbjct: 325 CGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSP 384
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H LA +AAR GIVLLKN+ LP + ++ +LA++GP+A+ K ++GNY G PC+ +
Sbjct: 385 AHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVT 444
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+D +Y K Y GC + C +N+ I A+ AKNAD V++ GLD + E E DRV
Sbjct: 445 PLDALRSYVKNAVYHQGCDSVAC-SNAAIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRV 503
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q ELI VA+AAK PV LV++ G VDI+FA NN KI SI+W GYPGE GG A
Sbjct: 504 DLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYPGEAGGIA 563
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
I+++IFG +NPGGRLP+TWY ++V I T M +R +PGRTYKF+ GP VY FG+GL
Sbjct: 564 ISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMRSATGYPGRTYKFYKGPKVYEFGHGL 623
Query: 606 SYTQFKYKVAS-SPKSVDIKLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
SY+ + Y+ + + ++ + K Q D + YT+ + + C K
Sbjct: 624 SYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSE--------MGKEGCDVAKTKVT 675
Query: 664 IEVENMGKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKS 720
+EVEN G+M G V+++++ G G KQ++G++ + ++ G+ A++ F + C+
Sbjct: 676 VEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEH 735
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
L + +L G + + VG+ PL +N+
Sbjct: 736 LSRANEFGVMVLEEGKYFLTVGDS----ELPLIVNV 767
>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 766
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/756 (45%), Positives = 481/756 (63%), Gaps = 35/756 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C LP +RA+DLV R+ + EK+ Q+G+ A G+PRLG+P YEWWSEALHGV++ G
Sbjct: 35 YQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVPAYEWWSEALHGVAYAG- 93
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
PG F+ V ATSFP VILT ASF+ W +I Q + EAR +YN G A G+TF
Sbjct: 94 -----PGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQAQGMTF 148
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP + G YA+ YVRGLQ +G R + S L+ S
Sbjct: 149 WAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDG----RKTLSIHLQAS 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLD W+G R+ F+++V+ D+ ET+ PF+ C+ EG S +MC+YNRVNG
Sbjct: 205 ACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGIMCAYNRVNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
IP+CADP LL +T RG W F GYI SDCD++ I ++ + T EDAVA VLKAG+D++
Sbjct: 265 IPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAK-TPEDAVADVLKAGMDVN 323
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y T A+QQ K++E DID +L L+ V +RLG F+G P Y N+ N++C+P
Sbjct: 324 CGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNISPNDVCSP 383
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H LA EAAR GIVLLKN+ LP + ++ +LA++GP+A+ K ++GNY G PC+ +
Sbjct: 384 AHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYAGPPCKTVT 443
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+D +Y K Y GC + C +N+ I A+ A+NAD V++ GLD + E E DRV
Sbjct: 444 PLDALRSYVKNAVYHNGCDSVAC-SNAAIDQAVAIARNADHVVLIMGLDQTQEKEDMDRV 502
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL LPG Q ELI VA+AAK PV LV++ G VDI+FA NN KI SI+W GYPGE GG A
Sbjct: 503 DLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYPGEAGGIA 562
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
+A++IFG +NPGGRLP+TWY ++V + T M +R +PGRTYKF+ GP V+ FG+GL
Sbjct: 563 LAEIIFGDHNPGGRLPVTWYPQSFVNVQMTDMRMRSATGYPGRTYKFYKGPKVFEFGHGL 622
Query: 606 SYTQFKYKVAS-SPKSVDIKLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
SY+ + Y+ + ++ + K Q D + YT+ + + + C K
Sbjct: 623 SYSTYSYRFKTLGATNLYLNQSKAQLNSDSVRYTLVSE--------MGEEGCNIAKTKVI 674
Query: 664 IEVENMGKMDGSEVVMVYSKPP--GIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKS 720
+ VEN G+M G V+++++ G G KQ++G++ + ++ G+ A++ F + C+
Sbjct: 675 VTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEH 734
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
L + ++ G + + VG+ PL +N+
Sbjct: 735 LSRANEVGVMVVEEGKYFLTVGDS----ELPLTINV 766
>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 774
Score = 668 bits (1724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/743 (45%), Positives = 460/743 (61%), Gaps = 40/743 (5%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CD L +RA DLV R+T EK+ Q+GD A GVPRLG+P Y+WW+EALHG++ G+
Sbjct: 46 FCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGYKWWNEALHGLATSGK-- 103
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
G HFD+ V ATSFP V+LT A+F++ LW +IGQ + EARA++N+G A GLT WS
Sbjct: 104 ----GLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFNVGQAEGLTIWS 159
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ETPGEDP V RYA+ +VRG+Q +S S L+ SACCK
Sbjct: 160 PNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG--------NSSSSLLQTSACCK 211
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H AYDL++W G R+ F +RVTEQD+++TF PF CV E S VMC+Y +NG+P C
Sbjct: 212 HATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAINGVPAC 271
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
A+ LL T+RGDW GY+ SDCD++ + ++ ++ T EDAVA LKAGLD+DCG Y
Sbjct: 272 ANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVAVSLKAGLDIDCGSY 330
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIE 369
A+QQGK+ E DID +L LY V MRLG+FDG P+ Y LG +IC P+H
Sbjct: 331 VQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADICTPEHRN 390
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
LA EAA+ GIVLLKND G LPL+ + + A++GP+AN A+I NY G PC T+P+ G
Sbjct: 391 LALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCESTTPLKG 450
Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
+Y + + GC C + + A+ A + D + GL E+EGKDR LLL
Sbjct: 451 LQSYVNDVRFLAGCNSAAC-DVAATDQAVALAGSEDYVFLFMGLSQKQESEGKDRTSLLL 509
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
PG Q LI VADA+K PV LV++S G VDI FA++NPKI +ILW GYPG+ GG AIA V
Sbjct: 510 PGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGLAIAKV 569
Query: 550 IFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSY 607
+FG +NP GRLP+TWY + K+P T M +R P + +PGR+Y+F+ G VY FGYGLSY
Sbjct: 570 LFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPTSGYPGRSYRFYQGNTVYKFGYGLSY 629
Query: 608 TQFKYKVA--------SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
+ F ++ SS ++ Q D +Y V I C+ K
Sbjct: 630 STFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDA---------IGTEGCEQLK 680
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
F +EV+N G MDG V+++ + P G Q+IG+ + AG++AK+ F ++ C
Sbjct: 681 FPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRFDISPC 740
Query: 719 KSLKIVDNAANSLLASGAHTILV 741
K V ++ G+H ++V
Sbjct: 741 KHFSRVRADGRKVIDIGSHFLMV 763
>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 779
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/756 (43%), Positives = 463/756 (61%), Gaps = 42/756 (5%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +CD LP RA DLV R+TL EKV Q+GD A VPRLG+P Y+WWSE LHG+SF G
Sbjct: 47 YAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYKWWSEGLHGLSFWGH 106
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G HFD V TSFP V+LT ASF++ +W +IGQ + TEARA+YNLG A GLT
Sbjct: 107 ------GMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNLGQAQGLTI 160
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP +YA+ +V+GLQ + + L+ SAC
Sbjct: 161 WSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQG---------TSATTLQTSAC 211
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL++W G R++F+++VT QD+ +TF PF+ CV EG + VMC+Y +NG+P
Sbjct: 212 CKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTNINGVP 271
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA L+ +T +GDW +GY+ SDCD++ + ++ ++ T ED VA LKAGLDL+CG
Sbjct: 272 ACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRY-RATPEDTVAVALKAGLDLNCG 330
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
+Y M A+QQGK+ E D+D +L+ L+ V MRLG+FDG P+ Y +LG ++C+P
Sbjct: 331 NYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADVCSPA 390
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA+ GIVLLKND G LPL+ + + A +G +AN A+ GNY G PC T+P
Sbjct: 391 HKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCETTTP 450
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y K + + GC C + A+ A ++D ++ GL E EG DR
Sbjct: 451 LQGLQGYVKNVKFLAGCDSAAC-GFAATGQAVTLASSSDYVILFMGLSQKEEQEGIDRTS 509
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q LI VA A+K PV LV+++ G+VDI FAK+NPKI +ILW GYPG+ GG AI
Sbjct: 510 LLLPGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGLAI 569
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG +NP GRLP+TWY + K+P T M +R P +PGR+Y+F+ G VY FG G
Sbjct: 570 ARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGDG 629
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC-----AAVLIDDV---KCK 656
LSY++F ++ SS + Q + N G + ++++ C
Sbjct: 630 LSYSKFSRQLVSSTNT--------HQVPNTNLLTGLTARTATDGGMSYYHVEEIGVEGCD 681
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQSAKVGFT 714
KF +EV+N G MDG VM++ + P GT + Q++G+ + AG+ A + F
Sbjct: 682 KLKFPAVVEVQNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLVGFRSQHLKAGEKASLTFD 741
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++ C+ ++ G+H ++VG+ +SF
Sbjct: 742 VSPCEHFARAREDGKKVIDRGSHFLVVGKDEREISF 777
>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
Length = 1411
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/740 (45%), Positives = 461/740 (62%), Gaps = 55/740 (7%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C+ L +RA DL+ R+TL EK+ Q+ A +PRLG+P YEWWSEALHG+
Sbjct: 710 YAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGIPAYEWWSEALHGI----- 764
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G F+ + ATSFP VILT ASF+ LW +IGQ + E RAMYN G A G+TF
Sbjct: 765 --RDRHGIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRAMYNAGQAMGMTF 822
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN+ RDPRWGR ETPGEDP V G+YA++YVRGLQ + L+ SAC
Sbjct: 823 WAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQG----DTFEGGKVDVLQASAC 878
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW DR+ FD+RVT QD+ +T+ PF C+ EG S +MC+YN VNG+P
Sbjct: 879 CKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCAYNLVNGVP 938
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CAD LL++T RG W F GYIVSDCD++ + + + + EDAVA VL AG+D+ CG
Sbjct: 939 NCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAK-SPEDAVAIVLTAGMDVACG 997
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
Y AV Q K+ E++ID +L L+ V MRLG F+G+P+ + N+G + +C+ +H
Sbjct: 998 GYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPDQVCSTEH 1057
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA EAAR GIVLLKN + LPL+ G +LA++GP+ANAT ++GNY G PC++ SP+
Sbjct: 1058 QTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPPCKFISPL 1117
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G +Y Y GC D+ C + S I A+D AK AD V+V GLD + E E DR+DL
Sbjct: 1118 QGLQSYVNNTMYHAGCNDVACSSAS-IENAVDVAKQADYVVLVMGLDQTQEREKYDRLDL 1176
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+LPG Q +LI VA AAK PV LV++ G VDI+FAK + I SILW GYPGE GG AIA
Sbjct: 1177 VLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEAGGAAIA 1236
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
+ IFG +NPGGRLP+TWY +++KIP T M +R P + +PGRT++F+ G V+ FG GL
Sbjct: 1237 ETIFGDHNPGGRLPVTWYPKDFIKIPMTDMRMRPEPQSGYPGRTHRFYTGKTVFEFGNGL 1296
Query: 606 SYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
SY+ + Y+ S +P + + N+P V
Sbjct: 1297 SYSPYSYEFLSVTPNKLYL-----------------NQPSTTHV---------------- 1323
Query: 665 EVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
VEN GKM G V+++ K G+ +KQ++G++ VF+ AG+S+ V F ++ C+ L
Sbjct: 1324 -VENSGKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHLSR 1382
Query: 724 VDNAANSLLASGAHTILVGE 743
+ ++ G H ++VG+
Sbjct: 1383 ANKDGLMVMEQGIHLLVVGD 1402
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 315/607 (51%), Positives = 417/607 (68%), Gaps = 21/607 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C LP P+R +DLV R+TL EK+ Q+ + A +PRLG+P YEWWSEALHGV+ G
Sbjct: 41 YHFCKTTLPIPDRVRDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAG- 99
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
PG F+ + ATSFP VILT ASF+ LW +IG+ + EARA+YN G G+TF
Sbjct: 100 -----PGIRFNGTIRSATSFPQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTF 154
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP V G YA++YVRG+Q + G++ + L+ S
Sbjct: 155 WAPNINIFRDPRWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKRCGE-----LQAS 209
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLD+W+G DRF FD+RVT QD+ +T+ PF C+ EG S +MC+YNRVNG
Sbjct: 210 ACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNG 269
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P+CAD LL T R WNF GYI SDCD++ I +S+ F T EDAV VLKAG+D++
Sbjct: 270 VPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAK-TPEDAVVDVLKAGMDVN 328
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y N T AV Q K+ E+++D +L L+ V MRLG F+G+P+ Y ++G N +C+
Sbjct: 329 CGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSV 388
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA +AAR GIVLLKN LPL G +LA++GP+AN+ K +IGNY G PC++ +
Sbjct: 389 EHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFIT 448
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ +Y K Y PGC + C + S I A++ A+ AD V+V GLD + E E DR+
Sbjct: 449 PLQALQSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRL 507
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL+LPG Q +LI VA+AAK PV LV++S G VDI+FAK + I SILW GYPG GG A
Sbjct: 508 DLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAA 567
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
IA+ IFG +NPGGRLP+TWY ++ KIP T M +RP +N +PGRTY+F+ G V+ FGY
Sbjct: 568 IAETIFGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGY 627
Query: 604 GLSYTQF 610
GLSY+ +
Sbjct: 628 GLSYSTY 634
>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 755
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/758 (44%), Positives = 469/758 (61%), Gaps = 43/758 (5%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ + +C+ LP +RA DLV ++TL EKV Q+GD A GVPR G+P Y WWSE LHGVS
Sbjct: 22 AQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYNWWSEGLHGVSMW 81
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
G G HF+ V G T+FP V+LTTASF++S+W +IGQ + TEARAM+NLG A GL
Sbjct: 82 GH------GMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNLGQADGL 135
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T WSPN+N+ RDPRWGR ETPGEDP +YA+ +VRGLQ + + L+ S
Sbjct: 136 TIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQG---------TSTTTLQTS 186
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYDLD+W R++F+++VT QD++ETF PF+ CV EG + VMC+Y VNG
Sbjct: 187 ACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTSVNG 246
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
IP CAD LL +TI+G+W +GYI SDCD++ + + + T EDAVA +KAGLD++
Sbjct: 247 IPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR--YSGTPEDAVAAAIKAGLDMN 304
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG----SPQYKNLGKNNICN 364
CG++ M A+QQ K++E D+D +LR L+ + MRLG+FDG SP Y LG ++C+
Sbjct: 305 CGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDVCS 364
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLN--TGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P H +LA EAA+ GIVLLKND LPL+ T + A++GP+AN A++GNY G PC
Sbjct: 365 PAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPPCE 424
Query: 423 YTSPMDGFYA-YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
T+P+ YSK + + PGC C N + A A +D T++ GL E EG
Sbjct: 425 TTTPLQALQKFYSKNVRFVPGCDSAAC-NVADTYQASGLAATSDYTILFMGLSQKQEQEG 483
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR LLLPG Q LI VA AAK P+ LV+++ G VDI FAK NPKI +ILW GYPG+
Sbjct: 484 LDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQA 543
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
GG AIA V+FG++NP GRLP+TWY Y K+P M +R P +PGR+Y+F+ G VY
Sbjct: 544 GGLAIAKVLFGEHNPSGRLPVTWYPEEYTKVPMDDMRMRADPATGYPGRSYRFYKGNAVY 603
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA---VLIDDVK-- 654
FGYGLSY++F ++ + S + + + + C A L++++
Sbjct: 604 KFGYGLSYSKFSRQLVRNSSSNNRAPNTE--------LLAAAAVDCGASRYYLVEEIGGE 655
Query: 655 -CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
C+ KF +EVEN G MDG + V+++ + P G Q++G+ + AG+ A V
Sbjct: 656 VCERLKFPAVVEVENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQDLRAGEKASVS 715
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
F ++ C+ ++ G+H ++V E +SF
Sbjct: 716 FDISPCEHFSRTTVDGTKVIDRGSHFLMVDEDEMEISF 753
>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 830
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/753 (44%), Positives = 465/753 (61%), Gaps = 34/753 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C+ KLP +RA DLV RMT EK Q+GD+A GVPRLG+P Y+WW+EALHGV+ G+
Sbjct: 96 LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGK 155
Query: 72 RTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G H D V ATSFP V+LT ASFN++LW +IGQ EARA YN+G A GLT
Sbjct: 156 ------GIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLT 209
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP V RYA +VRGLQ G + S L SA
Sbjct: 210 MWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSSNTKSVPPVLLTSA 266
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYDL++W+G R+ F + VT QD+ +TF PF CV +G S VMC+Y VNG+
Sbjct: 267 CCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGV 326
Query: 250 PTCADPKLLNQTIRGDWNFHG-YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
P+CA+ LL +T RG W G Y+ +DCD++ +I+ + +F T ED VA LKAGLD+D
Sbjct: 327 PSCANADLLTKTFRGSWGLDGRYVAADCDAV-SIMRNSQFYRPTAEDTVATTLKAGLDID 385
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y M A+Q+GK+ + D+D +++ L+ MRLG+FDG P+ Y NLG +IC
Sbjct: 386 CGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQ 445
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAA GIVLLKN G LPL G++ + A++G +AN A++GNY G PC T+
Sbjct: 446 EHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTT 505
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G Y K + + GC C N + P A A +D+ ++ GL E+EGKDR
Sbjct: 506 PLQGIQGYVKNVRFLAGCHKAAC-NVAATPQAAALASTSDSVILFMGLSQEQESEGKDRT 564
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VA+AAK PV LV+++ G VDI FA+ NPKI +ILW GYPG+ GG A
Sbjct: 565 TLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLA 624
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
IA V+FG+ NP GRLP+TWY + K+P T M +R ++PGR+Y+F+ G +Y FGYGL
Sbjct: 625 IAKVLFGEKNPSGRLPVTWYPEEFTKVPMTDMRMRSAGSYPGRSYRFYKGKTIYKFGYGL 684
Query: 606 SYTQFKYKVASS----PKSVDIKLDKDQQCR---DINYTVGTNKPPCAAVLIDDVKCKDY 658
SY++F ++V ++ + + L +++Y V I D C+
Sbjct: 685 SYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH---------IGDELCRQL 735
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
KF ++V+N G MDG +++ + P G +Q++G++ I AG+ A + F ++
Sbjct: 736 KFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSP 795
Query: 718 CKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
C+ V + ++ G+H + VG+ +SF
Sbjct: 796 CEDFSRVRDDGRKVIDKGSHFLKVGKHELEISF 828
>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 798
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/761 (45%), Positives = 475/761 (62%), Gaps = 48/761 (6%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C+ L +RAKD+V R+TL EK+ Q+ + A +PRLG+P Y+WW EALHGV+ G+
Sbjct: 48 LPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGK 107
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G + V GATSFP VILT ASF+ LW +I + + TEAR +YN G A G+TF
Sbjct: 108 ------GIRLNGSVAGATSFPQVILTAASFDSKLWYQISKVIGTEARGVYNAGQAQGMTF 161
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ET GEDP V +Y ++YVRGLQ EG + D LK S
Sbjct: 162 WAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGDR----LKAS 217
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRV----------------TEQDMQETFILPFEMCVN 232
ACCKH+ AYDLDNW+G DRF FD++V T QD+ +T+ PF C+
Sbjct: 218 ACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMAYSPWMINYVTLQDLADTYQPPFHSCIV 277
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
+G S +MC+YNRVNG+P CAD LL +T R WNF+GYI SDC++++ I ++ + T
Sbjct: 278 QGRSSGIMCAYNRVNGVPNCADYNLLTKTARQKWNFNGYITSDCEAVRIIYDNQGYAK-T 336
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
EDAVA VL+AG+D++CGDY T AV Q K+ + ID +L L+ + +RLG FDG+P
Sbjct: 337 PEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKVPISQIDRALHNLFTIRIRLGLFDGNP 396
Query: 353 ---QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN-A 408
QY +G N +C+ ++++LA EAAR GIVLLKN LPL + TL ++GP+AN +
Sbjct: 397 TKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLP--RVNTLGVIGPNANKS 454
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
+K ++GNY G PCR + GFY Y+ +Y GC D ++ I A++ AK +D +
Sbjct: 455 SKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSGCLDGTKCASAEIDRAVEVAKISDYVI 514
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+V GLD S E E +DR DL LPG Q ELIN VA A+K PV LV++ G VDI FAKNN K
Sbjct: 515 LVMGLDQSQERESRDRDDLELPGKQQELINSVAKASKKPVILVLLCGGPVDITFAKNNDK 574
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFP 586
I I+W GYPGE GGRA+A V+FG YNPGGRLP+TWY +++KIP T M +R P + +P
Sbjct: 575 IGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYP 634
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY+F+ GP VY FGYGLSY+ + Y + +K + + +++ N
Sbjct: 635 GRTYRFYTGPKVYEFGYGLSYSNYSYNF------ISVKNNNLHINQSTTHSILENSETIY 688
Query: 647 AVLIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF 702
L+ ++ CK + + + N G M G V+++ KP G G +KQ++G+E V
Sbjct: 689 YKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVT 748
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ G +VGF ++ C+ L + + ++ G H ++VGE
Sbjct: 749 VEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVVGE 789
>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
Length = 774
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/741 (46%), Positives = 475/741 (64%), Gaps = 25/741 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C LP P+R +DLV R+TL EK+ Q+ + A +PRLG+P YEWWSEALHGV+ G
Sbjct: 41 YHFCKTTLPIPDRVRDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAG- 99
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
PG F+ + ATSFP VILT ASF+ LW +IG+ + EARA+YN G G+TF
Sbjct: 100 -----PGIRFNGTIRSATSFPQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTF 154
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP V G YA++YVRG+Q + G++ + L+ S
Sbjct: 155 WAPNINIFRDPRWGRGQETPGEDPLVTGSYAVSYVRGVQGDCLRGLKRCGE-----LQAS 209
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLD+W+G DRF FD+RVT QD+ +T+ PF C+ EG S +MC+YNRVNG
Sbjct: 210 ACCKHFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNG 269
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P+CAD LL T R WNF GYI SDCD++ I +S+ F T EDAV VLKAG+D++
Sbjct: 270 VPSCADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAK-TPEDAVVDVLKAGMDVN 328
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y N T AV Q K+ E+++D +L L+ V MRLG F+G+P+ Y ++G N +C+
Sbjct: 329 CGTYLLNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSV 388
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA +AAR GIVLLKN LPL G +LA++GP+AN+ K +IGNY G PC++ +
Sbjct: 389 EHQTLALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFIT 448
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ +Y K Y PGC + C + S I A++ A+ AD V+V GLD + E E DR+
Sbjct: 449 PLQALQSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRL 507
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL+LPG Q +LI VA+AAK PV LV++S G VDI+FAK + I SILW GYPG GG A
Sbjct: 508 DLVLPGKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAA 567
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
IA+ IFG +NPGGRLP+TWY ++ KIP T M +RP +N +PGRTY+F+ G V+ FGY
Sbjct: 568 IAETIFGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGY 627
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSY+ + + ++ KL +Q Y + + + C +
Sbjct: 628 GLSYSTYSCETIPVTRN---KLYFNQSSTAHVYENTDSIRYTSVAELGKELCDSNNISIS 684
Query: 664 IEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
I V N G+M G V+++ + AG+ IKQ++ ++ V + G+SA VGF +N C+
Sbjct: 685 IRVRNDGEMAGKHSVLLFVRRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFS 744
Query: 723 IVDNAANSLLASGAHTILVGE 743
+ ++ G H ++VG+
Sbjct: 745 GPNKDGLMVIEEGTHFLVVGD 765
>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 769
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/744 (45%), Positives = 483/744 (64%), Gaps = 28/744 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+CDA L P RA+ LV +TL EK+ Q+ + A GVPRLG+P Y+WWSE+LHG++
Sbjct: 35 SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA-- 92
Query: 70 GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG +F S V AT FP VIL+TA+FN SLW+ + + V+TEA M+N G AGL
Sbjct: 93 ----DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGL 148
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+W+PNIN+ RDPRWGR ET GEDP V Y++ YV+G Q EG E +++S
Sbjct: 149 TYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGR-------IRLS 201
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD++ WEG R+ F+++V QD+++T+ PF+ C+ E S +MC+YN+VNG
Sbjct: 202 ACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNG 261
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA LL +T R +W F GYI SDCD++ I E+ + + ED++A VLKAG+D++
Sbjct: 262 VPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAIVLKAGMDIN 319
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNICNP 365
CG + T A+++GKI E DID +L L+ V +RLG FD + + LG N++C
Sbjct: 320 CGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTK 379
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELAAEA RQG VLLKND+ LPL ++ +A++GP AN AM G+Y G PC T+
Sbjct: 380 EHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTT 439
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G AY+ ++APGC D C + + A++AAK AD V++AGL+L+ E E DRV
Sbjct: 440 FLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRV 499
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI+ +A AK P+ LV++ G VD++FAK +P+I SILW+GYPGE GG+
Sbjct: 500 SLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQV 559
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+ +++FG+YNPGG+LPITWY ++ IP T M +R P +PGRTY+F+ G VVY FGY
Sbjct: 560 LPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGY 619
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQ--CRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
GLSY+++ Y ++S+PK + + D R Y T + +V +D+ C+ F
Sbjct: 620 GLSYSKYSYSISSAPKKITVSRSSDLGIISRKPAY---TRRDGLGSVKTEDIASCEALVF 676
Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+ + V N G MDGS V+++++ + G IKQ++G+E V AAG ++ V T++ CK
Sbjct: 677 SVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCK 736
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
+ + +L GAH + VG+
Sbjct: 737 QMSAANPEGKRVLLLGAHVLTVGD 760
>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 780
Score = 661 bits (1706), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/744 (45%), Positives = 473/744 (63%), Gaps = 28/744 (3%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
FP+C+ L +RAKD+V R+TL EK+ Q+ + A +PRLG+P Y+WW+EALHGVS++G
Sbjct: 45 SFPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVG 104
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
+ G + + ATSFP +IL ASF+ LW +I + + TEAR +YN G A G+T
Sbjct: 105 K------GIRLNGSITAATSFPQIILIAASFDPKLWYRISKVIGTEARGVYNAGQAQGMT 158
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FW+PNIN+ RDPRWGR ET GEDP V +Y ++YVRGLQ + E + R LK SA
Sbjct: 159 FWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASA 216
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH+ AYDL+NW+G +R+ FD++VT QD+ +T+ F CV +G S +MC+YNRVNG+
Sbjct: 217 CCKHFTAYDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGV 276
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CAD LL T R WNF+GYI SDCD+++ I E + T ED VA VL+AG+D++C
Sbjct: 277 PNCADYNLLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAK-TPEDVVADVLRAGMDVEC 335
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQ 366
G+Y T AV Q KI + ID +L L+ + +RLG FDG+P QY +G N +C+ +
Sbjct: 336 GNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKE 395
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCRYTS 425
+++LA EAAR GIVLLKN LPL + TL ++GP+AN + ++GNY G PC+ S
Sbjct: 396 NLDLALEAARSGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYFGQPCKQVS 453
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ GFY Y+ +Y GC D V ++ I A++ AK +D ++V GLD S E E DR
Sbjct: 454 ILKGFYTYASQTHYRSGCTDGVKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRD 513
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
L LPG Q +LIN VA A+K PV LVI+ G VDI FAKNN KI I+W GYPGE GGRA
Sbjct: 514 HLELPGKQQKLINSVAKASKKPVILVILCGGPVDITFAKNNDKIGGIIWAGYPGELGGRA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+A V+FG YNPGGRLP+TWY +++KIP T M +R P + +PGRTY+F+ GP VY FGY
Sbjct: 574 LAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGY 633
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
GLSY+ + Y S K+ +I +++ +++ N L+ ++ CK
Sbjct: 634 GLSYSNYSYNFISV-KNNNIHINQST-----THSILENSETIRYKLVSELGKKACKTMSI 687
Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+ + + N G M G V+++ KP G G +KQ++G+E V + G +VGF ++ C+
Sbjct: 688 SVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCE 747
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L + + ++ G + LVGE
Sbjct: 748 HLSRANESGVKVIEEGGYLFLVGE 771
>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 767
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/746 (45%), Positives = 464/746 (62%), Gaps = 37/746 (4%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S + +CDA LP +RA DLV R+T EKV Q+GD A GVPRLG+P Y+WW+EALHG++
Sbjct: 34 SSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGYKWWNEALHGLATS 93
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
G+ G HFD V ATSFP V LT A+F++ LW +IGQ + EARA+YNLG A GL
Sbjct: 94 GK------GLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYNLGQAEGL 147
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T WSPN+N+ RDPRWGR ETPGEDP RYA+ +VRG+Q + + L+ S
Sbjct: 148 TMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQG---------NSTSLLQAS 198
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYDL++W G R++FD++VT QD+++TF PF CV +G S VMC+Y +NG
Sbjct: 199 ACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTGING 258
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA+ LL +T+RGDW GY SDCD++ + ++ ++ + EDAVA LKAGLD+D
Sbjct: 259 VPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQ-SPEDAVALALKAGLDID 317
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y A+QQGKI E DID +L+ L+ + MRLG+FDG P+ Y LG +IC
Sbjct: 318 CGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADICTA 377
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA +AA+ GIVLLKND G LPL+ + + A++GP+AN A+I NY G PC T+
Sbjct: 378 EHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCESTT 437
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ G Y K + GC+ C + AA A +D + GL E+EG+DR
Sbjct: 438 PLKGIQGYVKDARFLAGCSSTACDVATTDQAAA-LASTSDYVFLFMGLGQRQESEGRDRT 496
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VADAA+ PV LV++S G VD+ FA+ NPKI +ILW GYPG+ GG A
Sbjct: 497 SLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGLA 556
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA V+FG +NP GRLP+TWY + +P T M +R P N +PGR+Y+F+ G VY FGY
Sbjct: 557 IARVLFGDHNPSGRLPVTWYPEEFTNVPMTDMRMRADPANGYPGRSYRFYQGKTVYKFGY 616
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL-------IDDVKCK 656
GLSY+ + ++ SS S D+ ++ T P +L I C+
Sbjct: 617 GLSYSSYSRRLLSSGTSTPAP------NADLLASLTTTMPSAENILGSYHVEQIGAQGCE 670
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
KF +EV+N G MDG + V++Y + P AG +Q+IG+++ + AG+ A + F +
Sbjct: 671 MLKFPAVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHLKAGEKAHIKFEI 730
Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
C+ L V N ++ G+H + V
Sbjct: 731 RPCEHLSRVREDGNKVIDRGSHFLRV 756
>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
Length = 772
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/740 (44%), Positives = 457/740 (61%), Gaps = 33/740 (4%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CD L +RA DLV R+T EK+ Q+GD A GVPRLG+P Y+WW+EALHG++ G+
Sbjct: 43 FCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGYKWWNEALHGLATSGK-- 100
Query: 74 NSPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G HFD V ATSFP V+LT A+F++ LW +IGQ + EARA++N+G A GLT
Sbjct: 101 ----GLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARALFNVGQAEGLTI 156
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V RYA+ +VRG+Q +S S L+ SAC
Sbjct: 157 WSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG--------NSSSSLLQTSAC 208
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL++W G R+ F +RVT QD+++TF PF CV EG S +MC+Y +NG+P
Sbjct: 209 CKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCAYTAINGVP 268
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA+ LL T+RGDW GY+ SDCD++ + ++ ++ T EDAVA LKAGLD+DCG
Sbjct: 269 ACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVAVSLKAGLDIDCG 327
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
Y A+QQGK+ E DID +L L+ V MRLG+FDG P+ Y L +IC P+H
Sbjct: 328 SYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAADICTPEH 387
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA EAA+ GIVLLKND G LPL+ + + A++GP++N A+I NY G PC T+P+
Sbjct: 388 RSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPPCESTTPL 447
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G +Y + + GC+ C + ++ A+ + + D + GL E+EGKDR L
Sbjct: 448 QGLQSYVNNVRFLAGCSSAAC-DVAVTDQAVVLSGSEDYVFLFMGLSQQQESEGKDRTSL 506
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q LI VADA+K PV LV++S G VDI FA++NPKI +ILW GYPG+ GG AIA
Sbjct: 507 LLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGGLAIA 566
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
V+FG +NP GRLP+TWY ++ K+P T M +R P + +PGR+Y+F+ G VY FGYGL
Sbjct: 567 KVLFGDHNPSGRLPMTWYPEDFTKVPMTDMRMRADPTSGYPGRSYRFYQGNAVYKFGYGL 626
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTF 662
SY+ F ++ + R+ G + IDD+ C+ KF
Sbjct: 627 SYSTFSSRLLYGTSMPALSSTVLAGLRETVTEEGDR-----SYHIDDIGTDGCEQLKFPA 681
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+EV+N G MDG +++ + P G Q+IG+ + AG++A + F ++ C+
Sbjct: 682 MVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQHLKAGETANLRFDISPCEHF 741
Query: 722 KIVDNAANSLLASGAHTILV 741
V ++ G+H + V
Sbjct: 742 SRVRADGMKVIDIGSHFLTV 761
>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
Length = 771
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/744 (44%), Positives = 459/744 (61%), Gaps = 29/744 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S + +CDA+LP RA DLV R+T EKV Q+GD A GVPRLG+P Y+WWSE LHG+S+
Sbjct: 36 SGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPYKWWSEGLHGLSYW 95
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
G G HF+ V TSFP V+LT A+F++ LW +IGQ + TEARA+YNLG A GL
Sbjct: 96 GH------GMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLGQAEGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T WSPN+N+ RDPRWGR ETPGEDP +YA+ +V+GLQ S L+ S
Sbjct: 150 TIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQG---------STPGTLQTS 200
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH AYDL+ W G R++F+++VT QD+ +TF PF+ CV + S VMC+Y +NG
Sbjct: 201 ACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDING 260
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA LL++T RG W GY+ SDCD++ + ++ ++ T ED VA +KAGLDL+
Sbjct: 261 VPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYA-PTPEDTVAVAIKAGLDLN 319
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICN 364
CG+Y M A+QQGK+ E+D+D +L L+ V MRLG+FDG P+ Y +LG ++C
Sbjct: 320 CGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCT 379
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
H +LA EAA+ GIVLLKND GALPL+ +++ A++GP+AN A+ GNY G PC T
Sbjct: 380 QAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETT 439
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ G Y + + GC C + A A ++D ++ GL E EG DR
Sbjct: 440 TPLQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDR 498
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
LLLPG Q LI VA AA+ PV LV+++ G VD+ FAKNNPKI +ILW GYPG+ GG
Sbjct: 499 TSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGL 558
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFG 602
AIA V+FG +NP GRLP+TWY + +IP T M +R P +PGR+Y+F+ G VY FG
Sbjct: 559 AIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFG 618
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGLSY++F ++ ++ K + +++ I G I + C+ KF
Sbjct: 619 YGLSYSKFSRRLVAAAKPR--RPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPA 676
Query: 663 QIEVENMGKMDGSEVVMVYSK-PPGIAGTH--IKQVIGYERVFIAAGQSAKVGFTMNACK 719
+EV N G MDG V+V+ + P AG +Q++G+ + AG+ A++ +N C+
Sbjct: 677 TVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCE 736
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L ++ G+H + VGE
Sbjct: 737 HLSRAREDGTKVIDRGSHFLKVGE 760
>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
Length = 771
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/745 (46%), Positives = 468/745 (62%), Gaps = 76/745 (10%)
Query: 48 VPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWK 107
+PRLG+P YEWWSEALHGVS++G PGT F + VPGATSFP ILT ASFN SL++
Sbjct: 45 LPRLGIPAYEWWSEALHGVSYVG------PGTRFSTLVPGATSFPQPILTAASFNASLFR 98
Query: 108 KIGQT------------------------------------------VSTEARAMYNLGN 125
IG++ VSTEARAM+N+G
Sbjct: 99 AIGESACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGL 158
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AGLTFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD G S L
Sbjct: 159 AGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGG-------GSDAL 211
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K++ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF PF+ CV +G+V+SVMCSYN+
Sbjct: 212 KVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNK 271
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG PTCAD LL+ IRGDW +GYIVSDCDS+ + + + + EDA A +K+GL
Sbjct: 272 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PEDAAAITIKSGL 330
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNI 362
DL+CG++ T+ AVQ GK++E+D+D ++ +IVLMRLG+FDG P+ + +LG ++
Sbjct: 331 DLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDV 390
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
C + ELA EAARQGIVLLKN GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+
Sbjct: 391 CTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCK 449
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEG 481
YT+P+ G A + Y PGC ++ C NS+ + AA AA +AD TV+V G D SVE E
Sbjct: 450 YTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERES 508
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR LLLPG Q +L++ VA+A++GPV LV+MS G DI+FAK++ KI +ILWVGYP
Sbjct: 509 LDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRS 568
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVV 598
R LP+TWY A++ K+ T M +RP +PGRTY+F+ G V
Sbjct: 569 RWRRPRRHPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTV 626
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
Y FG GLSYT+F + + S+P+ V ++L + C + C +V C
Sbjct: 627 YAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHAC---------HTEHCFSVEAAGEHCGSL 677
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
F + V N G M G V ++S PP + K ++G+E+V + GQ+ V F ++ C
Sbjct: 678 SFDVHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVC 737
Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
K L +VD N +A G+HT+ VG+
Sbjct: 738 KDLSVVDELGNRKVALGSHTLHVGD 762
>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/741 (44%), Positives = 467/741 (63%), Gaps = 60/741 (8%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+C KLP +R +DLV R+TL EKV Q+ D A +PRLG+P YEWWSEALHGV+
Sbjct: 44 YPFCQTKLPISQRVEDLVSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVAL--- 100
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
+T G F+ + ATSFP VILT ASF+ LW +IGQ + EAR +YN G A G+TF
Sbjct: 101 QTTVRQGIRFNGTIRFATSFPQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTF 160
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN+ RDPRWGR ETPGEDP V G+YA++YVRG+Q G + + L+ SAC
Sbjct: 161 WAPNINIFRDPRWGRGQETPGEDPLVAGKYAVSYVRGVQ---GDSFGGGTLGEQLQASAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLD W+G +RF FD+ QD+ +T+ PF+ C+ EG S +MC+YNRVNG+P
Sbjct: 218 CKHFTAYDLDKWKGMNRFVFDA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVP 273
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CAD LL++ RG W F+GYI SDCD++ I + + + EDAVA VLKAG+D++CG
Sbjct: 274 NCADYNLLSKKARGQWGFYGYITSDCDAVAIIHDDQGYAK-SPEDAVADVLKAGMDVNCG 332
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
DY N+T AV++ K+ E++ID +L L+ + MRLG F+G+P Y N+ + +C+ +H
Sbjct: 333 DYLKNYTKSAVKKKKLPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEH 392
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA +AA+ GIVLLKN + LPL+ K+LA++GP+AN + ++GNY G PC+ +P+
Sbjct: 393 QALALKAAQDGIVLLKNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPL 452
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y K Y PGC+ + C + S I A+ AK AD ++V GLD + E E +DRVDL
Sbjct: 453 QGLQNYIKNTRYHPGCSRVACSSAS-INQAVKIAKGADQVILVMGLDQTQEKEEQDRVDL 511
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+LPG Q ELI VA AAK PV LV+ G VD++FAK + I SI+W GYPGE GG A+A
Sbjct: 512 VLPGKQRELITAVAKAAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALA 571
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
+IFG +NPGGRLP+TWY ++ K+P T M +RP + +PGRTY+F++G V+ FGYGL
Sbjct: 572 QIIFGDHNPGGRLPMTWYPQDFTKVPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGL 631
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKFTF 662
SY+ + Y++AS ++ KL R + + N LI ++ C+ KFT
Sbjct: 632 SYSNYSYELASDTQN---KL----YLRASSNQITKNSNTIRHKLISNIGKELCEKTKFTV 684
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V+N G+M AG++A++ + ++ C+ L
Sbjct: 685 TVRVKNHGEM--------------------------------AGENAEIQYELSPCEHLS 712
Query: 723 IVDNAANSLLASGAHTILVGE 743
D+ ++ G+ +L+G+
Sbjct: 713 SPDDRGMMVMEEGSQFLLIGD 733
>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
[Cucumis sativus]
Length = 783
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/757 (45%), Positives = 471/757 (62%), Gaps = 39/757 (5%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C LP RA+DLV R+TL EKV Q+ + +PRLG+P YEWWSEALHGV+ +G
Sbjct: 50 LPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGY 109
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G + + ATSFP VILT ASF+E+LW +IGQ + TEARA+YN G A G+TF
Sbjct: 110 ------GIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTF 163
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP + G+Y++ YVRG+Q +EG + LK S
Sbjct: 164 WTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGGKL-----GNQLKAS 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLD W G R+ FD++VT QDM +T+ PFE CV EG S +MC+YNRVNG
Sbjct: 219 ACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGIMCAYNRVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P+CAD LL T R W F+GYI SDCD++ I ++ + EDAVA VL+AG+D++
Sbjct: 279 VPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAK-IPEDAVADVLRAGMDVN 337
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y T AV+ K+ ID +LR L+ V MRLG FDG+P + +G++ +C+
Sbjct: 338 CGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQIGRDQVCSQ 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
QH LA +AAR+GIVLLKN LPL+ N +LA++G + N K + GNY G PC+ +
Sbjct: 398 QHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYAGIPCKSAT 457
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P G Y K Y GC C + I A+ AK+ D V+V GLD + E E DR
Sbjct: 458 PFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRT 516
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG Q +LI +VA AAK PV LVI+S G VDI+ AK N KI SILW GYPG+ GG A
Sbjct: 517 ELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTA 576
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA++IFG +NPGGRLP+TWY +++K P T M +R +PGRTY+F++GP VY FGY
Sbjct: 577 IAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGY 636
Query: 604 GLSYTQFKYKVASSPKSVDI----KLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDY 658
GLSY+ Y+ S +S + K + + D ++Y + + +D C+
Sbjct: 637 GLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSE--------LDKKFCESK 688
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
+ V N G+M G V+++ KP I G+ +KQ++G+++V I AG+ ++ F ++
Sbjct: 689 TVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVSP 748
Query: 718 CKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
C + ++ G+++++VG+ V PL +
Sbjct: 749 CDHISKASEEGLMIIEEGSYSLVVGD----VEHPLDI 781
>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 775
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/750 (44%), Positives = 462/750 (61%), Gaps = 28/750 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+CD LP RA DLV R+T+ EKV Q+GD A GVPRLG+P Y+WWSE LHG++F G
Sbjct: 41 YPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVPPYKWWSEGLHGLAFWGH 100
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G F+ V TSFP V+LTTASF+ESLW +IGQ + EARA+YNLG A GLT
Sbjct: 101 ------GMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARALYNLGQAEGLTI 154
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V +YA+ +VRG+Q + + PL+ SAC
Sbjct: 155 WSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSNPAG----AAAAPLQASAC 210
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL++W G R++FD+RVT QD+ +TF PF+ CV +G S VMC+Y +NG+P
Sbjct: 211 CKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCAYTVINGVP 270
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA LL +T RG W GY+ SDCD++ + ++ ++ T ED VA LKAGLDL+CG
Sbjct: 271 ACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRY-EPTPEDTVAVALKAGLDLNCG 329
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
Y M A+QQGK+ E D+D +L L+ V MRLG+FDG P+ Y LG ++C
Sbjct: 330 TYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGAADVCTAD 389
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA+ GIVLLKND G LPL+ + + A++G +AN + GNY G C T+P
Sbjct: 390 HKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGPACETTTP 449
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
++G +Y + + + GC+ C + A A +A+ + GL E EG DR
Sbjct: 450 LEGLQSYVRNVRFLAGCSSAAC-GYAATGQAAALASSAEYVFLFMGLSQDQEKEGLDRTS 508
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q L+ VA AAK PV LV+++ G VDI FA++NPKI +ILW GYPG+ GG AI
Sbjct: 509 LLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLAI 568
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG +NP GRLP+TWY ++ K+P T M +R P +PGRTY+F+ G +Y FGYG
Sbjct: 569 ARVLFGDHNPSGRLPVTWYTEDFTKVPMTDMRMRADPATGYPGRTYRFYRGKTIYKFGYG 628
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD---VKCKDYKFT 661
LSY++F ++ + K++ + + T + +DD V C+ KF
Sbjct: 629 LSYSKFSRQLVTGDKNL-----APNTSLLAHLSAKTQHAATSYYHVDDIGTVGCEQLKFP 683
Query: 662 FQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
++EV N G MDG V+++ + P G ++Q+IG+ I AG+ A V F ++ C+
Sbjct: 684 AEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAGEKANVRFHVSPCEH 743
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++ G+H ++VG+ +SF
Sbjct: 744 FSRTRADGKKVIDRGSHFLMVGKEELEISF 773
>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 781
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/747 (45%), Positives = 462/747 (61%), Gaps = 44/747 (5%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +CDA LP +RA DLV R+T EKV Q+GD A GVPRLG+P Y+WW+EALHG++ G+
Sbjct: 51 YAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGVPAYKWWNEALHGLATSGK 110
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G HF+ V ATSFP V LT A+F++ LW +IGQ + EARA+YN+G A GLT
Sbjct: 111 ------GLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARALYNVGQAEGLTM 164
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP RY + +V+GLQ + S L+ SAC
Sbjct: 165 WSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQG-------NSTSSSLLQTSAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL++W G R++FD+RVT QD+++T+ PF CV +G S VMC+Y +NG+P
Sbjct: 218 CKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMCAYTAINGVP 277
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA+ LL T+R DW GY+ SDCD++ + ++ ++ T EDAVA LKAGLD+DCG
Sbjct: 278 ACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVALALKAGLDIDCG 336
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
Y A+QQGKI E D+D +L+ L+ + MRLG+FDG P+ Y L +IC P+H
Sbjct: 337 TYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNAAHICTPEH 396
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA EAA+ GIVLLKND G LPL+ I + A++GP+AN +IGNY G PC +P+
Sbjct: 397 RSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGPPCESVTPL 456
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y K + + GC C AA A ++D ++ GL E+EG+DR L
Sbjct: 457 KGVQGYVKDVRFMAGCGSAACDVADTDQAAT-LAGSSDYVLLFMGLSQQQESEGRDRTSL 515
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q LI VADAAK PV LV+++ G VD+ FAKNNPKI +ILW GYPG+ GG AIA
Sbjct: 516 LLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAIA 575
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
V+FG +NPGGRLP+TWY + K+P T M +R P +PGR+Y+F+ G VY FGYGL
Sbjct: 576 RVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGETVYKFGYGL 635
Query: 606 SYTQFKYKVA--SSPKSVDIKLDKDQQCRDINYTVGTNKPPC-----AAVLIDDVK---C 655
SY+ + ++ +P + D+ + T P A+ ++ + C
Sbjct: 636 SYSSYSRRLLSSGTPNT------------DLLAGLSTMPTPAEEGGVASYHVEHIGARGC 683
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ KF +EVEN G MDG V++Y + AG KQ+IG+ R + AG+ A + F
Sbjct: 684 EQLKFPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLIGFRRQHLKAGEKASLTFD 743
Query: 715 MNACKSLKIVDNAANSLLASGAHTILV 741
++ C+ V N ++ G+H ++V
Sbjct: 744 ISPCEHFSRVRKDGNKVVDRGSHFLMV 770
>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
Length = 783
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/757 (45%), Positives = 471/757 (62%), Gaps = 39/757 (5%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C LP RA+DLV R+TL EKV Q+ + +PRLG+P YEWWSEALHGV+ +G
Sbjct: 50 LPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGY 109
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G + + ATSFP VILT ASF+E+LW +IGQ + TEARA+YN G A G+TF
Sbjct: 110 ------GIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVYNAGQAKGMTF 163
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD--VEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP + G+Y++ YVRG+Q +EG + LK S
Sbjct: 164 WTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGDAIEGGKL-----GNQLKAS 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ AYDLD W G R+ FD++VT QDM +T+ PFE CV EG S +MC+YNRVNG
Sbjct: 219 ACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFESCVEEGKASGIMCAYNRVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P+CAD LL T R W F+GYI SDCD++ I ++ + EDAVA VL+AG+D++
Sbjct: 279 VPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGYAK-IPEDAVADVLRAGMDVN 337
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG Y T AV+ K+ ID +LR L+ V MRLG FDG+P + +G++ +C+
Sbjct: 338 CGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFDGNPTKLPFGQIGRDQVCSQ 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
QH LA +AAR+GIVLLKN LPL+ N +LA++G + N K + GNY G PC+ +
Sbjct: 398 QHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNGNDPKTLRGNYAGIPCKSAT 457
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P G Y K Y GC C + I A+ AK+ D V+V GLD + E E DR
Sbjct: 458 PFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDYVVLVMGLDQTQEREDFDRT 516
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG Q +LI +VA AAK PV LVI+S G VDI+ AK N KI SILW GYPG+ GG A
Sbjct: 517 ELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYNEKIGSILWAGYPGQAGGTA 576
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IA++IFG +NPGGRLP+TWY +++K P T M +R +PGRTY+F++GP VY FGY
Sbjct: 577 IAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMRADSSTGYPGRTYRFYNGPKVYEFGY 636
Query: 604 GLSYTQFKYKVASSPKSVDI----KLDKDQQCRD-INYTVGTNKPPCAAVLIDDVKCKDY 658
GLSY+ Y+ S +S + K + + D ++Y + + +D C+
Sbjct: 637 GLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRLVSE--------LDKKFCESK 688
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
+ V N G+M G V+++ KP I G+ +KQ++G+++V I AG+ ++ F ++
Sbjct: 689 TVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLVGFKKVEINAGERREIEFLVSP 748
Query: 718 CKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
C + ++ G+++++VG+ V PL +
Sbjct: 749 CDHISKASEEGLMIIEEGSYSLVVGD----VEHPLDI 781
>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/679 (51%), Positives = 454/679 (66%), Gaps = 52/679 (7%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
L+ +C+ + +R DLV+R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 48 SLASLGFCNTSIGINDRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVS 107
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG-----QTVSTEARAMYN 122
++G PGTHF +V GATSFP VILT ASFN SL++ IG Q VSTEARAMYN
Sbjct: 108 YVG------PGTHFSDDVAGATSFPQVILTAASFNTSLFEAIGKVYYTQVVSTEARAMYN 161
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+G AGLTFWSPNIN+ RDPRWGR ETPGEDP + +Y YV+GLQ + D D
Sbjct: 162 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVKGLQQRD------DGDP 215
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRV-TEQDMQETFILPFEMCVNEGDVSSVMC 241
LK++ACCKHY AYDLDNW+G+DR+HF++ V T+QDM +TF PF+ CV +G+V+SVMC
Sbjct: 216 DKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMC 275
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGY-------IVSDCDSIQTIVESHKFLNDTKE 294
SYN+VNG PTCADP LL+ IRG+WN +GY IV+DCDS+ +S + +E
Sbjct: 276 SYNQVNGKPTCADPDLLSGVIRGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEE 335
Query: 295 DAVARVLKA-----GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
A A +L G+DL+CG + T AV+ G + E ID ++ + LMRLG+FD
Sbjct: 336 AAAAAILAGNSLVTGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFD 395
Query: 350 GSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
G P Y LG ++C ++ ELA EAARQGIVLLKN G+LPL+ IK LA++GP+A
Sbjct: 396 GDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNA 455
Query: 407 NATKAMIGNYEG-TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
N TK MIGNYEG TPC+YT+P+ G A S Y PGC+++ C + + + A A AD
Sbjct: 456 NVTKTMIGNYEGGTPCKYTTPLQGLAA-SVATTYLPGCSNVAC-STAQVDDAKKLAAAAD 513
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
ATV+V G DLS+EAE +DRVD+LLPG Q LI VA+ + GPV LVIMS G +D++FA+
Sbjct: 514 ATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFART 573
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPG----GRLPITWYEANYV-KIPYTSMPLR 580
N KI SILWVGYPGE GG AIAD+IFG YNP GRLP+TWY +YV K+P T+M +R
Sbjct: 574 NDKITSILWVGYPGEAGGAAIADIIFGYYNPSTHQPGRLPMTWYPQSYVDKVPMTNMNMR 633
Query: 581 --PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
P N +PGRTY+F+ G VY FG GLSY+QF +++ +P+ V + L++ C
Sbjct: 634 PDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQLVYVPLEESHVC------- 686
Query: 639 GTNKPPCAAVLIDDVKCKD 657
+ C +V+ + C++
Sbjct: 687 --HSSECQSVVASEQTCQN 703
>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 778
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/748 (44%), Positives = 470/748 (62%), Gaps = 36/748 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C+ KLP +RA+DLV R+TL EK+ Q+ + A +PRLG+P Y+WWSEALHGV+ G
Sbjct: 42 YSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 101
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G F+ + ATSFP VILT ASF+ +LW +I +T+ EARA+YN G A G+TF
Sbjct: 102 ------GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGREARAVYNAGQATGMTF 155
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNINV RDPRWGR ET GEDP + +Y + YVRGLQ G + + L+ SAC
Sbjct: 156 WAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQ---GDSFEGGKLAERLQASAC 212
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLD W+G DRF FD+RVT QD+ +T+ PF+ C+ +G S +MC+YNRVNG+P
Sbjct: 213 CKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASGIMCAYNRVNGVP 272
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CAD LL +T R W F GYI SDC ++ I E + T EDA+A V +AG+D++CG
Sbjct: 273 NCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAK-TAEDAIADVFRAGMDVECG 331
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
DY T AV Q K+ + ID +L+ L+ + +RLG FDG+P + +G N +C+ Q
Sbjct: 332 DYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGTIGPNEVCSKQS 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT-KAMIGNYEGTPCRYTSP 426
++LA EAAR GIVLLKN N LPL N T+AL+GP+ANA+ K +GNY G PC +
Sbjct: 392 LQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGNYYGRPCNLVTL 450
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ GF Y+K + Y PGC D + I A++ AK D V+V GLD S E E DR
Sbjct: 451 LQGFEGYAKTV-YHPGCDDGPQCAYAQIEEAVEVAKKVDYVVLVMGLDQSQERESHDREY 509
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPG Q ELI VA AAK PV +V++ G VDI AK + K+ ILW GYPGE GG A+
Sbjct: 510 LGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWAGYPGELGGVAL 569
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG +NPGG+LPITWY +++K+P T M +R P + +PGRTY+F+ GP VY FGYG
Sbjct: 570 AQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFYTGPKVYEFGYG 629
Query: 605 LSYTQFKYKVAS-SPKSVDIKLDK----DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
LSYT++ YK+ S S ++ I Q I Y + + + + C+
Sbjct: 630 LSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSETIRYKLVSE--------LAEETCQTML 681
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIA----GTHIKQVIGYERVFIAAGQSAKVGFTM 715
+ + V N G + G V+++ + + G +KQ++G++ V + AG++ +VGF +
Sbjct: 682 LSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVGFQSVKVNAGETVQVGFEL 741
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGE 743
+ C+ L + + A + ++ G++ +VG+
Sbjct: 742 SPCEHLSVANEAGSMVIEEGSYLFIVGD 769
>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 776
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/746 (44%), Positives = 471/746 (63%), Gaps = 33/746 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+C+ +LP +RA+DLV R+TL EK+ Q+ + A +PRLG+P Y+WWSEALHGV+ G
Sbjct: 41 YPFCNTRLPISKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 100
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G F+ + ATSFP VILT ASF+ +LW +I +T+ EARA+YN G A G+TF
Sbjct: 101 ------GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGKEARAVYNAGQATGMTF 154
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNINV RDPRWGR ET GEDP + +Y + YVRGLQ G + L+ SAC
Sbjct: 155 WAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQ---GDSFEGGKLGERLQASAC 211
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLD+W+G DRF +D+RVT QD+ +T+ PF+ C+ +G S +MC+YNRVNG+P
Sbjct: 212 CKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIEQGRASGIMCAYNRVNGVP 271
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA+ LL +T R W F GYI SDC ++ +I+ + T EDA+A V +AG+D++CG
Sbjct: 272 NCANFNLLTKTARQQWKFDGYITSDCGAV-SIIHDEQGYAKTAEDAIADVFRAGMDVECG 330
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQH 367
DY T AV Q K+ + ID +L+ L+ + +RLG DG+P + +G + +C+ Q
Sbjct: 331 DYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNPTKLPFGTIGPDQVCSKQS 390
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT-KAMIGNYEGTPCRYTSP 426
++LA EAAR GIVLLKN N LPL N T+AL+GP+ANA+ K +GNY G PC +
Sbjct: 391 LQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGNYYGRPCNLVTL 449
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ GF Y+K Y PGC D + I A++ AK D V+V GLD S E E DR
Sbjct: 450 LQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVDYVVLVMGLDQSQERESHDREY 509
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L LPG Q ELI VA A+K PV LV++ G VDI AK + K+ ILW GYPGE GG A+
Sbjct: 510 LGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDKVGGILWAGYPGELGGVAL 569
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG +NPGG+LPITWY +++K+P T M +R P + +PGRTY+F+ GP VY FGYG
Sbjct: 570 AQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFYTGPKVYEFGYG 629
Query: 605 LSYTQFKYKVAS-SPKSVDIKLDK----DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
LSYT++ YK+ S S ++ I Q I Y + + + + C+
Sbjct: 630 LSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSETIRYKLVSE--------LAEETCQTML 681
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIA--GTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
+ + V N G M G V+++ + + G +KQ++G++ V + AG++ +VGF ++
Sbjct: 682 LSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQSVKLNAGETVQVGFELSP 741
Query: 718 CKSLKIVDNAANSLLASGAHTILVGE 743
C+ L + + A + ++ G++ +LVG+
Sbjct: 742 CEHLSVANEAGSMVIEEGSYLLLVGD 767
>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
Length = 771
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/742 (44%), Positives = 458/742 (61%), Gaps = 29/742 (3%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +CDA+LP RA DLV R+T EKV Q+GD A GV RLG+P Y+WWSE LHG+S+ G
Sbjct: 38 YAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPYKWWSEGLHGLSYWGH 97
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G HF+ V TSFP V+LT A+F++ LW +IGQ + TEARA+YNLG A GLT
Sbjct: 98 ------GMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNLGQAEGLTI 151
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP +YA+ +V+GLQ S L+ SAC
Sbjct: 152 WSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQG---------STPGTLQTSAC 202
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL+ W G R++F+++VT QD+ +TF PF+ CV + S VMC+Y +NG+P
Sbjct: 203 CKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYTDINGVP 262
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA LL++T RG W GY+ SDCD++ + ++ ++ T ED VA +KAGLDL+CG
Sbjct: 263 ACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYA-PTPEDTVAVAIKAGLDLNCG 321
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
+Y M A+QQGK+ E+D+D +L L+ V MRLG+FDG P+ Y +LG ++C
Sbjct: 322 NYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAADVCTQA 381
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H +LA EAA+ GIVLLKND GALPL+ +++ A++GP+AN A+ GNY G PC T+P
Sbjct: 382 HRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPCETTTP 441
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G Y + + GC C + A A ++D ++ GL E EG DR
Sbjct: 442 LQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGLSQDQEKEGLDRTS 500
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q LI VA AA+ PV LV+++ G VD+ FAKNNPKI +ILW GYPG+ GG AI
Sbjct: 501 LLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAGGLAI 560
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG +NP GRLP+TWY + +IP T M +R P +PGR+Y+F+ G VY FGYG
Sbjct: 561 AKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRADPATGYPGRSYRFYQGNPVYKFGYG 620
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSY++F ++ ++ K + +++ I G I + C+ KF +
Sbjct: 621 LSYSKFTRRLVAAAKPR--RPNRNLLAGVIPKPAGDGGESYHVEEIGEEGCERLKFPATV 678
Query: 665 EVENMGKMDGSEVVMVYSK-PPGIAGTH--IKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
EV N G MDG V+V+ + P AG +Q++G+ + AG+ A++ +N C+ L
Sbjct: 679 EVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGEKARLTMEINPCEHL 738
Query: 722 KIVDNAANSLLASGAHTILVGE 743
+ ++ G+H + VGE
Sbjct: 739 SRARDDGTKVIDRGSHFLKVGE 760
>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 772
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/746 (44%), Positives = 474/746 (63%), Gaps = 26/746 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ + +CD LP+P RA+ LV +TL EK+ Q+ + A GVPRLG+P YEWWSE+LHG++
Sbjct: 36 NSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGLA-- 93
Query: 70 GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG +F S V AT FP VIL+ A+FN SLW+ + + V+ EARAM+N G AGL
Sbjct: 94 ----DNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+W+PNINV RDPRWGR ETPGEDP ++ Y++ YV+G Q EY + R + +S
Sbjct: 150 TYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQG----EYGDGREGR-MMLS 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDL+ W R+ F++ V QD ++T+ PF+ C+ EG S +MCSYN+VNG
Sbjct: 205 ACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA LL Q IR +W F GYIVSDCD++ I E+ + + ED+VA VLKAG+D++
Sbjct: 265 VPACARKDLL-QKIRDEWGFKGYIVSDCDAVAIIHENQTY-TSSDEDSVAIVLKAGMDVN 322
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T A+++GKI E DI+ +L L+ V +RLG F+ + + + LG +N+C
Sbjct: 323 CGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTK 382
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELAAEA RQG VLLKNDN LPL + +AL+G AN M G+Y G PC +
Sbjct: 383 EHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPIT 442
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G A+ A GC D+ C + AI+AAK AD V++AGL+L+ E+E DRV
Sbjct: 443 FLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRV 502
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q +L+N +A K P+ LVI G VD+ FAK +P+I S+LW+GYPGE GG+
Sbjct: 503 TLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQV 562
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+ +++FG+YNPGG+LP+TWY ++ +P M +R P +PGRTY+F+ G VVY FGY
Sbjct: 563 LPEILFGEYNPGGKLPMTWYPESFTAVPMNDMNMRADPSRGYPGRTYRFYTGEVVYGFGY 622
Query: 604 GLSYTQFKYKVASSPKSVDIKLD--KDQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
GLSY+++ Y + +P+ + + R Y T + V ++D+ C+ F
Sbjct: 623 GLSYSKYSYNIVQAPQRISLSHSPVPGLISRKPAY---TRRDGLDYVQVEDIASCESLVF 679
Query: 661 TFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+ I V N G MDGS V+++++ + G +KQ++G+ERV+ AAG S V T++ CK
Sbjct: 680 SVHISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVDPCK 739
Query: 720 SLKIVDNAANSLLASGAHTILVGEGV 745
+ + +L G+H ++VG+ V
Sbjct: 740 YMSAANTEGRRVLLLGSHHLMVGDEV 765
>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 784
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/748 (45%), Positives = 477/748 (63%), Gaps = 30/748 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+C+ + +R +DL+ R+T+ EK++Q+ + A V RLG+P Y+WW E LHGV+
Sbjct: 32 FPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIPPYQWWGEGLHGVAI--- 88
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
P +F P ATSFP L+ S+N +LW KIGQ VSTE RAMYN G +GLT+W
Sbjct: 89 ----SPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAMYNQGRSGLTYW 144
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR---PLKIS 188
SPNIN+ RDPRWGR ETPGEDP + YA+++V+GLQ+ + + + SR LKIS
Sbjct: 145 SPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQAVSRGPRRLKIS 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ A+DLD W+ DR HFDS+VT+QD+++T+ F+ CV EG SSVMCSYNR+NG
Sbjct: 205 ACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSSSVMCSYNRLNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN--DTKEDAVARVLKAGLD 306
IP C +LL T+R W F GYIVSDCD++ I H ++N T EDAV+ V+ AG+D
Sbjct: 265 IPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALI---HDYINYAPTSEDAVSYVMLAGMD 321
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
L+CG + A+ + I E ID LR L+ V MRLG FDG+P Y +LG ++C
Sbjct: 322 LNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPYGSLGPEDMC 381
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+ LA EAARQ +VLLKN+ ALP + LA++G HA+AT+ M+GNYEG PC++
Sbjct: 382 TEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLGNYEGYPCKF 441
Query: 424 TSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
SP+ GF +S I++ GC+D C++ I AA +AA ADA V+V G+ + E
Sbjct: 442 VSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLVLGISQAQEK 501
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
EG+DR LLLPG Q EL++ V +A+ G PV LV++S +D++FA ++P+I+SI+W GYP
Sbjct: 502 EGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRIQSIIWAGYP 561
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP--VNNFPGRTYKFFDGP 596
G+ GG AIA+ IFG NPGGRL +WY NY I ++M +RP +PGRTY+FF
Sbjct: 562 GQSGGEAIAEAIFGLVNPGGRLAQSWYYENYTNIDMSNMNMRPNASTGYPGRTYRFFTDT 621
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
++ FG+GLSY+ FKY + S+P+S+ + Q C + V T+ C + + CK
Sbjct: 622 PLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLCSS-DRAVMTSDLNC--LHYEKEACK 678
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ F ++ V N G + G V+++SKPP GI G +KQ++ +ERV + AG ++ F
Sbjct: 679 ESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLVSFERVHLEAGAGQEILFK 738
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
+N C+ L V + + G HT++VG
Sbjct: 739 VNPCEDLGTVGDDGIRTVELGEHTLMVG 766
>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/646 (48%), Positives = 430/646 (66%), Gaps = 28/646 (4%)
Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
Q VS EARAM+N G AGLT+WSPN+N+ RDPRWGR ETPGEDP VVG+YA +YVRGLQ
Sbjct: 2 QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61
Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
SD LK++ACCKH+ AYDLDNW G DRFHF++ V++QDM++TF +PF MC
Sbjct: 62 ---------SDGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMC 112
Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
V EG V+SVMCSYN+VNGIPTCADP LL +T+RG + ++ I+ S+ L
Sbjct: 113 VKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGT------LFQTVTLLEFIMGSNTILQ 166
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
++ + +A LDLDCG + T AV++G + EA+I+ +L V MRLG FDG
Sbjct: 167 PRRKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDG 226
Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
P Y NLG N++C P H ELA EAARQGIVLLKN +LPL+T ++A+VGP++N
Sbjct: 227 EPSSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSN 286
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
T MIGNY G C YT+P+ G Y++ I + GCAD+ C ++ AAIDAA+ ADAT
Sbjct: 287 VTATMIGNYAGLACGYTTPLQGIQRYAQTI-HRQGCADVACVSDQQFSAAIDAARQADAT 345
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
V+V GLD S+EAE +DR LLLPG Q EL++KVA A+KGP LV+MS G +D++FA+N+P
Sbjct: 346 VLVMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDP 405
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN-- 584
KI SI+W GYPG+ GG AI+DV+FG NPGG+LP+TWY +Y+ +P T+M +R +
Sbjct: 406 KIGSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKG 465
Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
+PGRTY+F+ G VVYPFG+G+SYT F + +AS+P V + LD + + G
Sbjct: 466 YPGRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHR------HGSGNATIS 519
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
A+ + +C Q++V+N G MDG+ ++VYS+PP KQ++ +E+V +A
Sbjct: 520 GKAIRVTHARCNRLSLGMQVDVKNTGSMDGTHTLLVYSRPPARHWAPHKQLVAFEKVHVA 579
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
AG +VG ++ CKSL +VD + + G H++ +G+ VS
Sbjct: 580 AGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSL 625
>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
Length = 764
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/754 (43%), Positives = 458/754 (60%), Gaps = 39/754 (5%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CDA L +RA DLV +TL EKV Q+GD A GV RLG+P YEWWSE LHG+S GR
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGR-- 88
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
G F+ V TSFP VILT A+F+ LW+++G+ V EARA+YNLG A GLT WS
Sbjct: 89 ----GIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWS 144
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ETPGEDP RYA+ +V GLQ + G + SACCK
Sbjct: 145 PNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASACCK 192
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H AYDLD W R+++DS+VT QD+++T+ PF+ CV EG + +MC YN +NG+P C
Sbjct: 193 HATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPAC 252
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
A LL + +R +W +GY+ SDCD++ TI ++H + + ED VA +K G+D++CG+Y
Sbjct: 253 ASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHY-TLSPEDTVAVSIKVGMDVNCGNY 311
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHI 368
M AVQ+G + E DID +L L+ V MRLG+FDG P+ Y +LG ++C+P H
Sbjct: 312 TQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHK 371
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
LA EAA+ GIVLLKND GALPL + +LA++GP+A+ A+ GNY G PC T+P+
Sbjct: 372 SLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQ 431
Query: 429 GFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y + GC C + A A ++D V+ GL E +G DR L
Sbjct: 432 GIKGYLGDRARFLAGCDSPACAVAATN-EAAALASSSDHVVLFMGLSQKQEQDGLDRTSL 490
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q LI VA+AA+ PV LV+++ G VD+ FAK+NPKI +ILW GYPG+ GG AIA
Sbjct: 491 LLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIA 550
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
V+FG +NP GRLP+TWY + K+P T M +R P +PGR+Y+F+ G VY FGYGL
Sbjct: 551 KVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGL 610
Query: 606 SYTQFKYKVASS---PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYK 659
SY++F ++ SS + ++ L R G + ++ L+ ++ +C
Sbjct: 611 SYSKFSRRMFSSFSTSNAGNLSLLAGVMAR----RAGDDGGGMSSYLVKEIGVERCSRLV 666
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNAC 718
F +EV+N G MDG V++Y + P +G +Q+IG+ + G+ A V F ++ C
Sbjct: 667 FPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPC 726
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
+ V ++ GAH ++VG+ SF L
Sbjct: 727 EHFSWVGEDGERVIDGGAHFLMVGDEELETSFGL 760
>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
Length = 839
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/646 (49%), Positives = 438/646 (67%), Gaps = 26/646 (4%)
Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
++ I VSTEARAM+N+G AGLTFWSPNIN+ RDPRWGR ETPGEDP + +YA+ Y
Sbjct: 204 MYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGY 263
Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
V GLQD G S LK++ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF
Sbjct: 264 VTGLQDAGG-------GSDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQ 316
Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
PF+ CV +G+V+SVMCSYN+VNG PTCAD LL+ IRGDW +GYIVSDCDS+ +
Sbjct: 317 PPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYN 376
Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMR 344
+ + + EDA A +K+GLDL+CG++ T+ AVQ GK++E+D+D ++ +IVLMR
Sbjct: 377 NQHYTKN-PEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMR 435
Query: 345 LGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLAL 401
LG+FDG P+ + +LG ++C + ELA EAARQGIVLLKN GALPL+ +IK++A+
Sbjct: 436 LGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAV 494
Query: 402 VGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDA 460
+GP+ANA+ MIGNYEGTPC+YT+P+ G A + Y PGC ++ C NS+ + AA A
Sbjct: 495 IGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLSAATQA 553
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A +AD TV+V G D SVE E DR LLLPG Q +L++ VA+A++GPV LV+MS G DI
Sbjct: 554 AASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDI 613
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPL 579
+FAK++ KI +ILWVGYPGE GG A+AD++FG +NPGGRLP+TWY A++ K+ T M +
Sbjct: 614 SFAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRM 673
Query: 580 RP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
RP +PGRTY+F+ G VY FG GLSYT+F + + S+P+ V ++L + C
Sbjct: 674 RPDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHAC------ 727
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG 697
+ C +V C F + V N G M G V ++S PP + K ++G
Sbjct: 728 ---HTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSVHSAPAKHLLG 784
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+E+V + GQ+ V F ++ CK L +VD N +A G+HT+ VG+
Sbjct: 785 FEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 830
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 69/103 (66%), Gaps = 6/103 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
+S + +CD RA DL+ R+TL EKV + + +PRLG+P YEWWSEALHGVS+
Sbjct: 40 VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYEWWSEALHGVSY 99
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQ 111
+G PGT F + VPGATSFP ILT ASFN SL++ IG+
Sbjct: 100 VG------PGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136
>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
Length = 785
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/747 (45%), Positives = 469/747 (62%), Gaps = 33/747 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C+ L +RAKD+V R+TL EK+ Q+ + A +PRLG+ Y+WWSEALHGV+ G+
Sbjct: 48 YTFCNLNLTTIQRAKDIVSRLTLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGK 107
Query: 72 --RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GL 128
R N + + AT FP VILT ASF+ LW +I + + TEARA+YN G A G+
Sbjct: 108 GIRLNG------NVTIKAATIFPQVILTAASFDSKLWYRISKVIGTEARAVYNAGQAEGM 161
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLK 186
TFW+PNIN+ RDPRWGR ET GEDP V +YA+++VRGLQ EG + + D LK
Sbjct: 162 TFWAPNINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFEGGKLNEDR----LK 217
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
SACCKH+ AYDLDNW+G DRF FD+ VT QD+ +T+ PF C+ +G S +MC+YNRV
Sbjct: 218 ASACCKHFTAYDLDNWKGVDRFDFDANVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRV 277
Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
NGIP CAD LL T R WNF+GYI SDC ++ I + + EDAVA VL+AG+D
Sbjct: 278 NGIPNCADYNLLTNTARKKWNFNGYITSDCSAVDIIHDRQGYAK-APEDAVADVLQAGMD 336
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNIC 363
++CGDY+T+ + AV Q K+ + ID +L L+ + +RLG FDG P +Y +G N +C
Sbjct: 337 VECGDYFTSHSKSAVLQKKVPISQIDRALHNLFSIRIRLGLFDGHPTKLKYGKIGPNRVC 396
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT-KAMIGNYEGTPCR 422
+ Q++ +A EAAR GIVLLKN LPL + ++ ++GP+AN++ + ++GNY G PC
Sbjct: 397 SKQNLNIALEAARSGIVLLKNAASILPL-PKSTDSIVVIGPNANSSSQVVLGNYFGRPCN 455
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ + GF YS + Y PGC+D ++ I A++ AK D V+V GLD S E+EG
Sbjct: 456 LVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDRAVEVAKVVDYVVLVMGLDQSQESEGH 515
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR DL LPG Q ELIN VA A+K PV LV+ G VDI+FAK + KI ILW GYPGE G
Sbjct: 516 DRDDLELPGKQQELINSVAKASKRPVILVLFCGGPVDISFAKVDDKIGGILWAGYPGELG 575
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYP 600
G A+A V+FG YNPGGRLP+TWY +++KIP T M +R P + +PGRTY+F+ GP VY
Sbjct: 576 GMALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYE 635
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKD 657
FGYGLSY+ + Y S +K + + Y++ L+ ++ CK
Sbjct: 636 FGYGLSYSNYSYNFIS------VKNNNLHINQSTTYSILEKSQTIHYKLVSELGKKACKT 689
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
+ + + N G M G V+++ KP G G +KQ++G+E V + G +VGF ++
Sbjct: 690 MSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVS 749
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGE 743
C+ L + + ++ G + LVGE
Sbjct: 750 VCEHLSRANESGVKVIEEGGYLFLVGE 776
>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
Length = 776
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/778 (45%), Positives = 475/778 (61%), Gaps = 76/778 (9%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
RF + + ++ FPYCDA LPY +R +DLV RMTL EKV +GD A G PR+GLP Y
Sbjct: 51 RFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRYCGGG 110
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
RR ++P V+ A G + M
Sbjct: 111 RRCTACPTSARRDVVWRRRARRHQLPARHQQRRVVQRDAVARHRRRGVDGD------QGM 164
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+A LT+WSPNINVVRDPRWGR ETPGEDP+VVGRYA+N+VRG+QD++G +
Sbjct: 165 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 224
Query: 181 D------SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
SRP+K+S+CCKHYAA
Sbjct: 225 AAATDAFSRPIKVSSCCKHYAA-------------------------------------- 246
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
VMCSYNR+NG+P CAD +LL +T+R DW HGYIVSDCDS++ +V K+L T
Sbjct: 247 ---CVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 303
Query: 295 DAVARVLKAGLDLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
+A A +KAGLDLDCG D++T + + AV+QGK+ E+ +D +L LY+ LMRLG+
Sbjct: 304 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 363
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP--H 405
FDG P+ ++LG ++C +H ELAA+AARQG+VLLKND LPL+ + ++AL G H
Sbjct: 364 FDGIPELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQH 423
Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
NAT M+G+Y G PCR +P DG KV++ A C S A AAK D
Sbjct: 424 INATDVMLGDYRGKPCRVVTPYDGV---RKVVSSTSVHA---CDKGS-CDTAAAAAKTVD 476
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
AT++VAGL++SVE E DR DLLLP Q IN VA+A+ P+ LVIMSAG VD++FA++
Sbjct: 477 ATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQD 536
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--V 582
NPKI +++W GYPGEEGG AIADV+FGKYNPGGRLP+TWY+ YV KIP TSM LRP
Sbjct: 537 NPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAE 596
Query: 583 NNFPGRTYKFFDGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
+ +PGRTYKF+ G V+YPFG+GLSYT F Y A++ V +K+ + C+ + Y G +
Sbjct: 597 HGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVS 656
Query: 642 KPP-CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYE 699
PP C AV + C++ + +F + V N G DG+ VV +Y+ PP + G KQ++ +
Sbjct: 657 SPPACPAVNVASHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFR 715
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
RV +AAG + +V F +N CK+ IV+ A +++ SG +LVG+ +SFP+Q++L
Sbjct: 716 RVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 773
>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
Length = 779
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/744 (42%), Positives = 448/744 (60%), Gaps = 33/744 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F +C+A LP +RA DLV R+T EKV Q+GD A GVPRLG+P+Y+WWSEALHG++ G+
Sbjct: 48 FAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWWSEALHGLAISGK 107
Query: 72 RTNSPPGTHF-DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G HF + ATSFP VI T A+F++ LW +IGQ + E RA YNLG A GL
Sbjct: 108 ------GIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLGQAEGLA 161
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ S L+ SA
Sbjct: 162 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 212
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYD++ W+G R++F+++VT QD+ +T+ PF CV +G S +MC+Y +NG+
Sbjct: 213 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 272
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LL +T+RG+W GY SDCD++ + +S F T E+AVA LKAGLD++C
Sbjct: 273 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTR-TAEEAVAVALKAGLDINC 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
G Y A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+ Y LG ++C P
Sbjct: 332 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAADVCTP 391
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H LA EAAR+G+VLLKND LPL + + A++G +AN A++GNY G PC T+
Sbjct: 392 VHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLPCETTT 451
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P G Y K + PGC+ C + + A AK++D +V GL E EG DR
Sbjct: 452 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 510
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 511 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 570
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IADV+FG++NP G+LP+TWY + K T M +R P +PGR+Y+F+ G VY FGY
Sbjct: 571 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
GLSY++F ++ S + + T A +D++ +C+ +F
Sbjct: 631 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 686
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+EV+N G MDG V+++ + G ++Q+IG+ + G+ K+ ++ C+
Sbjct: 687 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 746
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L ++ G+H ++V E
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMVEE 770
>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
sativa Japonica Group]
gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 853
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 317/744 (42%), Positives = 447/744 (60%), Gaps = 33/744 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F +C+A LP +RA DLV R+T EKV Q+GD A GVPRLG+P+Y+WWSEALHG++ G+
Sbjct: 122 FAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWWSEALHGLAISGK 181
Query: 72 RTNSPPGTHF-DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G HF + ATSFP VI T A+F++ LW +IGQ + E RA YNLG A GL
Sbjct: 182 ------GIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLGQAEGLA 235
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ S L+ SA
Sbjct: 236 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 286
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYD++ W+G R++F+++VT QD+ +T+ PF CV +G S +MC+Y +NG+
Sbjct: 287 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 346
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LL +T+RG+W GY SDCD++ + +S F T E+AVA LKAGLD++C
Sbjct: 347 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEEAVAVALKAGLDINC 405
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
G Y A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+ Y L ++C P
Sbjct: 406 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTP 465
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H LA EAAR+G+VLLKND LPL + + A++G +AN A++GNY G PC T+
Sbjct: 466 VHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTT 525
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P G Y K + PGC+ C + + A AK++D +V GL E EG DR
Sbjct: 526 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 584
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 585 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 644
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IADV+FG++NP G+LP+TWY + K T M +R P +PGR+Y+F+ G VY FGY
Sbjct: 645 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 704
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
GLSY++F ++ S + + T A +D++ +C+ +F
Sbjct: 705 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 760
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+EV+N G MDG V+++ + G ++Q+IG+ + G+ K+ ++ C+
Sbjct: 761 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 820
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L ++ G+H ++V E
Sbjct: 821 HLSRARVDGEKVIDRGSHFLMVEE 844
>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
Length = 779
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 317/744 (42%), Positives = 447/744 (60%), Gaps = 33/744 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F +C+A LP +RA DLV R+T EKV Q+GD A GVPRLG+P+Y+WWSEALHG++ G+
Sbjct: 48 FAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKWWSEALHGLAISGK 107
Query: 72 RTNSPPGTHF-DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G HF + ATSFP VI T A+F++ LW +IGQ + E RA YNLG A GL
Sbjct: 108 ------GIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNLGQAEGLA 161
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ S L+ SA
Sbjct: 162 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 212
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYD++ W+G R++F+++VT QD+ +T+ PF CV +G S +MC+Y +NG+
Sbjct: 213 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 272
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LL +T+RG+W GY SDCD++ + +S F T E+AVA LKAGLD++C
Sbjct: 273 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTR-TAEEAVAVALKAGLDINC 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
G Y A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+ Y L ++C P
Sbjct: 332 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTP 391
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H LA EAAR+G+VLLKND LPL + + A++G +AN A++GNY G PC T+
Sbjct: 392 VHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTT 451
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P G Y K + PGC+ C + + A AK++D +V GL E EG DR
Sbjct: 452 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 510
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 511 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 570
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IADV+FG++NP G+LP+TWY + K T M +R P +PGR+Y+F+ G VY FGY
Sbjct: 571 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 630
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
GLSY++F ++ S + + T A +D++ +C+ +F
Sbjct: 631 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 686
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+EV+N G MDG V+++ + G ++Q+IG+ + G+ K+ ++ C+
Sbjct: 687 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 746
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L ++ G+H ++V E
Sbjct: 747 HLSRARVDGEKVIDRGSHFLMVEE 770
>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
Length = 771
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 330/743 (44%), Positives = 458/743 (61%), Gaps = 25/743 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+C+A LP+P RA+ LV +TL EK+ Q+ L + R + GV
Sbjct: 36 SAYPFCNATLPFPARARALVSLLTLDEKIAQL--LQHRRGRPPPRRPAL--RVVVGVPST 91
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
T P T V AT FP VIL+ A+FN SLW+ + ++ EARAM+N G AGLT
Sbjct: 92 ASATTGPGSTSPRGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLT 151
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
FW+PNINV RDPRWGR ETPGEDP VV Y++ YV+G Q G E + +SA
Sbjct: 152 FWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLSA 204
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKHY AYDL+ W G R+ F+++V QDM++T+ PF+ C+ EG S +MCSYN+VNG+
Sbjct: 205 CCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGV 264
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA +L Q R +W F GYI SDCD++ I E+ + + ED++A VLKAG+D++C
Sbjct: 265 PACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTY-TASDEDSIAVVLKAGMDINC 322
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G + T A+++GK+ E DI+ +L L+ V +RLG+FD + + + LG NN+C +
Sbjct: 323 GSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTE 382
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H ELAAEA RQG VLLKNDNG LPL + +AL+GP AN + G+Y G PC T+
Sbjct: 383 HRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTF 442
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G AY +A GC D+ C + AI+AAK AD V++AGL+L+ E E DRV
Sbjct: 443 VKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVS 502
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q +LI+ VA K PV LV+M G VD++FAK++P+I SILW+GYPGE GG +
Sbjct: 503 LLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVL 562
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
+++FGKYNPGG+LPITWY ++ +P M +R +PGRTY+F+ G VVY FGYG
Sbjct: 563 PEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYG 622
Query: 605 LSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKFT 661
LSY+++ Y + +PK + + D R Y T + V ++D+ C+ +F
Sbjct: 623 LSYSKYSYSILQAPKKISLSRSSVPDLISRKPAY---TRRDGVDYVQVEDIASCEALQFP 679
Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
I V N G MDGS V+++ S P G+ IKQ++G+ERV AAG+S V T++ CK
Sbjct: 680 VHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKL 739
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
+ + +L G H ++VG+
Sbjct: 740 MSFANTEGTRVLFLGTHVLMVGD 762
>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
Length = 777
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 321/744 (43%), Positives = 460/744 (61%), Gaps = 27/744 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+C+A LP P+R DLV R+T+ EK+ Q+ + A +PRLG+ YEWWSE LHG+S
Sbjct: 42 SSYPFCNAALPIPQRVNDLVSRLTVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRH 101
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-AGL 128
G+ GT F+ + AT FP +ILT +SF+E+LW +I Q + EARA+YN G G+
Sbjct: 102 GK------GTLFNGTIKAATQFPQIILTASSFDENLWYRIAQAIGREARAVYNAGQLKGI 155
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T W+PNIN++RDPRWGR ETPGEDP +VG+Y + YVRGLQ + E + D L+ S
Sbjct: 156 TLWAPNINILRDPRWGRGQETPGEDPMMVGKYGVAYVRGLQG-DSFEGGKLKDGH-LQTS 213
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKH+ A D+DNW R+ FD++V +QD+ +++ PF+ CV +G SSVMC+YN VNG
Sbjct: 214 ACCKHFIAQDMDNWHNFSRYTFDAQVLKQDLADSYEPPFKDCVEQGKASSVMCAYNLVNG 273
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
IP CA+ LL T RG W GYIVSDCD++ + + + EDAVA LKAG+D++
Sbjct: 274 IPNCANFDLLTTTARGKWGLQGYIVSDCDAVDKMYSEQHYAKEP-EDAVAATLKAGMDVN 332
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNP 365
CG + +T A+++ K+ E+DID +L L+ V MRLG F+G P +Y ++ +C+
Sbjct: 333 CGSHLKTYTKSALEKQKVKESDIDRALHNLFSVRMRLGLFNGDPSKLEYGDISAAEVCSE 392
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H LA EAAR G VLLKN N LPL+ +LA++GP AN ++ ++GNYEG C+ +
Sbjct: 393 EHRALAVEAARSGSVLLKNSNRLLPLSKMKTASLAVIGPKANDSEVLLGNYEGFSCKNVT 452
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
G Y Y PGC I C + + I A++ AK AD V+V GLD ++E E DR
Sbjct: 453 LFQGLQGYVANTMYHPGCDFINCTSPA-IDEAVNIAKKADYVVLVMGLDQTLEREKFDRT 511
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+L LPG Q +LI +A+AA PV LV+M G VD+ FAK+NPKI ILWVGYPGE G A
Sbjct: 512 ELGLPGMQEKLITSIAEAASKPVILVLMCGGPVDVTFAKDNPKIGGILWVGYPGEGGAAA 571
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGY 603
+A ++FG++NPGGR P+TWY + K+ M +RP ++ +PGRTY+F++GP V+ FGY
Sbjct: 572 LAQILFGEHNPGGRSPVTWYPKEFNKVAMNDMRMRPESSSGYPGRTYRFYNGPKVFEFGY 631
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---CKDYKF 660
GLSYT + Y AS K+ + ++ T K + + DV C
Sbjct: 632 GLSYTNYSYTFASVSKNQLL-------FKNPKINQSTEKGSVLNIAVSDVGPEVCNSAMI 684
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
T ++ V+N G+M G V+++ K + K +IG++ V + AG + +V F + C+
Sbjct: 685 TVKVAVKNQGEMAGKHPVLLFLKHSSTVDEVPKKTLIGFKSVNLEAGANTQVTFDVKPCE 744
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
+ ++ G H +L+G+
Sbjct: 745 HFTRANRDGTLVIDEGKHFLLLGD 768
>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
Length = 750
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 327/752 (43%), Positives = 451/752 (59%), Gaps = 46/752 (6%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+CD LP RA DLV R+T+ EKV Q+GD A GVPRLG+P Y+WWSE LHG++F G
Sbjct: 30 YPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVPPYKWWSEGLHGLAFWGH 89
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
G F+ V G TSFP V+LTTASF++ LW +IGQ + EARA+YNLG A GLT
Sbjct: 90 ------GMRFNGTVTGVTSFPQVLLTTASFDDGLWFRIGQAIGREARALYNLGQAEGLTI 143
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V +YA+ +VRG+Q + PL+ SAC
Sbjct: 144 WSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSS-----AAGAAAPLQASAC 198
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH AYDL++W G R++FD+RVT QD+ +TF PF+ CV +G + VMC+Y +NG+P
Sbjct: 199 CKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCAYTGINGVP 258
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
CA LL +T RG W GY+ SDCD++ + ++ +++ T ED VA LK
Sbjct: 259 ACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYV-PTPEDTVAVALK-------- 309
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQ 366
M A+QQGK+ E D+D +L L+ V MRLG+FDG P+ Y +LG ++C
Sbjct: 310 ----EHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGAADVCTAD 365
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
H LA EAA+ GIVLLKND G LPL+ + + A++G +AN + GNY G C T+P
Sbjct: 366 HKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGPACETTTP 425
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
+ G +Y + + GC+ C + A A +++ + GL E EG DR
Sbjct: 426 LQGVQSYVSNVRFLAGCSSAAC-GYAATGQAAALASSSEYVFLFMGLSQDQEKEGLDRTS 484
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LLLPG Q LI VA AAK PV LV+++ G VDI FA++NPKI +ILW GYPG+ GG AI
Sbjct: 485 LLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLAI 544
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYG 604
A V+FG +NP GRLP+TWY + K+P T M +R P N +PGR+Y+F+ G +Y FGYG
Sbjct: 545 ARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPANGYPGRSYRFYRGNTIYKFGYG 604
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL--IDDV---KCKDYK 659
LSY++F ++ + K+Q + T K A +DD+ C+ +
Sbjct: 605 LSYSKFSRQLVTG--------GKNQLASLLAGLSATTKDDDATSYYHVDDIGADGCEQLR 656
Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
F ++EV+N G MDG V+++ + P G + Q+IG+ I AG+ A V F + C
Sbjct: 657 FPAEVEVQNHGPMDGKHSVLMFLRWPNATDGRPVSQLIGFTSQHIKAGEKANVRFDVRPC 716
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ ++ G+H ++VG+ VSF
Sbjct: 717 EHFSRARADGKKVIDRGSHFLMVGKEEVEVSF 748
>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 1026
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 318/620 (51%), Positives = 411/620 (66%), Gaps = 22/620 (3%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+CD KLP +RA DL R+T+ EKV +GD++ GVPRLG+P Y+WWSEALHGV+
Sbjct: 34 SSYPFCDRKLPIGQRAADLASRLTVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVA-- 91
Query: 70 GRRTNSPP---GTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
N+P G FD V ATSFP V++T ASFN LW +IGQ + EAR +YN G
Sbjct: 92 ----NAPADRAGVRFDDGPVRAATSFPQVLVTAASFNPHLWYRIGQVIGREARGIYNSGQ 147
Query: 126 A-GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
A GLTFW+PNINV RDPRWGR ETPGEDP + G+YA +VRG+Q G +S
Sbjct: 148 AEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGASGAVNSSG 204
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
L+ SACCKH+ AYDL+NW G RF F+++V+EQD+ +T+ PF CV +G S +MCSYN
Sbjct: 205 LEASACCKHFTAYDLENWNGVTRFAFNAKVSEQDLADTYNPPFRSCVEDGGASGIMCSYN 264
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RVNG+PTCAD LL++T RGDW F+GYI SDCD++ I + + + EDAVA VLKAG
Sbjct: 265 RVNGVPTCADHNLLSKTARGDWRFNGYITSDCDAVAIIHDVQGYAKE-PEDAVADVLKAG 323
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NLGKNN 361
+D++CGDY + A QGKI E DID +L+ L+ + MRLG FDG+P+Y N+G +
Sbjct: 324 MDVNCGDYVQKHGVSAFHQGKITEQDIDRALQNLFAIRMRLGLFDGNPKYNRYGNIGADQ 383
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+C +H +LA EAA+ GIVLLKND G LPL I +LA++G +AN + + GNY G PC
Sbjct: 384 VCKKEHQDLALEAAQDGIVLLKNDAGTLPLPKQKISSLAVIGHNANDAQRLQGNYFGPPC 443
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP+ Y + + GC VC N S I A AA A+ V+ GLD E E
Sbjct: 444 ISVSPLQALQGYVRETKFVAGCNAAVC-NVSDIAGAAKAASEAEYVVLFMGLDQDQERED 502
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR++L LPG Q L+N VADAAK PV LV++ G VD+ FAK NPKI +I+W GYPG+
Sbjct: 503 LDRIELGLPGMQESLVNAVADAAKKPVVLVLLCGGPVDVTFAKGNPKIGAIIWAGYPGQA 562
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLR--PVNNFPGRTYKFFDGPVV 598
GG AIA V+FG++NPGGRLP+TWY Y + T M +R +PGRTY+F+ G V
Sbjct: 563 GGIAIAQVLFGEHNPGGRLPVTWYPKEYATAVAMTDMRMRADASTGYPGRTYRFYKGKTV 622
Query: 599 YPFGYGLSYTQFKYKVASSP 618
Y FGYGLSY+++ + S P
Sbjct: 623 YNFGYGLSYSKYSHSFVSKP 642
>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
Length = 793
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 331/782 (42%), Positives = 458/782 (58%), Gaps = 67/782 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CDA L +RA DLV +TL EKV Q+GD A GV RLG+P YEWWSE LHG+S GR
Sbjct: 32 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGR-- 89
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
G F+ V TSFP VILT A+F+ LW+++G+ V EARA+YNLG A GLT WS
Sbjct: 90 ----GIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWS 145
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ETPGEDP RYA+ +V GLQ + G + SACCK
Sbjct: 146 PNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASACCK 193
Query: 193 HYAAYDLDNWEGNDRFHFDSR----------------------------VTEQDMQETFI 224
H AYDLD W R+++DS+ VT QD+++T+
Sbjct: 194 HATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDTYN 253
Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
PF+ CV EG + +MC YN +NG+P CA LL + +R +W +GY+ SDCD++ TI +
Sbjct: 254 PPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRD 313
Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMR 344
+H + + ED VA +K G+D++CG+Y M AVQ+G + E DID +L L+ V MR
Sbjct: 314 AHHY-TLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMR 372
Query: 345 LGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
LG+FDG P+ Y +LG ++C+P H LA EAA+ GIVLLKND GALPL + +LA
Sbjct: 373 LGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLA 432
Query: 401 LVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-SKVINYAPGCADIVCQNNSMIPAAID 459
++GP+A+ A+ GNY G PC T+P+ G Y + GC C + A
Sbjct: 433 VIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATN-EAAA 491
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
A ++D V+ GL E +G DR LLLPG Q LI VA+AA+ PV LV+++ G VD
Sbjct: 492 LASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVD 551
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
+ FAK+NPKI +ILW GYPG+ GG AIA V+FG +NP GRLP+TWY + K+P T M +
Sbjct: 552 VTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRM 611
Query: 580 R--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS---PKSVDIKLDKDQQCRDI 634
R P +PGR+Y+F+ G VY FGYGLSY++F ++ SS + ++ L R
Sbjct: 612 RADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMAR-- 669
Query: 635 NYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
G + ++ L+ ++ +C F +EV+N G MDG V++Y + P +G
Sbjct: 670 --RAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGR 727
Query: 692 -IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+Q+IG+ + G+ A V F ++ C+ V ++ GAH ++VG+ SF
Sbjct: 728 PARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGDEELETSF 787
Query: 751 PL 752
L
Sbjct: 788 GL 789
>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 655
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 317/656 (48%), Positives = 416/656 (63%), Gaps = 27/656 (4%)
Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY--H 177
MYN G AGLTFWSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ H
Sbjct: 1 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60
Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
R+ LK++ACCKH+ AYDLD W G DRFHF++ V QD+++TF +PF CV +G +
Sbjct: 61 RNR----LKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAA 116
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
SVMCSYN+VNG+PTCAD L TIRG W GYIVSDCDS+ + T EDA
Sbjct: 117 SVMCSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTR-TPEDAA 175
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---Y 354
A L+AGLDLDCG + + AV GK+A+AD+D +L V MRLG FDG P +
Sbjct: 176 AATLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPF 235
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA------LPLNTGNIKTLALVGPHANA 408
LG ++C +H +LA +AARQG+VLLKN GA LPL + +A+VGPHA+A
Sbjct: 236 GRLGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADA 295
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
T AMIGNY G PCRYT+P+ G AY+ + + GC D+ C+ N I AA++AA+ ADATV
Sbjct: 296 TVAMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATV 355
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+VAGLD VEAEG DR LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N+P+
Sbjct: 356 VVAGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPR 415
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNF 585
I ILWVGYPG+ GG+AIADVIFG +NPG +LP+TWY +Y+ K+P T+M +R P +
Sbjct: 416 IDGILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGY 475
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL--DKDQQCRDINYTVGTNKP 643
PGRTY+F+ GP +YPFG+GLSYTQF + +A +P + ++L + T
Sbjct: 476 PGRTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLAR 535
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI------AGTHIKQVIG 697
P AV + +C+ ++V N+G DG+ V+VY P A +Q++
Sbjct: 536 PVRAVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVA 595
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
+E+V + AG A+V + C L + D + G H +++GE VS ++
Sbjct: 596 FEKVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGELTHSVSLGVE 651
>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
Length = 738
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 327/744 (43%), Positives = 453/744 (60%), Gaps = 60/744 (8%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+C+A LP+P RA+ LV +TL EK+ Q+ + A G PRLG+P +EWWSE+LHGV
Sbjct: 36 SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGV--- 92
Query: 70 GRRTNSPPGTHFDS-EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
++ PG +F S V AT FP VIL+ A+FN SLW+ + ++ EARAM+N G AGL
Sbjct: 93 ---CDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFW+PNINV RDPRWGR ETPGEDP VV Y++ YV+G Q G E + +S
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLS 202
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYDL+ W G R+ F+++V NG
Sbjct: 203 ACCKHYIAYDLEKWRGFTRYTFNAKV--------------------------------NG 230
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P CA +L Q R +W F GYI SDCD++ I E+ + + ED++A VLKAG+D++
Sbjct: 231 VPACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTY-TASDEDSIAVVLKAGMDIN 288
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + T A+++GK+ E DI+ +L L+ V +RLG+FD + + + LG NN+C
Sbjct: 289 CGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTT 348
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ELAAEA RQG VLLKNDNG LPL + +AL+GP AN + G+Y G PC T+
Sbjct: 349 EHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTT 408
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ G AY +A GC D+ C + AI+AAK AD V++AGL+L+ E E DRV
Sbjct: 409 FVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRV 468
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q +LI+ VA K PV LV+M G VD++FAK++P+I SILW+GYPGE GG
Sbjct: 469 SLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNV 528
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
+ +++FGKYNPGG+LPITWY ++ +P M +R +PGRTY+F+ G VVY FGY
Sbjct: 529 LPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGY 588
Query: 604 GLSYTQFKYKVASSPKSVDIKLDK--DQQCRDINYTVGTNKPPCAAVLIDDV-KCKDYKF 660
GLSY+++ Y + +PK + + D R Y T + V ++D+ C+ +F
Sbjct: 589 GLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAY---TRRDGVDYVQVEDIASCEALQF 645
Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
I V N G MDGS V+++ S P G+ IKQ++G+ERV AAG+S V T++ CK
Sbjct: 646 PVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCK 705
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
+ + +L G H ++VG+
Sbjct: 706 LMSFANTEGTRVLFLGTHVLMVGD 729
>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
var. deliciosa]
Length = 634
Score = 623 bits (1607), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/637 (49%), Positives = 422/637 (66%), Gaps = 29/637 (4%)
Query: 116 EARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
EARAMYN G AGLTFWSPN+N+ RDPRWGR ETPGEDP + G YA +YVRGLQ
Sbjct: 2 EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQG----- 56
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+D LK++ACCKHY AYDLDNW G DRFHF++RV++QD+++TF +PF CV G
Sbjct: 57 ----NDGERLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGK 112
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
V+SVMCSYN+VNGIPTCA+PKLL TIRG W +GYIVSDCDS+ E+ + + E+
Sbjct: 113 VASVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSK-PEE 171
Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--- 352
AVA +KAGLDLDCG + T AV++G +++ +I+ +L MRLG FDG P
Sbjct: 172 AVAAAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAH 231
Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
QY NLG ++C P H +LA EAARQGIVLL+N +LPL+ +T+A++GP+++ T M
Sbjct: 232 QYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTM 291
Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
IGNY G C YT+P+ G Y++ I+ A GC D+ C N + AA AA+ ADATV+V G
Sbjct: 292 IGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMG 350
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
LD S+EAE DR LLPG Q EL+++VA A++GP LV+MS G +D+ FAKN+P+I +I
Sbjct: 351 LDQSIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAI 410
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRT 589
+WVGYPG+ GG AIADV+FG NPGG+LP+TWY NYV +P T M +R P +PGRT
Sbjct: 411 IWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRT 470
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCA 646
Y+F+ GPVV+PFG GLSYT F + +A P V + L + + ++ V + C
Sbjct: 471 YRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRVSHADCN 530
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
A+ DV ++V+N G MDG+ ++V++ PP KQ++G+ ++ IAAG
Sbjct: 531 ALSPLDV---------HVDVKNTGSMDGTHTLLVFTSPPDGKWAASKQLVGFHKIHIAAG 581
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+V ++ CK L +VD + G H + +G+
Sbjct: 582 SETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 618
>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
Length = 885
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 325/663 (49%), Positives = 432/663 (65%), Gaps = 28/663 (4%)
Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
Q VS E RAMYN G AGLTFWSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ- 285
Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
+ S LK++ACCKH+ AYDLDNW G DRFHF++ VT QD+++TF +PF C
Sbjct: 286 ------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339
Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
V +G +SVMCSYN+VNG+PTCAD L TIR W GYIVSDCDS+ + S +
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVD-VFYSDQHYT 398
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
T+EDAVA L+AGLDLDCG + +T GAV QGK+ + DID ++ V MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458
Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIK-TLALVGPHA 406
P + +LG ++C H ELA EAARQGIVLLKND ALPL+ + +A+VGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518
Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNAD 465
AT AMIGNY G PCRYT+P+ G Y+ + PGC D+ C + I AA+DAA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
AT++VAGLD +EAEG DR LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PV 582
+PKI ILW GYPG+ GG+AIADVIFG +NPGG+LP+TWY +Y+ K+P T+M +R P
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+PGRTY+F+ GP ++PFG+GLSYT F + +A +P + ++L + + N
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLNA 758
Query: 643 PP----CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGT 690
AAV + +C++ + ++V N+G+ DG+ V+VY ++ G
Sbjct: 759 TARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGA 818
Query: 691 HIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++Q++ +E+V + AG +A+V ++ C L + D + G H +++GE V+
Sbjct: 819 PVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTI 878
Query: 751 PLQ 753
L+
Sbjct: 879 ALE 881
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 59/115 (51%), Positives = 71/115 (61%), Gaps = 6/115 (5%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ P+C LP RA+DLV RMT EKV+ + + A GVPRLG+ YEWWSEALHGVS
Sbjct: 39 ATLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
G PG F PGAT+FP VI T ASFN +LW+ IGQ S+ + LG
Sbjct: 99 G------PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147
>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 883
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 324/661 (49%), Positives = 432/661 (65%), Gaps = 26/661 (3%)
Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
Q VS E RAMYN G AGLTFWSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ- 285
Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
+ S LK++ACCKH+ AYDLDNW G DRFHF++ VT QD+++TF +PF C
Sbjct: 286 ------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339
Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
V +G +SVMCSYN+VNG+PTCAD L TIR W GYIVSDCDS+ + S +
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVD-VFYSDQHYT 398
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
T+EDAVA L+AGLDLDCG + +T GAV QGK+ + DID ++ V MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458
Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIK-TLALVGPHA 406
P + +LG ++C H ELA EAARQGIVLLKND ALPL+ + +A+VGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518
Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNAD 465
AT AMIGNY G PCRYT+P+ G Y+ + PGC D+ C + I AA+DAA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
AT++VAGLD +EAEG DR LLLPG Q ELI+ VA A+KGPV LV+MS G +DI FA+N
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PV 582
+PKI ILW GYPG+ GG+AIADVIFG +NPGG+LP+TWY +Y+ K+P T+M +R P
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+PGRTY+F+ GP ++PFG+GLSYT F + +A +P + ++L + ++
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATA 758
Query: 643 --PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHI 692
AAV + +C++ + ++V N+G+ DG+ V+VY ++ G +
Sbjct: 759 RLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGAPV 818
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
+Q++ +E+V + AG +A+V ++ C L + D + G H +++GE V+ L
Sbjct: 819 RQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGELTHTVTIAL 878
Query: 753 Q 753
+
Sbjct: 879 E 879
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/115 (50%), Positives = 71/115 (61%), Gaps = 6/115 (5%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ P+C LP RA+DLV R+T EKV+ + + A GVPRLG+ YEWWSEALHGVS
Sbjct: 39 ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
G PG F PGAT+FP VI T ASFN +LW+ IGQ S+ + LG
Sbjct: 99 G------PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147
>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 464
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 281/451 (62%), Positives = 350/451 (77%), Gaps = 3/451 (0%)
Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
MYNLG+AGLTFWSPNINVVRD RWGR ET EDP++VG +A+NYVRGLQDVEG E D
Sbjct: 1 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+SRPLK+S+CCKHYAAYD+D+W DR FD+RV+EQDM+ETF+ PFE CV EGDVSSV
Sbjct: 61 LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCS+N++NGIP C+DP+LL IR +W+ HGYIVSDC ++ IV++ +LND+K DAVA+
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
L+AGLDL+CG YYT+ V GK+++ ++D +L+ +Y++LMR+GYFDG P Y++LG
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGL 240
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+IC HIELA EAARQGIVLLKND PL G K LALVGPHANAT+ MIGNY G
Sbjct: 241 KDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--KKLALVGPHANATEVMIGNYAGL 298
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
P +Y SP++ F A V Y GC D C N++ A +AAK+A+ T+I G DLS+EA
Sbjct: 299 PRKYVSPLEAFSAIGNV-TYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEA 357
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E DRVD LLPG QTELI +VA+ + GPV LV++S +DI FAKNNP+I +ILWVG+PG
Sbjct: 358 EFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPG 417
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV 570
E+GG AIADV+FGKYNPGGRLP+TWYEA+YV
Sbjct: 418 EQGGHAIADVVFGKYNPGGRLPVTWYEADYV 448
>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
Length = 753
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 320/754 (42%), Positives = 448/754 (59%), Gaps = 50/754 (6%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CDA L +RA DLV +TL EKV Q+GD A GV RLG+P YEWWSE LHG+S GR
Sbjct: 31 FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGR-- 88
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
G F+ V TSFP VILT A+F+ LW+++G+ V EARA+YNLG A GLT WS
Sbjct: 89 ----GIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWS 144
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDP R PG+ R + G Q + G + SACCK
Sbjct: 145 PNVNIFRDPSGTR----PGD-----ARRGPRH--GEQGIGG------------EASACCK 181
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H AYDLD W R+++DS+VT QD+++T+ PF+ CV EG + +MC YN +NG+P C
Sbjct: 182 HATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPAC 241
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
A LL + +R +W +GY+ SDCD++ TI ++H + + ED VA +K G+D++CG+Y
Sbjct: 242 ASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHY-TLSPEDTVAVSIKVGMDVNCGNY 300
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHI 368
M AVQ+G + E DID +L L+ V MRLG+FDG P+ Y +LG ++C+P H
Sbjct: 301 TQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHK 360
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
LA EAA+ GIVLLKND GALPL + +LA++GP+A+ A+ GNY G PC T+P+
Sbjct: 361 SLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQ 420
Query: 429 GFYAY-SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G Y + GC C ++ AA A ++D V+ GL E +G DR L
Sbjct: 421 GIKGYLGDRARFLAGCDSPACAVDATNEAAA-LASSSDHVVLFMGLSQKQEQDGLDRTSL 479
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LLPG Q LI VA+AA+ PV LV+++ G VD+ FAK+NPKI +ILW GYPG+ GG AIA
Sbjct: 480 LLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIA 539
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGL 605
V+FG +NP GRLP+TWY + K+P T M +R P +PGR+Y+F+ G VY FGYGL
Sbjct: 540 KVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGL 599
Query: 606 SYTQFKYKVASS---PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYK 659
SY++F ++ SS + ++ L R G + ++ L+ ++ +C
Sbjct: 600 SYSKFSRRMFSSFSTSNAGNLSLLAGVMAR----RAGDDGGGMSSYLVKEIGVERCSRLV 655
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNAC 718
F +EV+N G MDG V++Y + P +G +Q+IG+ + G+ A V F ++ C
Sbjct: 656 FPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPC 715
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
+ V ++ GAH ++VG+ SF L
Sbjct: 716 EHFSWVGEDGERVIDGGAHFLMVGDEELETSFGL 749
>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 613
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 304/554 (54%), Positives = 392/554 (70%), Gaps = 25/554 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
++ + +CD L R KDLV R+TL EK+ + + A V RLG+P YEWWSEALHGVS
Sbjct: 40 VAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRYEWWSEALHGVSN 99
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G GT F + VPGATSFP ILT ASFN SL++ IG+ VSTEA AMYN+G AGL
Sbjct: 100 VGL------GTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYNVGLAGL 153
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
T+WSPNIN+ RDPRWGR LETPGEDP + +YA YV+GLQ +G D LK++
Sbjct: 154 TYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDG------GDPNKLKVA 207
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+D W+G R+ F++ +T+QD+++TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 208 ACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKVNG 267
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCADP LL +RG+W +GY+VSDCDS++ + + + T E+A A + AGLDL+
Sbjct: 268 KPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHY-TKTPEEAAAISILAGLDLN 326
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG + +T GAV+QG I E+ I+ ++ + LMRLG+FDG P+ Y NLG ++C P
Sbjct: 327 CGRFLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCTP 385
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA EAARQGIV LKN +LPLN IK+LA++GP+ANAT+ MIGNYEG PC+Y S
Sbjct: 386 ANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYIS 445
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAK---NADATVIVAGLDLSVEAEGK 482
P+ G A+ +YA GC D+ C N P DA K + DATVIV G L++EAE
Sbjct: 446 PLQGLTAFVPT-SYAAGCLDVRCPN----PVLDDAKKISASGDATVIVVGASLAIEAESL 500
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DRV++LLPG Q L+ +VA+A+KGPV LVIMS G +D++FAK+N KI SILWVGYPGE G
Sbjct: 501 DRVNILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAG 560
Query: 543 GRAIADVIFGKYNP 556
G AIADVIFG +NP
Sbjct: 561 GAAIADVIFGFHNP 574
>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
Length = 633
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 298/642 (46%), Positives = 423/642 (65%), Gaps = 26/642 (4%)
Query: 109 IGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
IG+ VSTE RAMYN+G AGLT+WSPN+N+ RDPRWGR ET GEDP + RY + YV+GL
Sbjct: 2 IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61
Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
Q + D LK+++CCKHY AYD+D+W+G R++F+++VT+QD+ +TF PF+
Sbjct: 62 QQRD------DGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFK 115
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
CV +G+V+SVMCSYN+V+G PTC D LL IRG W +GYIV+DCDS+ + + +
Sbjct: 116 SCVLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHY 175
Query: 289 LNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E+ A L AGL L+CG + +T GAV QG + E+ ID ++ + LMRLG+F
Sbjct: 176 -TKTPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFF 234
Query: 349 DGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
DG+P+ Y NLG +IC H ELA EAARQGIVLLKN G+LPL+ +IK+LA++GP+
Sbjct: 235 DGNPKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPN 294
Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
AN M+G+YEG+PC+YT+P+DG A + Y GC DI C + + A A AD
Sbjct: 295 ANLAYTMVGSYEGSPCKYTTPLDGLGASVSTV-YQQGC-DIACAT-AQVDNAKKVAAAAD 351
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
A V+V G D ++E E KDR ++ LPG Q+ L+ +VA +KGPV LVIMS G +D+ FA +
Sbjct: 352 AVVLVMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVD 411
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PV 582
NPK+ SILWVG+PGE GG A+ADV+FG +NPGGRLP+TWY +YV K+ T+M +R P
Sbjct: 412 NPKVTSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPK 471
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
FPGR+Y+F+ GP V+ FG GLSYTQ+K+ + +PK V I L++ CR
Sbjct: 472 TGFPGRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGHACRSTK------- 524
Query: 643 PPCAAV-LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERV 701
C ++ +++ C + ++V+N+GKM GS V++++ PP + K ++ ++++
Sbjct: 525 --CKSIDAVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNAPQKHLLDFQKI 582
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ V F ++ CK L +VD N +A G H + +G+
Sbjct: 583 HLTPQSEGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGD 624
>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 685
Score = 588 bits (1517), Expect = e-165, Method: Compositional matrix adjust.
Identities = 306/677 (45%), Positives = 423/677 (62%), Gaps = 22/677 (3%)
Query: 78 GTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWSPNIN 136
G + +P ATSFP VILT ASF+ LW +I + + TEAR +YN G A G+ FW+PNIN
Sbjct: 2 GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+ RDPRWGR ET GEDP V +Y ++YVRGLQ + E + R LK SACCKH+ A
Sbjct: 62 IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASACCKHFTA 119
Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
YDL+NW+G +R+ FD++VT QD+ +T+ F CV +G S +MC+YNRVNG+P CAD
Sbjct: 120 YDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYN 179
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
LL T R WNF+GYI SDCD+++ I E + T ED VA VL+AG+DL+CG+Y T
Sbjct: 180 LLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAK-TPEDVVADVLRAGMDLECGNYMTKH 238
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIELAAE 373
AV Q KI + ID +L L+ + +RLG FDG+P QY +G N +C+ ++++LA E
Sbjct: 239 AKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALE 298
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCRYTSPMDGFYA 432
AAR GIVLLKN LPL + TL ++GP+AN + ++GNY G PC+ S + GFY
Sbjct: 299 AARSGIVLLKNTASILPLP--RVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYT 356
Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
Y+ +Y GC D ++ I A++ AK +D ++V GLD S E E DR L LPG
Sbjct: 357 YASQTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGK 416
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q +LIN VA A+K PV LV++ G VDI FAKNN KI I+W GYPGE GGRA+A V+FG
Sbjct: 417 QQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFG 476
Query: 553 KYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQF 610
YNPGGRLP+TWY +++KIP T M +R P + +PGRTY+F+ GP VY FGYGLSY+ +
Sbjct: 477 DYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNY 536
Query: 611 KYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVE 667
Y + +K + + Y++ N L+ ++ CK + + +
Sbjct: 537 SYNF------ISVKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSISVTLGIT 590
Query: 668 NMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
N G M G V+++ KP G G +KQ++G+E V + G +VGF ++ C+ L +
Sbjct: 591 NTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANE 650
Query: 727 AANSLLASGAHTILVGE 743
+ ++ G + LVG+
Sbjct: 651 SGVKVIEEGGYLFLVGQ 667
>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
Length = 900
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 323/760 (42%), Positives = 453/760 (59%), Gaps = 57/760 (7%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C+ L Y +R +DL+ R+ + + + A GV L LP Y+WWSEALHGV
Sbjct: 182 LPFCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHLNLPAYQWWSEALHGVGH--- 238
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PG HF +VP ATSFP VI T A+FN++L++KIG +STEARAM N+ AG TFW
Sbjct: 239 ----SPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEARAMNNVQRAGNTFW 294
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PNIN++RDPRWGR ETPGEDP+ G YA N+V G QD E + Y +K S+CC
Sbjct: 295 APNINIIRDPRWGRGQETPGEDPFATGEYAANFVSGFQDGEDMNY--------IKASSCC 346
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+ Y+L+NW G DR H+++ T+QD+ +T++ FE CV G S +MCSYN VNG+P+
Sbjct: 347 KHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASGLMCSYNAVNGVPS 406
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CA+ ++ R W F GYI SDC ++ ++ SHKF +T E + VL+AG+D DCG
Sbjct: 407 CANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSE-TIRAVLEAGMDTDCGS 465
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
+ + A+Q+G + ++T+L L++V RLG FD Y N + P + +
Sbjct: 466 FVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFDPVSKQPYTNYSVARVNTPANQQ 525
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
LA EAA+QGIVLLKN N LPL TG +AL+GP+A+AT M GNY+GT SP+ G
Sbjct: 526 LALEAAQQGIVLLKNTNARLPLKTG--LHVALIGPNADATTVMQGNYQGTAPFLISPVRG 583
Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
F YS + YA GC D+ C++ S AA+ AAK ADA V+V GLD E+EG DR + L
Sbjct: 584 FKNYSAAVTYAKGC-DVACKDTSGFDAAVAAAKEADAVVVVVGLDQGQESEGHDRTSITL 642
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
PG Q +L+ +VA AAK P+ + +M+ GAVD++ K N + ILW GYPG+ GG+A+ADV
Sbjct: 643 PGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWCGYPGQSGGQAMADV 702
Query: 550 IFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGLS 606
+FG +PGGRLP T Y +YV +RP + PGRTY+F+ G VY +G GLS
Sbjct: 703 VFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRTYRFYTGKPVYEYGTGLS 762
Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT----- 661
YT F Y + ++D L Q + D K +++KF
Sbjct: 763 YTSFSYHI-HYLNTMDTSLATVQ------------------TYVQDAK-QNHKFIRYDAP 802
Query: 662 ----FQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
++ V N+G++ G++VV V+ +P P G IK +IG+ERVF+ GQ V F++
Sbjct: 803 EFTRVEVNVTNVGRVAGADVVQVFVEPKTPAELGAPIKTLIGFERVFLNPGQWTIVQFSV 862
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLN 755
NA L VD + + +G + +G ++FP+ +N
Sbjct: 863 NA-HDLTFVDASGKRVARAGEWLVHIGHD-SRLTFPVHVN 900
>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
Length = 776
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 307/757 (40%), Positives = 439/757 (57%), Gaps = 50/757 (6%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +P+C+ LP R L+ +TL +K+ Q+ + A + LG+P Y+WWSEALHG++
Sbjct: 37 SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATN 96
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G PG +F+ V AT+FP VI++ A+FN SLW IG V E RAM+N+G AGL+
Sbjct: 97 G------PGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLS 150
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY---HRDSDSRPLK 186
FW+PN+NV RDPRWGR ETPGEDP V YA+ +VRG+Q V+G++ DSD L
Sbjct: 151 FWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKKVLNDHDSDDDGLM 210
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+SACCKH+ AYDL+ W R++F++ V T+ PF CV +G S +MCSYN V
Sbjct: 211 VSACCKHFTAYDLEKWGEFSRYNFNAVVN------TYQPPFRGCVQQGKASCLMCSYNEV 264
Query: 247 NGIPTCADPKLLNQTIRGDWNFHGY-IVSDCDSIQTIVESHKFLNDTKEDAVARVLKA-- 303
NG+P CA LL +R W F G I+ + + S K + + + + LK
Sbjct: 265 NGVPACASKDLLG-LVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNLPKMLLLMFLKQVF 323
Query: 304 ---------GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ- 353
+D++CG + T A++QG + E D+D +L L+ V MRLG F+G P+
Sbjct: 324 FYVFENLWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEK 383
Query: 354 --YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
+ LG ++C P+H +LA EAARQGIVLLKNDN LPL+ + +LA++GP A T
Sbjct: 384 GKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMA-TTSE 442
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
+ G Y G PC S DG Y K I+YA GC+D+ C ++ AID AK AD VIVA
Sbjct: 443 LGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVA 502
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
GLD ++E E DRV LLLPG Q +L+++VA A+K PV LV+ G +D++FA++N I S
Sbjct: 503 GLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITS 562
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRT 589
ILW+GYP + ++ GRLP+TWY ++ +P M +R P +PGRT
Sbjct: 563 ILWIGYPVD-------------FDAAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRT 609
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI-KLDKDQQCRDINYTVGTNKPPCAAV 648
Y+F+ G +Y FG+GLSY+ F Y+V S+P + + K R + V + V
Sbjct: 610 YRFYTGSRIYGFGHGLSYSDFSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHV 669
Query: 649 LIDDVK-CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAG 706
+D+++ C F+ I V N+G MDGS VVM++SK P I G+ Q++G R+ +
Sbjct: 670 HVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSN 729
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+S + + C+ D +L G H + VG+
Sbjct: 730 KSIETSILADPCEHFSFADEQGKRILPLGNHILNVGD 766
>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
Length = 751
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 304/748 (40%), Positives = 441/748 (58%), Gaps = 73/748 (9%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
K+S P+CD LP R DLV R+ L + V + + A P + +P YEWW+EALHGV+
Sbjct: 28 KVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNEALHGVA 87
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
PG F + ATSFP V+ T ASFN +L+ +I + +STEARA YN NAG
Sbjct: 88 L-------SPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAG 140
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS-DSRPLK 186
LTFW+PN+N+ RDPRWGR ETPGEDPY+ G YA+ +VRGLQ E +E H + D++ LK
Sbjct: 141 LTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQG-EAMEGHENKDDNKFLK 199
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
IS+CCKH++AY + R D+ VT+QD +T+ FE CV G VSS+MCSYN V
Sbjct: 200 ISSCCKHFSAYS----QEVPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAV 255
Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
NGIP+CAD LL +R W F GYI SDC+++ ++ H F + E A L AG+D
Sbjct: 256 NGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHF-TQSPEQTCATTLDAGMD 314
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD-GSPQYKNLGKNNICNP 365
L+CG++ A++QG ++ + +L+ + V+MRLG F+ G+ + N+ K+ +
Sbjct: 315 LNCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFEKGTQPFSNITKDAVDTA 374
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIK---TLALVGPHANATKAMIGNYEGTPCR 422
H +LA EAARQ +VLLKN++ LPL T +LAL+GPH NA+ A++GNY G P
Sbjct: 375 AHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPSH 434
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIP---AAIDAAKNADATVIVAGLDLSVEA 479
+P+ G +Y + Y+ GC + ++P AI+ K AD V+ GLD S E
Sbjct: 435 IVTPLKGVSSYVPNVAYSLGCK----VSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQER 490
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E DR L LPGFQ L+N++ AA P+ LV++S G+VD++ KN+PK+ +I++ GY G
Sbjct: 491 EEIDRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLG 550
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGP 596
+ GG+A+AD++FGKY+P GRL T+Y+++YV +P M +RP V PGRTY+FF G
Sbjct: 551 QAGGQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGA 610
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
VY FG+GLSYT F + CR C A
Sbjct: 611 PVYEFGFGLSYTTFH-----------------KACRS-----------CVA--------- 633
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV-FIAAGQSAKVGF 713
+F+I V N+G ++G + +++Y++PP G G ++ ++ +ER + G++A F
Sbjct: 634 ----SFEITVTNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADF 689
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILV 741
+ A K+ + + + ++ G TI V
Sbjct: 690 CLEA-KAFALANAEGSWVVEQGNWTIHV 716
>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 694
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 294/648 (45%), Positives = 422/648 (65%), Gaps = 40/648 (6%)
Query: 104 SLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAIN 163
+L +K+G V+ + A+ LG +WS ETPGEDP + +YA+
Sbjct: 70 TLAEKVGFLVNKQP-ALGRLGIPAYEWWS---------------ETPGEDPLLASKYAVG 113
Query: 164 YVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETF 223
YV GLQD G D LK++ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF
Sbjct: 114 YVTGLQDA-GAGGVTDG---ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTF 169
Query: 224 ILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV 283
PF+ CV +G+V+SVMCSYN+VNG PTCAD LL IRGDW +GYIVSDCDS+ ++
Sbjct: 170 QPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VL 228
Query: 284 ESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
+ + T E+A A +K+GLDL+CG++ T+ AVQ G+++E D+D ++ +I+LM
Sbjct: 229 YTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLM 288
Query: 344 RLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
RLG+FDG P+ + +LG ++C + ELA E ARQGIVLLKN +GALPL+ +IK++A
Sbjct: 289 RLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMA 347
Query: 401 LVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAID 459
++GP+ANA+ MIGNYEGTPC+YT+P+ G A + Y PGC ++ C NS+ + A+
Sbjct: 348 VIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVA 406
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
AA +AD TV+V G D S+E E DR LLLPG QT+L++ VA+A+ GPV LV+MS G D
Sbjct: 407 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFD 466
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMP 578
I+FAK + KI +ILWVGYPGE GG A+AD++FG +NP GRLP+TWY A+Y + T M
Sbjct: 467 ISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMR 526
Query: 579 LRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS-VDIKLDKDQQCRDIN 635
+RP +PGRTY+F+ G V+ FG GLSYT+ + + S+P S V ++L +D CR
Sbjct: 527 MRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR--- 583
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV 695
CA+V C D F +++V N G++ G+ V+++S PP K +
Sbjct: 584 ------AEECASVEAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAPAKHL 637
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+G+E+V +A G++ V F ++ C+ L +VD +A G HT+ VG+
Sbjct: 638 LGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 685
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 34/53 (64%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
L+ + +C+ K RA+DLV R+TL EKV + + + RLG+P YEWWSE
Sbjct: 46 LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSE 98
>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
gi|223944757|gb|ACN26462.1| unknown [Zea mays]
Length = 630
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 281/633 (44%), Positives = 407/633 (64%), Gaps = 21/633 (3%)
Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
M+N G AGLT+W+PNIN+ RDPRWGR ET GEDP V Y++ YV+G Q EG E
Sbjct: 1 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEEGEEGR-- 58
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+++SACCKHY AYD++ WEG R+ F+++V QD+++T+ PF+ C+ E S +
Sbjct: 59 -----IRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCL 113
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MC+YN+VNG+P CA LL +T R +W F GYI SDCD++ I E+ + + ED++A
Sbjct: 114 MCAYNQVNGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAI 171
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKN 356
VLKAG+D++CG + T A+++GKI E DID +L L+ V +RLG FD + +
Sbjct: 172 VLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQ 231
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
LG N++C +H ELAAEA RQG VLLKND+ LPL ++ +A++GP AN AM G+Y
Sbjct: 232 LGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDY 291
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
G PC T+ + G AY+ ++APGC D C + + A++AAK AD V++AGL+L+
Sbjct: 292 TGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLT 351
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
E E DRV LLLPG Q LI+ +A AK P+ LV++ G VD++FAK +P+I SILW+G
Sbjct: 352 EEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLG 411
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFD 594
YPGE GG+ + +++FG+YNPGG+LPITWY ++ IP T M +R P +PGRTY+F+
Sbjct: 412 YPGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGYPGRTYRFYT 471
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ--CRDINYTVGTNKPPCAAVLIDD 652
G VVY FGYGLSY+++ Y ++S+PK + + D R Y T + +V +D
Sbjct: 472 GDVVYGFGYGLSYSKYSYSISSAPKKITVSRSSDLGIISRKPAY---TRRDGLGSVKTED 528
Query: 653 V-KCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAK 710
+ C+ F+ + V N G MDGS V+++++ + G IKQ++G+E V AAG ++
Sbjct: 529 IASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASN 588
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
V T++ CK + + +L GAH + VG+
Sbjct: 589 VEITVDPCKQMSAANPEGKRVLLLGAHVLTVGD 621
>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
queenslandica]
Length = 728
Score = 556 bits (1432), Expect = e-155, Method: Compositional matrix adjust.
Identities = 302/745 (40%), Positives = 426/745 (57%), Gaps = 58/745 (7%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
K + + YCD PER DL+ RMT+ +K+ Q+ A +P L +P Y+WWSE LHG
Sbjct: 23 KAPFNTYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGLHG 82
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
V+ PG HF P ATSFP VI A+FN SL + Q +STEARA N G
Sbjct: 83 VA-------GSPGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQ 135
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AGLT+++PNIN+ RDPRWGR ETPGEDPY+ +YA N+V+G+Q EG + D+R L
Sbjct: 136 AGLTYFAPNINIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ--EGAD-----DTRYL 188
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K A CKHYAAYDL+N+ R F++ V++QD +ET+ F CV EG V S+MCSYN
Sbjct: 189 KTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNA 248
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG+P+CA+ + N+ RG W F GY+VSDC +I I+ SHK+ ++T +D VA L+ G
Sbjct: 249 VNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSNT-DDTVAAGLRGGC 307
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
DL+CG +Y++ A G I + DID ++ L+ MRLG FD +++ + +
Sbjct: 308 DLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFDPPSMQPFRDYTNDKVD 367
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
QH LA +A+R+ IVLL+N+ LPL+ + +ALVGPH A AM GNY+GT
Sbjct: 368 TKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYL 427
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAK--NADATVIVAGLDLSVEAEG 481
SPM G + +A GC + C + + + +A + V GLD S E+EG
Sbjct: 428 ISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLVEEHSIEAIIAVIGLDESQESEG 487
Query: 482 KDRVDLLLPGFQTELINKVADAAKG--PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
DR L LPG Q +L+ + A P +V+MS G VD++ K+ +ILW GYPG
Sbjct: 488 HDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKD--IADAILWAGYPG 545
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
+ GG+AIA+VI+GK NP GRLP+T+Y A+Y+ +IPYT+M +R PGR+YKF+ G V
Sbjct: 546 QSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP---PGRSYKFYTGTPV 602
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
+PFG+GLSYT F+ K + P +K D D+NY
Sbjct: 603 FPFGFGLSYTTFEMKWKNPPNVTHLKTTHD---VDVNY---------------------- 637
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
++ V N GK GS V+ Y + G +K++ G++++++ QS + F
Sbjct: 638 ----EVVVTNAGKRSGSVSVLAYITST-VPGAPMKELFGFQKIYLKPEQSMTLSFVAEP- 691
Query: 719 KSLKIVDNAANSLLASGAHTILVGE 743
K VD + G + I +G+
Sbjct: 692 KVFTTVDKHGERKIRPGTYKITIGD 716
>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
Length = 1620
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 300/738 (40%), Positives = 429/738 (58%), Gaps = 60/738 (8%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+FP+C+A L R +D++ R+++ +KV + A GLP Y+WWSEALHGV F
Sbjct: 923 NFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEALHGVGF-- 980
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PG F +V ATSFP VI T+ASFN++LW IG T+STEARAM N+ AGLTF
Sbjct: 981 -----SPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGLTF 1035
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN++RDPRWGR ETPGEDPY G YA N+V G+Q+ E D+R +K S+C
Sbjct: 1036 WAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGE--------DTRYIKASSC 1087
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ Y+L++W DR HF++ T+QD+ +T++ FE CV G SS+MCSYN VNG+P
Sbjct: 1088 CKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVP 1147
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
+CA+ ++ R W F GYI SDC +++ + +HK+ N T V VL AG+D+DCG
Sbjct: 1148 SCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGA-TVNGVLSAGMDVDCG 1206
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
+ + A+ G + A +D +L L+ V RLG FD + Y NL + + P+H
Sbjct: 1207 SFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFDPAEDQPYLNLTTDAVNTPEHQ 1266
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA EAARQG+ LL+N + LPL+ +IK LAL+GP+ANAT M GNY G SP
Sbjct: 1267 QLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQ 1326
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G Y N ++ A+ AAK AD V+V GLD + E+EG DR +
Sbjct: 1327 GVQQY--------------VSNVALELGAVTAAKAADTVVMVIGLDQTQESEGHDREIIA 1372
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LPG Q EL+ +VA+A+ P+ +V+M+ GAVD+ K+ + G+ GG+A+A+
Sbjct: 1373 LPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAE 1423
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
+FG NPGGRLP T Y A+ V ++ +RP + PGRTY+F+ G VY +G GL
Sbjct: 1424 TLFGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTGTPVYAYGTGL 1483
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
SYT F Y+ ++ V ++ R G + + D+V +DY +
Sbjct: 1484 SYTSFSYETSTPSLRVSA-----ERVRAWVAARGQT-----SFIRDEVDAEDY---ITVT 1530
Query: 666 VENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V+N G + G++VV V+ K PG G IK + G+ERVF+ G++ + F + L +
Sbjct: 1531 VQNNGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPGETTSIQFPVTP-HDLSV 1589
Query: 724 VDNAANSLLASGAHTILV 741
V++ + G T+ V
Sbjct: 1590 VNSRGERVAVPGTWTVEV 1607
>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 303/751 (40%), Positives = 439/751 (58%), Gaps = 59/751 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L + P+C+ L + +RA DLV R+TL EK+ Q G A GV RLG+ YEWWSEALHGV+
Sbjct: 32 LRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALHGVA- 90
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVI--------LTTASFNESLWKKIGQTVSTEARAM 120
PG +F P +T FP +I A+FN + Q +STEARA
Sbjct: 91 ------ESPGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAF 144
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
N G+AGLT+++PNIN+ RDPRWGR ETPGEDPY+ RY V+ LQ+ E
Sbjct: 145 ANAGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGE-------- 196
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
D+R LK+ A CKHY AYD+++W G DRFHF++ V++QD+ ETF+ PFE CV G +S+M
Sbjct: 197 DARYLKVVATCKHYTAYDMEDWGGIDRFHFNAVVSDQDLVETFMPPFEACVRVGKGASLM 256
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CSYN VNGIP+CAD + N+ R W F GYIVSDC +I I +H + N T+ A +
Sbjct: 257 CSYNAVNGIPSCADDFINNEIAREQWGFDGYIVSDCGAIDCIQYTHNYTNTTQATCAAGI 316
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
+ G DLDCGD+Y + M A+ + EAD+D SLR L+ +RLG FD + Y+ +
Sbjct: 317 -QGGCDLDCGDFYQSHLMDAIGNATLHEADLDFSLRRLFGHRIRLGEFDAASIQPYRQIP 375
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ I + +H ELA + AR+ IVLL NDN LP + ++ LA++GP+A+ + ++GNY G
Sbjct: 376 VSAINSQEHQELALQIARESIVLLGNDNNTLPFSLATVRKLAIIGPNADDAETLLGNYYG 435
Query: 419 TPCRYTSPMDGFYAYSKV--INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
+P+ GF I + GC D+ + S AA AAK ADAT++V GL+ +
Sbjct: 436 DAPYLITPLKGFQQLDPTLSITFVKGC-DVNSTDTSGFVAAAAAAKAADATIVVVGLNQT 494
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
VE+E DR L+LPG Q ELI + AA+GPV LV+MS +D++ + +++ LW+G
Sbjct: 495 VESENLDRTTLVLPGVQAELILALTAAARGPVILVVMSGSPIDLSNVIH--PVRAALWIG 552
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG+ GGRA+A+ +FG ++P GRLP T Y A+YV ++P T+M +R PGRTY+F+ G
Sbjct: 553 YPGQAGGRALAEAVFGVFSPAGRLPFTVYPADYVNQLPMTNMDMRAG---PGRTYRFYTG 609
Query: 596 PVVYPFGYGLSYTQFKYK--VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
++ FG+GLSY+ F+Y +SS S + + P AV
Sbjct: 610 TPLFEFGHGLSYSTFQYTWSNSSSSSSSSATSQHSLSTAALAAQHLAARAPVEAV----- 664
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSK----------PPGIAGTHIKQVIGYERVFI 703
+F++ V+N GKM +VV+ ++ A I+ ++G+ R+ +
Sbjct: 665 -------SFRVLVQNTGKMASDDVVLAFASFNASSIIDQSSSQFASPPIRSLVGFRRIHL 717
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
A G S ++ F + + + ++ A +L+ S
Sbjct: 718 APGASQEIFFAVTSSQLAQVDSTGAQTLVPS 748
>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
Length = 709
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 296/730 (40%), Positives = 420/730 (57%), Gaps = 71/730 (9%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
R+ + R+ L + V + + A P + +P YEWW+EALHGV+ PG F
Sbjct: 7 RSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVAL-------SPGVTFKG 59
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
+ ATSFP V+ T ASFN SL+ +I +STEARA +N +AGLTFW+PN+N+ RDPRW
Sbjct: 60 SITAATSFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRW 119
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
GR ETPGEDPY+ G YA+ +VRGLQ EG+E +S+ LKIS+CCKH++AY +
Sbjct: 120 GRGQETPGEDPYLTGEYAVAFVRGLQG-EGMEGREVENSKFLKISSCCKHFSAYS----Q 174
Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
R ++ VT+QD +T+ FE CV G VSS+MCSYN VNGIP+CAD LL +R
Sbjct: 175 EVPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVR 234
Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQ 323
G W F GYI SDC+++ +++ H + + E A L AG+DL+CG++ A++Q
Sbjct: 235 GQWKFDGYIASDCEAVADVIDHHHY-TQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQ 293
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G + I +L+ + VLMRLG F+ + N+ K+++ H +LA EAARQ IVLLK
Sbjct: 294 GIVTTEMIHNALKNQFRVLMRLGMFEKVEPFANITKDSVDTTMHRQLALEAARQSIVLLK 353
Query: 384 NDNGALPLNTGNI---KTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYA 440
ND LPL T + ++LAL+GPH NA+ A++GNY G P +P++G + + ++
Sbjct: 354 NDGNTLPLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHS 413
Query: 441 PGCADIVCQNNSMIP---AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
GC + ++P AI AK AD ++ GLD S E E DR + LP FQ+ L+
Sbjct: 414 LGCK----VSGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLL 469
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
+V + A P+ V++S G VD++ KN+PK+ +I++ GY G+ GG+A+ADV+FGKYNP
Sbjct: 470 KRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPS 529
Query: 558 GRLPITWYEANYVK-IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
G+LP T+Y++ YV + M +R PV GRTY+FF G VY FG+GLSYT F
Sbjct: 530 GKLPQTFYDSEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTFH--- 586
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
N C A TF I V N G + G
Sbjct: 587 -------------------------KNCHACVA-------------TFNITVTNAGAISG 608
Query: 675 SEVVMVYSKPP--GIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
+V++ Y +PP G G +K ++ +ER IAAGQ A + A K+ + + A N +
Sbjct: 609 EDVILTYVEPPLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLEA-KAFALANEAGNWV 667
Query: 732 LASGAHTILV 741
+ G TI V
Sbjct: 668 VEPGNWTIHV 677
>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
Length = 690
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 304/732 (41%), Positives = 422/732 (57%), Gaps = 72/732 (9%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
ERA+ LV +TL EK+ MG A V RL +P Y+WWSEALHGV+ + PG F
Sbjct: 3 ERARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-------ASPGVVFQ 55
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPR 142
P AT+FP V LT SF++ L+ I +STEAR M N A LT+WSPN+NV RDPR
Sbjct: 56 EPTPFATAFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPR 115
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR ETPGEDP++V YA+ +VRGLQ+ E D R LK+SACCKHY+AYDL+NW
Sbjct: 116 WGRGQETPGEDPFLVATYAVEFVRGLQEGE--------DPRYLKVSACCKHYSAYDLENW 167
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
G +RF FD+ V+++DM +TF +PFE CV +G VSS+MCSYN +NGIP CAD +LL T
Sbjct: 168 HGVERFEFDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTA 227
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQ 322
RG W F GYI SDC +I TI+ +H + NDT A+ V +A DLDCG +Y + +V+
Sbjct: 228 RGGWGFEGYITSDCGAIDTIIYNHHYTNDTDTTAMLGV-RATCDLDCGGFYQQHILHSVE 286
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIV 380
G++ EA++D +L L+ V MRLG FD Q Y + G + + +H +A AAR+GI
Sbjct: 287 SGRLKEAEVDDALANLFKVQMRLGLFDPVEQQVYTHYGLDKLNTKEHQAMALRAAREGIA 346
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYA 440
LLKN N LPL+ + K + ++GP+A M+GNY G P ++ A
Sbjct: 347 LLKNQNDFLPLSLKD-KHVVVMGPYAEDAGVMLGNYNGIP-------------EFIVTVA 392
Query: 441 PGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELIN 498
G + VC + ++ + A+ + D V+ GL+ +E EG DR DLLLP Q L++
Sbjct: 393 QGLRN-VCDHVDVVKSLEALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLD 451
Query: 499 KVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
+ PV L ++S G +VDI+ + N + +L VGY G GG+AIA+VI G NP
Sbjct: 452 GLLAQTDVPVVLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPS 511
Query: 558 GRLPITWYEANYV-KIPYTSMPLRPVNN--FPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
GRL T Y +YV + Y M +RP FPGRTY+FF GPV++PFG+GLSYT F +
Sbjct: 512 GRLVNTMYYNDYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAH-- 569
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
+V+I ++ + R +A+ ID ++V N G G
Sbjct: 570 -----AVEIGQMRNHRLR-------------SALAID----------VYVKVTNTGSRQG 601
Query: 675 SEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
E V+++ K P G G +K + + RV +A G++ V F + + L + + A +L
Sbjct: 602 DESVLLFVKSPLAGKQGYPLKSLADFSRVSLAPGETQTVHFVLGE-EQLHLANEQAKYVL 660
Query: 733 ASGAHTILVGEG 744
G + V E
Sbjct: 661 LRGEWKVEVEEA 672
>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 755
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 298/740 (40%), Positives = 425/740 (57%), Gaps = 61/740 (8%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ YC+ ER KDL+ R+T+ EK+ Q A + RL +P Y+WWSE LHG++
Sbjct: 56 YLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAYDWWSECLHGLA---- 111
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PG F++++ ATSFP VI A+FN SL +GQ +STEARA N G +GLTF+
Sbjct: 112 ---QSPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFANNGQSGLTFF 168
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PNIN+ RDPRWGR ETPGEDPY+ +YA N+V+G+Q EG E D R LK A C
Sbjct: 169 APNINIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQ--EGSE-----DRRYLKAIATC 221
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHYAAY+L+ + R +F++ V++QD++ET++ F+ CV EG V S+MCSYN +NG+P
Sbjct: 222 KHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAINGVPN 281
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
CA+ + N+ R W F GYIVSDC +I I H + +DT VA LK G DL+CG
Sbjct: 282 CANDFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTN-ITVADALKGGCDLNCGH 340
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHI 368
+Y + A I E DID SL L+ MRLG FD P+ ++ ++ P+
Sbjct: 341 FYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFD-PPEIQPFRQYSVKDVNTPEAQ 399
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA AAR+GIVLL+N LPL+ +A +GP+A+AT M GNY G SP+
Sbjct: 400 DLALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLISPLQ 459
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
GF Y GC + C + P A+ A + DA + V GL+ + E E DR +
Sbjct: 460 GFSNLGINATYQIGCP-VACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRTSIA 518
Query: 489 LPGFQTELINKV-ADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LPG Q +L+ ++ +AAKG P+ +V+MS G+VD+ K+ +ILW GYPG+ GG+AI
Sbjct: 519 LPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKD--IADAILWAGYPGQSGGQAI 576
Query: 547 ADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
A+VI+GK NP GRLP+T+Y A+Y+ +IPYT+M +R PGR+YKF+ G V+PFG+GL
Sbjct: 577 AEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP---PGRSYKFYTGTPVFPFGFGL 633
Query: 606 SYTQF--KYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
SYT F K+K S+ K +K D+ +NY +
Sbjct: 634 SYTTFEIKWKDTSTAKDYYLKTTHDEV---VNY--------------------------E 664
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V N G GS V+ + + G +K++ ++++++ +S V F K
Sbjct: 665 ATVTNSGSRPGSVSVLAFIT-SSVPGAPMKELFAFKKIYLEPTESVDVSFVAEP-KVFTT 722
Query: 724 VDNAANSLLASGAHTILVGE 743
VD + GA+ I++G+
Sbjct: 723 VDIYGIRKIRPGAYKIIIGD 742
>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 263/493 (53%), Positives = 352/493 (71%), Gaps = 17/493 (3%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ + +C+ K RA+DLV R+TL EKV + + + RLG+P YEWWSEALHGVS+
Sbjct: 46 LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYEWWSEALHGVSY 105
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+G PGT F VPGATSFP ILT ASFN SL++ IG+ VSTEARAM+N+G AGL
Sbjct: 106 VG------PGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNVGLAGL 159
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
TFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQD G D LK++
Sbjct: 160 TFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDA-GAGGVTDG---ALKVA 215
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
ACCKHY AYD+DNW+G +R+ FD++V++QD+ +TF PF+ CV +G+V+SVMCSYN+VNG
Sbjct: 216 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 275
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
PTCAD LL IRGDW +GYIVSDCDS+ ++ + + T E+A A +K+GLDL+
Sbjct: 276 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVD-VLYTQQHYTKTPEEAAAITIKSGLDLN 334
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNP 365
CG++ T+ AVQ G+++E D+D ++ +I+LMRLG+FDG P+ + +LG ++C
Sbjct: 335 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 394
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ ELA E ARQGIVLLKN +GALPL+ +IK++A++GP+ANA+ MIGNYEGTPC+YT+
Sbjct: 395 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 453
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ G A + Y PGC ++ C NS+ + A+ AA +AD TV+V G D S+E E DR
Sbjct: 454 PLQGLGAKVNTV-YQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDR 512
Query: 485 VDLLLPGFQTELI 497
LLLPG QT+L+
Sbjct: 513 TSLLLPGQQTQLV 525
>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
Length = 618
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 253/624 (40%), Positives = 366/624 (58%), Gaps = 25/624 (4%)
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ S L+ SA
Sbjct: 1 MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQG---------SSLTNLQTSA 51
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYD++ W+G R++F+++VT QD+ +T+ PF CV +G S +MC+Y +NG+
Sbjct: 52 CCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGV 111
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LL +T+RG+W GY SDCD++ + +S F T E+AVA LKAGLD++C
Sbjct: 112 PACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTR-TAEEAVAVALKAGLDINC 170
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNP 365
G Y A+QQGK+ E D+D +L+ L+ + MRLG+FDG P+ Y L ++C P
Sbjct: 171 GVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTP 230
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
H LA EAAR+G+VLLKND LPL + + A++G +AN A++GNY G PC T+
Sbjct: 231 VHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTT 290
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P G Y K + PGC+ C + + A AK++D +V GL E EG DR
Sbjct: 291 PFGGIQKYVKSAKFLPGCSSAAC-DVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRT 349
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
LLLPG Q LI VA A+K PV L++++ G VDI FA+ NPKI +ILW GYPG+ GG+A
Sbjct: 350 SLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQA 409
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
IADV+FG++NP G+LP+TWY + K T M +R P +PGR+Y+F+ G VY FGY
Sbjct: 410 IADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 469
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKF 660
GLSY++F ++ S + + T A +D++ +C+ +F
Sbjct: 470 GLSYSKFACRIVSGAGNS----SSYGKAALAGLRAATTPEGDAVYRVDEIGDDRCERLRF 525
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGI-AGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
+EV+N G MDG V+++ + G ++Q+IG+ + G+ K+ ++ C+
Sbjct: 526 PVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCE 585
Query: 720 SLKIVDNAANSLLASGAHTILVGE 743
L ++ G+H ++V E
Sbjct: 586 HLSRARVDGEKVIDRGSHFLMVEE 609
>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 732
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 292/758 (38%), Positives = 420/758 (55%), Gaps = 80/758 (10%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
K K F YC+ LP +R KDL+ RMTL EK+ Q+G+ A + RL +P Y+WWSE LHG
Sbjct: 26 KTKFQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHG 85
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
V+ PG HF+ ATSFP VI T +SFN++L+ +I +STEARA N
Sbjct: 86 VA-------DSPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARA---FAN 135
Query: 126 AGLTFWSPNINVV--------RDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
G+ ++ + ++ RDPRWGR ETPGEDPY+ +YAI +V G Q
Sbjct: 136 QGIVYFKQHQQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQ-------- 187
Query: 178 RDSDSRPLKISACCKHYAAYDLDNW-EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
DS+ LK+ CKH+A YDL+++ +G R F++++T QD +ET+ F+ CV E +V
Sbjct: 188 --GDSKYLKVVTTCKHFAGYDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANV 245
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
+S+MCSYN VNG+P+CAD ++ N+ R W F G+I SDC +I I H + N+T +D
Sbjct: 246 ASIMCSYNEVNGVPSCADGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNT-DDT 304
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--- 353
VA LK G DL+CG YY + A G I +I+ +L L+ M+LG FD P+
Sbjct: 305 VAAALKGGCDLNCGSYYQSHAQSAFLNGTITIGEINLALTRLFTARMKLGMFD-PPELQP 363
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y + + + + +H LA AAR+ IVLL+N+N LPLN T+A+VGPHA AT M
Sbjct: 364 YNAISPDVVNSLEHQALALNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQ 423
Query: 414 GNYEGTPCRYTSPMDGF--YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
GNY G SP++GF V+ A GC D+ C+ A D A ADA + V
Sbjct: 424 GNYNGVAPYLISPVEGFENLGIDSVLT-ASGC-DVNCEVTDGFQDAFDIAVKADAVIAVL 481
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK-----GPVTLVIMSAGAVDINFAKNN 526
GLD S E+EG DR DL LP Q + + + + K P+ +V+MS +VD+ K +
Sbjct: 482 GLDQSHESEGHDREDLFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTKKH 541
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNF 585
+ILW GYPG+ GG+AIA++I+GK NP GRLP+T+Y +Y+ + + M +R +
Sbjct: 542 A--DAILWAGYPGQSGGQAIAEIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMR---EY 596
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
PGRTYKF++ + FG GLSYT F Y S P ++ R ++Y
Sbjct: 597 PGRTYKFYNDTPDFSFGDGLSYTTF-YLEWSKPVNM-------SGVRSVSYPT------- 641
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAA 705
+ + V N GKM G+ V+ Y +G K++ G+E+VF+
Sbjct: 642 --------------VVYNVTVTNTGKMPGAISVLAYISYNN-SGAPKKKLFGFEKVFLNP 686
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
QS V F ++ K+ VD + + G + + +G+
Sbjct: 687 LQSVSVTFPADS-KAFSTVDKSGKRSVNPGDYHVTIGD 723
>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
Length = 699
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 276/645 (42%), Positives = 373/645 (57%), Gaps = 89/645 (13%)
Query: 104 SLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAIN 163
S + ++ + VSTEARAMYN+G AGLTFWSPN+N+ +DPRWGR ETPGEDP + +YA
Sbjct: 128 SKFMRLRKVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASG 187
Query: 164 YVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETF 223
YVRGLQ + D LK++ACCKHY AYDLDNW+G D FHF++ VT QDM +TF
Sbjct: 188 YVRGLQQSD------DGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTF 241
Query: 224 ILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV 283
PF+ CV +G+V+SV+ YIVSDCDS+
Sbjct: 242 QPPFKSCVIDGNVASVI------------------------------YIVSDCDSVDVFY 271
Query: 284 ESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
S + T E+A A+ + AGLDL+CG + T AV+ G + E+ +D ++ + LM
Sbjct: 272 NSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLM 330
Query: 344 RLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
RLG+FDG+P Y LG ++C +H E A EA RQGIV
Sbjct: 331 RLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV-------------------- 370
Query: 401 LVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ GTPC+YT+P+ G A Y PGC+++ C + I A
Sbjct: 371 ---------------FAGTPCKYTTPLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKI 413
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A ADATV++ G+D S+EAEG+DRV++ LPG Q LI +VA +KG V LV+MS G DI
Sbjct: 414 AAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDI 473
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPL 579
+FAKN+ KI SI WVGYPGE GG AIADVIFG YNP G+LP+TWY +YV K+P T+M +
Sbjct: 474 SFAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNM 533
Query: 580 R--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
R P + +PGRTY+F+ G +Y FG GLSYTQF + + +PKSV I +++ C
Sbjct: 534 RPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSC------ 587
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG 697
+ C +V C++ F + V N G + GS V ++S PP + + K ++G
Sbjct: 588 ---HSSKCKSVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLG 644
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+E+VF+ A A V F ++ CK L IVD +A G H + VG
Sbjct: 645 FEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVG 689
>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 733
Score = 503 bits (1296), Expect = e-139, Method: Compositional matrix adjust.
Identities = 293/740 (39%), Positives = 414/740 (55%), Gaps = 70/740 (9%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
YC+ +L + +R KDL+ R+TL EK+ Q+G+ A + RLG+P Y+WWSE LHGV+
Sbjct: 37 YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVAV----- 91
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
PG H + TSFP +I T +SFN+SL+ +IG+ VSTEAR + G GLT+++P
Sbjct: 92 --SPGLHLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTP 149
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+VRDPRWGR ET GEDPY+ +YA+N VRG Q +DS KI A CKH
Sbjct: 150 NINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQ---------GNDSEYKKIIATCKH 200
Query: 194 YAAYDLDNW-EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
+AAYDL+++ G+ R F++ VT+QD++ET+ F CV G V S+MCSYN VNG+P+C
Sbjct: 201 FAAYDLESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSC 260
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
D N+ R W F GY+VSDC +I ++ H + + T D VA LK G DL+CG +
Sbjct: 261 VDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNKHHYTS-TPTDTVAAGLKGGTDLNCGSF 319
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIE 369
Y M A G I E DID ++ L+ MRLG FD P+Y+ N + QH +
Sbjct: 320 YQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQD 378
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
LA +AAR+ IVLL+N NG LPL+ + +A+VGP+ A M G + SP+DG
Sbjct: 379 LALQAARESIVLLQN-NGKLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDG 437
Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
F + + Y+ GC D+ C A K+A A V V GLD +E E DR D+ L
Sbjct: 438 FKSKGLHVTYSLGC-DVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFL 496
Query: 490 PGFQTELINKVADAAKG-----PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
PG Q + + + D P+ +VIMS +VD++ +K+ +ILWVGYPG+ GG+
Sbjct: 497 PGLQDKFLLGLRDTLTNLQSPVPLIVVIMSGSSVDLSESKS--LADAILWVGYPGQSGGQ 554
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA+VI+G+ NP GRLP+T+Y Y+ + Y M +R PGRTY+F+ V+PFG+
Sbjct: 555 AIAEVIYGEVNPSGRLPLTFYPGEYIDLVAYRHMSMREP---PGRTYRFYTENPVFPFGH 611
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F+ + +V V+ D V D F
Sbjct: 612 GLSYTTFELSWTNKMNNV-----------------------TEIVISDSV---DINIDFD 645
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
I V N G + G+ V+ Y I ++++ +++VFI +S K+ SL
Sbjct: 646 ITVVNTGYLSGAVSVLGYVS-SNIPDAPLRELFDFDKVFIDKYESKKI--------SLFA 696
Query: 724 VDNAANSLLASGAHTILVGE 743
++A ++ G IL GE
Sbjct: 697 TNDAFTTVDEKGRRNILPGE 716
>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 790
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 295/745 (39%), Positives = 406/745 (54%), Gaps = 60/745 (8%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RAK L+ TL EK+ G + GVPRLGL YEWW EALHGV+
Sbjct: 39 CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVA------- 91
Query: 75 SPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
S PG +F E ATSFP IL A+F++ L + +STEARA N AGL FW+
Sbjct: 92 SSPGVNFSVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWT 151
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDPY + Y + +RGLQ + Y K+ A CK
Sbjct: 152 PNINPFKDPRWGRGQETPGEDPYHLSSYVHSLIRGLQG-DNPSYK--------KVVATCK 202
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+ AYD++NW GN R+ D+ + QD+ E ++ PF C + +V + MCSYN +NG+PTC
Sbjct: 203 HFVAYDVENWNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTC 262
Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
ADP LL +R WN+ ++ SDCDS+Q + H + + ++E+A A LKAG D++C
Sbjct: 263 ADPYLLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNYAS-SREEAAAISLKAGTDINC 321
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNNICNPQHI 368
G YY A +QG I E D+DTSL Y L+RLGYFDG Y+NL N++ P
Sbjct: 322 GTYYQEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGDRVPYRNLTWNDVSTPYAQ 381
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA +AA GI LLKND G LPL N +AL+G ANAT M+GNY G P + SP+
Sbjct: 382 DLALKAATSGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLW 440
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
+ Y G AA +D + + G+D VEAE KDRV +
Sbjct: 441 AAQQTGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIA 500
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
G Q ++I ++AD P +V M G++D + NP I+++LW GYPG++GG+AI D
Sbjct: 501 WSGPQLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFD 560
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGL 605
++ G P GRLPIT Y A+Y+ K+P T LRP + PGRTY + + V+ FGYGL
Sbjct: 561 ILQGISAPAGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGL 620
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
YT F + D + D Y++ + C +D K TF I+
Sbjct: 621 HYTNFTATI------------PDAESSDTTYSIDSLASDCTESYLDRCPFK----TFSID 664
Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAG--QSAKVGFTMN 716
V N G + V + + + G H K+++ Y+R+ I AG Q+A + T+
Sbjct: 665 VTNTGSVTSDYVTLGF-----LTGAHGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLG 719
Query: 717 ACKSLKIVDNAANSLLASGAHTILV 741
SL VD+ N++L G++ +LV
Sbjct: 720 ---SLSRVDDKGNTVLFPGSYALLV 741
>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
Length = 573
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 257/579 (44%), Positives = 361/579 (62%), Gaps = 17/579 (2%)
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+S L+ SACCKH+ AYDL+NW+G RF FD++VTEQD+ +T+ PF+ CV +G S +M
Sbjct: 1 NSSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIM 60
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
CSYNRVNG+PTCAD LL++T RGDW+F+GYI SDCD++ I + + EDAVA V
Sbjct: 61 CSYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAK-APEDAVADV 119
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK---NL 357
LKAG+D++CG Y + A QQGKI DID +LR L+ + MRLG F+G+P+Y N+
Sbjct: 120 LKAGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNI 179
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
G + +C +H +LA +AA+ GIVLLKND GALPL+ + ++A++GP+ N ++GNY
Sbjct: 180 GADQVCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYF 239
Query: 418 GTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
G PC +P Y K + GC VC N S I A+ AA +AD V+ GLD +
Sbjct: 240 GPPCISVTPFQALQGYVKDATFVQGCNAAVC-NVSNIGEAVHAASSADYVVLFMGLDQNQ 298
Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
E E DR++L LPG Q L+NKVADAAK PV LV++ G VD+ FAKNNPKI +I+W GY
Sbjct: 299 EREEVDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGY 358
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDG 595
PG+ GG AIA V+FG++NPGGRLP+TWY + +P T M +R P +PGRTY+F+ G
Sbjct: 359 PGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKG 418
Query: 596 PVVYPFGYGLSYTQFKYKVASS---PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
VY FGYGLSY+++ ++ AS P S+ ++ + TV + A
Sbjct: 419 KTVYNFGYGLSYSKYSHRFASEGTKPPSMS-GIEGLKATASAAGTVSYDVEEMGA----- 472
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKV 711
C +F + V+N G MDG V+++ + P G Q+IG++ V + A ++A V
Sbjct: 473 EACDRLRFPAVVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHV 532
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
F ++ CK ++ G+H + VG+ +SF
Sbjct: 533 EFEVSPCKHFSRAAEDGRKVIDQGSHFVKVGDDEFELSF 571
>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
Length = 748
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 286/750 (38%), Positives = 414/750 (55%), Gaps = 100/750 (13%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D P+C+ L +R DLV R+TL + + QMG A VP LG+P Y WW+E LHGV
Sbjct: 10 LKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGV-L 68
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
TN P TSFP A+FN L K+ + +S EARA+ N G GL
Sbjct: 69 TKCGTNCP------------TSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGL 116
Query: 129 TFWSPNI-----------------------NVVRDPRWGRVLETPGEDPYVVGRYAINYV 165
FW+PNI ++ RDPRWGR +E PGEDP++ +Y +++
Sbjct: 117 DFWAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFM 176
Query: 166 RGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFIL 225
RGLQ+ E DSR ++ CKH+AAY L+ W+ DRF FD+ V++ D ET++
Sbjct: 177 RGLQEGE--------DSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLP 228
Query: 226 PFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES 285
F+ C+ EG S+MCSYN VNG+P+CA+ LL +R W+F GY+VSDCD++ TI +
Sbjct: 229 AFKGCIVEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNN 288
Query: 286 HKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRL 345
H F T E A A L AG DL+CGD+Y A +G++ E ++ +++ L+ M L
Sbjct: 289 HHF-TKTPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMEL 347
Query: 346 GYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
G +D + YK + + + +H +LA +AAR+ +VLL+N G LPL +++ +A++G
Sbjct: 348 GMWDPPAEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRK-SVRRVAVIG 406
Query: 404 PHANATKAMIGNYEGTPCR------YTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIP 455
P+ANAT+ M+GNY G+ C SP A ++ Y GC D+ N + IP
Sbjct: 407 PNANATETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGC-DVDSTNTTGIP 465
Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
A+ AA+ AD ++V GL+ SVE+EGKDRV + LPG Q LI + A P +V+M
Sbjct: 466 EAVKAAQAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHG 524
Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG----------GRLPITWY 565
GAV I + K+ ++ I+ YPGE GG+AIADV+FG YNPG GRLP+T
Sbjct: 525 GAVAIEWIKD--QVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVL 582
Query: 566 EANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPV-VYPFGYGLSYTQFKYKVASSPKSVDI 623
ANYV +P T+M +R N PGRTY+++ GP ++ FG+GLSYT FK + S+P+ +
Sbjct: 583 PANYVDMVPLTNMSMRASGNNPGRTYRYYTGPAPLWEFGFGLSYTTFKTEWLSTPQPSAL 642
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
K +D +F++ V N+G + G EVV+ +
Sbjct: 643 K----------------------------SYARDEAVSFRVRVTNVGPVAGDEVVLAFVT 674
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
+KQ+ +ERV + G+S ++ F
Sbjct: 675 RDNADRGPLKQLFAFERVHLNPGESKEIFF 704
>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
Length = 800
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 295/747 (39%), Positives = 421/747 (56%), Gaps = 52/747 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D CD+ RA LV+ +TL EK+ G+ + GVPRLG+P Y+WW+EALHGV+F
Sbjct: 35 LKDNLVCDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAF 94
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+F S ATSFP IL A+F++ L ++ VSTEARA N G +GL
Sbjct: 95 TYPGQPMTESGNFSS----ATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGL 150
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+PNIN +DPRWGR ETPGEDP+ + Y N +RGL+ + Y KI
Sbjct: 151 DYWTPNINPYKDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYK--------KIV 202
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+ YD++NW GN R+ FD+++ +DM E ++ PF+ C E V + MCSYN VNG
Sbjct: 203 ATCKHFTGYDMENWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNG 262
Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+PTCADP LL +R W ++ ++VSDCD+IQ + H++ +++E AVA L AG
Sbjct: 263 VPTCADPWLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEWA-ESREQAVADTLNAGT 321
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNIC 363
DL+CG YY + GA +QG I + +D +L Y L++LGYFD S Y+ +G ++
Sbjct: 322 DLNCGTYYQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADSQPYRQIGWQDVN 381
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+ ELA +AA++GIVLLKND G LPL+ + ++AL+G ANAT+ M GNY G
Sbjct: 382 SQHAQELALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYL 440
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIP---AAIDAAKNADATVIVAGLDLSVEAE 480
SP+ +NYA G + Q+N A AA+N+D ++V G+D +E+E
Sbjct: 441 HSPLYAAEQLGVKVNYAEGAS----QSNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESE 496
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRV + G Q ++I K+A K PV +V M AG +D +N I ++LW GYPG+
Sbjct: 497 ELDRVAIAWSGPQLDMITKLATYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQ 555
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
+GG A+ D+I G P GRLPIT Y A Y K + T M LRP + GRTYK+++G V+
Sbjct: 556 DGGTALFDIITGAVAPAGRLPITQYPARYTKEVAMTDMSLRPSSTSAGRTYKWYNGTAVF 615
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GL YT F + S P S ++ + C+A D K
Sbjct: 616 PFGFGLHYTNFSAAIPSPPAS--------------SFAISDLVASCSAN--DTSKLDLCP 659
Query: 660 FT-FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVF-IAAG--QSAKVGFT 714
FT +++ N G V + + H K ++ Y+R+ IAAG Q+A++ T
Sbjct: 660 FTSLAVDIANDGTRASDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQTARLNLT 719
Query: 715 MNACKSLKIVDNAANSLLASGAHTILV 741
+ SL VD + LL G +++L+
Sbjct: 720 LG---SLVRVDENGDKLLYPGDYSVLI 743
>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
Length = 732
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 303/755 (40%), Positives = 413/755 (54%), Gaps = 85/755 (11%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
D P+C LP R DL+ RMTL EKV + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 38 DLPFCQVNLPIHTRVNDLIGRMTLQEKVGLLVNNAAAVPRLGIKGYEWWSEALHGVSNVG 97
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F P ATSFP VI T ASFN +LW+ IG+ VS EARAM+N G AGLT+
Sbjct: 98 ------PGTKFGGAFPVATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGVAGLTY 151
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+ PRWGR ETPGEDP VVG+YA +YVRGLQ +G+ LK++AC
Sbjct: 152 WSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSDGIR---------LKVAAC 202
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW G DRFHF+++V++QDM +TF +PF MCV EG V+SVMCSYN+VNGIP
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
TCADP LL +T+RG W +GYIVSDCDS F + + KAGLDLDCG
Sbjct: 263 TCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFTSPRRSS--LGCYKAGLDLDCG 320
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNNICNPQHIE 369
+ AV++ EA+I+ + + LG FDGSP Q + P + +
Sbjct: 321 PFLVTHR-DAVKKAA-EEAEINNAWLKTLTFQISLGIFDGSPLQAVGDVVPTMGPPTNQD 378
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA--NATKAMIGNYEGTPCRYTSPM 427
LA A ++ + + KN L + GP A + M+GNYEG PC+Y P+
Sbjct: 379 LAVNAPKR-LFIFKNRAFLL------YSPRHIFGPVALFKSLPFMLGNYEGLPCKYLFPL 431
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G + ++ Y PGC++++C + +A+D A +ADA V+V G D S+E EG DRVD
Sbjct: 432 QGLAGFVSLL-YLPGCSNVICAVAD-VGSAVDLAASADAVVLVVGADQSIEREGHDRVDF 489
Query: 488 LLPGFQTELINKVADAAKGPVTLVIM----SAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
LPG Q EL+ +VA AAKGPV LVIM S G N + G
Sbjct: 490 YLPGKQQELVTRVAMAAKGPVLLVIMDLAISGGGCSYN------------------QVNG 531
Query: 544 RAIADVIFGK-------YNPGGRLP-ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
I+DV G N G +P I++ A + + +T + P ++ + +KF
Sbjct: 532 IPISDVCEGSSYRWPSFSNCHGYMPWISYSRAIWETLRFTKVNWVPTWSW-NKLHKF--- 587
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
G +++ +P+ L R N+ G +ID +
Sbjct: 588 --------GSHHSKCTDDGFGTPRRPPPWL------RKCNHFQGRQSELHMLDVIDSL-- 631
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
Q++V+N G MDG+ ++VY +PP KQ++ +E+V +AAG +VG +
Sbjct: 632 ----LGMQVDVKNTGSMDGTHTLLVYFRPPARHWAPHKQLVAFEKVHVAAGTQQRVGINI 687
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ CKSL +VD + + G H++ +G+ VS
Sbjct: 688 HVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSVSL 722
>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
NZE10]
Length = 802
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 287/749 (38%), Positives = 403/749 (53%), Gaps = 46/749 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D CD RA L+ TL EK+ G + GVPRLGLP Y WW EALHGV+
Sbjct: 33 LKDNTVCDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVA- 91
Query: 69 IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
S PG +F P ATSFP IL A+F++ L + + +STEARA N A
Sbjct: 92 ------SSPGVNFSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRA 145
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL FW+PNIN +D RWGR ETPGEDPY + Y + GLQ +Y R
Sbjct: 146 GLDFWTPNINPFKDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYKR-------- 197
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+ A CKH+ AYD+++W GN R+ FD++V+ QD+ E ++ PF+ C + +V + MCSYN +
Sbjct: 198 VVATCKHFVAYDMESWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNAL 257
Query: 247 NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+PTCADP LL +R WN+ ++ SDCD++Q + H + + T+E+A A LKA
Sbjct: 258 NGVPTCADPWLLQTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDYAS-TREEAAALSLKA 316
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNI 362
G D++CG YY + A QG I D+D SL Y L+RLGYFDG + Y+NL N++
Sbjct: 317 GTDINCGTYYQDHLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDGLAVPYRNLTWNDV 376
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P +LA +AA +GI LLKND G LPL N ++AL+G ANAT M+GNY+G P
Sbjct: 377 STPHAQQLAYKAAAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPF 435
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ SP+ +N+A G AA +D + G+D SVE+EG
Sbjct: 436 FHSPLYAAQQTGATVNFATGPGGQGDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGM 495
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DRV L G Q ++I ++A K PV ++ M G +D + NNP + +++W GYPG++G
Sbjct: 496 DRVSLTWTGAQLDMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDG 554
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVY 599
G A+ D+I G P GRLP T Y A Y+ ++P T M LRP PGRTY +++ V+
Sbjct: 555 GVALFDIIRGITAPAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVF 614
Query: 600 PFGYGLSYTQFKYKVASS-PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
P+G GL YT F + S P + D Y + T C A D
Sbjct: 615 PYGLGLHYTNFTAAIKPSFPSTYDSSSSNSGSAS---YDISTLTSNCTATYKDLCPFT-- 669
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVFIAAGQSAKVG 712
+F + + N G++ V + + +AG H K+++ Y+R+ S++
Sbjct: 670 --SFSVSITNTGEIMSDYVTLGF-----LAGIHGPAPHPNKRLVSYQRLHNITAGSSQTA 722
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILV 741
+ SL VD N +L G + +LV
Sbjct: 723 WLNLTLGSLARVDEMGNKVLYPGDYALLV 751
>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
Length = 591
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 245/593 (41%), Positives = 355/593 (59%), Gaps = 20/593 (3%)
Query: 156 VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT 215
+ +YA+ +V+G+Q + S L+ SACCKH AYDL++W G R++F+++VT
Sbjct: 1 MASKYAVAFVKGMQG---------NSSAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVT 51
Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
QD+++T+ PF CV + + +MC+Y +NG+P CA+ LL +T+RGDW GYI SD
Sbjct: 52 AQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASD 111
Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSL 335
CD++ + ++ ++ T EDAVA LKAGLD++CG Y A+QQGK+ E DID +L
Sbjct: 112 CDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKAL 170
Query: 336 RFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
+ L+ + MRLG+FDG P+ Y LG +IC P+H LA EAA GIVLLKND G LPL
Sbjct: 171 KNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPL 230
Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
+ + + A++GP+AN A+IGNY G PC T+P++G Y K + + GC C
Sbjct: 231 DRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVA 290
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
+ AA A+ ++D + GL E+EG+DR LLLPG Q LI VADAAK PV LV
Sbjct: 291 ATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILV 349
Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
+++ G VD+ FA+ NPKI +ILW GYPG+ GG AIA V+FG +NPGGRLP+TWY + K
Sbjct: 350 LLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTK 409
Query: 572 IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
+P T M +R P +PGR+Y+F+ G VY FGYGLSY+ + ++ S K + +
Sbjct: 410 VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLLA 469
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
R + G + D C+ KF +EV+N G MDG V++Y + P G
Sbjct: 470 SLRTTTTSEGDESYHIEEIGTDG--CEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKG 527
Query: 690 TH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
Q+IG+ + G+ A + F ++ C+ V ++ G+H ++V
Sbjct: 528 GRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMV 580
>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 290/736 (39%), Positives = 418/736 (56%), Gaps = 43/736 (5%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
C+ LP +RAK +V+ T+ E + MG+ + GV RLGLP Y+WWSEALHG I R
Sbjct: 61 CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHG---IARSNF 117
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G E ATSFP IL +FN L+K++G + TEARA N+G AGL F+SPN
Sbjct: 118 TASG-----EYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPN 172
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN RD RWGR E E P +VG YA+NYV+GLQ G++ +++ D+ L+++A CKH+
Sbjct: 173 INPFRDARWGRGQEVASESPVLVGNYALNYVQGLQG--GLDSNQNDDT--LQVAATCKHF 228
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
YD+++W + R +++ +++QD+ + ++ F+ CV + + MCSYN VNG+P CA
Sbjct: 229 VGYDMESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACAS 288
Query: 255 PKLLNQTIRGDWNFH-GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LN +R ++F G I SDCD+I + H + D A A +KAG+D++CGD Y
Sbjct: 289 EFFLNTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLG-GAAADAIKAGVDVNCGDTY 347
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIEL 370
N A+ I E I TS+ Y L+RLGYFD SPQ Y+ N++ PQ +L
Sbjct: 348 QNNLGYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQL 406
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA +GI LLKND G LP N ++ +A++GP ANAT M+G+Y GTP SP+ G
Sbjct: 407 AYQAAVEGIALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGA 465
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ + YA G I + S AA++AAK ADA V G+D SVE E DR L P
Sbjct: 466 QSEGFQVEYALGT-QINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWP 524
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q +L++K++ K P+ ++ G +D KNN + +I++ GYPG+ GG AI D++
Sbjct: 525 GNQLDLVSKLS-GLKKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDIL 583
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GKY P GRL T Y A+Y ++P T M LRP +PGRT+ +++G VY FGYGL YT
Sbjct: 584 SGKYAPAGRLTTTQYPASYADQVPMTDMTLRPRQGYPGRTFMWYNGEPVYEFGYGLHYTT 643
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F +A++P+ Q +I V K + +D TF + ++N
Sbjct: 644 FSASLANAPRG-------GHQSFNIEQVVAAAK---RSQYVDTGLIT----TFDVNIKNT 689
Query: 670 GKMDGSEVVMVYSKPPGIAGTHIKQV-IGYERVF-IAAG--QSAKVGFTMNACKSLKIVD 725
GK ++YSK G H ++ + ++++ I AG Q+AK+ T+ SL D
Sbjct: 690 GKTTSDYAALLYSKTTAGPGPHPNKILVSFDKLHQIHAGQTQTAKLPVTIG---SLLQTD 746
Query: 726 NAANSLLASGAHTILV 741
N L G +T V
Sbjct: 747 TNGNKWLYPGTYTFFV 762
>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
Length = 763
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 286/744 (38%), Positives = 411/744 (55%), Gaps = 50/744 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D C+ + +RAK L++ T E V + + GVPRLGLP Y+WWSEALHGV+
Sbjct: 31 LKDNLVCNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVA- 89
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
+ PG HF + ATSFP IL A+F++ L ++ +STEARA N G
Sbjct: 90 -----GANPGVHFAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGF 144
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G+ F++PNIN RDPRWGR ETPGEDP + RY V LQ G +
Sbjct: 145 SGIDFFTPNINPFRDPRWGRGQETPGEDPLHISRYVFQLVTALQGGLGPSPY-------Y 197
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
KI A CKH+A YDL++WEG DRFHFD+ +T QD+ E + F+ CV + V SVMCSYN
Sbjct: 198 KIVADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNS 257
Query: 246 VNGIPTCADPKLLNQTIRGDWNF-HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
VNG+P CA LL +R + G+I SDCD++Q + +H F T+ +A A LKAG
Sbjct: 258 VNGVPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTT-TQANASAISLKAG 316
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
D+DCG+ Y A+ QG + E D+ +L LY L+R GYFD SP+ ++ LG +
Sbjct: 317 TDVDCGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWAD 375
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P LA AA +GIVLLKND G LPL++ ++ + +VGP NAT M GNY G
Sbjct: 376 VDTPASRRLALLAAEEGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAP 434
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP GF + + G + S A+ AA + D V V G D VE E
Sbjct: 435 YLVSPRQGFVDAGFNVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERES 494
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
+DR+++ PG Q +LI ++A K P+ ++ M AG VD + K + I +++W GYPG+
Sbjct: 495 RDRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQS 553
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
GG A+A+++ GK P RLPIT Y +Y+ +P T M +RP N+ PGRTYK+F G ++ F
Sbjct: 554 GGTALANIVTGKTAPAARLPITQYPEDYISLPMTDMNVRPSNSSPGRTYKWFTGEPIFEF 613
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL Y++F + A P + ++ +G A V + T
Sbjct: 614 GFGLHYSKFDFAWAEEPPA--------------SFAIGD----LVANASSPVDLATFH-T 654
Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF---IAAGQSAKVGFTMNA 717
FQ+ V N+G + V M++ + G + +K+++GY R+ + A +A V T+
Sbjct: 655 FQVNVTNLGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATVTASVPVTLG- 713
Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
++ D NS+L G +++ +
Sbjct: 714 --TIARADEDGNSVLFPGQYSVWL 735
>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
FP-91666 SS1]
Length = 772
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 302/745 (40%), Positives = 414/745 (55%), Gaps = 44/745 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV-S 67
L D C+ + +RA L+E L + V + + GV RLGLP Y+WW+EALHGV S
Sbjct: 33 LRDNLVCNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGS 92
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
G S P +F S ATSFP IL A+FN+SL I +STEARA N AG
Sbjct: 93 SPGVNWGSGPDANFTS----ATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAG 148
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LTF++PNIN RDPRWGR ETPGEDPY + RY YV GLQ + + K+
Sbjct: 149 LTFFTPNINPFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQGGLSPDPY-------YKV 201
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A CKH AYD++NWEGNDR F++ VT QD+ E + F+ C+ + +S MCSYN VN
Sbjct: 202 LANCKHVLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVN 261
Query: 248 GIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
G+P+CA +L +R W G+I DC ++Q I + H + DT +A A + AG
Sbjct: 262 GVPSCASSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGY-TDTLVNATAVAMDAG 320
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
DLDCGD Y+ AV +G I I T+L LY L+RLGYFD + Q Y++ +N+
Sbjct: 321 TDLDCGDVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYFDPAEQQPYRSFDWSNV 380
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P +LA AA QGIVLL+ND G LPL+T N+K +AL+GP ANAT ++ GNY G
Sbjct: 381 NTPSSQDLAYNAAVQGIVLLEND-GLLPLST-NVKNIALIGPMANATLSLQGNYAGIAPF 438
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
SP F + +A G I +NS A++AA+ AD V V G+D S+EAEG+
Sbjct: 439 VISPQQAFETAGYNVTFAFGTG-ISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQ 497
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR + PG Q +LI ++ + K P+ +V M G D + K N + ++LW GYPG+ G
Sbjct: 498 DRTSIEWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSG 556
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYP 600
G A+ D+I GK +P GRLP+T Y ++YV +I T M +RP + PGRTYK++ G +YP
Sbjct: 557 GTALVDIISGKQSPSGRLPVTQYPSSYVSEIDMTDMAIRPNSSGSPGRTYKWYTGAPIYP 616
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYG+ YT F+ + S + +DI NK A D + D
Sbjct: 617 FGYGIHYTTFRLAWSDSSSTT-------YNIQDI--VSSANKSGGFA----DTEILD--- 660
Query: 661 TFQIEVENMGKMDGSEVV--MVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNA 717
TF + V N G S+ V + + G + +++++GY RV I G +A +
Sbjct: 661 TFSLLVTNTGSNYTSDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGGTATAELNV-T 719
Query: 718 CKSLKIVDNAANSLLASGAHTILVG 742
S+ VD N +L G + + VG
Sbjct: 720 LGSISRVDENGNWILYPGTYNLWVG 744
>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 764
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 299/741 (40%), Positives = 414/741 (55%), Gaps = 42/741 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D C+ RA LV+ +TL E V + + GVPRLGLP Y WWSEALHGV+
Sbjct: 32 LKDNLVCNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVAL 91
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
S PG+ F S ATSFP I+ A+F++ L I +STEARA N G AGL
Sbjct: 92 SPGTNFSVPGSPFSS----ATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGL 147
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
F++PNIN +DPRWGR ETPGEDP+ + +Y V GLQ + + K+
Sbjct: 148 DFFTPNINPFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQGGLSPDPY-------YKVI 200
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+A YDL+NWEGN R F++ ++ QD+ E + F+ CV + V SVMCSYN VNG
Sbjct: 201 ADCKHFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNG 260
Query: 249 IPTCADPKLLNQTIRGDWNF-HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
IP+CA+ LL IRG + G+I SDCD++ I H++ T +A A LKAG D+
Sbjct: 261 IPSCANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTT-TLVNASAVALKAGTDV 319
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNP 365
DCG Y+ + AV Q + E DI S+ LY L+RLGYFD + ++ LG +++ P
Sbjct: 320 DCGTTYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFDSPAEQPFRQLGWSDVNTP 379
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
LA AA +G+ LLKND G LPL++ IK +ALVGP ANAT M GNY+G S
Sbjct: 380 SSQALALTAAEEGVTLLKND-GTLPLSSA-IKRIALVGPWANATTQMQGNYQGIAPFLVS 437
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ + +A G A I ++S AA+ A + ADA + G+D ++E+EG DR
Sbjct: 438 PLQALQDAGFQVTFANGTA-INSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDRE 496
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+ PG Q +L++++A K P ++ M G VD + K+N + +++W GYPG+ GG A
Sbjct: 497 IITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAA 555
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
I +++ GK P GRLPIT Y A+YV +IP T M LRP PGRTYK+F G ++ FG+G
Sbjct: 556 IVNILTGKIAPAGRLPITQYPADYVNEIPMTDMALRPNGTSPGRTYKWFTGTPIFGFGFG 615
Query: 605 LSYTQFKYKVASSPKS--VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
L YT F A +P S L + +++ TN P FTF
Sbjct: 616 LHYTTFSLDWAPTPPSSFAISTLVSEANTAGVSF---TNLAPL--------------FTF 658
Query: 663 QIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKS 720
++ V+N GK+ V +++S G +KQ++ Y RV IA GQ+ + S
Sbjct: 659 RVNVKNTGKVGSDYVALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVT-LGS 717
Query: 721 LKIVDNAANSLLASGAHTILV 741
+ +D +S L G + I V
Sbjct: 718 IARIDENGDSALYPGRYNIWV 738
>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
Length = 785
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 289/746 (38%), Positives = 403/746 (54%), Gaps = 66/746 (8%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RA L+ T+ EK+ G A GVPRLGLP Y WW EALHGV+
Sbjct: 39 CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVA------- 91
Query: 75 SPPGTHFD--SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG +F + ATSFP IL A+F++ L K + +STEARA N +GL +W+
Sbjct: 92 QSPGVNFSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWT 151
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +D RWGR ETPGEDPY + Y + + GLQ D + K+ A CK
Sbjct: 152 PNINPFKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQ----------GDGKYKKVVATCK 201
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+ AYDL+ W GN R+ FD V Q++ E ++ PF+ C + +V + MCSYN +NGIPTC
Sbjct: 202 HFVAYDLETWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTC 261
Query: 253 ADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
ADP LL +R WN+ ++ SDCDSIQ + H++ + T+E+AVA LKAG D++C
Sbjct: 262 ADPYLLQTILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTS-TREEAVAVSLKAGTDVNC 320
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNNICNPQHI 368
G YY F GA+ G + E DID +L Y L+RLGYFDG+ +Y++L ++ P
Sbjct: 321 GTYYQEFLPGALSLGLVTEKDIDMALIRQYSSLVRLGYFDGTAVEYRSLSWKDVSTPYAQ 380
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA +AA +GI LLKND G LPL +A++G ANAT+ M+GNY+G P SP+
Sbjct: 381 QLALKAAVEGITLLKND-GILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLW 439
Query: 429 GFYAYSKVINYA---PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ Y+ G D N I A+D AD + G+D VEAEG DRV
Sbjct: 440 AAQQTGANVTYSGNPGGQGDPTTNNWLHIWTAVD---EADVILFAGGIDNGVEAEGMDRV 496
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+ G Q ++I ++A K PV + M VD NN I ++LW GYPG++GG A
Sbjct: 497 SIAWTGAQLDVIGQLASRGK-PVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVA 555
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
+ D+I GK P GRLP T Y A+Y+ K+P T M LRP FPGRTY +++ V+ FG
Sbjct: 556 LLDIIQGKSAPAGRLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFG 615
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGL YT F ++ + + ++++ C +D D K
Sbjct: 616 YGLHYTNFSATISPTDTT--------------SFSIADLTKDCTEHYMDRCPFADMK--- 658
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQSAKVGFTM 715
I V N G + V + + +AG H K+++ Y+R+ I AG S +
Sbjct: 659 -IAVTNTGNVTSDYVTLGF-----LAGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNL 712
Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
SL VD+ N++L G++ +L+
Sbjct: 713 T-LASLARVDDMGNTVLYPGSYALLI 737
>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
SS1]
Length = 770
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 296/752 (39%), Positives = 417/752 (55%), Gaps = 46/752 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
C+ + +RAK LV MTL E V + + GVPRLGLP YEWWSEALHGV+
Sbjct: 36 CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVA------- 88
Query: 75 SPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
S PG F++ + GATSFP IL +A+F++ L + T+STEARA N ++GL F++
Sbjct: 89 SSPGVTFETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFT 148
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDP RY + GLQ G S KI A CK
Sbjct: 149 PNINPFKDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVG-------PSPYYKIIADCK 201
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+AAYDL+NWEGN+R F++ V+ QD+ E + F+ CV + V SVMCSYN VNG+P C
Sbjct: 202 HFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPAC 261
Query: 253 ADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
P LL +R + +I SDCD++ I + H + T +A A L AG D+DCG
Sbjct: 262 GSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTT-TLTNASAVALLAGTDVDCG 320
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHI 368
Y+ AV +G ++++D++ +L LY L+RLGYFD S Y+ LG +++ P
Sbjct: 321 TSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSVPYRALGASDVNTPAAQ 380
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
LA AA +GIVLLKND G LPL++ N+ +AL+GP ANAT M GNYEG SP+D
Sbjct: 381 TLAYTAAVEGIVLLKND-GLLPLSS-NVSHIALIGPWANATTQMQGNYEGIAPLLISPLD 438
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
GF + +++ G I + S A+ A AD V + G+D +VEAEG+DR +
Sbjct: 439 GFTSAGFNVSFTNGTT-ISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSIT 497
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
PG Q ELI ++ K P ++ M G VD K N + ++LW GYPG+ GG+A+AD
Sbjct: 498 WPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALAD 556
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF--PGRTYKFFDGPVVYPFGYGL 605
+I G P GRL T Y A+YV ++ T M +RP N+ PGRTYK++ G V+ FG+GL
Sbjct: 557 IITGVQAPAGRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGL 616
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
YT F + A + + Q + + + ++D TF ++
Sbjct: 617 HYTTFDVEWAEGSPAASYSI---QDLVASANSSSSAVAHVDSAILD---------TFTVQ 664
Query: 666 VENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKI 723
V N G + V +++S G + +++++ Y RV I G SA + ++
Sbjct: 665 VTNTGNVTSDYVALLFSNTTAGPSPAPLQELVSYARVKGITPGVSATASLNVT-LGTIAR 723
Query: 724 VDNAANSLLASGAHTILV---GEGVGGVSFPL 752
VD NS++ G + + V G+ SF L
Sbjct: 724 VDEDGNSIIYPGVYNLWVDTTGQAKAVTSFEL 755
>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
Length = 789
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 291/749 (38%), Positives = 404/749 (53%), Gaps = 68/749 (9%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
C+ +RAK LV TL EK+ + GVPRLG+P Y+WWSE LHG++
Sbjct: 35 CNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEGLHGIA------- 87
Query: 75 SPPGTHFDS---EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
P T+F + E +TSFP IL A+F++ L + + +STEARA N GL FW
Sbjct: 88 -GPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRTGLDFW 146
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PNIN RDPRWGR ETPGED + + Y + GLQ Y R + A C
Sbjct: 147 TPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYKR--------VVATC 198
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A YD+++W GN R+ FD+++++QD+ E ++ PF+ CV + +V + MCSYN VNG+PT
Sbjct: 199 KHFAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPT 257
Query: 252 CADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
CADP LL +R W N ++ SDCD++Q I H++ + T+E AVA L AG DLD
Sbjct: 258 CADPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQW-SATREQAVADALIAGTDLD 316
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQ 366
CG Y GA QG + E +D +L Y L+RLG+FD + Y+ G +++
Sbjct: 317 CGTYMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFDDAADQPYRQFGWDSVATDA 376
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
LA AA +GIVLLKND G LPL+ + +L + G ANAT ++GNY G P SP
Sbjct: 377 SQALARRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSP 435
Query: 427 MDGFYAYSKVINYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ + INYA G D S + AI +D + + G+D S+E EG
Sbjct: 436 LWALQQENLTINYAGGNPGGQGDPTTNRWSSLSGAI---ATSDILIYIGGIDNSIEEEGH 492
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR L G Q ++I ++A K P +V+M G +D NN I +ILW GYPG++G
Sbjct: 493 DRTSLAWTGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDG 551
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G AI D++ GK P GRLP T Y A+Y +P T M LRP N PGRTYK+++G Y F
Sbjct: 552 GPAIVDILTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRPSENNPGRTYKWYNGTATYEF 611
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F V S + D C++ CA +D
Sbjct: 612 GHGLHYTNFSATVTSPMQQSYRIADLMSTCKN---ATSITLERCAFTSVD---------- 658
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQS--AKVG 712
I V N G + V + Y I+G+H K ++GY+R+F IAAG S A++
Sbjct: 659 --ISVTNTGAVASDYVTLCY-----ISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARID 711
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILV 741
T+ +SL VD N +L G ++++V
Sbjct: 712 LTL---ESLARVDEVGNKVLYPGEYSLMV 737
>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 765
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 286/703 (40%), Positives = 393/703 (55%), Gaps = 57/703 (8%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RA+ L+ MTL EK+ + GVPRLGLP Y WWSEALHGV+
Sbjct: 37 LSKNAVCDTTLDPVTRAQALLAAMTLEEKINNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95
Query: 69 IGRRTNSPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG HF ATSFP+ I A+F++ L K+I + TE RA N G+A
Sbjct: 96 ------GSPGVHFADSGNFSYATSFPSPITLGAAFDDDLVKQIATVIGTEGRAFGNAGHA 149
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL +W+PNIN RDPRWGR ETPGEDP+ RY + + GLQD G E +P K
Sbjct: 150 GLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHLIDGLQDGIGPE-------KP-K 201
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
I A CKH+A YD+++WEGN+R+ FD+ +++QDM E + PF+ C + V +VMCSYN V
Sbjct: 202 IVATCKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSV 261
Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NGIPTCADP LL +R W + G ++ SDC +I I + HK++ A A + A
Sbjct: 262 NGIPTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYKDHKYVA-DGAHAAAVAVNA 320
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG Y F A+ QG + +D +L LY L++LGYFD + Y+++G ++
Sbjct: 321 GTDLDCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKLGYFDPAADQPYRSIGWSD 380
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P +LA AA +G VLLKND G LPL T+A+VGP+ANAT + GNYEGT
Sbjct: 381 VATPDAEQLAHTAAVEGTVLLKND-GTLPLKKNG--TVAIVGPYANATTQLQGNYEGTAK 437
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + + YAPG I + S A++AAK +D + G+D VEAE
Sbjct: 438 YIHTMLSAAAQQGYKVKYAPGTG-INSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEA 496
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR + PG Q +LI +++D K P+ +V G VD + +N + +LW GYP +
Sbjct: 497 LDRTSIAWPGNQLDLIQQLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQA 555
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG A+ D++ GK P GRLP+T Y YV ++P T M LRP + PGRTY+++D V+ P
Sbjct: 556 GGAAVFDILTGKTAPAGRLPVTQYPEEYVDQVPMTDMNLRPGPSNPGRTYRWYDKAVI-P 614
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYG+ YT F D + NY AAV ++ +
Sbjct: 615 FGYGMHYTTF-----------------DVSWKRKNY----GPYNTAAVKAENAVLE---- 649
Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERV 701
TF ++V+N GK+ V +V+ + G IK ++GY+RV
Sbjct: 650 TFSLQVKNTGKVTSDYVALVFLTTTDAGPKPYPIKTLVGYQRV 692
>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
Length = 401
Score = 479 bits (1234), Expect = e-132, Method: Compositional matrix adjust.
Identities = 240/423 (56%), Positives = 303/423 (71%), Gaps = 38/423 (8%)
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
QGK E D+DTSLR LYIVL ++G+FDG P Y++L K ++C +HIELAA+AARQGIVLL
Sbjct: 2 QGKAREEDVDTSLRNLYIVLTQVGFFDGIPSYESLDKKDLCTKEHIELAADAARQGIVLL 61
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPG 442
KN N LPL+ +K LAL+GPHANAT M+GNY G PC+Y+SP+DGF AY KV Y G
Sbjct: 62 KNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKV-TYEMG 120
Query: 443 CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVAD 502
C ++ C N + I A++A+KNADAT+++ GLD +VE EG DR DLLLPG+QTELI +V
Sbjct: 121 CNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQVIV 180
Query: 503 AAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPI 562
A+KGP+ LVIMS AVDI+F+K + ++K+ILW GYPGEEGGRAIADV++GKYNPGGRLP+
Sbjct: 181 ASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRLPL 240
Query: 563 TWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
TW++ +Y+ +P TSM LRPVNN+PGRTYKFF+G VVYPFG+GLSYT+F Y + SS
Sbjct: 241 TWHQNDYLSMLPMTSMSLRPVNNYPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRSS---- 296
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
++ CKD+ F IEV+N+G G+EVV+VY
Sbjct: 297 ------------------------------NMSCKDH-FELDIEVKNIGAKHGNEVVLVY 325
Query: 682 SKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
SKPP GI GTH KQVIG++RVF+ AG S V F N CKSL IV A LL SG H I+
Sbjct: 326 SKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHKII 385
Query: 741 VGE 743
+G+
Sbjct: 386 IGD 388
>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
gi|223975771|gb|ACN32073.1| unknown [Zea mays]
Length = 507
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 240/510 (47%), Positives = 332/510 (65%), Gaps = 18/510 (3%)
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN+VNG PTCAD LL+ IRGDW +GYI SDCDS+ + + + T EDA A
Sbjct: 1 MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAI 59
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
+KAGLDL+CG + T+ AVQ GK++E+D+D ++ + LMRLG+FDG P+ + N
Sbjct: 60 SIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 119
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
LG +++C P + ELA EAARQGIVLLKN G LPL+ +IK++A++GP+ANA+ MIGNY
Sbjct: 120 LGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNY 178
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDL 475
EGTPC+YT+P+ G A + Y PGC ++ C NS+ + AA AA +AD TV+V G D
Sbjct: 179 EGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQ 237
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
S+E E DR LLLPG Q +L++ VA+A+ GP LV+MS G DI+FAK++ KI +ILWV
Sbjct: 238 SIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWV 297
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFF 593
GYPGE GG AIADV+FG +NP GRLP+TWY ++ K+P T M +R P +PGRTY+F+
Sbjct: 298 GYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFY 357
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
G VY FG GLSYT F + + S+PK + ++L + C C +V +
Sbjct: 358 TGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACL---------TEQCPSVEAEGA 408
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
C+ F + V N G+ G V ++S PP + K ++G+E+V + GQ+ V F
Sbjct: 409 HCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAF 468
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
++ CK L +VD N +A G+HT+ VG+
Sbjct: 469 KVDVCKDLSVVDELGNRKVALGSHTLHVGD 498
>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
Length = 523
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 247/520 (47%), Positives = 335/520 (64%), Gaps = 15/520 (2%)
Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
V +G+V+SVMCSYN+VNG PTCADP LL+ IRG+W +GYIVSDCDS+ + ++ +
Sbjct: 3 VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A +L AGLDL+CG + T AV+ G + EA ID ++ ++ LMRLG+FDG
Sbjct: 63 TPAEAAAISIL-AGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDG 121
Query: 351 SPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+P+ Y LG ++C + ELAA+AARQGIVLLKN G LPL+ +IKTLA++GP+AN
Sbjct: 122 NPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNAN 180
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
TK MIGNYEGTPC+YT+P+ G A + Y PGC+++ C + A A AD +
Sbjct: 181 VTKTMIGNYEGTPCKYTTPLQGL-AGTVSTTYLPGCSNVACAVAD-VAGATKLAATADVS 238
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
V+V G D S+EAE +DRVDL LPG Q EL+ +VA AAKGPV LVIMS G DI FAKN+P
Sbjct: 239 VLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDP 298
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNN 584
KI ILWVGYPGE GG AIAD+IFG+YNP G+LP+TWY +YV K+P T M +RP +
Sbjct: 299 KIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASG 358
Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN-YTVGTNKP 643
+PGRTY+F+ G VY FG GLSYT+F + + +P V + L+++ CR ++ P
Sbjct: 359 YPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQSLDAIGP 418
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFI 703
C + V F I+V N G +G V +++ PP I G+ K ++G+E++ +
Sbjct: 419 HCE----NAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAIHGSPRKHLVGFEKIRL 474
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ A V F + CK L +VD + G H + VG+
Sbjct: 475 GKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGD 514
>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 288/741 (38%), Positives = 412/741 (55%), Gaps = 41/741 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D C+ LP +RAK +V+ T+ E + MG+ + GV RLGLP Y+WWSE LHG
Sbjct: 55 LKDNDVCNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHG--- 111
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
I R + G E ATSFP IL +FN L+K++G + TEARA N+G AGL
Sbjct: 112 IARSNFTASG-----EYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGL 166
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
++SPNIN +DPRWGR E E P +VG YA+NYV+GLQ G++ + + D+ L+++
Sbjct: 167 DYYSPNINPFKDPRWGRGQEVASESPVLVGNYALNYVQGLQG--GIDSNPNDDT--LQVA 222
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+A YD+++W+ + R +++ +++QD+ + + F+ CV + + MCSYN +NG
Sbjct: 223 ATCKHFAGYDMESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAING 282
Query: 249 IPTCADPKLLNQTIRGDWNFH-GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
IP CA L IR ++F G I SDCDS+ +I H ++ D A A +KAG+D+
Sbjct: 283 IPVCASEFFLGTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGA-AAADGIKAGVDV 341
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICN 364
+CGD Y N A+ I E I S+ Y L+RLGYFD SPQ Y+ +++
Sbjct: 342 NCGDTYQNNLGYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDVST 400
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
Q +LA +AA +GI LLKND G LP N +K +A++GP ANAT M+G+Y GTP
Sbjct: 401 SQANQLAYQAAVEGITLLKND-GTLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLI 459
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP+ G + YA G I + AA++AAK ADA V G+D S+E E DR
Sbjct: 460 SPLQGAQDSGFKVQYAYGT-QINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALDR 518
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
L PG Q +L++K++ K P+ +V AG VD KNN + SI++ GYPG+ GG
Sbjct: 519 ESLAWPGNQLDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGT 577
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AI DV+ G Y P GRL T Y A+Y ++P T M LRP + +PGRT+ +++G VY FGY
Sbjct: 578 AIWDVLNGIYAPAGRLSTTQYPASYADQVPMTDMTLRPRDGYPGRTFMWYNGEPVYEFGY 637
Query: 604 GLSYTQFKYKVASS-PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GL YT F +A++ PK + DQ + + LI TF
Sbjct: 638 GLHYTTFSVSLANAPPKGAPQSFNIDQ------FIAAKSSQYVDTSLIT---------TF 682
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV-IGYERVF-IAAGQSAKVGFTMNACKS 720
+ ++N GK+ ++YS G H ++ + ++++ I GQ + S
Sbjct: 683 DVNIKNTGKVTSDYAALLYSNTTSGPGPHPNKILVSFDKLHQIHPGQIQTASLPV-TIGS 741
Query: 721 LKIVDNAANSLLASGAHTILV 741
L D N L GA+T V
Sbjct: 742 LLQTDTNGNKWLYPGAYTFFV 762
>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
Length = 805
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 288/754 (38%), Positives = 409/754 (54%), Gaps = 57/754 (7%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
+ + LS+ CD RAK LV T+ EK+ G+ + GVPRLGLP+Y+WW EA
Sbjct: 32 DCVNGPLSNNTVCDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEA 91
Query: 63 LHGVSFIGRRTNSPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
LHGV+ S PG F++ + ATSFP IL A+F+++L + + + VSTEARA
Sbjct: 92 LHGVA-------SSPGVTFNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAF 144
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
N G AGL FW+PNIN RDPRWGR ETPGEDPY + Y + + GLQ E
Sbjct: 145 NNYGRAGLDFWTPNINPYRDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGE-------- 196
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
D KI+A CKH+A YD+++W GN R+ D ++ ++D+ E ++ F C + +V + M
Sbjct: 197 DPEIRKITATCKHFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFM 256
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
C+Y+ +NG+PTCADP LLN +R W N ++ SDCDSIQ I H F +DT++ A
Sbjct: 257 CTYSALNGVPTCADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNF-SDTRQGAA 315
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKN 356
A L AG DLDCG YY + A QG I + +D +L LY L+R GYFDG + Y+N
Sbjct: 316 AAALNAGTDLDCGTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDGPNAMYRN 375
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
L +++ +LA +AA +G+VLLKND G LPL+ N +AL+G ANAT M GNY
Sbjct: 376 LTWSDVGTTHAQQLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNY 434
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
G P SP+ + YA G AA+ AD + + G+D+S
Sbjct: 435 YGVPTYLHSPLYAAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDIS 494
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
VEAEG DR D+ G Q ++I ++A K P+ L M +D NN I +++W G
Sbjct: 495 VEAEGMDREDINWTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGG 552
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFF 593
YPG++GG A+ ++I GK P GRLP+T Y A+Y+ IP T M LRP PGRTYK++
Sbjct: 553 YPGQDGGVALFNIITGKTAPAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWY 612
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
+G V+ FGYG+ YT+F ++ KS +Y + + C D
Sbjct: 613 NGTAVFEFGYGMHYTKFSADISPMSKS--------------SYDISSLLSGCNETYKDRC 658
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVFIAAGQ 707
+ + + V N G + + + IAG K ++ Y+R+ AG
Sbjct: 659 AFE----SISVNVHNTGNVTSDYAALGF-----IAGQFGPSPYPKKSLVNYQRLHNIAGG 709
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
S++ SL VD+ N+ L G + +++
Sbjct: 710 SSQTATLNLTLGSLSRVDDHGNTYLYPGDYALMI 743
>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
Length = 522
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 254/527 (48%), Positives = 351/527 (66%), Gaps = 23/527 (4%)
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+NG+P CAD +LL +T+R DW HGYIVSDCDS++ +V K+L T +A A +KAGL
Sbjct: 1 INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60
Query: 306 DLDCG-------DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG 358
DLDCG D++T + + AV+QGK+ E+ +D +L LY+ LMRLG+FDG P+ ++LG
Sbjct: 61 DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELESLG 120
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP--HANATKAMIGNY 416
++C +H ELAA+AARQG+VLLKND LPL+ + ++AL G H NAT M+G+Y
Sbjct: 121 AADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDY 180
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
G PCR +P DG KV++ A C S A AAK DAT++VAGL++S
Sbjct: 181 RGKPCRVVTPYDGV---RKVVSSTSVHA---CDKGS-CDTAAAAAKTVDATIVVAGLNMS 233
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
VE E DR DLLLP Q IN VA+A+ P+ LVIMSAG VD++FA++NPKI +++W G
Sbjct: 234 VERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAG 293
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFF 593
YPGEEGG AIADV+FGKYNPGGRLP+TWY+ YV KIP TSM LRP + +PGRTYKF+
Sbjct: 294 YPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFY 353
Query: 594 DGP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP-CAAVLID 651
G V+YPFG+GLSYT F Y A++ V +K+ + C+ + Y G + PP C AV +
Sbjct: 354 GGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVA 413
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQSAK 710
C++ + +F + V N G DG+ VV +Y+ PP + G KQ++ + RV +AAG + +
Sbjct: 414 SHACQE-EVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVE 472
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
V F +N CK+ IV+ A +++ SG +LVG+ +SFP+Q++L
Sbjct: 473 VAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 519
>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
Length = 760
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 296/742 (39%), Positives = 406/742 (54%), Gaps = 57/742 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RA L+ TL EK+ G+ + GVPRLGLP Y+WW EALHGV+
Sbjct: 34 CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVA------- 86
Query: 75 SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG F E ATSFP IL A+F++ L ++ VSTEARA N +GL FW+
Sbjct: 87 ESPGVIFAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWT 146
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDP+ + Y N + GLQ EY R I A CK
Sbjct: 147 PNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKR--------IVATCK 198
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
HYA YDL+NWEGN R+ FD+ ++ QD+ E + FE C + +V + MCSYN VNG+P+C
Sbjct: 199 HYAGYDLENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSC 258
Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
A+ LL +RG WN+ +I SDCD+IQ I E H + T+E VA L AG DLDC
Sbjct: 259 ANSYLLQDILRGHWNWTSDDQWITSDCDAIQNIYEPH-YYAPTRELTVADALNAGADLDC 317
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQH 367
G YY A +G AE+ +D +L Y L++LGYFD + Y+ +G N+ P+
Sbjct: 318 GTYYPENLGAAYDEGLFAESTLDRALIRQYASLVKLGYFDPAENQPYRQIGWANVSTPEA 377
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA AA +GI L+KND G LPL+ +IK+LAL+GP ANAT M GNY G P SP+
Sbjct: 378 EELAYRAAVEGITLIKND-GTLPLSP-SIKSLALIGPWANATTQMQGNYYGQPPYLISPL 435
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
A + + Y+PG + S PAA AA+ ADA + + G+D +VEAE DR L
Sbjct: 436 MAAEALNYTVYYSPGPG-VDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTL 494
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
PG Q + I++++ K P+ ++ M G VD + N + +++W GYPG+ GG A+
Sbjct: 495 DWPGVQPDFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALM 553
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
D+I G P GRLP T Y +YV ++ T M LRP PGRTY ++ G + FG+GL
Sbjct: 554 DIIVGNAAPAGRLPTTQYPLDYVYQVAMTDMSLRPSATNPGRTYMWYTGTPIVEFGFGLH 613
Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
YT F ++ S P + DI VG C V D+ + ++ + V
Sbjct: 614 YTNFSAEL-SQPSA---------PSYDIASLVGA----CEGVAHLDLCAFE---SYTVNV 656
Query: 667 ENMG-KMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVFIAAGQSAKVGFTMNACK 719
N+G K+ V +++ +AG H K + Y+R+ A S++
Sbjct: 657 TNIGSKVTSDYVALLF-----VAGEHGPAPIPNKVLAAYDRLHTIAPLSSQQATLNLTLG 711
Query: 720 SLKIVDNAANSLLASGAHTILV 741
SL VD N +L G +T+++
Sbjct: 712 SLSRVDEYGNRVLYPGEYTLIL 733
>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 766
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 299/755 (39%), Positives = 416/755 (55%), Gaps = 59/755 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L RA +V+ T+ E + + GVPRLGLP Y+WWSE LHGV+
Sbjct: 37 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA------- 89
Query: 75 SPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG +F + E ATSFP I+ A+F++ L K +G V E R+ N G AGL FW+
Sbjct: 90 DSPGVNFSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWT 149
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACC 191
PNIN +DPRWGR ETPGEDPY + +Y N V+GLQ D +P ++ + C
Sbjct: 150 PNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTC 201
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+AAYDL++W+GN R+ FD+ VT QD+ E ++ F+ C + V + MCSYN VNGIP+
Sbjct: 202 KHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPS 261
Query: 252 CADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
CA+ LL +R W F ++ SDCD++ I + H + T E+AVA LKAG D+DC
Sbjct: 262 CANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDC 320
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS--PQYKNLGKNNICNPQH 367
G +Y+ + GA Q I E ++ +L Y L+RLGYFD + Y+ NN+ PQ
Sbjct: 321 GTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQA 380
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
+LA +AA +GIVLLKND G LPL++ +IK +AL+GP NAT M GNY G SP+
Sbjct: 381 QQLAYQAAAEGIVLLKND-GTLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLISPL 438
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G A + Y G +I + S AAI AA+ AD + G+D +VE+EG DR +
Sbjct: 439 MGAVATGYNVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYI 497
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
PG Q +L+ ++A K P+ +V G VD K N + ++LW GYPG+ GG A+
Sbjct: 498 TWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALF 556
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
D+I GK P GRLP+T Y A+YV +IP T M LRP PGRTYK++ G +Y FGYGL
Sbjct: 557 DIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATSPGRTYKWYTGTPIYDFGYGLH 616
Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
YT F YK A +P S Y + T D+ D TF + V
Sbjct: 617 YTTFSYKWAKAPSST--------------YNIQTLVQSGNLYSYLDLAPFD---TFTVNV 659
Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQSAKVGFTMNACK 719
N G + +++ + GT+ K +I Y R+ IA+G +A V +
Sbjct: 660 TNTGNVTSDFASLLF-----VNGTYGPSPYPNKSLITYARLHDIASGDTASVALGV-TLG 713
Query: 720 SLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
S+ D N L G + + + + +G +++ QL
Sbjct: 714 SIARADTYGNMWLYPGTYQVTL-DTLGVLTYQFQL 747
>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 732
Score = 474 bits (1219), Expect = e-130, Method: Compositional matrix adjust.
Identities = 300/754 (39%), Positives = 413/754 (54%), Gaps = 61/754 (8%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L RA +V+ T+ E + + GVPRLGLP Y+WWSE LHGV+
Sbjct: 22 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA------- 74
Query: 75 SPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG +F + E ATSFP I+ A+F++ L K +G V E R+ N G AGL FW+
Sbjct: 75 DSPGVNFSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWT 134
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACC 191
PNIN +DPRWGR ETPGEDPY + +Y N V+GLQ D +P ++ + C
Sbjct: 135 PNINPFKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTC 186
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+AAYDL++W+GN R+ FD+ VT QD+ E ++ F+ C + V + MCSYN VNGIP+
Sbjct: 187 KHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPS 246
Query: 252 CADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
CA+ LL +R W F ++ SDCD++ I + H + T E+AVA LKAG D+DC
Sbjct: 247 CANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDC 305
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS--PQYKNLGKNNICNPQH 367
G +Y+ + GA Q I E ++ +L Y L+RLGYFD + Y+ NN+ PQ
Sbjct: 306 GTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQA 365
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
+LA +AA +GIVLLKND G LPL++ +IK +AL+GP NAT M GNY G SP+
Sbjct: 366 QQLAYQAAAEGIVLLKND-GTLPLSS-DIKNIALIGPWGNATGEMQGNYYGVAPYLISPL 423
Query: 428 DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
G A + Y G +I + S AAI AA+ AD + G+D +VE+EG DR +
Sbjct: 424 MGAVATGYNVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYI 482
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
PG Q +L+ ++A K P+ +V G VD K N + ++LW GYPG+ GG A+
Sbjct: 483 TWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALF 541
Query: 548 DVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLS 606
D+I GK P GRLP+T Y A+YV +IP T M LRP PGRTYK++ G +Y FGYGL
Sbjct: 542 DIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATSPGRTYKWYTGTPIYDFGYGLH 601
Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
YT F YK A +P S Y + T D+ D TF + V
Sbjct: 602 YTTFSYKWAKAPSST--------------YNIQTLVQSGNLYSYLDLAPFD---TFTVNV 644
Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYERVF-IAAGQSAKVGFTMNACK 719
N G + +++ + GT+ K +I Y R+ IA+G +A V +
Sbjct: 645 TNTGNVTSDFASLLF-----VNGTYGPSPYPNKSLITYARLHDIASGDTASVALGV-TLG 698
Query: 720 SLKIVDNAANSLLASGAHTI---LVGEGVGGVSF 750
S+ D N L G + + +G VG +F
Sbjct: 699 SIARADTYGNMWLYPGTYQVTLDTLGNSVGANTF 732
>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 786
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 289/754 (38%), Positives = 412/754 (54%), Gaps = 46/754 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD+ RA+ L++ T E +Q + + GVPRLGLP YEWWSEALHGV
Sbjct: 32 LKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+P G + ATSFP I+ A+F++ L K + VSTEARA N G AGL
Sbjct: 92 SPGVVFAPSG-----DFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGL 146
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKI 187
+++PNIN +DPRWGR ETPGEDP+ + +Y + V GLQ G+ D P +K+
Sbjct: 147 NYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--GI------DPWPYIKV 198
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+A CKH+AAYDL+NWEG DRFHFD++V++QD+ E ++ PF+ CV + +SVMCSYN VN
Sbjct: 199 AADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVN 258
Query: 248 GIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P CA LL +R W F ++ SDC ++ I +SH F E A A LKAG
Sbjct: 259 GVPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAE-AAAISLKAGT 317
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
D+DCG + + A+ Q I+ D+ + Y L+RLGYFD S Y+ +++
Sbjct: 318 DIDCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDSQTYRQFDWSDVN 377
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P+ L+ AA +G+VLLKND G LPL KT+A++GP+ NAT +M GNY G
Sbjct: 378 TPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPII 435
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
TSP G + A G + +++ AI+ AK AD V V G+D ++E EG D
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTT-VNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLD 494
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R + PG Q +L+ +A K P+ +V G VD N K+++I+W GYPG+ GG
Sbjct: 495 RSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGG 553
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
AI D+I G P GRLP+T Y A+Y ++ T M LRP ++ PGRTYK++ PV+ +G
Sbjct: 554 TAIFDIIVGSTAPAGRLPVTQYPADYTHQVRMTDMSLRPSSHNPGRTYKWYKTPVL-EYG 612
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GL +T F + P + + D + R + D+ D TF
Sbjct: 613 HGLHFTTFDFSWQRQPAA---EYDIQELIR------------ASHSKFLDLAHFD---TF 654
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
+I V N G + V +++ G H IK ++ Y RV I G SA + + S
Sbjct: 655 EICVRNTGNITSDYVGLLFLSGNTGPGPHPIKSLVAYSRVHDIQGGTSATLTLKVT-LGS 713
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
+ VD + L G + +++ G ++ P +L
Sbjct: 714 VARVDKNGDLWLFPGPYRLVLDTKDGVLTHPFRL 747
>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
bisporus H97]
Length = 763
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 288/754 (38%), Positives = 412/754 (54%), Gaps = 46/754 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD+ RA+ L++ T E +Q + + GVPRLGLP YEWWSEALHGV
Sbjct: 32 LKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
+P G + ATSFP I+ A+F++ L K + VSTEARA N G AGL
Sbjct: 92 SPGVVFAPSG-----DFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGL 146
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKI 187
+++PNIN +DPRWGR ETPGEDP+ + +Y + V GLQ G+ D P +K+
Sbjct: 147 NYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--GI------DPWPYIKV 198
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+A CKH+AAYDL+NWEG DRFHFD++V++QD+ E ++ PF+ CV + +SVMCSYN VN
Sbjct: 199 AADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVN 258
Query: 248 GIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P CA LL +R W F ++ SDC ++ I +SH F E A A LKAG
Sbjct: 259 GVPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAE-AAAISLKAGT 317
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
D+DCG + + A+ Q I+ D+ + Y L+RLGYFD S Y+ +++
Sbjct: 318 DIDCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHSQTYRQFDWSDVN 377
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P+ L+ AA +G+VLLKND G LPL KT+A++GP+ NAT +M GNY G
Sbjct: 378 TPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPFI 435
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
TSP G + A G + +++ AI+ A+ AD V V G+D ++E EG D
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTI-VNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLD 494
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R + PG Q +L+ +A K P+ +V G VD N K+++I+W GYPG+ GG
Sbjct: 495 RSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGG 553
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
AI D+I G P GRLP+T Y A+Y ++ T M LRP ++ PGRTYK++ PV+ +G
Sbjct: 554 TAIFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLRPSSHNPGRTYKWYKTPVL-EYG 612
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GL +T F + P + + D + R + D+ D TF
Sbjct: 613 HGLHFTTFDFSWQRQPAA---EYDIQELIR------------ASHSKFLDLAHFD---TF 654
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
+I V N G + V +++ G H IK ++ Y RV I G SA + + S
Sbjct: 655 EICVRNTGNITSDYVGLLFLSGNSGPGPHPIKSLVAYSRVHDIQGGTSATLTLKVT-LGS 713
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
+ VD + L G + +++ G ++ P +L
Sbjct: 714 VARVDKNGDLWLFPGPYRLVLDTKDGVLTHPFRL 747
>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
Length = 741
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 294/738 (39%), Positives = 400/738 (54%), Gaps = 51/738 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD ERA L+ TL EK+ G+ A GVPRLGLP Y+WW EALHGV+
Sbjct: 34 CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVA------- 86
Query: 75 SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG F E ATSFP IL A+F+++L + VSTEARA N +G+ FW+
Sbjct: 87 ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWT 146
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDP+ + Y N + GLQ EY R I A CK
Sbjct: 147 PNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKR--------IVATCK 198
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+AAYDL+NWEGN R+ FD+ V+ QD+ E + F C + +V S MCSYN VNG+P+C
Sbjct: 199 HFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSC 258
Query: 253 ADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
A+ LL +R W N YI SDCD+IQ I E H + T+ + VA L AG DLDC
Sbjct: 259 ANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPH-YYTATRAETVADALNAGTDLDC 317
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS--PQYKNLGKNNICNPQH 367
G+YY A QG E+ ++ +L Y L++LGYFD + Y+ +G N+ P+
Sbjct: 318 GEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFDPADIQPYRQIGWANVSTPEA 377
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
ELA AA +GI LLKND G LPL+ +IKT+AL+GP ANAT M GNY G SP+
Sbjct: 378 EELAYTAAVEGITLLKND-GTLPLSP-SIKTIALIGPWANATTQMQGNYYGVAPYLISPL 435
Query: 428 DGFYAYSKVINYA--PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
+ Y+ PG D S PAA AA+ ADA + G+D++VEAE DR
Sbjct: 436 MAAEELGFTVYYSAGPGVDD---PTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRY 492
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
L PG Q + I++++ K P+ ++ G +D + NP + +++W GYPG+ GG+A
Sbjct: 493 TLDWPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKA 551
Query: 546 IADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
I D+I G P GRLPIT Y +YV ++ T M LRP PGRTY ++ G + FG+G
Sbjct: 552 IMDIIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPTNPGRTYMWYTGTPIVEFGFG 611
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
L YT F ++ Q +Y + T C+ V D+ C +T
Sbjct: 612 LHYTTFTASLS--------------QPSAPSYDIATLVSLCSGVAHPDL-CPFASYT--A 654
Query: 665 EVENMGKMDGSEVV--MVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
V N G S+ V + + G A K ++ Y+R+ A +++ SL
Sbjct: 655 NVTNTGSSVTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLGSLS 714
Query: 723 IVDNAANSLLASGAHTIL 740
VD+ N++L G +T++
Sbjct: 715 RVDDYGNTILYPGEYTLI 732
>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 268/617 (43%), Positives = 359/617 (58%), Gaps = 29/617 (4%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L +RA LVE T+ E + + + GVPRLGLP Y+WWSE LHGV+
Sbjct: 37 CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVA------- 89
Query: 75 SPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG +F + P ATSFP I+ +A+F+++L K +G V E R+ N G+AGL FW+
Sbjct: 90 DSPGVNFSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWT 149
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDPY + +Y N ++GLQ E + ++ A CK
Sbjct: 150 PNINPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQGGVNPEPY-------FQVVATCK 202
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+A YDL++WE N R+ FD+ +T QD+ E ++ F+ C + + MCSYN VNGIPTC
Sbjct: 203 HFAGYDLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTC 262
Query: 253 ADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
AD LL +R WNF ++ SDCD+++ I H + + A A L+AG DLDCG
Sbjct: 263 ADTYLLQDILRDYWNFDETRWVTSDCDAVENIYNPHNY-TALPQQAAADALRAGTDLDCG 321
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
+YT + A Q I E ++ +L Y L+RLGYFD + Q Y+ G +N+ P
Sbjct: 322 TFYTEYLPLAYNQSLITETELRAALTRQYASLVRLGYFDPAAQQPYRQYGWSNVDTPYAQ 381
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA AA +GI LLKND G LPL + +K +AL+GP ANAT M GNY G SP+
Sbjct: 382 QLAYTAATEGITLLKND-GTLPLPS-TLKNIALIGPWANATNQMQGNYFGVAPYLVSPLQ 439
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
G A + Y G +I + + AAI AA+ ADA V G+D++VEAE DR ++
Sbjct: 440 GALAAGYNVTYVFGT-NITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVT 498
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
PG Q +LI ++A K P + G VD K N + S++W GYPG+ GG+A+ D
Sbjct: 499 WPGNQLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFD 557
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN---FPGRTYKFFDGPVVYPFGYG 604
+I GK P GRL T Y A+YV +IP T M LRP N PGRTYK++ G VY FGYG
Sbjct: 558 IISGKVAPAGRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYG 617
Query: 605 LSYTQFKYKVASSPKSV 621
L YT F Y +P S
Sbjct: 618 LHYTNFTYTWTKAPAST 634
>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
Length = 771
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 301/764 (39%), Positives = 410/764 (53%), Gaps = 85/764 (11%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RA+ LV MT EKV + GVPRLGLP Y WWSEALHGV+
Sbjct: 37 LSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95
Query: 69 IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG F P ATSFP IL A+F++ L K++ VSTE RA N G +
Sbjct: 96 ------GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRS 149
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL FW+PNIN RD RWGR ETPGEDP V RY + V GLQ+ G + P K
Sbjct: 150 GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIG-------PANP-K 201
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+ A CKH+AAY L++W G R F++ V+ QD+ E ++ PF+ C + V +VMCSYN +
Sbjct: 202 VVATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNAL 261
Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+P CAD LL +R W + +I SDC +I I H F T +A A L A
Sbjct: 262 NGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAAATALNA 320
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG + + A +G + +D +L LY ++LGYFD + Y+++G +
Sbjct: 321 GTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFDPAEDQPYRSIGWTD 380
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P LA +AA +GIVLLKND LPL TLAL+GP+ANATK M GNYEG P
Sbjct: 381 VDTPAVEALAHKAAGEGIVLLKNDK-TLPLKAKG--TLALIGPYANATKQMQGNYEG-PA 436
Query: 422 RYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
+Y + +A ++ + YA G A I + + AA+ AAK AD V G+D ++E
Sbjct: 437 KYIRTL--LWAATQAGYDVKYAAGTA-INTNSTAGFDAALSAAKQADVVVYAGGIDNTIE 493
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AEG+DR + PG Q LI++++ K P+ +V G VD + +NP++ ++LW GYP
Sbjct: 494 AEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYP 552
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
+EGG AI D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V
Sbjct: 553 SQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNTPGRTYRWYDKAV 612
Query: 598 VYPFGYGLSYTQFKYK--------------VASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
+ PFG+GL YT FK V+ SPK+V I D+ D
Sbjct: 613 L-PFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKNVPI----DRAAFD---------- 657
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
TF I+V N GK V +++ K G +K ++GY R
Sbjct: 658 -----------------TFHIQVTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRA 700
Query: 702 -FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
I G+ V ++ + +N + +L G +T+ V G
Sbjct: 701 KQIKPGEKRSVDIEVSLGSLARTAEN-GDLVLYPGRYTLEVDVG 743
>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
SS1]
Length = 778
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 285/744 (38%), Positives = 402/744 (54%), Gaps = 44/744 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L++ CD+ L RA+ LV +T+ EK + + GVPRLGLP Y WWSE LHGV+
Sbjct: 36 LANTTVCDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS 95
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
T +P G +F ATSFP IL A+F+++L I +STEARA N ++GL
Sbjct: 96 SPGVTFAPAGQNFSY----ATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGL 151
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
FW+PNIN VRDPRWGR LETPGEDP+ + Y V GLQ D + K+
Sbjct: 152 DFWTPNINPVRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ-------FGGDDPKYQKLV 204
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKHYA YDL+NW G R+ FD+ ++ QD+ E F+ PF+ C + +V+SVMCSYN VNG
Sbjct: 205 ATCKHYAGYDLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNG 264
Query: 249 IPTCADPKLLNQTIRGDWNFH--------GYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
IP+CA+ LL +R W + Y+ SDCD++ I H + T E AVA
Sbjct: 265 IPSCANDYLLQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNY-TITPEQAVAVS 323
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
LKAG DLDCG +Y + + +QG + DID +L Y L LGYFD + Y+
Sbjct: 324 LKAGTDLDCGTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFDPAEGQIYRQYN 383
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
NI +LA AA +GI LLKN + LPL + + +AL+GP ANAT M GNY+G
Sbjct: 384 WANINTDYAQQLAYTAAWEGITLLKNIDDMLPLPS-TMTNIALIGPWANATTQMQGNYQG 442
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ + Y G +I + + AA+ AA+ AD T+ + G+D++VE
Sbjct: 443 IAPFLHSPLYALQQRGINVTYVLGT-NITSNSTAGFAAALAAAQTADLTLYIGGIDITVE 501
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AE DRV++ PG Q +LI ++A+ + + + M G +D NPK+ +LW GYP
Sbjct: 502 AEAMDRVNITWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYP 560
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
G++GG A+ D+++G P GRLP++ Y AN++ ++P T M L P PGRTYK++ G +
Sbjct: 561 GQDGGTAMIDILYGSRAPAGRLPLSQYPANFINEVPMTDMRLHPALGTPGRTYKWYSGDL 620
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
V PFGYGL YT F KD R + N+ ++ +D K
Sbjct: 621 VLPFGYGLHYTTFAKAAL-----------KDHSPRSSDIATLVNEAKQSSAWLD----KA 665
Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTM 715
+ F EV N G + V + Y + G A ++ Y R+ + G++ V F +
Sbjct: 666 FFDVFAAEVTNTGSLTSDYVALGYLTGEFGPAPYPKSSLVSYTRLSQVTPGETQVVNFDL 725
Query: 716 NACKSLKIVDNAANSLLASGAHTI 739
S+ D + L G +T+
Sbjct: 726 T-LGSIARADYYGDLYLYPGTYTL 748
>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
bisporus H97]
Length = 767
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 295/758 (38%), Positives = 421/758 (55%), Gaps = 54/758 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD RAK L++ T E +Q +++ GVPRLG+P Y+WWSEALHGV+
Sbjct: 32 LSSTAVCDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVA- 90
Query: 69 IGRRTNSPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG F E ATSFP I+ ++F+ L K + +STEARA N A
Sbjct: 91 ------GSPGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA 144
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-L 185
GL +++PNIN +DPRWGR ETPGEDP+ V +Y + + GLQ G+ D RP
Sbjct: 145 GLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG--GI------DPRPYF 196
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K++A CKHYAAYDLD+WEG DRFHFD++V+ QD+ E ++ F+ CV + V+SVMCSYN
Sbjct: 197 KVAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNS 256
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
VNGIP CA+P LL +R W F ++ SDCD+I I +H F DT +AVA LKA
Sbjct: 257 VNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKA 315
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G D+DCG Y+ A+ Q I D++ +L Y LMRLGYFD S + L ++
Sbjct: 316 GTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSD 375
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P LA AA +G+VLLKND G LP++ KT+A++GP+ANATK M GNY GT
Sbjct: 376 VNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAIIGPYANATKDMQGNYFGTAP 433
Query: 422 RYTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+P G +++V++ A I + + AAI A ++D + G++ S+E+
Sbjct: 434 FIVTPFQGAVDAGFNEVVSAA--GTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIES 491
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E KDR+ + G Q L+ ++A K PV +V G +D + +N +++++W GYPG
Sbjct: 492 EAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPG 550
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
+ GG AI DVI G P GRL +T Y ++V ++ T M LRP + PGRTYK++ G V
Sbjct: 551 QSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSANPGRTYKWYTGRPV 610
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
FG+GL +T F + P + + + +T P D+ D
Sbjct: 611 LEFGHGLHFTTFDFSWRGRPG-------RKYNIQHLLHTADKKFP--------DLIPLD- 654
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
TF + + N G + V +++ + G A K ++ + R I AG SA V +N
Sbjct: 655 --TFHVNIRNTGNITSDYVALLFLRSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN 712
Query: 717 ACKSLKIVDNAANSLLASGAHTIL--VGEGVGGVSFPL 752
S+ VD +S L +G + ++ +G+GV SF L
Sbjct: 713 -LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749
>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 781
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 287/742 (38%), Positives = 400/742 (53%), Gaps = 57/742 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RAK LV TL EK+ + A GV RLG+P Y+WW+E LHG++
Sbjct: 37 CDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHGIA------- 89
Query: 75 SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
P T F + +TSFP IL A+F++ L ++ + +STEARA N GL FW+
Sbjct: 90 -GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLDFWT 148
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN RDPRWGR ETPGED Y + Y + GLQ Y R + A CK
Sbjct: 149 PNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQGNATDPYRR--------VVATCK 200
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
HYA YD++NW GN R+ D ++++QD+ E ++ PFE CV + +V + MCSYN VNG P C
Sbjct: 201 HYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPC 259
Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
ADP LL +R W + ++ SDCD+IQ + H++ + T+E A A L AG DLDC
Sbjct: 260 ADPYLLQTVLREHWGWSSDDHWVTSDCDAIQNVYLPHQW-SSTREGAAADSLNAGTDLDC 318
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQ 366
G Y GAV+QG E +D +L Y L++LGYFD +P+ Y+ LG + +
Sbjct: 319 GTYLQTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFD-APENQPYRQLGFDAVATSA 377
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
LA +AA +GIVLLKND G LP+N G+ K + + G ANAT + GNY G TSP
Sbjct: 378 SQALALKAAEEGIVLLKND-GVLPINLGS-KQVGIYGDWANATSQLQGNYFGVAKFLTSP 435
Query: 427 MDGFYAYSKVINYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ + YA G D S + I +D + V G+D VE+E +
Sbjct: 436 LMALQNLGVDVKYAGNLPGGQGDPTTGAWSSLSGVI---TTSDVHIWVGGIDNGVESEDR 492
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR L L G Q ++I ++AD K PV +VIM G +D + NPKI ++LW GYPG++G
Sbjct: 493 DRSWLTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDG 551
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G AI +++ GK P GRLP T Y + YV ++P T M +RP + PGRTYK++ G ++ F
Sbjct: 552 GTAIVNILTGKAAPAGRLPQTQYPSKYVSEVPMTDMAMRPSDKNPGRTYKWYTGEPIFEF 611
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GYGL YT F + + PK D + C N T G + +C T
Sbjct: 612 GYGLHYTNFSASITNQPKQSYAISDLVKGC---NSTGGFLE-----------RCPFTGIT 657
Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACK 719
+ V+N GK+ V + + + G K ++ Y+R+F IAAG S+ +
Sbjct: 658 --VSVQNTGKISSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAAGSSSTATLNLT-LA 714
Query: 720 SLKIVDNAANSLLASGAHTILV 741
SL VD + N +L G + + +
Sbjct: 715 SLARVDESGNKVLYPGDYELQI 736
>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 451
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 228/422 (54%), Positives = 299/422 (70%), Gaps = 18/422 (4%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
++ L+ + +C+ RA DLV R+TL EKV + D +PRLG+PLYEWWSEA
Sbjct: 43 DASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYEWWSEA 102
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
LHGVS++G PGT F VPGATSFP ILT ASFN +L++ IG+ VS EARAM+N
Sbjct: 103 LHGVSYVG------PGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHN 156
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+G AGLTFWSPNIN+ RDPRWGR ETPGEDP + +YA+ YV GLQ S +
Sbjct: 157 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAV-------SGA 209
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
LK++ACCKHY AYD+DNW+G +R+ FD+ V++QD+ +TF PF+ CV +G+V+SVMCS
Sbjct: 210 GALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCS 269
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN+VNG PTCAD LL+ IRGDW +GYI SDCDS+ + + + T EDA A +K
Sbjct: 270 YNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAISIK 328
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGK 359
AGLDL+CG + T+ AVQ GK++E+D+D ++ + LMRLG+FDG P+ + NLG
Sbjct: 329 AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 388
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+++C P + ELA EAARQGIVLLKN G LPL+ +IK++A++GP+ANA+ MIGNYEGT
Sbjct: 389 SDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNYEGT 447
Query: 420 PC 421
C
Sbjct: 448 SC 449
>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
Length = 534
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 250/543 (46%), Positives = 340/543 (62%), Gaps = 19/543 (3%)
Query: 219 MQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDS 278
+++T+ +PF+ CV EG V+SVMCSYN+VNG PTCADP LL TIRG W GYIVSDCDS
Sbjct: 1 LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60
Query: 279 IQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFL 338
+ + +S F T E+A A +KAGLDLDCG + T AV +G + E D++ +L L
Sbjct: 61 VGVLYDSQHF-TPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANL 119
Query: 339 YIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
V MRLG FDG P Y NLG ++C P H LA EAARQGIVLL+N GALPL+
Sbjct: 120 LSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTR 179
Query: 396 IKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIP 455
+T+A++GP+++AT MIGNY G C YT+P+ G Y K I +A GCA++ C + +I
Sbjct: 180 HRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTI-HAKGCANVACVGDQLIG 238
Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
A AA+ ADA V+V GLD S+EAE +DR +LLPG Q EL+ ++ A KGP +V+MS
Sbjct: 239 EAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSG 298
Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPY 574
G +D++FAKN+ KI ILWVGYPG+ GG AIADV+FG NPGG+LP+TWY +Y+ K+P
Sbjct: 299 GPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPM 358
Query: 575 TSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
T+M LR P +PGRTY+F+ GPVV+PFG+GLSY++F A +P I L
Sbjct: 359 TNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAP--TKISLPLSSLSP 416
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
+ + TV + CA+V I+V+N G +DGS ++V+S P +
Sbjct: 417 NSSATVKVSHTDCASV---------SDLPIMIDVKNTGTVDGSHTILVFSTVPNQTWSPE 467
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
K +IG+E+V + AG +V ++ C L VD + G H + +G+ +S
Sbjct: 468 KHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQA 527
Query: 753 QLN 755
L
Sbjct: 528 DLQ 530
>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 767
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 295/758 (38%), Positives = 420/758 (55%), Gaps = 54/758 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD RA L++ T E +Q +++ GVPRLG+P Y+WWSEALHGV+
Sbjct: 32 LSSTAVCDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVA- 90
Query: 69 IGRRTNSPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG F E ATSFP I+ ++F+ L K + +STEARA N A
Sbjct: 91 ------GSPGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA 144
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-L 185
GL +++PNIN +DPRWGR ETPGEDP+ V +Y + + GLQ G+ D RP
Sbjct: 145 GLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG--GI------DPRPYF 196
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K++A CKHYAAYDLD+WEG DRFHFD++V+ QD+ E ++ F+ CV + V+SVMCSYN
Sbjct: 197 KVAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNS 256
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
VNGIP CA+P LL +R W F ++ SDCD+I I +H F DT +AVA LKA
Sbjct: 257 VNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKA 315
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G D+DCG Y+ A+ Q I D++ +L Y LMRLGYFD S + L ++
Sbjct: 316 GTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSD 375
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P LA AA +G+VLLKND G LP++ KT+A++GP+ANATK M GNY GT
Sbjct: 376 VNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAIIGPYANATKDMQGNYFGTAP 433
Query: 422 RYTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+P G +++V++ A I + + AAI A ++D + G++ S+E+
Sbjct: 434 FIVTPFQGAVDAGFNEVVSAA--GTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIES 491
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E KDR+ + G Q L+ ++A K PV +V G +D + +N +++++W GYPG
Sbjct: 492 EAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPG 550
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
+ GG AI DVI G P GRL +T Y ++V ++ T M LRP + PGRTYK++ G V
Sbjct: 551 QSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSANPGRTYKWYTGRPV 610
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
FG+GL +T F + P + + + +T P D+ D
Sbjct: 611 LEFGHGLHFTTFDFSWRGRPG-------RKYNIQHLLHTADKKFP--------DLIPLD- 654
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
TF + + N G + V +++ K G A K ++ + R I AG SA V +N
Sbjct: 655 --TFHVNIRNTGNITSDYVALLFLKSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN 712
Query: 717 ACKSLKIVDNAANSLLASGAHTIL--VGEGVGGVSFPL 752
S+ VD +S L +G + ++ +G+GV SF L
Sbjct: 713 -LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749
>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 797
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 289/748 (38%), Positives = 407/748 (54%), Gaps = 54/748 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RAK LV MTL EK+ + G PRLGLP Y WW+EALHGV+
Sbjct: 56 LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 114
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
G + +F ATSFP IL A+F++ L K++ +STEARA N G+AGL
Sbjct: 115 EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 170
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+PNIN RDPRWGR ETPGEDP + RY + V GLQD G E RP K+
Sbjct: 171 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 222
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++ F+ C + V +VMCSYN +NG
Sbjct: 223 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 282
Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
IPTCAD LL +R W + ++ DC +I I H ++ A A L AG
Sbjct: 283 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 341
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DLDCG + + A+QQG ++ +L LY L++LGYFD + Y+++G N +
Sbjct: 342 DLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 401
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P ELA +A +GIV+LKND G LPL + T+A++GP ANAT + GNYEG P
Sbjct: 402 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 458
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
+ + + ++ G DI +++ AI AAK AD + G+D ++E E +D
Sbjct: 459 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 517
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R ++ PG Q +LI +++D K P+ +V G VD + N + ++LW GYP + GG
Sbjct: 518 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 576
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
A+ D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V+ PFG
Sbjct: 577 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 635
Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
+GL YT F P + D + GT P L D
Sbjct: 636 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 675
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERV-FIAAGQSAKVGFTMN 716
TF I V N G + + +++ G+ IK ++GY R I GQS +V ++
Sbjct: 676 -TFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 734
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
+ +N + +L G++ + V G
Sbjct: 735 VGSVARTAEN-GDLVLYPGSYKLEVDVG 761
>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 776
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 289/748 (38%), Positives = 407/748 (54%), Gaps = 54/748 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RAK LV MTL EK+ + G PRLGLP Y WW+EALHGV+
Sbjct: 35 LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 93
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
G + +F ATSFP IL A+F++ L K++ +STEARA N G+AGL
Sbjct: 94 EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+PNIN RDPRWGR ETPGEDP + RY + V GLQD G E RP K+
Sbjct: 150 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 201
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++ F+ C + V +VMCSYN +NG
Sbjct: 202 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 261
Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
IPTCAD LL +R W + ++ DC +I I H ++ A A L AG
Sbjct: 262 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 320
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DLDCG + + A+QQG ++ +L LY L++LGYFD + Y+++G N +
Sbjct: 321 DLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 380
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P ELA +A +GIV+LKND G LPL + T+A++GP ANAT + GNYEG P
Sbjct: 381 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 437
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
+ + + ++ G DI +++ AI AAK AD + G+D ++E E +D
Sbjct: 438 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 496
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R ++ PG Q +LI +++D K P+ +V G VD + N + ++LW GYP + GG
Sbjct: 497 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 555
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
A+ D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V+ PFG
Sbjct: 556 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 614
Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
+GL YT F P + D + GT P L D
Sbjct: 615 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 654
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERV-FIAAGQSAKVGFTMN 716
TF I V N G + + +++ G+ IK ++GY R I GQS +V ++
Sbjct: 655 -TFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 713
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
+ +N + +L G++ + V G
Sbjct: 714 VGSVARTAEN-GDLVLYPGSYKLEVDVG 740
>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
Length = 771
Score = 464 bits (1193), Expect = e-127, Method: Compositional matrix adjust.
Identities = 303/764 (39%), Positives = 412/764 (53%), Gaps = 85/764 (11%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RA+ LV MT EKV + GVPRLGLP Y WWSEALHGV+
Sbjct: 37 LSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95
Query: 69 IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG F P ATSFP IL A+F++ L K++ VSTE RA N G +
Sbjct: 96 ------GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRS 149
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL FW+PNIN RD RWGR ETPGEDP V RY + V GLQ+ G + P K
Sbjct: 150 GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIG-------PANP-K 201
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+ A CKH+AAYDL++W G R F++ V+ QD+ E ++ PF+ C + V +VMCSYN +
Sbjct: 202 VVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNAL 261
Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+P CAD LL +R W + +I SDC +I I H F T +A A L A
Sbjct: 262 NGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAAATALNA 320
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG + + A +G + +D +L LY L++LGYFD + Y+++G +
Sbjct: 321 GTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFDPAEDQPYRSIGWTD 380
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P LA +AA +GIVLLKND LPL TLAL+GP+ANATK M GNYEG P
Sbjct: 381 VDTPAAEALAHKAAGEGIVLLKNDK-TLPLKAKG--TLALIGPYANATKQMQGNYEG-PA 436
Query: 422 RYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
+Y + +A ++ + YA G A I + + AA+ AAK AD V G+D ++E
Sbjct: 437 KYIRTL--LWAATQAGYDVKYAAGTA-INTNSTAGFDAALSAAKQADVVVYAGGIDNTIE 493
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AEG+DR + PG Q LI++++ K P+ +V G VD + +NP++ ++LW GYP
Sbjct: 494 AEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALLWAGYP 552
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
+EGG AI D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V
Sbjct: 553 SQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNTPGRTYRWYDKAV 612
Query: 598 VYPFGYGLSYTQFKYK--------------VASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
+ PFG+GL YT FK V+ SPK+V I D+ D
Sbjct: 613 L-PFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKNVPI----DRAAFD---------- 657
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
TF I+V N GK V +++ K G +K ++GY R
Sbjct: 658 -----------------TFHIQVTNTGKTTSDYVALLFLKTTDAGPKPYPLKTLVGYTRA 700
Query: 702 -FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
I G+ V ++ + +N + +L G +T+ V G
Sbjct: 701 KQIKPGEKRSVDIEVSLGSLARTAEN-GDLVLYPGRYTLEVDVG 743
>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
Length = 776
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 289/748 (38%), Positives = 407/748 (54%), Gaps = 54/748 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RAK LV MTL EK+ + G PRLGLP Y WW+EALHGV+
Sbjct: 35 LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 93
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
G + +F ATSFP IL A+F++ L K++ +STEARA N G+AGL
Sbjct: 94 EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+PNIN RDPRWGR ETPGEDP + RY + V GLQD G E RP K+
Sbjct: 150 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 201
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++ F+ C + V +VMCSYN +NG
Sbjct: 202 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 261
Query: 249 IPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
IPTCAD LL +R W + ++ DC +I I H ++ A A L AG
Sbjct: 262 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 320
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DLDCG + + A+QQG ++ +L LY L++LGYFD + Y+++G N +
Sbjct: 321 DLDCGSVFPEYLRSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 380
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P ELA +A +GIV+LKND G LPL + T+A++GP ANAT + GNYEG P
Sbjct: 381 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 437
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
+ + + ++ G DI +++ AI AAK AD + G+D ++E E +D
Sbjct: 438 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 496
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R ++ PG Q +LI +++D K P+ +V G VD + N + ++LW GYP + GG
Sbjct: 497 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 555
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
A+ D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V+ PFG
Sbjct: 556 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 614
Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
+GL YT F P + D + GT P L D
Sbjct: 615 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 654
Query: 660 FTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
TF I V N G + + +++ + G IK ++GY R I GQS +V ++
Sbjct: 655 -TFSITVTNTGNVASDYIALLFLTADRVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 713
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
+ +N + +L G++ + V G
Sbjct: 714 VGSVARTAEN-GDLVLYPGSYKLEVDVG 740
>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 797
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 289/748 (38%), Positives = 406/748 (54%), Gaps = 54/748 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RAK LV MTL EK+ + G PRLGLP Y WW+EALHGV+
Sbjct: 56 LSKNNVCDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA- 114
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
G + +F ATSFP IL A+F++ L K++ +STEARA N G+AGL
Sbjct: 115 EGHGVSFSDSGNFSY----ATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGL 170
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+PNIN RDPRWGR ETPGEDP + RY + V GLQD G E RP K+
Sbjct: 171 DYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVV 222
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+AAYDL+NWEG +R+ FD+ V+ QD+ E ++ F+ C + V +VMCSYN +NG
Sbjct: 223 ATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNG 282
Query: 249 IPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
IPTCAD LL +R W + ++ DC +I I H ++ A A L AG
Sbjct: 283 IPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGT 341
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DLDCG + + A+QQG + +L LY L++LGYFD + Y+++G N +
Sbjct: 342 DLDCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVF 401
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P ELA +A +GIV+LKND G LPL + T+A++GP ANAT + GNYEG P
Sbjct: 402 TPAAEELAHKATVEGIVMLKND-GTLPLKSNG--TVAIIGPFANATTQLQGNYEGPPKYI 458
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
+ + + ++ G DI +++ AI AAK AD + G+D ++E E +D
Sbjct: 459 RTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQD 517
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R ++ PG Q +LI +++D K P+ +V G VD + N + ++LW GYP + GG
Sbjct: 518 RTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGG 576
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
A+ D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V+ PFG
Sbjct: 577 AAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNNPGRTYRWYDKAVL-PFG 635
Query: 603 YGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
+GL YT F P + D + GT P L D
Sbjct: 636 FGLHYTTFNVSWNHAEYGPYNTD------------SVASGTTNAPVDTELFD-------- 675
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERV-FIAAGQSAKVGFTMN 716
TF I V N G + + +++ G+ IK ++GY R I GQS +V ++
Sbjct: 676 -TFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKLDVS 734
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEG 744
+ +N + +L G++ + V G
Sbjct: 735 VGSVARTAEN-GDLVLYPGSYKLEVDVG 761
>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 849
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 278/731 (38%), Positives = 390/731 (53%), Gaps = 53/731 (7%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHF-- 81
RAK LV TL EK+ + A GV RLG+P Y+WW+E LHG++ P T F
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--------GPFTSFAK 165
Query: 82 DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
+ +TSFP IL A+F+++L ++ +STEARA N+ GL FW+PNIN RDP
Sbjct: 166 QGDYSYSTSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDP 225
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RWGR ETPGED Y + Y + GLQ E Y R + A CKHYA YD++N
Sbjct: 226 RWGRGQETPGEDSYHLSSYVKALIHGLQGNETDPYRR--------VVATCKHYAGYDIEN 277
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W GN R+ D ++++QD+ E ++ PFE CV + +V + MCSYN VNG P CADP +L
Sbjct: 278 WNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTV 336
Query: 262 IRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
+R W + ++ SDCDSIQ + H++ + T+E A A L AG DLDCG Y +
Sbjct: 337 LREHWGWSSDEHWVTSDCDSIQNVYLPHQW-SSTREGAAADSLNAGTDLDCGTYLQSHLP 395
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAAR 376
GAV+QG E +D +L Y L++LGYFD + Y+ LG + + LA +AA
Sbjct: 396 GAVKQGLTNETTLDNALIRQYSSLIKLGYFDIPENQPYRQLGFDAVATSASQALALKAAE 455
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
+GIVLLKND G LP+N G+ K + + G ANAT + GNY G TSP
Sbjct: 456 EGIVLLKND-GVLPINFGS-KNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVN 513
Query: 437 INYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
+ YA G D + + I +D + V G+D +E+E +DR L L G
Sbjct: 514 VRYAGNLPGGQGDPTTGSWPRLSGVI---TTSDVHIWVGGMDNGIESEDRDRSWLTLTGS 570
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q ++I ++AD K PV ++IM G +D + NPKI ++LW GYPG++GG AI +++ G
Sbjct: 571 QLDVIGQLADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTG 629
Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
K P GRLP T Y YV ++P T M +RP N PGRTYK++ G ++ FGYGL YT F
Sbjct: 630 KAAPAGRLPQTQYLYKYVSEVPMTDMAMRPSNKNPGRTYKWYTGKPIFEFGYGLHYTNFS 689
Query: 612 YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGK 671
+ + PK D + C + G C I+ + V+N GK
Sbjct: 690 ASITNQPKQSYAISDLVKGCN----STGGFLERCPFTGIN------------VSVQNTGK 733
Query: 672 MDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS 730
V + + + G K ++ Y+R+F A S+ SL VD + N
Sbjct: 734 TSSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLTLASLARVDESGNK 793
Query: 731 LLASGAHTILV 741
+L G + + +
Sbjct: 794 VLYPGDYELQI 804
>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
Length = 771
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 299/761 (39%), Positives = 405/761 (53%), Gaps = 79/761 (10%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L RA+ LV MT EKV + GVPRLGLP Y WWSEALHGV+
Sbjct: 37 LSKLAVCDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVA- 95
Query: 69 IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG F P ATSFP IL A+F++ L K++ VSTE RA N G A
Sbjct: 96 ------GSPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRA 149
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL FW+PNIN RD RWGR ETPGEDP V RY + V GLQ+ G + P K
Sbjct: 150 GLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIG-------PANP-K 201
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+ A CKH+AAYDL++W G R F++ V+ QD+ E ++ PF+ C + V +VMCSYN +
Sbjct: 202 VVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNAL 261
Query: 247 NGIPTCADPKLLNQTIRGDWNFH---GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+P CAD LL +R W + +I DC +I I H + T +A A L A
Sbjct: 262 NGVPACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNY-TKTPAEAAATALNA 320
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG + + A +G +D +L LY L++LGYFD + Y+++G +
Sbjct: 321 GTDLDCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYFDPAEDQPYRSIGWKD 380
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +P LA +AA +GIVLLKND LPL TLAL+GP+ANATK M GNYEG P
Sbjct: 381 VDSPAAEALAHKAAVEGIVLLKNDK-TLPLKAKG--TLALIGPYANATKQMQGNYEGPPK 437
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + + Y G A I + + AA+ AAK AD V G+D ++EAEG
Sbjct: 438 YIRTLLWAATQAGYDVKYVAGTA-INANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEG 496
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR ++ PG Q +LI++++ K P+ +V G VD + +NP + ++LW GYP +E
Sbjct: 497 HDRTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQE 555
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG AI D++ GK P GRLP+T Y A+YV ++P T M LRP +N PGRTY+++D V+ P
Sbjct: 556 GGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPLTDMALRPGSNTPGRTYRWYDKAVL-P 614
Query: 601 FGYGLSYTQFKYK--------------VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
FG+GL YT FK V+ SPK+V I D+ D
Sbjct: 615 FGFGLHYTTFKISWPRRALGPYDTAALVSRSPKNVPI----DRAAFD------------- 657
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV-FI 703
TF I+V N GK V +++ K G +K ++GY R I
Sbjct: 658 --------------TFHIQVTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQI 703
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
G+ V ++ + +N + +L G +T+ V G
Sbjct: 704 KPGEKRSVDIKVSLGSLARTAEN-GDLVLYPGRYTLEVDVG 743
>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 822
Score = 460 bits (1184), Expect = e-126, Method: Compositional matrix adjust.
Identities = 280/741 (37%), Positives = 407/741 (54%), Gaps = 50/741 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD L ER LV+ +TL EK+ + D + G RLGLP YEWWSEA HGV
Sbjct: 74 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-- 131
Query: 69 IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S PG F S+ ATSFP ILT ASF+++L +KI + + E RA N G
Sbjct: 132 -----GSAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGF 186
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G FW+PNIN RDPRWGR ETPGEDP V Y N+V GLQ D +
Sbjct: 187 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 237
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYA YDL+ R+ + T+QD+ + F+ PF+ CV + DV S+MCSYN
Sbjct: 238 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNS 293
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
V+GIP CA+ LL++ +R WNF+ Y+VSDC ++ I + H F DT+E A + L
Sbjct: 294 VSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 352
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
AG+DL+CG Y ++ + + +D SL LY L +G+FDG +Y L +++
Sbjct: 353 AGVDLECGSSYLKLNE-SLAANQTSVKVMDQSLARLYSALFTVGFFDGG-KYDKLDFSDV 410
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPC 421
P LA EAA +G+ LLKND+ LPL++ + K++A++GP ANAT M G+Y G
Sbjct: 411 STPDAQALAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAP 469
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP++ F +NYA G A + QN S A+ AA +D + + G+D S+E+E
Sbjct: 470 YLISPLEAFGDSRWKVNYALGTA-MNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 528
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR L PG Q +LI ++ +K P+ +V G VD + N I++++W GYP +
Sbjct: 529 LDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQS 587
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG A+ DV+ GK +P GRLP+T Y A+Y ++ + LRP +++PGRTYK++ G V P
Sbjct: 588 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDSYPGRTYKWYTGKPVLP 647
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGL YT+F + + L+++ +D+ V + + + D+
Sbjct: 648 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 693
Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
T ++ V+N+G V +++ SK G A K ++ Y R+ A S +V
Sbjct: 694 TVKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 753
Query: 719 KSLKIVDNAANSLLASGAHTI 739
SL D + ++ G + I
Sbjct: 754 GSLARADENGSLVIFPGRYKI 774
>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 882
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 280/741 (37%), Positives = 407/741 (54%), Gaps = 50/741 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD L ER LV+ +TL EK+ + D + G RLGLP YEWWSEA HGV
Sbjct: 134 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-- 191
Query: 69 IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S PG F S+ ATSFP ILT ASF+++L +KI + + E RA N G
Sbjct: 192 -----GSAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGF 246
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G FW+PNIN RDPRWGR ETPGEDP V Y N+V GLQ D +
Sbjct: 247 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 297
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYA YDL+ R+ + T+QD+ + F+ PF+ CV + DV S+MCSYN
Sbjct: 298 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNS 353
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
V+GIP CA+ LL++ +R WNF+ Y+VSDC ++ I + H F DT+E A + L
Sbjct: 354 VSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 412
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
AG+DL+CG Y ++ + + +D SL LY L +G+FDG +Y L +++
Sbjct: 413 AGVDLECGSSYLKLNE-SLAANQTSVKVMDQSLARLYSALFTVGFFDGG-KYDKLDFSDV 470
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPC 421
P LA EAA +G+ LLKND+ LPL++ + K++A++GP ANAT M G+Y G
Sbjct: 471 STPDAQALAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAP 529
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP++ F +NYA G A + QN S A+ AA +D + + G+D S+E+E
Sbjct: 530 YLISPLEAFGDSRWKVNYALGTA-MNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 588
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR L PG Q +LI ++ +K P+ +V G VD + N I++++W GYP +
Sbjct: 589 LDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQS 647
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG A+ DV+ GK +P GRLP+T Y A+Y ++ + LRP +++PGRTYK++ G V P
Sbjct: 648 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDSYPGRTYKWYTGKPVLP 707
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGL YT+F + + L+++ +D+ V + + + D+
Sbjct: 708 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 753
Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
T ++ V+N+G V +++ SK G A K ++ Y R+ A S +V
Sbjct: 754 TVKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 813
Query: 719 KSLKIVDNAANSLLASGAHTI 739
SL D + ++ G + I
Sbjct: 814 GSLARADENGSLVIFPGRYKI 834
>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 775
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 281/741 (37%), Positives = 404/741 (54%), Gaps = 50/741 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD L ER LV+ +TL EK+ + D + G RLGLP YEWWSEA HGV
Sbjct: 27 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 85
Query: 69 IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S PG F S+ ATSFP ILT ASF+++L +KI + + E R N G
Sbjct: 86 ------SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGF 139
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G FW+PNIN RDPRWGR ETPGEDP V Y N+V GLQ D +
Sbjct: 140 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 190
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYA YDL+ R+ + T+QD+ E F+ PF+ CV + DV S+MCSYN
Sbjct: 191 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNS 246
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
V+GIP CA+ LL++ +R WNF+ Y+VSDC ++ I + H F DT+E A + L
Sbjct: 247 VSGIPACANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 305
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
AG+DL+CG Y ++ + + +D SL LY L +G+FDG +Y L +++
Sbjct: 306 AGVDLECGSSYLKLNE-SLAANQTSVKVMDQSLARLYSALFTVGFFDGG-KYDKLDFSDV 363
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPC 421
P LA EAA +G+ LLKND+ LPL++ + K++A++GP ANAT M G+Y G
Sbjct: 364 STPDAQALAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAP 422
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP++ F +NYA G A I QN S A+ AA +D + + G+D S+E+E
Sbjct: 423 YLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 481
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR L PG Q +LI ++ +K P+ +V G VD + N I++++W GYP +
Sbjct: 482 LDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQS 540
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG A+ DV+ GK +P GRLP+T Y A+Y ++ + LRP + +PGRTYK++ G V P
Sbjct: 541 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDLYPGRTYKWYTGKPVLP 600
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGL YT+F + + L+++ +D+ V + + + D+
Sbjct: 601 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 646
Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
T + V+N+G V +++ SK G A K ++ Y R+ A S +V
Sbjct: 647 TVKARVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 706
Query: 719 KSLKIVDNAANSLLASGAHTI 739
SL D + ++ G + I
Sbjct: 707 GSLARADENGSLVIFPGRYKI 727
>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 796
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 292/756 (38%), Positives = 410/756 (54%), Gaps = 64/756 (8%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
CD ERA +V+ MTL EKV +G A G RLGLP Y+W +EALHGV+ G +
Sbjct: 75 CDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLGLPAYQWQNEALHGVAGSTGVQF 134
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
SP G +F + ATSFP IL +A+F+++L K + +STEARA N G AGL FW+P
Sbjct: 135 QSPLGANFSA----ATSFPMPILLSAAFDDALVKSVATAISTEARAFANYGFAGLDFWTP 190
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN RDPRWGR +ETPGED + + Y + V GLQ +++R + CKH
Sbjct: 191 NINPFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQGGIDPDFYR--------TLSTCKH 242
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+AAYD++ N R + T+QDM + ++ FE CV + V+S+MC+YN V+G+P CA
Sbjct: 243 FAAYDIE----NGRTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACA 298
Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
D LL +R + F Y+VSDCD+++ + + H + + + A A + AG DLDCG
Sbjct: 299 DSYLLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQ-AAAMSINAGTDLDCG 357
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
Y N +VQ G EA +D SL LY L+++GYFD +Y +LG N+ Q L
Sbjct: 358 SSY-NVLNASVQAGLTTEATLDKSLIRLYSALVKVGYFDQPAEYNSLGWGNVNTTQSQAL 416
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA +G+ LLKND G LPL+ + +A++GP AN T M GNY GT +P+ F
Sbjct: 417 AHDAATEGMTLLKND-GTLPLSR-TLSNVAVIGPWANVTTQMQGNYAGTAPLLVNPLSVF 474
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ + YA G A I Q+ S AA+ AA ++D V + G+D+SVE EG DR + P
Sbjct: 475 QQKWRNVKYAQGTA-INSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWP 533
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q LI+++A+ K P+ +V G +D + +N K+ SILW GYPG++GG AI DV+
Sbjct: 534 GNQLNLISQLANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVL 592
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
G P GRLP+T Y ANYV M LRP N PGRTY ++ G V PFGYGL YT
Sbjct: 593 TGANPPAGRLPVTQYPANYVNNNNIQDMNLRPSNGIPGRTYAWYTGTPVLPFGYGLHYTN 652
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F S+ T G++ A L+++ TF V N+
Sbjct: 653 FSLSFQSTK------------------TAGSD----IATLVNNAGSNKDLATFATIVVNV 690
Query: 670 GKMDGSE--------VVMVYSKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
G ++ + S G A KQ+ Y RV + G + ++ T+N S
Sbjct: 691 KNTGGKANLASDYVGLLFLKSTNAGPAPHPNKQLAAYGRVRNVGVGATQQLTLTVN-LGS 749
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
L D + + GA+T+++ V+ PL N
Sbjct: 750 LARADTNGDRWIYPGAYTLIL-----DVNGPLTFNF 780
>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 822
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 281/741 (37%), Positives = 404/741 (54%), Gaps = 50/741 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD L ER LV+ +TL EK+ + D + G RLGLP YEWWSEA HGV
Sbjct: 74 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVG- 132
Query: 69 IGRRTNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S PG F S+ ATSFP ILT ASF+++L +KI + + E RA N G
Sbjct: 133 ------SAPGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGF 186
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G FW+PNIN RDPRWGR ETPGEDP V Y N+V GLQ D +
Sbjct: 187 SGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQ---------GDDPKNK 237
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYA YDL+ R+ + T+QD+ + F+ PF+ CV + DV S+MCSYN
Sbjct: 238 QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNS 293
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
V+GIP CA+ LL++ +R WNF+ Y+VSDC ++ I + H F DT+E A + L
Sbjct: 294 VSGIPACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNF-TDTEEAAASVALN 352
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
AG+DL+CG Y ++ + + +D SL LY L +G+FDG +Y L +++
Sbjct: 353 AGVDLECGSSYLKLNE-SLAANQTSVKVMDRSLARLYSALFTVGFFDGG-KYDKLDFSDV 410
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLN-TGNIKTLALVGPHANATKAMIGNYEGTPC 421
P LA EAA +G+ LLKND+ LPL+ K++A++GP ANAT M G+Y G
Sbjct: 411 STPDAQALAYEAAVEGMTLLKNDD-LLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAP 469
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP++ F +NYA G A I QN S A+ AA +D + + G+D S+E+E
Sbjct: 470 YLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALAAANKSDLIIYLGGIDNSLESET 528
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR L PG Q +LI ++ +K P+ +V G VD + N I++++W GYP +
Sbjct: 529 LDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQS 587
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG A+ DV+ GK +P GRLP+T Y A+Y ++ + LRP +++PGRTYK++ G V P
Sbjct: 588 GGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDSYPGRTYKWYTGKPVLP 647
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGL YT+F + + L+++ +D+ V + + + D+
Sbjct: 648 FGYGLHYTKFMFDWEKT-------LNREYNIQDL---VASCRNSSGGPINDNTPLT---- 693
Query: 661 TFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
T + V+N+G V +++ SK G A K ++ Y R+ A S +V
Sbjct: 694 TVKARVKNVGHKTSDYVSLLFLSSKNAGPAPRPNKSLVSYVRLLNIARGSDQVAELPLTL 753
Query: 719 KSLKIVDNAANSLLASGAHTI 739
SL D + ++ G + I
Sbjct: 754 GSLARADENGSLVIFPGRYKI 774
>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 714
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 281/753 (37%), Positives = 400/753 (53%), Gaps = 99/753 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L + +RAKDLV RMTLPEK+ QM A +PRL +P Y WW+E LHGV+ G
Sbjct: 13 YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
AT FP I A+FN L K+ + +S EARA ++
Sbjct: 71 --------------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIY 116
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDPY+ R + +V+GLQ D + L
Sbjct: 117 KGLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQG---------DDPKYL 167
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A KHYA + + + R FD+RV+++D++ET++ FE CV EG S+M +YNR
Sbjct: 168 KVVATPKHYAVH---SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNR 224
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
NG P CA LL +R +W F GY+VSDC +I I HK + T ++ A + G
Sbjct: 225 TNGEPCCASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHK-VTKTAAESAALAVNNGC 283
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNIC 363
+L+CG Y + AV+QG I+E ID ++ L+ MRLG FD +Y ++ +
Sbjct: 284 ELNCGKTY-EYLCQAVEQGLISEETIDQAVIKLFTARMRLGMFDPPEMVRYAHIPYDVND 342
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+P+H ELA E ARQ IVLLKND LPL+ +KT+A++GP+A+ ++ NY GTP +Y
Sbjct: 343 SPEHRELALETARQSIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKY 401
Query: 424 TSPMDGFYAY----SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+P++G +KV+ YA GC ++ + A++ A+ AD ++ GL +E
Sbjct: 402 VTPLEGIKNKVSPDTKVL-YAKGC-EVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEG 459
Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
E G DR+ + LPG Q +L+ + K P+ LV+++ A+ IN+A + +
Sbjct: 460 EEGDVADSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWAHEH--VP 516
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
+I+ YPGEEGG AIADV+FG YNP GRLPIT+ + P+T N GRTY
Sbjct: 517 AIIEAWYPGEEGGTAIADVLFGDYNPAGRLPITFVRSLDDLPPFTDY------NMKGRTY 570
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
++F+ +YPFGYGLSYT FKY N + + P L
Sbjct: 571 RYFEKEPLYPFGYGLSYTSFKYS---------------------NLRLSAMRLPAGNNL- 608
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSA 709
++VEN GK+ G EVV +Y S ++Q+ G + + + GQ
Sbjct: 609 ----------DINVDVENTGKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQ 658
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V FT+ + + + D +L G I VG
Sbjct: 659 TVSFTVEP-QHMSLFDYDGKRILEPGQFIIAVG 690
>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 893
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 285/750 (38%), Positives = 414/750 (55%), Gaps = 55/750 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P CD L RAK LV+ MT EKVQ + + G RLGLP Y+WW+EALHGV+
Sbjct: 159 LCSNPICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAG 218
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
T P G ATSFP IL +A+F+++L K++G VS E RA N GNAGL
Sbjct: 219 SPGVTFQPSG-----NFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGL 273
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
FW+PNIN RDPRWGR ETPGEDPY + RY N V GLQ+ G+ + + P ++
Sbjct: 274 DFWTPNINPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQN--GI-----APANP-RVV 325
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+A YD+++WEGN R+ F++ ++ QD+ E ++ PF+ C + V ++MCSYN VNG
Sbjct: 326 ATCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNG 385
Query: 249 IPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
IPTCAD LL+ +R WN++ ++ SDCD++ I H++ + + A A L AG
Sbjct: 386 IPTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTS-SLAAAAADALNAGT 444
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICN 364
+LDCG +N A Q A ++++L +LY L+RLG+FD QY +LG +++
Sbjct: 445 NLDCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSSLVRLGWFDSEDSQYSSLGWSDVGT 504
Query: 365 PQHIELAAEAARQGIVLLKNDN-GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+LA AA +GIVLLKND+ LPL+ + +T+AL+GP+ANAT + GNY GTP
Sbjct: 505 TASQQLANRAAVEGIVLLKNDHKKVLPLSQ-HGQTIALIGPYANATTQLQGNYYGTPAYI 563
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
+ + G + Y G I + S AA+ AAK AD + G+D S+EAE D
Sbjct: 564 RTLVWGAEQMGYTVQYEAGTG-INSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMD 622
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R + G Q +LI++++ K P+ ++ G +D + N + ++LW GYP + GG
Sbjct: 623 RNTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGG 681
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
+A+ D++ G+ P GRLP+T Y ANY IP T M LRP + PGRTY+++D V+ PFG
Sbjct: 682 QAVFDILTGQSAPAGRLPVTQYPANYTNAIPMTDMSLRPNGSTPGRTYRWYDDAVI-PFG 740
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT- 661
+GL YT F D ++ P A L+ Y+ T
Sbjct: 741 FGLHYTTF----------------------DASWADKKFGPYNTASLVAKASKSKYQDTA 778
Query: 662 ----FQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV-FIAAGQSAKVGFT 714
F + V+N GK+ V ++++ G IK +I Y R I G++ V
Sbjct: 779 PFDSFHVNVKNTGKVTSDFVALLFASTDNAGPKPYPIKTLISYARASSIKPGETRTVSID 838
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEG 744
+ + N + +L G++T+ + G
Sbjct: 839 VTIGSIARTATN-GDLVLYPGSYTLQLDVG 867
>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
Length = 771
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 287/704 (40%), Positives = 396/704 (56%), Gaps = 51/704 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD RA+ LV+ M+ EKV A GVPRLGLP Y WWSEALHGV+
Sbjct: 37 LSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVA- 95
Query: 69 IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG HF P ATSF IL ASF++ L K++ V TE RA N G A
Sbjct: 96 ------GAPGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA 149
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL +W+PNIN RDPRWGR ETPGEDP V RY + V GLQ G +RP +
Sbjct: 150 GLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG-------PARP-Q 201
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
I+A CKH+AAYD+++W G R FD+RV+ QD+ E ++ F+ CV + V +VMCSYN +
Sbjct: 202 IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNAL 261
Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+PTCADP LL +R W++ ++VSDC +I I H + T +A A L A
Sbjct: 262 NGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAAAVALNA 320
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG + A +QG +D +L LY L++LGYFD + + Y ++G +
Sbjct: 321 GTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYGSIGWKD 380
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P +LA +AA +GIVLLKND LPL TLAL+GP+ANATK M GNY+G P
Sbjct: 381 VDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAKG--TLALIGPYANATKQMQGNYQGPP- 436
Query: 422 RYTSPMD-GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+Y ++ + + Y+PG A I + + AA+ AAK+AD + G+D ++E+E
Sbjct: 437 KYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAAAKDADVVLYAGGIDNTIESE 495
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DR + PG Q LI+++++ K P+ ++ G VD NP + ++LW GYP +
Sbjct: 496 TLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQ 554
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
EGG AI D++ GK P GRLPIT Y A Y ++P T M LR + PGRTY+++D VV
Sbjct: 555 EGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGDNPGRTYRWYDKAVV- 613
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GL YT F ++ D+ + N N+ P + + D D
Sbjct: 614 PFGFGLHYTSF-----------EVSWDRGR-LGPYNTAALVNRAPGGSHV--DRALFD-- 657
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
TF+++V+N G + V +++ K G +K ++GY RV
Sbjct: 658 -TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRV 700
>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
Length = 803
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 287/704 (40%), Positives = 396/704 (56%), Gaps = 51/704 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD RA+ LV+ M+ EKV A GVPRLGLP Y WWSEALHGV+
Sbjct: 69 LSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVA- 127
Query: 69 IGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
PG HF P ATSF IL ASF++ L K++ V TE RA N G A
Sbjct: 128 ------GAPGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA 181
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL +W+PNIN RDPRWGR ETPGEDP V RY + V GLQ G +RP +
Sbjct: 182 GLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG-------PARP-Q 233
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
I+A CKH+AAYD+++W G R FD+RV+ QD+ E ++ F+ CV + V +VMCSYN +
Sbjct: 234 IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNAL 293
Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+PTCADP LL +R W++ ++VSDC +I I H + T +A A L A
Sbjct: 294 NGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAAAVALNA 352
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG + A +QG +D +L LY L++LGYFD + + Y ++G +
Sbjct: 353 GTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYGSIGWKD 412
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ P +LA +AA +GIVLLKND LPL TLAL+GP+ANATK M GNY+G P
Sbjct: 413 VDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAKG--TLALIGPYANATKQMQGNYQGPP- 468
Query: 422 RYTSPMD-GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+Y ++ + + Y+PG A I + + AA+ AAK+AD + G+D ++E+E
Sbjct: 469 KYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAAAKDADVVLYAGGIDNTIESE 527
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DR + PG Q LI+++++ K P+ ++ G VD NP + ++LW GYP +
Sbjct: 528 TLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQ 586
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
EGG AI D++ GK P GRLPIT Y A Y ++P T M LR + PGRTY+++D VV
Sbjct: 587 EGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGDNPGRTYRWYDKAVV- 645
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GL YT F ++ D+ + N N+ P + + D D
Sbjct: 646 PFGFGLHYTSF-----------EVSWDRGR-LGPYNTAALVNRAPGGSHV--DRALFD-- 689
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV 701
TF+++V+N G + V +++ K G +K ++GY RV
Sbjct: 690 -TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRV 732
>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 865
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 260/607 (42%), Positives = 359/607 (59%), Gaps = 27/607 (4%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
CD L ERA +V+ MTL EKV +G A G RLGLP Y+W +EALHGV+ G +
Sbjct: 144 CDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSARLGLPAYQWQNEALHGVAGSTGVQF 203
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
SP G +F + ATSFP IL +A+F+++L + + +STEARA N G AGL FW+P
Sbjct: 204 QSPLGANFSA----ATSFPMPILLSAAFDDALVQNVATAISTEARAFANYGFAGLDFWTP 259
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN RDPRWGR +ETPGED + + Y + + GLQ ++ R I A CKH
Sbjct: 260 NINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQGGINPDFFR--------IIATCKH 311
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+AAYD++N + + T+QDM + ++ FE CV + V SVMC+YN V+GIP CA
Sbjct: 312 FAAYDIENGRTGNNLN----PTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIPACA 367
Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
LL +R + F Y+VSDCD++ + + H + ++ E A A L AG DLDCG
Sbjct: 368 SEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTE-AAALSLNAGTDLDCG 426
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
Y N +V+ +EA ++ SL LY L+++GYFD +YK+L N+ Q+ L
Sbjct: 427 SSY-NVLNASVEAALTSEAALNQSLVRLYSALIKVGYFDQPSEYKSLSWANVNTTQNQAL 485
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA G+ LLKND G LPL+ + +A++GP NAT M GNY GT +P+D F
Sbjct: 486 AHDAATGGMTLLKND-GTLPLSR-TLSNVAIIGPWVNATTQMQGNYAGTAPFLVNPLDVF 543
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ YA G A I Q+ S AA+ AA ++D V + G+D++VE EG DR ++ P
Sbjct: 544 QQKWGNVKYAQGTA-INSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSIVWP 602
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q +LI+++A+ K P+ +V G +D + +NP ++SILW GYPG++GG A+ DV+
Sbjct: 603 GNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVFDVL 661
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
G P GRLPIT Y A+Y+ M LRP N PGRTY ++ G V PFGYGL YT
Sbjct: 662 TGANPPAGRLPITQYPASYINNNNIQDMNLRPSNGIPGRTYAWYTGTPVLPFGYGLHYTN 721
Query: 610 FKYKVAS 616
F S
Sbjct: 722 FSVSFQS 728
>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 752
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 293/756 (38%), Positives = 406/756 (53%), Gaps = 59/756 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ P CDA L + ERA+ LVE T+PE + + A+GVPRLGLP YEWW+EALHGV
Sbjct: 30 LASNPVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWNEALHGVGL 89
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
SP F+ E ATSFP I ++F+++L +G +STEARA N G AGL
Sbjct: 90 ------SPGVVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRAGL 143
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+PNIN +DPRWGR ETPGEDP RY + V GLQ G+ D LK++
Sbjct: 144 DYWTPNINPFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQG--GI------DPPSLKVA 195
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+AAYDL+NW G R+ FD+ VT QD+ E + PF CV + +S MCSYN VNG
Sbjct: 196 AACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNG 255
Query: 249 IPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
+P CA P LL +R W ++ SDC ++ + + H + D +A LKAG D
Sbjct: 256 VPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTEDLV-NASTVSLKAGTD 314
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNIC 363
L+CG YT + A +G I E D+ +L LY L+ LGYFD +P+ Y+ + ++
Sbjct: 315 LNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQITWADVN 373
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK-AMIGNYEGTPCR 422
P+ LA AA + VLLKND G LPL T + +LAL+GP ANA+ M+GNY G P
Sbjct: 374 TPEAQALAYTAAIKSFVLLKND-GTLPL-TDSTLSLALIGPMANASALQMLGNYFGIPPF 431
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+P+ GF + Y G ++ + AA+ AA+ AD + V G+D ++E E K
Sbjct: 432 VIAPLQGFLDAGFNVTYVLGT-NVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEK 490
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR ++ P Q L++ + K P+ +V M G +D K + + +ILW GYPG+ G
Sbjct: 491 DRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSG 549
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF--PGRTYKFFDGPVVY 599
G AIAD + GK P GRL YV ++ T M LRP N PGRTYK++ G VY
Sbjct: 550 GTAIADTVTGKVAPAGRL--------YVDEVAMTDMTLRPDNATGNPGRTYKWYTGTPVY 601
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
P+GYGL YT AS D + C I G A+ +D
Sbjct: 602 PYGYGLHYTNISVAWAS---------DAPEACYSIQDLTGE-----ASGFVDLAPLD--- 644
Query: 660 FTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNA 717
TF++ V N G + V +++ S G A IK+++ Y R + G S +V +
Sbjct: 645 -TFRVTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARASDVQPGNSTEVELEVT- 702
Query: 718 CKSLKIVDNAANSLLASGAHTILVG-EGVGGVSFPL 752
+L D + ++ L G + + +G +SF L
Sbjct: 703 LGALARTDESGDASLYPGKYELTFDYDGALSLSFEL 738
>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
Length = 860
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 290/756 (38%), Positives = 408/756 (53%), Gaps = 64/756 (8%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
CD RA +V+ MTL EKV +G A G RLGLP Y+W +EALHGV+ G +
Sbjct: 139 CDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLGLPAYQWQNEALHGVAGSTGVQF 198
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
SP G +F + ATSFP IL +A+F+++L + + +STEARA N G AGL FW+P
Sbjct: 199 QSPLGANFSA----ATSFPMPILLSAAFDDALVQSVATAISTEARAFANYGFAGLDFWTP 254
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN RDPRWGR +ETPGED + + Y ++ + GLQ ++ R + CKH
Sbjct: 255 NINPFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQGGIDPDFFRTIST--------CKH 306
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+AAYD++ N R + T+QDM + ++ FE CV + V S+MC+YN VNG+P CA
Sbjct: 307 FAAYDIE----NGRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACA 362
Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
D LL +R + F Y+VSDCD+++ + + H + + + A A L AG DLDCG
Sbjct: 363 DSYLLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQ-AAAMSLNAGTDLDCG 421
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
Y N +VQ G EA +D SL LY L+++G+FD +Y +LG N+ Q L
Sbjct: 422 SSY-NVLNASVQAGMTTEATLDKSLIRLYSALIKVGWFDQPAKYSSLGWGNVNTTQTRAL 480
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA G+ LLKND G LPL+ ++ +A++GP NAT + GNY GT +P+ F
Sbjct: 481 AHDAATGGMTLLKND-GTLPLSP-TLQNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVF 538
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ + YA G A I Q+ S AAI AA ++D V + G+D+SVE EG DR + P
Sbjct: 539 QQKWRNVKYAQGTA-INSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWP 597
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q LI+++A+ K P+ +V G +D + +N K+ SILW GYPG+EGG A+ DV+
Sbjct: 598 GNQLSLISQLANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVL 656
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
G P GRLPIT Y ANYV M LRP + PGRTY ++ G V PFGYGL YT
Sbjct: 657 TGANPPAGRLPITQYPANYVNNNNIQDMNLRPSGSIPGRTYAWYTGTPVLPFGYGLHYTN 716
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F S+ S GT+ A ++++ + TF V N+
Sbjct: 717 FSVSFQSTKTS------------------GTD----VATIVNNAGSNKDRATFATLVVNV 754
Query: 670 GKMDGSE--------VVMVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKS 720
G ++ + S G A KQ+ Y RV + G + ++ T+N S
Sbjct: 755 KNTGGKANLASDYVGLLFLKSTNAGPAPHPNKQLAAYGRVKKVGVGATQQLTLTVN-LGS 813
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
L D + + GA+T+ + V+ PL N
Sbjct: 814 LARADTNGDRWVYPGAYTLTL-----DVNGPLTFNF 844
>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 757
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 247/604 (40%), Positives = 356/604 (58%), Gaps = 35/604 (5%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+R K L++ +TL EK+ + D + G RLGLP YEWW+EA HGV S PG F
Sbjct: 24 QRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGV-------GSAPGVQF- 75
Query: 83 SEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVV 138
+E P ATSFP ILT ASF+++L ++I + E RA N G +G FW+PNIN
Sbjct: 76 TEKPVNFSYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPF 135
Query: 139 RDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYD 198
RDPRWGR ETPGED +VV Y N++ GLQ D ++ A CKHYAAYD
Sbjct: 136 RDPRWGRGQETPGEDSFVVQSYIRNFIPGLQ---------GDDPEDKQVIATCKHYAAYD 186
Query: 199 LDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLL 258
L+ R+ D T+QD+ + F+ PF+ CV + V S+MC+YN V+GIPTCA LL
Sbjct: 187 LE----TGRYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLL 242
Query: 259 NQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
+Q +R WNF + Y+VSDC ++ I + H F DT+E A + L AG+DL+CG Y
Sbjct: 243 DQVLRKHWNFTADYNYVVSDCGAVTDIWQYHNF-TDTEEAAASVSLNAGVDLECGSSYLK 301
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAA 375
+A +D +L LY L +G+FDG +Y LG ++ P+ LA EAA
Sbjct: 302 LNESLAANQTTVQA-LDQALTRLYSALFTVGFFDGG-KYTALGFADVSTPEAQSLAYEAA 359
Query: 376 RQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
+G+ LLKND LP+ + + K++AL+GP ANAT M G+Y G P SP++ F +
Sbjct: 360 VEGMTLLKNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHD 419
Query: 435 KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQT 494
+NYA G I Q + +A+ AA+ +D + + G+D S+EAE DR L PG Q
Sbjct: 420 WEVNYAMGTG-INNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQL 478
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
+L+ +++ K P+ +V G +D + N +++++W GYP + GG A+ DV+ GK
Sbjct: 479 DLVTQLSKLHK-PLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKR 537
Query: 555 NPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYK 613
+ GRLP+T Y A+Y ++ + +RP +++PGRTYK++ G V PFGYGL YT+F+++
Sbjct: 538 SIAGRLPVTQYPASYADQVSIFDINIRPNDSYPGRTYKWYTGMPVVPFGYGLHYTKFEFE 597
Query: 614 VASS 617
A +
Sbjct: 598 WAQT 601
>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
Length = 750
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 268/765 (35%), Positives = 409/765 (53%), Gaps = 103/765 (13%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
++ F Y D L + +RAKDLV RMTL EKV QM ++ +PRLG+P Y WWSEALHGV+
Sbjct: 26 RMEIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVA 85
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-- 125
G AT FP I A+F+E L + + +S E RA ++
Sbjct: 86 RAGV----------------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRK 129
Query: 126 ------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
GLTFWSPN+N+ RDPRWGR ET GEDPY+ GR +++++GLQ
Sbjct: 130 GDHGIYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG--------- 180
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
D + L+ +AC KH+A + ++R FD+ V+ +D++ET++ F+ CV E +V +V
Sbjct: 181 QDKKYLRAAACAKHFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAV 237
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
M +YNRVNG P C LL +T+R +W F G++VSDC +I+ E+H+ + E +VA
Sbjct: 238 MGAYNRVNGEPCCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPE-SVAL 296
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
L G DL+CG+ Y N + A Q+G + E I+T++ L + M+LG FD + Y N+
Sbjct: 297 ALNNGCDLNCGNMYLNLLI-AYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTNI 355
Query: 358 G-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
G N C +H E A E +++ +VLLKN+N LPL+ I ++A++GP+AN+ +A+ GNY
Sbjct: 356 GFHQNDCQ-EHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNY 414
Query: 417 EGTPCRYTSPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADAT 467
GT Y + ++G +++YA GC A+ + + A+ A+ AD
Sbjct: 415 CGTASNYITVLEGIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIV 474
Query: 468 VIVAGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
V+ GLD S+E E D++ L LPG Q EL+ + K P+ LV+++ A+
Sbjct: 475 VMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSAL 533
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+ +A K+ +I+ YPG EGG+A+A IFG+Y+P G+LPIT+Y +T
Sbjct: 534 AVTWAAE--KVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYS 591
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
++ RTY++ +YPFGYGL YT F Y+ ++L++ Q +
Sbjct: 592 MK------NRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ------ISA 631
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIG 697
G N V+C + V+N G E V +Y K I ++ G
Sbjct: 632 GEN-----------VQCS-------VLVKNTGNFASDETVQLYIKDVKASVEVPILELQG 673
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++V + G +V FT+ + L +++ N +L GA I VG
Sbjct: 674 IQKVHLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717
>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
Length = 774
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 284/738 (38%), Positives = 401/738 (54%), Gaps = 35/738 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS+ CD RA L++ T E + + GVPRLGLP Y WWSE LHGV+
Sbjct: 35 LSNNTVCDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ 94
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
T +P G ATSFP IL A+F++ L + + VSTE RA N+G AGL
Sbjct: 95 SPGVTFAPSG-----NFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGL 149
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKI 187
+W+PNIN +DPRWGR ETPGEDP+ + Y N + GLQ D P K+
Sbjct: 150 DYWTPNINPFKDPRWGRGQETPGEDPFHLQGYVYNLILGLQG--------GLDPTPYFKV 201
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A CKH+AAYD+DNWEGN R+ F++ VT+QD+ E ++ F+ CV + V+SVMCSYN VN
Sbjct: 202 VADCKHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVN 261
Query: 248 GIPTCADPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
GIP+CA+ LL +R W F ++ SDCD++Q I H + D A A L AG
Sbjct: 262 GIPSCANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNY-TDNPAQAAADALLAGT 320
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
D+DCG + + + A+ QG + D+ + Y L+RLGYFD S Y+ LG +++
Sbjct: 321 DIDCGTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFDPPESQPYRQLGWSDVN 380
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
P+ +LA AA +G+VLLKND G LPL+ +++ LAL+GP ANAT M GNY G
Sbjct: 381 TPEAQQLAHTAAVEGMVLLKND-GTLPLSK-HVRKLALIGPWANATTLMQGNYAGIAPYL 438
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP+ G + Y G + S AA+ AAK ADA + GLD +VE E D
Sbjct: 439 ISPLLGAQQAGFDVEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVD 498
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R+++ PG Q +L+ ++A K P+ + G +D + K+ + +I+W GYPG+ GG
Sbjct: 499 RLNVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGG 557
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
A+ D++ GK P GRLPIT Y A Y ++P T M LRP PGRTYK++ G V+ FG
Sbjct: 558 TALFDILTGKAAPAGRLPITQYPAEYANQVPMTDMTLRPSATNPGRTYKWYTGTPVFEFG 617
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GL YT F + AS+ + + I+ + + A + D+ D TF
Sbjct: 618 FGLHYTTFSFAWASNAHA-----NTPAASYSIDALMASGNKSAAFL---DLAPLD---TF 666
Query: 663 QIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+ V N GKM V +++ S G A KQ++ Y RV A + + + ++
Sbjct: 667 AVRVTNTGKMTSDYVALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTIAELTVTLGAI 726
Query: 722 KIVDNAANSLLASGAHTI 739
D + + G +T+
Sbjct: 727 ARADESGAKWVYPGTYTL 744
>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 761
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 290/750 (38%), Positives = 404/750 (53%), Gaps = 49/750 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L + ERA+ LVE +T+ E + A GVPRLGLP Y WW+EALHGV+
Sbjct: 35 CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVAASPGVVF 94
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ PG F S ATSFP I ++F+++L +G STEARA N G AGL +W+PN
Sbjct: 95 TSPGEEFSS----ATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPN 150
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN +DPRWGR ETPGEDP RY V GLQ G+ D LK++A CKH+
Sbjct: 151 INPFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQG--GI------DPPSLKVAADCKHW 202
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
AAYDL++W G R+ FD+ VT QD+ E + PF+ CV + +SVMCSYN VNG+P CA
Sbjct: 203 AAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGVPACAS 262
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
P LL +R W ++ SDCD++ + + H + D + A LKAG DLDCG
Sbjct: 263 PYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFV-NGSAVSLKAGSDLDCGTT 321
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIE 369
Y+ + A +G I E D+ +L LY L+ LGYFD +P+ Y+ + ++ P
Sbjct: 322 YSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQISWADVNTPAAQA 380
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI-GNYEGTPCRYTSPMD 428
LA AA + VLLKND G LPL ++ ++AL+GP ANA+ + GNY G P +P+
Sbjct: 381 LAYTAAIESFVLLKND-GTLPLTDSSL-SIALIGPMANASAVQLQGNYNGIPPFAIAPLQ 438
Query: 429 GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
GF + Y G ++ + I A+ AA+ AD + V G+D +VE E KDR ++
Sbjct: 439 GFLDAGFNVTYVLGT-NVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRTEIS 497
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
P Q L++ + +A K P+ +V M G +D K + + +ILW GYPG+ GG AIAD
Sbjct: 498 WPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIAD 556
Query: 549 VIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF--PGRTYKFFDGPVVYPFGYGL 605
+ GK P GRL IT Y A+YV + T M LRP N+ PGRTYK++ G VYP+GYGL
Sbjct: 557 TVMGKVAPAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVYPYGYGL 616
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
YT F AS D + C I + +D TF++
Sbjct: 617 HYTNFSVAWAS---------DAPEACYSIQDLTSSADGFVDLAPLD---------TFRVT 658
Query: 666 VENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKI 723
V N G + V +++ S G A +K+++ Y R + G S V + +L
Sbjct: 659 VTNDGDVASDFVALLFVSTQAGPAPAPMKELVAYARASDVQPGDSTDVDLEVT-LGALAR 717
Query: 724 VDNAANSLLASGAHTILVG-EGVGGVSFPL 752
D + ++ L G + + +G +SF L
Sbjct: 718 SDESGDASLYPGDYELTFDYDGALSLSFEL 747
>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 758
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 262/617 (42%), Positives = 357/617 (57%), Gaps = 35/617 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L++ CD RA LV TL EK+ G+ + GVPR+GLP Y+WW+EALHG+++
Sbjct: 28 LANNTVCDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAY 87
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
GTHF S ATSFP IL A+F+++L + +STEARA N
Sbjct: 88 ---------GTHFAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANR 138
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL FW+PNIN +DPRWGR ETPGEDP+ V Y V GLQ G+ D P
Sbjct: 139 YGLNFWTPNINPYKDPRWGRGQETPGEDPFHVSSYVNALVTGLQG--GL------DDLPY 190
Query: 186 KIS-ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K A CKHYA YDL+N G R+ FD+ + QD+++ ++ F+ C + +V S+MCSYN
Sbjct: 191 KKGVATCKHYAGYDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYN 250
Query: 245 RVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
VNG+PTCAD LL +R W + ++ SDCD++Q I +SH + + T E A A L
Sbjct: 251 AVNGVPTCADDWLLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTS-TPEQAAADAL 309
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
AG DLDCG ++ + A Q + +D SL Y L+RLGYFD + Y+ LG
Sbjct: 310 NAGTDLDCGGFWPTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYFDPASIQPYRQLGW 369
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+++ P +LA +AA GIVLLKND G LPL + NI +AL+GP ANAT M GNY G
Sbjct: 370 SDVSTPSAEQLALQAAEDGIVLLKND-GILPLPS-NITNVALIGPWANATTQMQGNYYGQ 427
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
SP+ + Y G ADI N + AAI AAK AD + + G+D S+EA
Sbjct: 428 APYLHSPLIAAQNAGFHVTYVQG-ADIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEA 486
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKSILWVGYP 538
E KDR + P Q L+N++A+ + + L+I G +D + N + I+W GYP
Sbjct: 487 EAKDRKTIAWPSSQISLVNQLANLS---IPLIISQMGTMIDSSSLLTNRGVNGIIWAGYP 543
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
G++GG AI +++ GK P GRLPIT Y ++YV ++ +M L P N PGRTYK+F+G
Sbjct: 544 GQDGGTAIFNILTGKTAPAGRLPITQYPSDYVNEVSMNNMNLHPGANNPGRTYKWFNGTS 603
Query: 598 VYPFGYGLSYTQFKYKV 614
++ FG+GL YT F K+
Sbjct: 604 IFDFGFGLHYTTFNAKI 620
>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 761
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/731 (37%), Positives = 382/731 (52%), Gaps = 52/731 (7%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD- 82
RA+ LV TL EK+ A GVPRLG+P Y+WWSE LHG++ P T+F
Sbjct: 10 RAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--------GPYTNFSD 61
Query: 83 -SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
E +TSFP IL A+F++ L + + +STEARA N GL FW+PNIN RDP
Sbjct: 62 SGEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDP 121
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RWGR ETPGED Y + Y + GLQ Y R + A CKH+A YD+++
Sbjct: 122 RWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYKR--------VVATCKHFAGYDVED 173
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W GN R+ D ++T+Q++ E ++ PF+ CV + +V + MCSYN VNG P CADP LL
Sbjct: 174 WNGNLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTI 232
Query: 262 IRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
+R W N ++ DCD++Q + H++ + T+ A A L AG D+ CG Y
Sbjct: 233 LREHWGWTNEEQWVTGDCDAVQNVYLPHQW-SPTRAGAAADSLVAGTDVTCGTYMQEHLP 291
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAAR 376
A QQ + E+ +D +L Y L+RLGYFD S Y+ LG + + LA AA
Sbjct: 292 AAFQQKLLNESSLDQALIRQYSSLVRLGYFDASENQPYRQLGFDAVATNASQALARRAAA 351
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
+GIVLLKND G LPL+ + T+ L G ANAT ++GNY G SP+
Sbjct: 352 EGIVLLKND-GTLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGVK 410
Query: 437 INYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
INYA G D S + A +D + V G+D SVE EG+DR L G
Sbjct: 411 INYAGGNPGGQGDPTTNRWSNL---YGAYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGA 467
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q ++I ++AD K PV +V+ G +D + NNP I +I+W GYPG++GG AI D+I G
Sbjct: 468 QLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGG 526
Query: 553 KYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
K P GRLP T Y ANY + +M LRP N PGRTYK+++G + FGYG+ YT F
Sbjct: 527 KTAPAGRLPQTQYPANYTAAVSMMNMNLRPGENSPGRTYKWYNGSATFEFGYGMHYTNF- 585
Query: 612 YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGK 671
S +I Q +Y + + C + +C + ++V N G
Sbjct: 586 --------SAEITTQMQQ-----SYAISSLASGCNSTGGFLERCP--FASVNVQVHNTGN 630
Query: 672 MDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS 730
+ + + Y + G A K ++ Y+R+ AG + SL VD N
Sbjct: 631 VTSDYITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLTLASLARVDEHGNK 690
Query: 731 LLASGAHTILV 741
+L G +++ +
Sbjct: 691 VLYPGDYSLQI 701
>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
Length = 374
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 215/390 (55%), Positives = 270/390 (69%), Gaps = 19/390 (4%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLGL YEWWSEALHGVS +G PGT F P ATSFP VI T ASFN SLW+ I
Sbjct: 1 RLGLQGYEWWSEALHGVSNVG------PGTKFQGAFPAATSFPQVITTAASFNASLWQAI 54
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP + +YA +YV GLQ
Sbjct: 55 GQAVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQ 114
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
G LK++ACCKHY AYDLDNW G DRFHF+++V++QD+++T+ +PF+
Sbjct: 115 GNYGNR---------LKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKA 165
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
CV EG V+SVMCSYN+VNG PTCADP +L TIRG W+ +GYIVSDCDS+ + + +
Sbjct: 166 CVLEGKVASVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYT 225
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
T E+A A + AGLDLDCG + T GA++QG + EA ++ +L V MRLG FD
Sbjct: 226 R-TPEEAAADTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFD 284
Query: 350 GSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
G P + NLG ++C P H +LA +AAR+GIVLLKN G+LPL+T + +A++GP+A
Sbjct: 285 GEPSAQPFGNLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNA 344
Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
AT MIGNY G C YTSP+ G Y++
Sbjct: 345 QATTTMIGNYAGIACGYTSPLQGISRYART 374
>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
Length = 798
Score = 444 bits (1141), Expect = e-121, Method: Compositional matrix adjust.
Identities = 278/748 (37%), Positives = 398/748 (53%), Gaps = 55/748 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L + CD RAK LV TL EK+ A GVPRLG+P Y+WW+E LHG++
Sbjct: 31 LKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLHGIA- 89
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
G TN +H E +TSFP IL A+F++ L ++ + +STEARA N GL
Sbjct: 90 -GPYTNF---SHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGL 145
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
FW+PNIN RDPRWGR ETPGED Y + Y + GLQ Y R +
Sbjct: 146 DFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGEATDPYKR--------VV 197
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKH+A YD+++W GN R+ D ++T+QD+ E ++ PF+ CV + +V + MCSYN VNG
Sbjct: 198 ATCKHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNG 256
Query: 249 IPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
P CADP LL +R W ++ ++ DCD++Q + H++ + T+ A A L AG
Sbjct: 257 APPCADPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQW-SSTRAGAAADSLVAGT 315
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNI 362
D+ CG Y A +Q + E+ +D +L Y L+RLGYFD +P+ Y+ LG + +
Sbjct: 316 DITCGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFD-APENQPYRQLGFDAV 374
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
LA AA +GIVLLKND G LPL+ + T+ L G ANAT ++GNY G
Sbjct: 375 ATNASQALARRAAAEGIVLLKND-GTLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATY 433
Query: 423 YTSPMDGFYAYSKVINYA----PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ INYA G D S + A +D + V G+D VE
Sbjct: 434 LHSPLYALKQTGVKINYAGGKPGGQGDPTTNRWSNL---YGAYSTSDVLIYVGGIDNGVE 490
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
EG DR L G Q ++I ++A+ K PV +V+ G +D + NNP I +I+W GYP
Sbjct: 491 EEGHDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYP 549
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
G++GG AI D+I GK P GRLP T Y A+Y + +M LRP N PGRTYK+++G
Sbjct: 550 GQDGGSAIIDIISGKTAPAGRLPQTQYPASYAAAVSMMNMNLRPGENNPGRTYKWYNGSA 609
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
V+ FGYG+ YT F +++ + QQ +Y + + C + +C
Sbjct: 610 VFEFGYGMHYTNFSAAIST----------QMQQ----SYAISSLASGCNSTGGFLERCP- 654
Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG---QSAKVGF 713
+ ++V N GK+ V + Y + G A K ++ Y+R+ AG +AK+
Sbjct: 655 -FASVDVQVHNTGKVTSDYVTLGYMAGTFGPAPHPRKTLVSYKRLHNIAGGATSTAKLNL 713
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILV 741
T+ S+ VD N +L G +++ +
Sbjct: 714 TL---ASVARVDEYGNKVLYPGHYSLQI 738
>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
Length = 724
Score = 443 bits (1140), Expect = e-121, Method: Compositional matrix adjust.
Identities = 263/761 (34%), Positives = 405/761 (53%), Gaps = 103/761 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F Y D L + +RAKDLV RMT+ EKV QM + + RLG+P Y WWSEALHGV+ G
Sbjct: 4 FDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARAGV 63
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP I A+F+E L + + +S EARA ++
Sbjct: 64 ----------------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHG 107
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFWSPN+N+ RDPRWGR ET GEDPY+ GR +++++GLQ D +
Sbjct: 108 IYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKK 158
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L+ +AC KH+A + ++R FD+ V+ +D++ET++ F+ CV E +V +VM +Y
Sbjct: 159 YLRAAACAKHFAVHSGPE---SERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAY 215
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG P C LL +T+R +W F G++VSDC +I+ E+H+ + E +VA L
Sbjct: 216 NRVNGEPCCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPE-SVALALNN 274
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG-KN 360
G DL+CG+ Y N + A Q+G + E I+T++ L + M+LG FD + Y N+G
Sbjct: 275 GCDLNCGNMYLNLLI-AYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENVPYTNIGFHQ 333
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
N C +H E A E +++ +VLLKN+N LPL+ I ++A++GP+AN+ +A+ GNY GT
Sbjct: 334 NDCQ-EHREFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTA 392
Query: 421 CRYTSPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVA 471
Y + ++G +++YA GC A+ + + A+ A+ AD V+
Sbjct: 393 SNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCM 452
Query: 472 GLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF 522
GLD S+E E D++ L LPG Q EL+ + K P+ LV+++ A+ + +
Sbjct: 453 GLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTW 511
Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
A K+ +I+ YPG EGG+A+A IFG+Y+P G+LPIT+Y +T ++
Sbjct: 512 AAE--KVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK-- 567
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
RTY++ +YPFGYGL YT F Y+ ++L++ + C
Sbjct: 568 ----NRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTKICAG--------- 606
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
++V+C I V+N G E V +Y K I + G +++
Sbjct: 607 --------ENVQCS-------ILVKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKI 651
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G ++ FT+ + + L +++ N +L G I VG
Sbjct: 652 HLLPGAEQEISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691
>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 783
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 282/737 (38%), Positives = 402/737 (54%), Gaps = 36/737 (4%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RA L+ T E + + GVPRLGLP Y WWSE LHGV+ T
Sbjct: 41 CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQSPGVTF 100
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+P G ATSFP IL A+F+++L + I VSTE RA N G AGL +W+PN
Sbjct: 101 APSG-----NFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPN 155
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACCKH 193
IN +DPRWGR ETPGEDP+ + +Y N + GLQ D +P K+ A CKH
Sbjct: 156 INPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADCKH 207
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+AAYDL+NWEG R FD+ V++QD+ E ++ PF+ CV + V+SVMCSYN VNGIP+CA
Sbjct: 208 FAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCA 267
Query: 254 DPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ LL +R W F ++ SDCD+++ I+ HK+ D A A L AG D+DCG
Sbjct: 268 NSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTD-PAQAAADALLAGTDIDCGT 326
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
+ + + A+Q+G + D+ + Y L+RLGYFD + Y+ LG +++ PQ +
Sbjct: 327 FSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTPQAQQ 386
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
LA AA +GIVLLKND G LP + +++ LAL+GP ANAT + G+Y G SP+ G
Sbjct: 387 LAHTAAVEGIVLLKND-GVLPFSK-HVRKLALIGPWANATSLLQGSYIGVAPYLVSPLQG 444
Query: 430 FYAYSKVINYAPGCADIVCQNN-SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLL 488
+ Y G ++ QN+ S AA+ A + ADA V GLD +VE EG DR+++
Sbjct: 445 AQEAGFEVEYVLGT-NVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRLNVT 503
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
PG Q +L+ ++ K P+ + G +D K++ + +I+W GYPG+ GG A+ D
Sbjct: 504 WPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTALFD 562
Query: 549 VIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
++ GK P GRLPIT Y A Y K +P T M LRP PGRTYK++ G V+ FG+GL Y
Sbjct: 563 ILTGKAAPAGRLPITQYPAAYTKQVPMTDMSLRPSATNPGRTYKWYSGTPVFEFGFGLHY 622
Query: 608 TQFKYKVASSPKSVDI----KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
T F + A+ + + + I+ V + A + D+ D TF
Sbjct: 623 TTFVFSWAAPSAAAAVDSTASFGSLAKSYSISQLVAHGQESTAFL---DLAPLD---TFA 676
Query: 664 IEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ V N G++ V +++ S G A KQ++ Y RV A + + V ++
Sbjct: 677 VRVTNTGRVASDYVALLFVSGAFGPAPHPKKQLVAYTRVHGLAPRGSTVAQLPVTLGAIA 736
Query: 723 IVDNAANSLLASGAHTI 739
D + G +T+
Sbjct: 737 RADKNGEKWVHPGTYTL 753
>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
RWD-64-598 SS2]
Length = 770
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 258/616 (41%), Positives = 348/616 (56%), Gaps = 27/616 (4%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L +RA L++ T+ E + + A GVPRLGLP YEWWSE LHGV+ T
Sbjct: 37 CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G ATSFP IL +A+F+++L K +G + E RA N G+AGL FW+PN
Sbjct: 97 SITG-----PFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWTPN 151
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACCKH 193
IN +DPRWGR ETPGEDPY + +Y N ++GLQ D P ++ A CKH
Sbjct: 152 INPFKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG--------GLDPEPYFQVVATCKH 203
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YDL++W+ N R+ +++ ++ QD+ E ++ F+ C + + MCSYN +NGIPTCA
Sbjct: 204 FAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCA 263
Query: 254 DPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
D LL +RG W F ++ DCDS++ I + H + + A A LKAG D+DCG
Sbjct: 264 DTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHY-TALPQQAAADALKAGSDIDCGI 322
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIE 369
+YT + A + I E D+ +L Y L+RLGYFD + + Y+ +N+ E
Sbjct: 323 FYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYFDPASEQPYRQYNWSNVDTSYAQE 382
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
LA AA +GI LLKND G LP ++ IK +AL+GP AT M GNY G SP G
Sbjct: 383 LAYTAAVEGITLLKND-GTLPFSSA-IKNIALIGPWTFATTQMQGNYYGNAPYLISPYQG 440
Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
I+Y ++ AA AA+ ADA V V G+D +VEAE DR D+
Sbjct: 441 AQLAGYNISYVLET-NVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITW 499
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
P FQ LI ++ K P+ +V G VD NP + ++LW GYPG+ GG+A+ D+
Sbjct: 500 PAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDI 558
Query: 550 IFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNN---FPGRTYKFFDGPVVYPFGYGL 605
I GK P GRL T Y A+YV +IP T+M LRP N PGRTYK++ G VY FGYGL
Sbjct: 559 ISGKVAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTGTPVYEFGYGL 618
Query: 606 SYTQFKYKVASSPKSV 621
YT F Y +P +
Sbjct: 619 HYTNFTYAWTKAPAAT 634
>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
Length = 724
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 268/761 (35%), Positives = 404/761 (53%), Gaps = 103/761 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
F Y D L + +RAKDLV RMTL EKV QM ++ +PRLG+P Y WWSEALHGV+ G
Sbjct: 4 FAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAGV 63
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP I A+F+E L + + +S E RA ++
Sbjct: 64 ----------------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHG 107
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFWSPN+N+ RDPRWGR ET GEDPY+ GR +++++GLQ D +
Sbjct: 108 IYKGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKK 158
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L+ +AC KH+A + ++R FD+ V+ +D++ET++ F+ CV E +V +VM +Y
Sbjct: 159 YLRAAACAKHFAVHSGPE---SERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAY 215
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG P C LL +T+R +W F G++VSDC +I+ E+H+ + E +VA L
Sbjct: 216 NRVNGEPCCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRVTSSAPE-SVAMALNN 274
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG-KN 360
G DL+CG+ Y N + A Q+G + E I+T++ L + M+LG FD + Y +G
Sbjct: 275 GCDLNCGNMYLNLLI-AYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTKIGFHQ 333
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
N C +H E A E +++ +VLLKN+N LPL+ I ++A++GP+AN+ +A+ GNY GT
Sbjct: 334 NDCQ-EHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTA 392
Query: 421 CRYTSPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVA 471
Y + ++G +++YA GC A+ + + A+ A+ AD V+
Sbjct: 393 SNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCM 452
Query: 472 GLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF 522
GLD S+E E D++ L LPG Q EL+ + K P+ LV+++ A+ + +
Sbjct: 453 GLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTW 511
Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
A KI +I+ YPG EGG+A+A IFG+Y+P G+LPIT+Y +T ++
Sbjct: 512 AAE--KIPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEELPEFTDYSMK-- 567
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
RTY++ +YPFGYGL YT F Y+ ++L++ Q +VG N
Sbjct: 568 ----NRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ------ISVGENV 609
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
+VL V+N G E V +Y K I + G ++V
Sbjct: 610 Q--GSVL----------------VKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKV 651
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G +V FT+ + L +++ N +L G I VG
Sbjct: 652 HLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691
>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 217/455 (47%), Positives = 302/455 (66%), Gaps = 13/455 (2%)
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLG 358
+A LDLDCG + T AV++G + EA+I+ +L V MRLG FDG P Y NLG
Sbjct: 5 QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
++C P H ELA EAARQGIVLLKN LPL+T + +++A++GP++N T MIGNY G
Sbjct: 65 PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
C YT+P+ G Y+K I Y GCAD+ C ++ AA+DAA+ ADATV+V GLD S+E
Sbjct: 125 VACGYTTPLQGIGRYAKTI-YQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIE 183
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AE +DR +LLLPG Q ELI+KVA A+KGP LV+MS G +D++FA+N+PKI I+W GYP
Sbjct: 184 AESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYP 243
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDG 595
G+ GG AI+DV+FG NPGG+LP+TWY +YV +P T+M +RP N +PGRTY+F+ G
Sbjct: 244 GQAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKG 303
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
VVYPFG+G+SYT F + +AS+P V + LD +Q N T+ A+ + +C
Sbjct: 304 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASR-NATISGK-----AIRVTHARC 357
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
F Q++V+N G MDG+ ++VYSKPP +KQ++ +E+V +AAG +VG +
Sbjct: 358 NRLSFGVQVDVKNTGSMDGTHTLLVYSKPPAGHWAPLKQLVAFEKVHVAAGTQQRVGINV 417
Query: 716 NACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ CK L +VD + + GAH++ +G+ VS
Sbjct: 418 HVCKFLSVVDRSGIRRIPMGAHSLHIGDVKHSVSL 452
>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
Length = 799
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 273/722 (37%), Positives = 390/722 (54%), Gaps = 62/722 (8%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D CD Y +RA+ L+ TL E + + GVPRLGLP YE WSE LHG+
Sbjct: 58 LKDNIVCDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGLD- 116
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
HF E ATSFP IL+ A+ N +L +I ++T+ARA N+G
Sbjct: 117 ---------RAHFVKSGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGR 167
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDP-YVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GL ++PNIN R P WGR ETPGED ++ YA Y+ GLQ G+ D
Sbjct: 168 YGLDAYAPNINGFRSPLWGRGQETPGEDANFLTSSYAYEYITGLQG--GI------DPDN 219
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
LKI+A KH+A YDL+NW GN R FD+R+T+QD+ E + F S MCSYN
Sbjct: 220 LKIAATAKHFAGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYN 279
Query: 245 RVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VN IP+C+ LL +R W+F +GY+ SDCD++ + H + ++ + A A L+
Sbjct: 280 SVNAIPSCSSSFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASN-QSSAAAESLR 338
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-QYKNLGKNN 361
AG D+DCG Y+ + +G + +I+ S+ LY L++LGYFDG +Y+ LG N+
Sbjct: 339 AGTDIDCGQTYSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGDKNEYRQLGWND 398
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ ++ EAA +GIVLLKND G LPL + N+K++ALVGP ANATK + GNY GT
Sbjct: 399 VVTTDAWNISYEAAVEGIVLLKND-GVLPL-SKNVKSVALVGPWANATKQLQGNYFGTAP 456
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+P+ G +NYA G +I A+ AAK +D V + G+D ++EAEG
Sbjct: 457 YLITPLQGASDAGYKVNYALGT-NISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEG 515
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR+++ P Q +LI +++ K P+ ++ M G VD + K+N K+ +++W GYPG+
Sbjct: 516 TDRMNVTWPRNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQS 574
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRP-VNNFPGRTYKFFDGPVVY 599
GG+AI D++ GK P GRL T Y A Y + P T M LRP + PG+TY ++ G VY
Sbjct: 575 GGKAIFDILKGKRAPAGRLVSTQYPAEYATQFPATDMSLRPDGKSNPGQTYMWYIGKPVY 634
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
FGYGL YT FK KL DI+ V + + P Y+
Sbjct: 635 EFGYGLFYTTFKETAK--------KLGSSSSSFDISEIVSSPRSPS------------YE 674
Query: 660 FTFQI-------EVENMGKMDGSEVVMVYSKP--PGIAGTHIKQVIGYERV-FIAAGQSA 709
++ + ++N GK M+++ G A K ++GY+R+ I G+SA
Sbjct: 675 YSELVPFLNVTATIKNTGKTASPYTAMLFANTTNAGPAPYPNKWLVGYDRLPSIEPGKSA 734
Query: 710 KV 711
+
Sbjct: 735 DL 736
>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
Length = 715
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 273/759 (35%), Positives = 404/759 (53%), Gaps = 104/759 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + ERAKDLV RMT+ EKV QM + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F+E L K+ +STE RA Y+ +
Sbjct: 65 --------------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIY 110
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDPY+ R + +V+GLQ + + L
Sbjct: 111 KGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQG---------NHPKYL 161
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KH+A + + R F++ V+++D+ ET++ F+ V E V SVM +YNR
Sbjct: 162 KAAACAKHFAVHSGPE---SLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNR 218
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
NG P C LL+ +RG+W F G++VSDC +I+ H + T ++ A ++ G
Sbjct: 219 TNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDF-HMHHHVTATAPESAALAVRNGC 277
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DL+CG+ + N + A+++G I E +ID ++ L I M+LG FD Q Y ++ + +
Sbjct: 278 DLNCGNMFGNLLI-ALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISYDFVD 336
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+H ELA + A++ IVLLKND G LPL+ I+++A++GP+A++ +A+IGNYEGT Y
Sbjct: 337 CKEHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEY 395
Query: 424 TSPMDGFYAYSK---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
+ +DG + I Y+ GC + + + I A+ A++AD ++ GLD
Sbjct: 396 VTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLD 455
Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
++E E D+ DL LPG Q EL+ V K P+ LV+++ A+ + +A
Sbjct: 456 STIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWADE 514
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+ I +IL YPG GGRAIA V+FG+ NP G+LP+T+Y +T +
Sbjct: 515 H--IPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELPDFTDYSME----- 567
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
RTY+F +YPFG+GLSYT F Y D+KL KD T+ +
Sbjct: 568 -NRTYRFMKNEALYPFGFGLSYTTFDYS--------DLKLSKD--------TIRAGE--- 607
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK--QVIGYERVFI 703
F ++V N GKM G EVV VY K A + Q+ G +RV +
Sbjct: 608 -------------GFNVSVKVTNTGKMAGEEVVQVYIKDLE-ASWRVPNWQLSGMKRVRL 653
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+G++A++ F + + L +V + S++ G I VG
Sbjct: 654 ESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 792
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 273/754 (36%), Positives = 399/754 (52%), Gaps = 48/754 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA L+ TL E V G+ + GVPRLGLP Y+ W+EALHG+
Sbjct: 57 LSKTIVCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLD- 115
Query: 69 IGRRTNSPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
+F E +TSFP ILT ++ N +L ++ +ST+ RA N G
Sbjct: 116 ---------RAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRY 166
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPL 185
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D + L
Sbjct: 167 GLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--GV------DPKSL 218
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A KHYA YD++NW+G+ R D +T+QD+ E + F + + V SVMCSYN
Sbjct: 219 KLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNA 278
Query: 246 VNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
VNG+P+CA+ L +R + F GYI SDCDS + H++ + A A ++A
Sbjct: 279 VNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRA 337
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNI 362
G D+DCG Y + AV Q ++ ADI+ + LY LMRLGYFDG S Y+NL N++
Sbjct: 338 GTDIDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSAYRNLTWNDV 397
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
++ E +G VLLKND G LPL+ +I+++ALVGP N + + GNY G
Sbjct: 398 VTTNSWNISYEV--EGTVLLKND-GTLPLSE-SIRSIALVGPWMNVSTQLQGNYFGPAPY 453
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
SP+D F +NYA G +I + A+ AAK +DA + G+D S+EAE
Sbjct: 454 LISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETL 512
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR+++ PG Q ELI++++ K P+ ++ M G VD + K+N + S++W GYPG+ G
Sbjct: 513 DRMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSG 571
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G+A+ D+I GK P GRL +T Y A Y + P T M LRP N PG+TY ++ G VY F
Sbjct: 572 GQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEF 631
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F+ A + +K+ +D+ +P + ++ +
Sbjct: 632 GHGLFYTTFRVSHARA-----VKIKPTYNIQDL-----LAQPHPGYIHVEQMPF----LN 677
Query: 662 FQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
F +++ N GK M+++ G A K ++G++R+ ++K+ S
Sbjct: 678 FTVDITNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINS 737
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
+ D N +L G + + + V PL L
Sbjct: 738 MARTDELGNRVLYPGKYELALNNE-RSVVLPLSL 770
>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
Length = 774
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 272/751 (36%), Positives = 399/751 (53%), Gaps = 57/751 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ CDA L P+RA LV MT EK+Q + + G PR+GLP Y WWSEALHGV++
Sbjct: 36 LASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY 95
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT F D +TSFP +L A+F++ L +K+G+ + TE RA N G
Sbjct: 96 A-------PGTQFRSGDGPFNSSTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGF 148
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G +W+PN+N +DPRWGR ETPGED + RYA + +RGLQ G R
Sbjct: 149 SGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQ---GPLPER------- 198
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYAA D ++W G+ R FD++VT QD+ E ++ PF+ C + V S+MCSYN
Sbjct: 199 RVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNA 258
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VNG+P CA+ L+ +R WN+ YI SDC+++ I +H + T + A +
Sbjct: 259 VNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHYAK-TNAEGTALAFE 317
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNN 361
AG D C ++ GA QG + ++ +D +L LY L+R+GYFDG+ +Y +LG +
Sbjct: 318 AGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNHSEYASLGWKD 377
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNG-ALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ +P+ E+A + A +GIVLLKND L L T LA++G AN K + G Y G P
Sbjct: 378 VNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKTDPKSKLAMIGFWANDPKTLSGGYSGKP 437
Query: 421 CRYTSPMDGFYAYSKVINYAPG-CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
SP+ A + A G N++ AA++AA++A+ + GLD S
Sbjct: 438 AFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYILYFGGLDTSAAG 497
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E KDR + P Q +LI + K P+ +V M +D + SILW +PG
Sbjct: 498 ETKDRTTINWPEAQLQLIKTLTKLGK-PLVVVQM-GDQLDNTPLLATKTVNSILWANWPG 555
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
++GG A+ ++ G +P GRLP+T Y ANY +P T M LRP + PGRTY+++ V
Sbjct: 556 QDGGTAVMQILTGLKSPAGRLPVTQYPANYTAAVPMTDMNLRPSDRLPGRTYRWYPT-AV 614
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
PFG+GL YT F+ K+A+ + I+ D +C N N P L
Sbjct: 615 QPFGFGLHYTTFQAKIAAPLPRLAIQ-DLLSRCGGDN----ANAYPDTCALP-------- 661
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVF-IAAGQ--SAKVG 712
++EV N G VV+ + G AG IK ++ Y R+ ++ G +A +
Sbjct: 662 --PLKVEVTNSGNRSSDYVVLAFLA--GDAGPRPYPIKTLVSYTRLRDVSPGHKTTAHLE 717
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+T+ + D N++L G +T+ V E
Sbjct: 718 WTLG---DIARYDEQGNTVLYPGTYTVTVDE 745
>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
Length = 772
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 266/625 (42%), Positives = 364/625 (58%), Gaps = 34/625 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L++ CD RA L+ TL EKV G+ + GVPR+GLP YEWW+EALHG++
Sbjct: 28 LANNTVCDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA- 86
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT F S +TSFP IL A+F++ L K+ VSTEARA N+
Sbjct: 87 ------RSPGTTFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR 140
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL FW+PNIN +DPRWGR ETPGEDP+ Y + GLQ G+ D P
Sbjct: 141 FGLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--GL------DDLPY 192
Query: 186 KIS-ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K A CKH+A YDL++ +G R+ FD+ + QD+++ ++ PF+ C + +V SVMCSYN
Sbjct: 193 KKGVATCKHFAGYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYN 252
Query: 245 RVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+NG+PTCAD LL +R W + ++ SDCD+++ I + H + T E + A L
Sbjct: 253 AMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNY-TLTPEQSAADAL 311
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGK 359
AG DLDCG ++ + A QG + +D SL Y L+RLGYFD Y+ L
Sbjct: 312 NAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNW 371
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+N+ P +LA +AA GIVLLKND G LPL++ NI +AL+GP ANATK M GNY GT
Sbjct: 372 DNVSTPAAQQLALQAAEDGIVLLKND-GILPLSS-NITNVALIGPLANATKQMQGNYYGT 429
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
SP+ + Y G ADI QN + AAI AA++AD + V G+D S+EA
Sbjct: 430 APYLRSPLIAAQNAGFKVTYVQG-ADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEA 488
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKSILWVGYP 538
E DR + P Q LIN++A+ + L+I G +D + +N + ++LW GYP
Sbjct: 489 EEIDRTSISWPSSQLSLINQLANLS---TPLIISQMGCMIDSSSLLSNTGVNALLWAGYP 545
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
G++GG AI +++ GK P GRLPIT Y +NYV ++ T M L+P PGRTYK+++G
Sbjct: 546 GQDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQPSRFNPGRTYKWYNGEP 605
Query: 598 VYPFGYGLSYTQFKYKVA-SSPKSV 621
V+ +GYGL YT F K+ SSP +
Sbjct: 606 VFEYGYGLQYTTFDAKITPSSPNNT 630
>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
Length = 516
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 228/521 (43%), Positives = 317/521 (60%), Gaps = 17/521 (3%)
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYNRVNG+PTCAD LL+ T R DW F+GYI SDCD++ I ++ + T EDAVA
Sbjct: 1 MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAK-TAEDAVAD 59
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
VLKAG+D++CG Y + A+QQGKI E DI+ +L L+ V MRLG F+G P+ Y +
Sbjct: 60 VLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGD 119
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIG 414
+G + +C +H +LA EAA+ GIVLLKND GA LPL+ N+ +LA++G +AN + G
Sbjct: 120 IGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRG 179
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
NY G PC +P+ Y K ++ GC C N + IP A+ AA +AD+ V+ GLD
Sbjct: 180 NYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLD 238
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
E E DR+DL LPG Q LI VA+AAK PV LV++ G VD++FAK NPKI +ILW
Sbjct: 239 QDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILW 298
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKF 592
GYPGE GG AIA V+FG++NPGGRLP+TWY ++ ++P T M +R P +PGRTY+F
Sbjct: 299 AGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRF 358
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+ GP V+ FGYGLSY+++ ++ A+ P + + T G I
Sbjct: 359 YRGPTVFNFGYGLSYSKYSHRFATKPPPT----SNVAGLKAVEATAG-GMASYDVEAIGS 413
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSA 709
C KF + V+N G MDG V+V+ + P +G Q+IG++ + + A Q+A
Sbjct: 414 ETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTA 473
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
V F ++ CK ++ G+H ++VGE +SF
Sbjct: 474 HVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSF 514
>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 711
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 275/747 (36%), Positives = 389/747 (52%), Gaps = 100/747 (13%)
Query: 20 PYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
P R DL+ ++TLPEK+ +G + V RLG+P Y WW+EALHGV+ G
Sbjct: 23 PMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHGVARAGV-------- 74
Query: 80 HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFW 131
AT FP I A+FN+ L K+ +STEARA YNL A GLTFW
Sbjct: 75 --------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHLQYMGLTFW 126
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPNIN+ RDPRWGR ET GEDP++ +V+GLQ +D R LK SAC
Sbjct: 127 SPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQ---------GNDPRYLKASACA 177
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A + + N R F++ V E+D++ET++ F V+ G V SVMC+YNRVN P
Sbjct: 178 KHFAVH---SGPENGRHTFNAIVDEKDLRETYLYAFHALVDAG-VESVMCAYNRVNDQPC 233
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
C+ LLN +R +W F G++V+DC ++ I HK + E A A +KAG++LDC +
Sbjct: 234 CSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAA-AIKAGVNLDCSN 292
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNICNPQHI 368
AV+Q + E DID+SL L ++LG++D +P YK G +++ N H
Sbjct: 293 VLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYDDPTANPFYK-YGADSVANTAHA 351
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
LA A+Q +VLLKN N LPL+ + +VG ++ + A++GNY G R S ++
Sbjct: 352 TLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAVSFVE 411
Query: 429 GFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------DLS 476
G + Y G N++ I AA NAD TV V GL D
Sbjct: 412 GITNAVDAGTRVEYDQGSD----YNDTTHFGGIWAAGNADITVAVIGLTPVYEGEEGDAF 467
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
+ A+G D+ D+ LP + + A K P+ VI + AVDI+ + P +IL
Sbjct: 468 LAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAIE--PYADAILLAW 525
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPGE+GG A+AD++FGK +P GRLP+T+Y++ + +P GRTY++F+G
Sbjct: 526 YPGEQGGNALADILFGKVSPAGRLPVTFYQS------FADVPAYDNYAMKGRTYRYFNGK 579
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
V YPFGYGLSYT F Y+ P +I+ KD
Sbjct: 580 VQYPFGYGLSYTSFAYEWQQMP--ANIRTAKDS--------------------------- 610
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
+F I+V+N G MDG EVV VY + P + +K++ ++RV + AG V T+
Sbjct: 611 ---VSFSIKVKNTGSMDGDEVVQVYVEYPAVERMPLKELKAFKRVHVKAGGEETVQLTIP 667
Query: 717 ACKSLKIVDNAANSL-LASGAHTILVG 742
A L+ D A +S L G++ I G
Sbjct: 668 AS-DLQKWDLATSSWKLYPGSYNIFAG 693
>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 781
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 277/732 (37%), Positives = 392/732 (53%), Gaps = 28/732 (3%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RA L+ T E + + GVPRLGLP Y WWSE LHGV+ T
Sbjct: 41 CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQSPGVTF 100
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+P G ATSFP IL A+F++ L + I VSTE RA N G AGL +W+PN
Sbjct: 101 APSG-----NFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPN 155
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP-LKISACCKH 193
IN +DPRWGR ETPGEDP+ + +Y N + GLQ D +P K+ A CKH
Sbjct: 156 INPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADCKH 207
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+AAYD+DNWEG R+ F++ V++QD+ E ++ PF+ CV + V+SVMCSYN VNGIP+CA
Sbjct: 208 FAAYDMDNWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCA 267
Query: 254 DPKLLNQTIRGDWNFHG--YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ LL +R W F ++ SDCD++Q I H + D A A L AG D+DCG
Sbjct: 268 NSFLLQDVLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTD-PAQAAADALLAGTDIDCGT 326
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
+ + + A+Q+G + D+ + Y L+RLGYFD + Y+ LG +++ Q +
Sbjct: 327 FSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTLQAQQ 386
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
LA AA +G+VLLKND G LPL+ ++ LAL+GP ANAT+ + GNY G SP+ G
Sbjct: 387 LAHTAAVEGMVLLKND-GLLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQG 444
Query: 430 FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
+ Y G + S AA+ AAK ADA V GLD +VE E DR+++
Sbjct: 445 AQQAGFEVEYVFGTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTW 504
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
PG Q +L+ ++ K P+ + G +D K + + +I+W GYPG+ GG A+ D+
Sbjct: 505 PGNQLDLVAELERVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDI 563
Query: 550 IFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYT 608
+ GK P GRLPIT Y A Y ++P T M LRP PGRTYK++ G V+ FG+GL YT
Sbjct: 564 LTGKAAPAGRLPITQYPAAYAEQVPMTDMTLRPSATNPGRTYKWYSGTPVFEFGFGLHYT 623
Query: 609 QFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVEN 668
F + A+ + D + + + +A +D TF + V N
Sbjct: 624 TFAFAWAAPGAAADSTASFGGPAKSYSISQLVAHGQESAAFLDLAPLD----TFAVRVTN 679
Query: 669 MGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
GK+ V +++ S G A K ++ Y R+ A + + VG ++ D
Sbjct: 680 TGKVASDYVALLFVSGSFGPAPHPKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADEN 739
Query: 728 ANSLLASGAHTI 739
+ G +T+
Sbjct: 740 GEKWVHPGTYTL 751
>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
cellulolyticum H10]
Length = 712
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 267/759 (35%), Positives = 393/759 (51%), Gaps = 106/759 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L + ERA DLV RMTL EK Q+ A V RLG+P Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ +KI ++TE RA YN +
Sbjct: 64 --------------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIY 109
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
G+TFWSPN+N+ RDPRWGR ET GEDPY+ R + +V+GLQ D + L
Sbjct: 110 KGITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKYL 159
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KH+A + + +DR HF++ +++DM ET++ FE V E V SVM +YNR
Sbjct: 160 KSAACAKHFAVH---SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNR 216
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
NG P LL +R DW F G++VSDC +I+ E H + T ++VA LK G
Sbjct: 217 TNGEPCNGSKTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKNGC 275
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
DL+CG+ Y + A+++GKI E DID + L M+LG FD ++ + +
Sbjct: 276 DLNCGNMYL-LILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKIPYEVNDSI 334
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H +L+ EAAR+ +VLLKN NG LPL++ IK +A++GP+A+++ A+ NY GTP +
Sbjct: 335 EHNKLSLEAARKSMVLLKN-NGLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNIT 393
Query: 426 PMDGFYA---------YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
+DG + YS + + + Q + + A+ A+ +D V+ GLD S
Sbjct: 394 ILDGVRSRVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMAERSDVVVLCLGLDAS 453
Query: 477 VEAE-----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
VE E G D+ DL LP Q L+N V K P + ++S A+ I A +
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
K +I+ YPG +GG A A++IFG Y+P GRLP+T+Y++ P+ +
Sbjct: 513 --KAAAIVQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFEDYSME----- 565
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
RTYKF G +YPFG+GLSYT F+Y P++V+
Sbjct: 566 -NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQAVN----------------------- 601
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIGYERVFI 703
+ ++V+N G +D EVV VY K A + + G++R+F+
Sbjct: 602 ----------NGESLSVSVDVQNAGSVDSDEVVQVYIKDME-ASVRVPNHSLCGFKRIFL 650
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+G+ V F +++ +++ IVD + +G T+ VG
Sbjct: 651 KSGEKKTVTFEIDS-RAMTIVDEEGKRYIENGDFTLYVG 688
>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
Length = 712
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 270/759 (35%), Positives = 397/759 (52%), Gaps = 106/759 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L + ERA DLV RMTL EK Q+ A V RLG+P Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ +KI ++TE RA YN NA
Sbjct: 64 --------------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYN-ENAKKGDRDI 108
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
G+TFWSPN+N+ RDPRWGR ET GEDPY+ R + +V+GLQ D +
Sbjct: 109 YKGITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKY 158
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
LK +AC KH+A + + +DR HFD+ V+++D+ ET++ FE V E V SVM +YN
Sbjct: 159 LKTAACAKHFAVH---SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYN 215
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
R NG P LL +R W F G++VSDC +I+ E H + T ++VA LK+G
Sbjct: 216 RTNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSG 274
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
DL+CG+ Y + A+++G+I E DID + L MRLG FD ++ + +
Sbjct: 275 CDLNCGNMYL-LILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPYELNDS 333
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
+H +L+ EAA++ +VLLKND G LPL++ IK +A++GP+A+++ A+ NY GTP +
Sbjct: 334 VEHNKLSLEAAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNI 392
Query: 425 SPMDGF---YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
+ +DG + + Y+ G + + Q + + A+ A+ +D V+ GLD
Sbjct: 393 TILDGIRKRVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAERSDVVVLCLGLDA 452
Query: 476 SVEAE-----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAK 524
SVE E G D+ DL LP Q L+N V K P + ++S A+ I A
Sbjct: 453 SVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAA 511
Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN 584
+ K +I+ YPG GG A A++IFG Y+P GRLP+T+Y++ P+ +
Sbjct: 512 D--KAAAIVQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFADYSME---- 565
Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
RTYKF G +YPFG+GLSYT F+Y P++V+ G N
Sbjct: 566 --NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQNVN---------------NGEN--- 605
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVFI 703
+ ++V+N G +D EVV VY K + K + G++R+ +
Sbjct: 606 ---------------LSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHL 650
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+G+ V F +++ ++ IVD A + +G T+ VG
Sbjct: 651 KSGEKKTVTFEIDS-NAMTIVDEAGKRYIENGEFTLYVG 688
>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
2508]
gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 770
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 267/750 (35%), Positives = 401/750 (53%), Gaps = 55/750 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ CD L P+RA LV MT EK+Q + + G PR+GLP Y WWSEALHGV++
Sbjct: 36 LASLKVCDVTLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY 95
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT F D +TSFP +L A+F++ L +K+G+ + TE RA N G
Sbjct: 96 A-------PGTQFWSGDGPFNASTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGF 148
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G +W+PN+N +DPRWGR ETPGED + RYA + +RGLQ +R
Sbjct: 149 SGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQ----------GPARER 198
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYAA D ++W G+ R F+++VT QD+ E ++ PF+ C + V S+MCSYN
Sbjct: 199 RVVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNA 258
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VNG+P CA+ L+ +R WN+ YI SDC+++ I +H + +T + A +
Sbjct: 259 VNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHYA-ETNAEGTALAFE 317
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNN 361
AG+D C ++ GA QG + ++ +D +L+ +Y L+R+GYFDG+ +Y +LG +
Sbjct: 318 AGIDSSCEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNHSEYASLGWKD 377
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLN--TGNIKTLALVGPHANATKAMIGNYEGT 419
+ +P+ E+A +AA +GIVLLKND LPL+ T LA++G AN K + G Y G
Sbjct: 378 VNSPKSQEVALQAAVEGIVLLKNDK-TLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGK 436
Query: 420 PCRYTSPMDGFYAYSKVINYAPG-CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
P SP+ A + A G N++ AA++AAK+A+ + G D S
Sbjct: 437 PAFEHSPVYAAQAMGFSVTTAGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAA 496
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E KDR + P Q +LI ++ K P+ +V M +D + +ILW +
Sbjct: 497 GETKDRTTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLLAAKAVNAILWANWL 554
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPV 597
G++GG A+ ++ G NP GRLP+T Y ANY +P T M LRP + PGRTY+++
Sbjct: 555 GQDGGTAVMQILTGLKNPAGRLPVTQYPANYTAAVPMTDMNLRPSDKLPGRTYRWYPT-A 613
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
V PFG+GL YT F+ K+A + I+ D +C N N P L
Sbjct: 614 VQPFGFGLHYTTFQTKIAVPLPRLAIQ-DLLSRCGGDN----ANAYPDTCALP------- 661
Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQ--SAKVGF 713
++EV N G VV+ + + G IK ++ Y R+ ++ G +A + +
Sbjct: 662 ---PLKVEVTNSGNRSSDYVVLAFLAGDVGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKW 718
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
T+ + D N++L G +T+ V E
Sbjct: 719 TLG---DIARYDEQGNTVLYPGTYTVTVDE 745
>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
Length = 783
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 283/762 (37%), Positives = 409/762 (53%), Gaps = 65/762 (8%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS+ C+ +RAK LV +T EK G+ + GVPRLGL Y+WW EALHGV+
Sbjct: 29 LSNNTVCNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVA- 87
Query: 69 IGRRTNSPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
S PG +F + + ATSFP IL +A+F+++L + VSTEARA N+ +
Sbjct: 88 ------SSPGVNFSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRS 141
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL FW+PNIN +DPRWGR ETPGED + + Y + GLQ G+ + P+K
Sbjct: 142 GLDFWTPNINPYKDPRWGRGQETPGEDTFHLKSYVAALIDGLQG--GL-------NPPIK 192
Query: 187 -ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ A CKH+ AYDL++W DR++FD+ V+ QD+ E ++ PF+ C + V S+MCSYN
Sbjct: 193 KVIATCKHFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNA 252
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
+NG+PTCADP +L +R WN+ Y+ SDCD+IQ I H + T+E AVA L
Sbjct: 253 MNGVPTCADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPH-YYEPTREQAVADALT 311
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
AG DL+CG YY A +G + ID ++ LY L++LGYFD + Y++L +
Sbjct: 312 AGTDLNCGTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFDPPSATPYRSLNWS 371
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLN--TGNIKTLALVGPHANATKAMIGNYEG 418
++ P LA +AA +GIVLLKND G LPL+ T T+A++G ANAT M GNY G
Sbjct: 372 DVSTPAAEALALKAAEEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFG 430
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAA------IDAAKNADATVIVAG 472
SP+ YA ++ N + V +P + AA AD +I G
Sbjct: 431 IAPYLHSPL---YALQQLPN-----INAVYGGGFGVPTTDGWDELLGAAGEADLIIIADG 482
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
L S E+E DR + ++IN++ + G T+ + +D NNP I ++
Sbjct: 483 LTTSDESESNDRYTIGWQPAAIDIINQL--SGMGKPTVFLQMGDQLDNTPLLNNPNISAL 540
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRT 589
+W GYPG GG A+ +++ GK P GRLP+T Y A+YV ++ T M LRP + PGRT
Sbjct: 541 IWGGYPGMAGGDALINILTGKAAPAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRT 600
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
YK+++ V+ PFGYGL YT F ++ ++ + Q N + G + L
Sbjct: 601 YKWYNNAVL-PFGYGLHYTNFSVAASAQGQA------QTQSGPSSNSSQGQGTSYNISSL 653
Query: 650 IDDVKCKDYKF-------TFQIEVENMGKMDGSEVVMV--YSKPPGIAGTHIKQVIGYER 700
+ Y + +F + V N G S+ V + S G IKQ++ Y+R
Sbjct: 654 VSSCDRSQYAYLDLCPFESFNVNVTNTGSKLASDFVALGFISGSYGPQPYPIKQLVAYQR 713
Query: 701 VF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+F I+AG SA + SL D N++L G + +L+
Sbjct: 714 LFNISAGASATATLNL-TLGSLARHDENGNAVLYPGDYGLLI 754
>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
Length = 797
Score = 430 bits (1106), Expect = e-117, Method: Compositional matrix adjust.
Identities = 250/615 (40%), Positives = 352/615 (57%), Gaps = 26/615 (4%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
+ I L D CD Y +RA+ L+ TL E + + A GVPRLGLP Y+ WSEA
Sbjct: 52 DCINGPLKDNIVCDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEA 111
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
LHG+ T+ E ATSFP IL+ A+ N +L +I + T+ARA N
Sbjct: 112 LHGLDRANFATSG-------DEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNN 164
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDP-YVVGRYAINYVRGLQDVEGVEYHRDSD 181
G GL ++PNIN R P WGR ETPGED ++ YA Y+ GLQ GV D
Sbjct: 165 AGRYGLDAYAPNINGFRSPLWGRGQETPGEDANFLSSSYAYEYITGLQG--GV------D 216
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK+ A KH+A YDL+NW GN R FD+ +T+QD+ E + F S MC
Sbjct: 217 PDHLKVVATAKHFAGYDLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMC 276
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
SYN VNG+P+C+ LL +R +W+F +GY+ SDCD++ + H + ++ + A A
Sbjct: 277 SYNSVNGVPSCSSSFLLQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGYASN-QSAAAAD 335
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLG 358
L+AG D+DCG Y + +G + +I+ S+ LY L++LGYFDG +Y+ LG
Sbjct: 336 SLRAGTDIDCGQTYPWNLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKSEYRQLG 395
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
N++ ++ EAA +GIVLLKND G LPL + ++K++AL+GP ANAT+ + GNY G
Sbjct: 396 WNDVVTTDAWNISYEAAVEGIVLLKND-GILPL-SKHVKSIALIGPWANATEQLQGNYYG 453
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
T +P+ G +NYA G +I+ A+ AAK +D V + G+D ++E
Sbjct: 454 TAPYLITPLQGASDAGYKVNYALGT-NILGNTTEGFADALSAAKKSDVIVYLGGIDNTIE 512
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AEG DR+++ PG Q +LI +++ K P+ ++ M G VD + K N K+ +++W GYP
Sbjct: 513 AEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYP 571
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVN-NFPGRTYKFFDGP 596
G+ GG AI D++ GK P GRL T Y A Y + P T M LRP + PG+TY ++ G
Sbjct: 572 GQSGGTAIFDILSGKRVPAGRLVTTQYPAEYATQFPATDMNLRPDGASNPGQTYMWYTGT 631
Query: 597 VVYPFGYGLSYTQFK 611
VY FGYGL YT FK
Sbjct: 632 PVYDFGYGLFYTTFK 646
>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
Length = 908
Score = 430 bits (1106), Expect = e-117, Method: Compositional matrix adjust.
Identities = 276/713 (38%), Positives = 394/713 (55%), Gaps = 57/713 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L ER LV+ +TL EK+ + D A G RLGLP YEWW+EA HGV
Sbjct: 163 CDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGV-------G 215
Query: 75 SPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
S PG F S+ ATSFP IL ASF+ +L +KI + + E RA N G +G FW
Sbjct: 216 SAPGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFW 275
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PNIN RDPRWGR ETPGED +V Y N++ GLQ D + ++ A C
Sbjct: 276 APNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQ---------GDDPKNKQVIATC 326
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHYA YDL+ R+ + T+QD+ + F+ PF+ CV + DV S+MCSYN V+GIP
Sbjct: 327 KHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 382
Query: 252 CADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
CA+ LL++ +R W F+ Y+VSDC+++ I + H F DT+E A A L AG+DL+
Sbjct: 383 CANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQYHNF-TDTEEAAAAVALNAGVDLE 441
Query: 309 CGDYY--TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ 366
CG Y N ++ A Q A +D SL LY L +G+FDG +Y +L +++ P
Sbjct: 442 CGSSYLKLNESLAANQTSVKA---MDQSLARLYSALFTIGFFDGG-KYDHLDFSDVSIPA 497
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGTPCRYTS 425
LA EAA +G+ LLKND G LPL++ + K++A++GP ANAT M G Y G S
Sbjct: 498 AQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLIS 556
Query: 426 PMDGFYA-YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
P+ F + + +NYA G A I QN + A++ AAK +D V + G+D S+E+E DR
Sbjct: 557 PLVAFESDHRWKVNYAVGTA-INDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDR 615
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
L PG Q +LI +++ +K P+ +V G VD + N I++++W GYP + GG
Sbjct: 616 TSLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGT 674
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPF 601
A+ D++ GK +P GRLP+T Y A+Y +I + LRP ++ PGRTYK++ G V PF
Sbjct: 675 ALLDILVGKRSPAGRLPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGKPVIPF 734
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT+FK+ ++ + Y++ C +K T
Sbjct: 735 GHGLHYTKFKFGW--------------EETLNREYSIQELVASCQRSSGGPIKDNTPFTT 780
Query: 662 FQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
+ V N+G V +++ SK G A K ++ Y+R+ A S +V
Sbjct: 781 VKARVRNVGHETSDYVSLLFLSSKNAGPAPRPNKSLVSYKRLHNIAPGSDRVA 833
>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 728
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 244/589 (41%), Positives = 344/589 (58%), Gaps = 34/589 (5%)
Query: 32 MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHF--DSEVPG-A 88
M++ EKV+ + D + GV LGLP + WW+E LHGV F PG F DSE G A
Sbjct: 1 MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVGF-------SPGVLFAQDSEPFGYA 53
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
TSFP ILT ASF++ L+ IGQ + E RA N G AG FW+PN+N RDPRWGR E
Sbjct: 54 TSFPLPILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQE 113
Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
TPGED VV Y +YV GLQ SD I A CKH+AAYD++ + +
Sbjct: 114 TPGEDVLVVSNYVQSYVTGLQ---------GSDPTDKVIIAACKHFAAYDIETARRANNY 164
Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
+ T+QD+Q+ ++ F CV + V +VMCSYN V+GIP C+ LL + +R W F
Sbjct: 165 N----PTQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGF 220
Query: 269 ---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGK 325
+ ++VSDC ++ + H F N T++DA + + AG DL+CG Y + G++ +
Sbjct: 221 TNDYQFVVSDCGAVTDVWLLHNFTN-TEQDAASVSMAAGTDLECGSSYLHLN-GSLADKQ 278
Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
+ + +D +L LY L +GYFDGS + +LG +++ ++A EAAR G+ LLKND
Sbjct: 279 VTQERVDEALTRLYKALFTVGYFDGS-SHSSLGWSDVSTIDAQQIACEAARAGMTLLKND 337
Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV-INYAPGCA 444
G LPL G K++AL+GP ANAT M GNY G SP+ F S + +NYA G
Sbjct: 338 -GVLPLADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGT- 395
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAA 504
DI ++S A+ AAKN+D + G+D ++EAE DRV + PG Q +LI++++
Sbjct: 396 DINSTSDSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLG 455
Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
K P+ + G VD +N + ++ W G PG+ GG A+ D++ GK + GRLP T
Sbjct: 456 K-PLVVAQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQ 514
Query: 565 YEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY 612
Y A+Y + ++ LRP FPGRTYK++ G V+PFG+GL YT+F +
Sbjct: 515 YPASYADLVSIFNINLRPNGTFPGRTYKWYIGEPVFPFGFGLHYTKFNF 563
>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 797
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 277/760 (36%), Positives = 405/760 (53%), Gaps = 57/760 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ Y ERA+ L+ TL E + + GVPRLGLP Y+ W+EALHG+ R
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 119
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G F+ ATSFP ILTTA+ N +L +I +ST+ARA N G GL ++PN
Sbjct: 120 ATKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPN 175
Query: 135 INVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
+N R P WGR ETPGED + + Y Y+ G+Q GV D LK++A KH
Sbjct: 176 VNGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--GV------DPEHLKVAATVKH 227
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YDL+NW R FD+ +T+QD+ E + F S+MC+YN VNG+P+CA
Sbjct: 228 FAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCA 287
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L +R W F GY+ SDCD++ + H + + A A L+AG D+DCG
Sbjct: 288 NSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYA-SNQSSAAASSLRAGTDIDCGQ 346
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
Y + G+++ +I+ S+ LY L+RLGYFD QY++LG ++ ++
Sbjct: 347 TYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNIS 406
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
EAA +GIVLLKND G LPL+ ++++AL+GP ANAT M GNY G SP++
Sbjct: 407 YEAAVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAK 464
Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
+N+ G +I + + AI AAK +DA + + G+D ++E EG DR D+ PG
Sbjct: 465 KAGYHVNFELGT-EIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPG 523
Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
Q +LI ++++ K P+ ++ M G VD + K+N K+ S++W GYPG+ GG A+ D++
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582
Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GK P GRL T Y A YV + P M LRP + PG+TY ++ G VY FG GL YT
Sbjct: 583 GKRAPAGRLVTTQYPAEYVHQFPQNDMNLRPDGKSNPGQTYIWYTGKPVYEFGSGLFYTT 642
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
FK +AS PKS+ YT P FTF+ ++N
Sbjct: 643 FKETLASHPKSLKFNTSSILSAPHPGYTYSEQIP---------------VFTFEANIKNS 687
Query: 670 GKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDN 726
GK + M++ + G A K ++G++R+ I G S+K+ + +L VD+
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746
Query: 727 AANSLLASGAHTI-------------LVGEGVGGVSFPLQ 753
N ++ G + + LVGE V ++PL+
Sbjct: 747 HGNRIVYPGKYELALNTDESVKLEFELVGEEVTIENWPLE 786
>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 717
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 259/763 (33%), Positives = 406/763 (53%), Gaps = 106/763 (13%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
++D + D + ERA+ LV MTL EKV Q A + RLG+P Y +W+EALHGV+
Sbjct: 1 MTDKAWLDETKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVAR 60
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-- 126
G AT FP I ASF+E L ++ T+STEARA +N+
Sbjct: 61 AGV----------------ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFG 104
Query: 127 ------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
GLTFWSPN+N+ RDPRWGR ET GEDP++ GR ++++RG+Q
Sbjct: 105 DRDIYKGLTFWSPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQG---------D 155
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
D R +K++AC KH+A + + R F++ V+EQD++ET++ F CV E V +VM
Sbjct: 156 DERYMKVAACAKHFAVHSGPE---DQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVM 212
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
+YNR NG C KLL +RG+W F G++ SDC +++ E H + +E+ VA
Sbjct: 213 GAYNRTNGEACCGSKKLLVDILRGEWGFRGHVTSDCWALKDFHEFH-MVTKNQEETVALA 271
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
+ +G DL+CG+ Y + + AV+ G + E+ ID ++ L+ M+LG FD S + Y +G
Sbjct: 272 MNSGCDLNCGNLYVHL-LQAVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEVPYNGIG 330
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + + +L EA+R+ + LLKN +G LPL+ ++T+ +VGP+A+ KA++GNYEG
Sbjct: 331 YDRVDTEANRKLNREASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEG 390
Query: 419 TPCRYTSPMDGFYAYS----KVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATV 468
T Y + +DG + +V+ Y+ GC + Q N I A A+ +D +
Sbjct: 391 TASEYVTVLDGIRELAGDDVRVV-YSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVI 449
Query: 469 IVAGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
V GLD +E E D+ +L LPG Q E++ + ++ K PV LV++ A+
Sbjct: 450 AVMGLDPGLEGEEGDQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVLVLLGGSALA 508
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
I +A+ + + +IL YPG +GGRA+ADV+FG+ P G+LP+T+Y + +T +
Sbjct: 509 IPWAEEH--VPAILDAWYPGAQGGRAVADVLFGRACPEGKLPVTFYRTSEELPAFTDYSM 566
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ RTY++ P +YPFGYGLSYT ++ ++ SVD
Sbjct: 567 K------NRTYRYMKQPALYPFGYGLSYTSWELTNTTAEGSVD----------------- 603
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYE 699
D V C+ + N G M G++ V VY K P G + Q+ G
Sbjct: 604 -----------DGVVCRAV-------LRNTGAMAGAQTVQVYVKAPLATGPN-AQLKGLR 644
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ + G+SA+V +++ ++ + + +L G + I +G
Sbjct: 645 KIRLQPGESAEVAISLDK-EAFGVYNEKGLRVLLPGEYKIYIG 686
>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
M50/1]
Length = 710
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 259/754 (34%), Positives = 392/754 (51%), Gaps = 109/754 (14%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
Y +RA +LV +MTL EKV Q A V RL + Y WW+EALHGV+ G
Sbjct: 13 YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG--------LTFWS 132
AT FP I A+F+E L +++G VSTEARA +N+ G LTFW+
Sbjct: 64 -------ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWA 116
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ET GEDPY+ R + Y+ GLQ D LK +AC K
Sbjct: 117 PNVNIFRDPRWGRGHETFGEDPYLTSRLGVRYIEGLQG---------HDENYLKAAACAK 167
Query: 193 HYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
H+A + G + R FD+ VTEQD++ET++ FE CV EG V +VM +YNR NG+P
Sbjct: 168 HFAVHS-----GPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVP 222
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
C + +LL +R +W F G++ SDC +I+ E H + T ++VA + G DL+CG
Sbjct: 223 CCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGH-HVTGTAIESVAMAMNNGCDLNCG 281
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
+ F + AV+QG + E +D ++ L++ M+LG FD + Y + + +
Sbjct: 282 TLF-GFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMK 340
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+L AR+ +VLLKN LPL+ IKT+ ++GP+A++ +A++GNYEGT RY + ++
Sbjct: 341 KLNEAVARRTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLE 400
Query: 429 GFYAY---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
G Y + Y+ GC + Q N + + K +D V V GLD +E
Sbjct: 401 GIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEG 460
Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
E D+ DL LPG Q E++ K PV LV++S A+ +N+A + +
Sbjct: 461 EEGDAGNEYGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVD 517
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
+I+ YPG GG AIAD++FG+ NP G+LP+T+Y +P + GRTY
Sbjct: 518 AIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTT------EELPDFEDYSMQGRTY 571
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
++ + +YPFGYGLSYT++ Y+ +++ + + T+G
Sbjct: 572 RYMEQEALYPFGYGLSYTEYAYQ--------NVRFLEQEPVVSEGVTIG----------- 612
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQS 708
+ V+N GKMDG+E V VY K H +K+++ ++ + AG+
Sbjct: 613 -------------LSVKNTGKMDGTETVQVYVKAEHSKMPHGQLKKIV---KLPLCAGEE 656
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ + + ++ + D +L SG I VG
Sbjct: 657 KEINIRLES-EAFMLYDENGEKILPSGHFEIFVG 689
>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
XB6B4]
Length = 710
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 259/754 (34%), Positives = 392/754 (51%), Gaps = 109/754 (14%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
Y +RA +LV +MTL EKV Q A V RL + Y WW+EALHGV+ G
Sbjct: 13 YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG--------LTFWS 132
AT FP I A+F+E L +++G VSTEARA +N+ G LTFW+
Sbjct: 64 -------ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWA 116
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ET GEDPY+ R + Y+ GLQ D LK +AC K
Sbjct: 117 PNVNIFRDPRWGRGHETFGEDPYLTSRLGVRYIEGLQG---------HDENYLKAAACAK 167
Query: 193 HYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
H+A + G + R FD+ VTEQD++ET++ FE CV EG V +VM +YNR NG+P
Sbjct: 168 HFAVHS-----GPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVP 222
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
C + +LL +R +W F G++ SDC +I+ E H + T ++VA + G DL+CG
Sbjct: 223 CCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGH-HVTGTAIESVAMAMNNGCDLNCG 281
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
+ F + AV+QG + E +D ++ L++ M+LG FD + Y + + +
Sbjct: 282 TLF-GFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMK 340
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+L AR+ +VLLKN LPL+ IKT+ ++GP+A++ +A++GNYEGT RY + ++
Sbjct: 341 KLNEAVARRTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLE 400
Query: 429 GFYAY---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
G Y + Y+ GC + Q N + + K +D V V GLD +E
Sbjct: 401 GIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEG 460
Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
E D+ DL LPG Q E++ K PV LV++S A+ +N+A + +
Sbjct: 461 EEGDAGNEYGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVD 517
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
+I+ YPG GG AIAD++FG+ NP G+LP+T+Y +P + GRTY
Sbjct: 518 AIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTT------EELPDFEDYSMQGRTY 571
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
++ + +YPFGYGLSYT++ Y+ +++ + + T+G
Sbjct: 572 RYMEQEALYPFGYGLSYTEYAYQ--------NVRFLEQEPVVSEGVTIG----------- 612
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQS 708
+ V+N GKMDG+E V VY K H +K+++ ++ + AG+
Sbjct: 613 -------------LSVKNTGKMDGTETVQVYVKAEHSKMPHGQLKKIV---KLPLCAGEE 656
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ + + ++ + D +L SG I VG
Sbjct: 657 KEINIRLES-EAFMLYDENGEKILPSGHFEIFVG 689
>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 213/401 (53%), Positives = 271/401 (67%), Gaps = 46/401 (11%)
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
DVEG E D +SRPLK+S+CCKHYA YD+D+W V+EQDM+ETF PFE
Sbjct: 4 DVEGTENVTDLNSRPLKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE- 53
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
R +W+ HGYIVSDC ++ IV++ +L
Sbjct: 54 ---------------------------------RDEWDLHGYIVSDCYGLEVIVDNQNYL 80
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
N++K DAVA+ L+AGLDL+CG YYT+ +V GK+++ ++D +L+ +Y++LMR+GYFD
Sbjct: 81 NESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFD 140
Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
G P Y++LG +IC HIELA EAARQGIVLLKND LPL G K L LVGPHANAT
Sbjct: 141 GIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKLVLVGPHANAT 198
Query: 410 KAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
+ MIGNY G P +Y SP++ F A V YA GC D C N++ A +AAK A+ T+I
Sbjct: 199 EVMIGNYAGLPYKYVSPLEAFSAIGNV-TYATGCLDASCSNDTYFSEAKEAAKFAEVTII 257
Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
G DLS+EAE DRVD LLPG QTELI +VA+ + GPV LV++S +DI FAKNNP+I
Sbjct: 258 FVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRI 317
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV 570
+ILWVG+PGE+GG AIADV+FGKYNPGGRLP+TWYEA+YV
Sbjct: 318 SAILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYV 358
>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
Length = 526
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 222/483 (45%), Positives = 305/483 (63%), Gaps = 22/483 (4%)
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
YIVSDCDS+ + S + T E+A A+ + AGLDL+CG + N T AV++G I EA
Sbjct: 45 YIVSDCDSLGILYGSQHY-TKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAA 103
Query: 331 IDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNG 387
I+ ++ + LMRLG+FDG+P+ Y LG ++C ++ ELA E ARQGIVLLKN G
Sbjct: 104 INKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAG 163
Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIV 447
+LPL+ IKTLA++GP+AN TK MIGNYEG C+YT+P+ G Y GC ++
Sbjct: 164 SLPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVT 223
Query: 448 CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGP 507
C + A AA +ADATV+V G D ++E E DR+DL LPG Q EL+ +VA AA+GP
Sbjct: 224 CTEADLDSAKTLAA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGP 282
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V LVIMS G DI FAKN+ KI SI+WVGYPGE GG AIADVIFG++NP G+LP+TWY
Sbjct: 283 VVLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQ 342
Query: 568 NYV-KIPYTSMPLRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
+YV K+P T+M +RP N + GRTY+F+ G VY FG GLSYT F +++ +PK V +
Sbjct: 343 SYVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLN 402
Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD-----YKFTFQIEVENMGKMDGSEVVM 679
LD+ Q CR P C ++ C+ F Q++V N+G +G+E V
Sbjct: 403 LDESQSCRS---------PECQSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVF 453
Query: 680 VYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
+++ PP + G+ KQ++G+E++ + + V F ++ CK L +VD LA G H +
Sbjct: 454 LFTTPPEVHGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLL 513
Query: 740 LVG 742
VG
Sbjct: 514 HVG 516
>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
Length = 715
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 271/759 (35%), Positives = 400/759 (52%), Gaps = 104/759 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + ERAKDLV RMT+ EKV QM + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F+E L K+ +STE RA Y+ +
Sbjct: 65 --------------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIY 110
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDPY+ R + +V+GLQ + + L
Sbjct: 111 KGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQG---------NHPKYL 161
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K CK+ + + + R F++ V+++D+ ET++ F+ V E V SVM +YNR
Sbjct: 162 KAGGMCKNILPFTV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNR 219
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
NG P C LL+ +RG+W F G++VSDC +I+ H + T ++ A ++ G
Sbjct: 220 TNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDF-HMHHHVTATAPESAALAVRNGC 278
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DL+CG+ + N + A+++G I E +ID ++ L I M+LG FD Q Y ++ C
Sbjct: 279 DLNCGNMFGNLLI-ALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISSFVDC 337
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+H ELA + A++ IVLLKND G LPL+ I+++A++GP+A++ +A+IGNYEGT Y
Sbjct: 338 K-EHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEY 395
Query: 424 TSPMDGFYAYSK---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
+ +DG + I Y+ GC + + + I A+ A++AD ++ GLD
Sbjct: 396 VTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLD 455
Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
++E E D+ DL LPG Q EL+ V K P+ LV+++ A+ + +A
Sbjct: 456 STIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWADE 514
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+ I +IL YPG GGRAIA V+FG+ NP G+LP+T+Y +T +
Sbjct: 515 H--IPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELPDFTDYSME----- 567
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
RTY+F +YPFG+GLSYT F Y D+KL KD T+ +
Sbjct: 568 -NRTYRFMKNEALYPFGFGLSYTTFDYS--------DLKLSKD--------TIRAGE--- 607
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK--QVIGYERVFI 703
F ++V N GKM G EVV VY K A + Q+ G +RV +
Sbjct: 608 -------------GFNVSVKVTNTGKMAGEEVVQVYIKDLE-ASWRVPNWQLSGMKRVRL 653
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+G++A++ F + + L +V + S++ G I VG
Sbjct: 654 ESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 712
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 266/758 (35%), Positives = 394/758 (51%), Gaps = 104/758 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L + ERA DLV +MTL EK Q+ A V RLG+P Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ +KI ++TE RA YN
Sbjct: 64 --------------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIY 109
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
G+TFWSPN+N+ RDPRWGR ET GEDPY+ R + +V+GLQ D + L
Sbjct: 110 KGITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKYL 159
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KHYA + + +DR FD+ V+++D+ ET++ FE V E V S+M +YNR
Sbjct: 160 KTAACAKHYAVH---SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNR 216
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
NG P LL +R W F G++VSDC +I+ E H + T ++VA LK+G
Sbjct: 217 TNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSGC 275
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
DL+CG+ Y + A+++G I E DID + L M+LG FD ++ N+ +
Sbjct: 276 DLNCGNMYL-LILLALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNIPYELNDSA 334
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H +++ EAA++ +VLLKND G LPL++ IK +A++GP+A+++ A+ NY GTP + +
Sbjct: 335 EHNKISLEAAKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVT 393
Query: 426 PMDGF---YAYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
++G + + + YA G + + Q + + A+ AA+ +D V+ GLD S
Sbjct: 394 IIEGIRKRVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDAS 453
Query: 477 VEAE-----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
VE E G D+ DL LP Q L+N V K P + ++S A+ I A +
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
K +I+ YPG GG A A++IFG Y+P GRLP+T+Y++ P+ +
Sbjct: 513 --KAAAIVQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYKSTEELPPFADYSME----- 565
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
RTYKF G +YPFG+GLSYT F+Y P++V+ G N
Sbjct: 566 -NRTYKFMKGDALYPFGFGLSYTSFEYSNMVCPQTVN---------------NGEN---- 605
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVFIA 704
+ ++V+N G +D EVV VY K + K + G++R+ +
Sbjct: 606 --------------LSVSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLK 651
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+G+ V F + A ++ IVD A + +G T+ G
Sbjct: 652 SGEKKTVTFEV-ASNAMSIVDEAGKRHIENGEFTLYAG 688
>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
Length = 744
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 270/744 (36%), Positives = 381/744 (51%), Gaps = 114/744 (15%)
Query: 5 IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
+ P+CDA L RA D V RMT+PEK+ + + LGLP Y WWSEA
Sbjct: 30 LNATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASS 89
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
GV +G R P T F ++P + T SFN +LW+ G + EARA+ N G
Sbjct: 90 GV--MGSR----PTTKF--------AYP--VTTAMSFNRTLWRATGAAIGREARALMNAG 133
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
A T+W+P +N+ R+PRWGR +E PGEDPY+ G YA +V G Q YH
Sbjct: 134 AAYSTYWAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYH------- 186
Query: 185 LKISACCKHYAAYDLDNW-----EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
L+ SACCKHY A +L+N E DR H DS VT++D+ +++++PF+ CV +G VSS+
Sbjct: 187 LQASACCKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSL 246
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN VNG+P+CA+ LL R W+F GYI SDCD+ + ++H + T E+AVA
Sbjct: 247 MCSYNAVNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAA-TPEEAVAD 305
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-------- 351
VLKAG D+DC + A+ +G I EAD+D L L+ V +RLG+FD S
Sbjct: 306 VLKAGTDVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRG 365
Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
P + +C+ H++ + E Q LLKND GALPL T A+VGP+A +KA
Sbjct: 366 PLDEIDADAVVCSDAHLDASMEGLAQSATLLKND-GALPLKPSG--TAAVVGPNALLSKA 422
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
G Y P D ADA V+
Sbjct: 423 DAGYY--------GPTDA----------------------------------ADAVVLAV 440
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN--FAKNNPKI 529
G DL+ AEGKD ++ Q ELI+ VA A+ PV +V+ SA +D+ A+++ K+
Sbjct: 441 GTDLTWAAEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKV 500
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR-------- 580
+++ VG P + + D+++G+ + GR T Y A Y +I +R
Sbjct: 501 GAVVHVGQPSVT-VKGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFAR 559
Query: 581 ----------PVNNFPGRTYKFF-DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
P PGRTY+F+ D PVV PFG+GLSYT F Y V S+P +VD L +
Sbjct: 560 PDCATNESACPRGTNPGRTYRFYVDEPVV-PFGFGLSYTTFAYAVRSAPTTVD--LAPLR 616
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GI 687
+ P L DD T+ ++V N G +D +VV+ + PP G+
Sbjct: 617 AAYAGVAAARGDGGPAFLSLHDDAAAA----TYAVDVTNTGDIDADDVVLGFVTPPGAGV 672
Query: 688 AGTHIKQVIGYERVFIAAGQSAKV 711
G +K++ G+ERV + AG++ V
Sbjct: 673 DGVPLKELFGFERVHVKAGETKTV 696
>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
Length = 884
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/711 (37%), Positives = 382/711 (53%), Gaps = 44/711 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS-FIGRRT 73
CD L +R L+ +MT+ EK + D A G+PR+GLP YEWW+EALHGV+ G
Sbjct: 146 CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRIGLPPYEWWNEALHGVAGSRGVSF 205
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
+SP G+ F ATSFP IL A+F++ L + + EARA N ++G FW+P
Sbjct: 206 DSPNGSDFSY----ATSFPLPILMGAAFDDPLIYDVASIIGKEARAFANYAHSGYDFWTP 261
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
N+N DPRWGR LE P ED + RY + V GLQ + H+ +I A CKH
Sbjct: 262 NMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQGGKEKTDHK-------QIIATCKH 314
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YD++ +R + T QD+ E ++ F+ CV + +V S+MCSYN V G+P CA
Sbjct: 315 FAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACA 370
Query: 254 DPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
L +R WNF + Y+ SDC++++ I H F DT+ A A L AG D +CG
Sbjct: 371 SEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNF-TDTEPAAAAVALNAGTDTNCG 429
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
Y +V EA +D SL LY L +GYFDG P+Y L ++ P
Sbjct: 430 TSYLQLNT-SVANNWTTEAQMDISLTRLYNALFTVGYFDGQPEYDGLSFADVSTPFAQAT 488
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A AA +GI LLKND G LPL + ++AL+GP ANAT M G Y+G SP+
Sbjct: 489 AYRAASEGITLLKND-GLLPLKK-SYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAA 546
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
A I++ G A I N + +A+ AA++AD + G+D S+E E +DR + P
Sbjct: 547 QAQWGHISFTNGTA-INSTNTTGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWP 605
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q +L+ ++++ K P+ +V G VD + N + S++W GYPG++GG A+ DV+
Sbjct: 606 GNQLDLVQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVL 664
Query: 551 FGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GK +P GRL IT Y A+Y+ +I LRP ++ PGRTYK+++ V PFGYGL YT
Sbjct: 665 VGKQSPAGRLTITQYPADYINQISLFDPNLRPSDSSPGRTYKWYNKEPVLPFGYGLHYTT 724
Query: 610 FKYKVASSPK-SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVEN 668
F++ A +P+ S DI D +YT K + I+V N
Sbjct: 725 FEFDWAKAPQASYDIASLVDSTA---SYTTSPKKNDASPWT-----------ELSIKVHN 770
Query: 669 MGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMN 716
G + V +V+ + P G A K + Y R+ ++AG SA++ F+++
Sbjct: 771 SGSLGSDYVGLVFLRTPNAGPAPYPNKWLASYARLHGLSAGASAELSFSLS 821
>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
Length = 792
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 391/737 (53%), Gaps = 41/737 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV T E V G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 57 LSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLD- 115
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N + E ATSFP ILT ++ N +L +I ++T+ RA N+G GL
Sbjct: 116 ---RANFTD----EGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGL 168
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKI 187
++PNIN R WGR ETPGED Y + YA Y+ G+Q GV D LK+
Sbjct: 169 DVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--GV------DPEHLKL 220
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A KHYA YDL+NW+G+ R D +T+Q++ E + F + + V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVN 280
Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P+CA+ L +R + F GY+ SDCDS + H+F + A A ++AG
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGT 339
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICN 364
D+DCG Y + A + ++ A+I+ + LY L+RLGYFDG+ Y++L N++
Sbjct: 340 DIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVT 399
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +GIVLLKND G LPL +++++AL+GP N T + GNY G
Sbjct: 400 TDAWNISYEAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP++ F +NYA G +I + A+ AAK +D + G+D ++EAE DR
Sbjct: 458 SPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDR 516
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+++ PG Q +LI++++ K P+ ++ M G VD + K+N + S++W GYPG+ GG+
Sbjct: 517 MNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQ 575
Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
A+ D+I GK P GRL +T Y A Y + P T M LRP N PG+TY ++ G VY FG+
Sbjct: 576 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFGH 635
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GL YT F AS P + KD+ +I + P A V + F
Sbjct: 636 GLFYTTFH---ASLPGT-----GKDKTSFNIQDLLTQPHPGFANVEQMPL------LNFT 681
Query: 664 IEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ + N GK+ M+++ G A K ++G++R+ ++ S+
Sbjct: 682 VTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVA 741
Query: 723 IVDNAANSLLASGAHTI 739
D A N +L G + +
Sbjct: 742 RTDEAGNRVLYPGKYEL 758
>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
Length = 792
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 391/737 (53%), Gaps = 41/737 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV T E V G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 57 LSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLD- 115
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N + E ATSFP ILT ++ N +L +I ++T+ RA N+G GL
Sbjct: 116 ---RANFTD----EGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGL 168
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKI 187
++PNIN R WGR ETPGED Y + YA Y+ G+Q GV D LK+
Sbjct: 169 DVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--GV------DPEHLKL 220
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A KHYA YDL+NW+G+ R D +T+Q++ E + F + + V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVN 280
Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P+CA+ L +R + F GY+ SDCDS + H+F + A A ++AG
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGT 339
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICN 364
D+DCG Y + A + ++ A+I+ + LY L+RLGYFDG+ Y++L N++
Sbjct: 340 DIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVT 399
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +GIVLLKND G LPL +++++AL+GP N T + GNY G
Sbjct: 400 TDAWNISYEAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP++ F +NYA G +I + A+ AAK +D + G+D ++EAE DR
Sbjct: 458 SPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDR 516
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+++ PG Q +LI++++ K P+ ++ M G VD + K+N + S++W GYPG+ GG+
Sbjct: 517 MNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQ 575
Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
A+ D+I GK P GRL +T Y A Y + P T M LRP N PG+TY ++ G VY FG+
Sbjct: 576 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFGH 635
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GL YT F AS P + KD+ +I + P A V + F
Sbjct: 636 GLFYTTFH---ASLPGT-----GKDKTSFNIQDLLTQPHPGFANVEQMPL------LNFT 681
Query: 664 IEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ + N GK+ M+++ G A K ++G++R+ ++ S+
Sbjct: 682 VTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVA 741
Query: 723 IVDNAANSLLASGAHTI 739
D A N +L G + +
Sbjct: 742 RTDEAGNRVLYPGKYEL 758
>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
Length = 792
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 391/737 (53%), Gaps = 41/737 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV T E V G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 57 LSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLD- 115
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N + E ATSFP ILT ++ N +L +I ++T+ RA N+G GL
Sbjct: 116 ---RANFTD----EGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGL 168
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKI 187
++PNIN R WGR ETPGED Y + YA Y+ G+Q GV D LK+
Sbjct: 169 DVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--GV------DPEHLKL 220
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A KHYA YDL+NW+G+ R D +T+Q++ E + F + + V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVN 280
Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P+CA+ L +R + F GY+ SDCDS + H+F + A A ++AG
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGT 339
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICN 364
D+DCG Y + A + ++ A+I+ + LY L+RLGYFDG+ Y++L N++
Sbjct: 340 DIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVT 399
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +GIVLLKND G LPL +++++AL+GP N T + GNY G
Sbjct: 400 TDAWNISYEAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLI 457
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP++ F +NYA G +I + A+ AAK +D + G+D ++EAE DR
Sbjct: 458 SPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDR 516
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+++ PG Q +LI++++ K P+ ++ M G VD + K+N + S++W GYPG+ GG+
Sbjct: 517 MNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQ 575
Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
A+ D+I GK P GRL +T Y A Y + P T M LRP N PG+TY ++ G VY FG+
Sbjct: 576 ALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFGH 635
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GL YT F AS P + KD+ +I + P A V + F
Sbjct: 636 GLFYTTFH---ASLPGT-----GKDKTSFNIQDLLTQPHPGFANVEQMPL------LNFT 681
Query: 664 IEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
+ + N GK+ M+++ G A K ++G++R+ ++ S+
Sbjct: 682 VTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVA 741
Query: 723 IVDNAANSLLASGAHTI 739
D A N +L G + +
Sbjct: 742 RTDEAGNRVLYPGKYEL 758
>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
Length = 716
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 262/760 (34%), Positives = 393/760 (51%), Gaps = 114/760 (15%)
Query: 19 LPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPG 78
L E AK LVE+MTL EK+ QM + + RL +P Y WW+EALHGV+ G
Sbjct: 3 LETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV------- 55
Query: 79 THFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTF 130
AT FP I A+F+E L +KIG VSTE RA + + GLTF
Sbjct: 56 ---------ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTF 106
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PNIN+ RDPRWGR ET GEDP + + Y+RG+Q +D D LK +AC
Sbjct: 107 WAPNINIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQG-------KDPDH--LKAAAC 157
Query: 191 CKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
KH+A + G + R FD++V+ D+ +T++ F+ CV + V +VM +YNRVNG
Sbjct: 158 AKHFAVHS-----GPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNG 212
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
P C LL +R + F G++VSDC +I E H + T E++ A + G DL+
Sbjct: 213 EPACGSKTLLQDILREQFGFEGHVVSDCWAILDFHEHHH-VTKTVEESAAMAVNHGCDLN 271
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQH 367
CG + + A +QG + E I ++ L V +RLG + P Y N+ + + P+H
Sbjct: 272 CGKAFLYLSR-ACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPSPYANIPYDVVECPEH 330
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
I L+ EA+++ +VLLKNDN LPL + T+A++GP+AN+ A++GNYEGT RY +P+
Sbjct: 331 IALSLEASKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPL 390
Query: 428 DGFYAYS---KVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
+G Y+ + YA GC + + + A+ AA+ AD V+ GLD +E
Sbjct: 391 EGIQEYTGEKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIE 450
Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
E D++ L LPG Q EL+ VA K P+ L +++ A+D+++A+ + +I
Sbjct: 451 GEEGDAGNEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQI 509
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
++IL YPG GG+AIA+ +FG+++P G+LP+T+YE +T + GRT
Sbjct: 510 RAILDCWYPGARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDYSM------AGRT 563
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
Y++ D V+YPFGYGL+Y+Q +Y A + D+ G +P
Sbjct: 564 YRYTDRHVLYPFGYGLTYSQIRYSDAHA---------------DVT-DFGILEP------ 601
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-------PPGIAGTHIKQVIGYERVF 702
T + VEN G E V VY + PG Q+ G V
Sbjct: 602 ----------VTVHVTVENTGTYPVQEAVQVYVRFSEREAYDPGY------QLKGIRSVA 645
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G+ +V T++ + ++ L+ G++ I VG
Sbjct: 646 LECGEKKEVCITLSP-RDFALISEEGKCLVHPGSYEIAVG 684
>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 786
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 267/743 (35%), Positives = 387/743 (52%), Gaps = 44/743 (5%)
Query: 5 IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
+ LS P C+ L +RA LV+ TL E G+ A GVPRLGLP YE W+EALH
Sbjct: 50 VNSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALH 109
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
G+S TN GT ATSFP+ IL+ AS N +L +IG +ST+ RA N G
Sbjct: 110 GISHGHFATN---GTW-----SWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAG 161
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
GL ++PNIN R P WGR ETPGED + + YA Y+ G+Q + + +
Sbjct: 162 RYGLDSYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGGK-------APAV 214
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
P K+ A KH+A YD++NW N R D +T+QD+ + F + +MCSY
Sbjct: 215 P-KLVAVPKHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSY 273
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF-HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
N VNG+P+C++ L R W F +G++ SDCD++ + H + +T AVA L+
Sbjct: 274 NAVNGVPSCSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTT-GAVADSLR 332
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNN 361
AG D+DCG Y + + A G ++ DI+ +L Y L+ GYFDG S Y+NLG N+
Sbjct: 333 AGTDIDCGTSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSLYRNLGWND 392
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ ++ EAA +GI LLKND G LPL+ + +++AL+GP ANAT + GNY
Sbjct: 393 VLTTDAWNISYEAAVEGITLLKND-GTLPLSK-STRSVALIGPWANATLQLQGNYYAAAP 450
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP+ F A +N+ G I N S AI A+ +D + G+D S+EAEG
Sbjct: 451 YLISPLQAFRASGMTVNFVNGTT-ISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEG 509
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR ++ PG Q +LI +++ K P+ ++ M G VD + KNN K+ +++W GYPG+
Sbjct: 510 LDRQNITWPGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQS 568
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG+A+ D+I G P GRL T Y A+Y +M + PVN G+TY ++ G VYP
Sbjct: 569 GGQALFDIIMGNRAPAGRLVTTQYPASYATSFNQLNMNMAPVNGSLGQTYMWYTGTPVYP 628
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FG+GL YT F P + N T P +++V D+ F
Sbjct: 629 FGHGLFYTNFTTTSTMGPVTT------------YNLTSIFAAPHPGYEFVEEVPIMDFNF 676
Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER-VFIAAGQSAKVGFTMNAC 718
V N G+ M++ S G IK ++G +R I G A V +
Sbjct: 677 I----VNNTGRTASDWSGMLFASTTSGPTPRPIKWLVGIDREAIIVPGGLASVTIKV-PV 731
Query: 719 KSLKIVDNAANSLLASGAHTILV 741
+L D N ++ G++++++
Sbjct: 732 GALARADANGNLVVYPGSYSLML 754
>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
Length = 797
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 278/760 (36%), Positives = 403/760 (53%), Gaps = 57/760 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ Y ERA+ L+ TL E + + GVPRLGLP Y+ W+EALHG+ R
Sbjct: 63 CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 119
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G F+ ATSFP ILTTA+ N +L +I +ST+ARA N G GL ++PN
Sbjct: 120 ATKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPN 175
Query: 135 INVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
+N R P WGR ETPGED + + Y Y+ G+Q GV D LK++A KH
Sbjct: 176 VNGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--GV------DPEQLKVAATVKH 227
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YDL+NW R FD+ +T+QD+ E + F S+MCSYN VNG+P+CA
Sbjct: 228 FAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCA 287
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L +R W F GY+ SDCD++ + H + + A A L+AG D+DCG
Sbjct: 288 NSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYA-SNQSSAAASSLRAGTDIDCGQ 346
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
Y + G++ +I+ S+ LY L+RLGYFD QY++LG ++ ++
Sbjct: 347 TYPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNIS 406
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
EAA +GIVLLKND G LPL+ ++++AL+GP ANAT M GNY G SP++
Sbjct: 407 YEAAVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAK 464
Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
+N+ G +I + + AI AAK +DA V + G+D ++E EG DR D+ PG
Sbjct: 465 KAGYHVNFELGT-EIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPG 523
Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
Q +LI ++++ K P+ ++ M G VD + K+N K+ S++W GYPG+ GG A+ D++
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582
Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GK P GRL T Y A YV + P M LRP + PG+TY ++ G VY FG GL YT
Sbjct: 583 GKRAPAGRLITTQYPAEYVHQFPQNDMNLRPDGKSNPGQTYIWYTGKPVYEFGSGLFYTT 642
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
FK +AS PK + YT P FTF+ ++N
Sbjct: 643 FKETLASHPKCLKFNTSSILSAPHPGYTYSEQIP---------------VFTFEANIKNS 687
Query: 670 GKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDN 726
GK + M++ + G A K ++G++R+ I G S+K+ + +L VD+
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746
Query: 727 AANSLLASGAHTI-------------LVGEGVGGVSFPLQ 753
N ++ G + + LVGE V ++PL+
Sbjct: 747 YGNRIVYPGKYELALNTDESVKLEFELVGEEVTIENWPLE 786
>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
Length = 800
Score = 421 bits (1081), Expect = e-114, Method: Compositional matrix adjust.
Identities = 267/757 (35%), Positives = 400/757 (52%), Gaps = 64/757 (8%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS C+ L PERA LV +T EK+Q + + G PR+GLP Y WWSEALHGV++
Sbjct: 34 LSTNQVCNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAY 93
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT F D +TSFP +L A+F++ L +KI + + E RA N G
Sbjct: 94 A-------PGTQFWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGF 146
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+GL +W+PN+N +DPRWGR ETPGED +V RYA ++GL EG ++
Sbjct: 147 SGLDYWTPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGL---EGPVPEKER----- 198
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
++ A CKHYAA D ++W G R +F+++++ QDM E + +PF+ CV + V S+MC+YN
Sbjct: 199 RVVATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNA 258
Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VNG+P+CA P LL +R WN+ + YI SDC+++ + +HK+ T + A +
Sbjct: 259 VNGVPSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAA-TNAEGTAISFE 317
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNN 361
AG+D C ++ GA QG + E+ +D +L LY ++R GYFDG Y +LG +
Sbjct: 318 AGMDTSCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSLYSSLGWAD 377
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALP----LNTGNIKTLALVGPHANATKAMIGNYE 417
+ P +L+ +AA G VLLKND G LP L+ K +A++G ++A + G Y
Sbjct: 378 VNKPSAQKLSLQAAVDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYS 436
Query: 418 GTPCRYTSPMDGFYAYSKV-INYAPGCADI----VCQNNSMIPAAIDAAKNADATVIVAG 472
GT +P YA S++ I ++ I + N S A+ AAK+AD + G
Sbjct: 437 GTAAYLHTPA---YAASQLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGG 493
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKS 531
+D S E KDR DL PG Q LIN + +K L+++ G +D +NPKI +
Sbjct: 494 IDTSAAGETKDRYDLDWPGAQLSLINLLTTLSK---PLIVLQMGDQLDNTPLLSNPKINA 550
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPV--NNFPGR 588
ILW +PG++GG A+ +++ G +P GRLP+T Y +N+ + +P T M LRP N+ GR
Sbjct: 551 ILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTDMALRPSAGNSQLGR 610
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY+++ P V FG+GL YT F K +V I +D+ + D Y
Sbjct: 611 TYRWYKTP-VQAFGFGLHYTTFSPKFGKKFPAV-IDVDEVLEGCDDKY------------ 656
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIGYERVFIAAG 706
+D D + VEN G V + + PG+ IK + + R+ G
Sbjct: 657 -LDTCPLPD----LPVVVENRGNRTSDYVALAFVSAPGVGPGPWPIKTLGAFTRLRGVKG 711
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ G +L D N+++ G + + + E
Sbjct: 712 GEKREGGLKWNLGNLARHDEEGNTVVYPGKYEVSLDE 748
>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
Length = 797
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 260/744 (34%), Positives = 386/744 (51%), Gaps = 46/744 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV +T E V + +G PR+GLP Y+ W+EALHGV+
Sbjct: 57 LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115
Query: 69 IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
H D G +TSFP I T A+ N +L +I +ST+ RA N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D+
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
PLK+ A KHYA YD++NW+ + R D ++T+QD+ E + F + + V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
N VNG+P+C++ L +R ++F GY+ DC ++ + H + + + A A +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
+AG D+DCG Y + +++ D++ + LY L+R GYFDG + Y+N+ +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTSPYRNITWS 395
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN-IKTLALVGPHANATKAMIGNYEGT 419
++ + L+ EAA Q IVLLKND G LPL T + KT+AL+GP ANAT M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLGNYYGP 454
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
SP+ F I Y G +++ A+ AK AD + G+D ++E
Sbjct: 455 APYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLET 514
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E +DR ++ P Q LI K+AD K P+ ++ M G VD + KNN + +++W GYPG
Sbjct: 515 EAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYPG 573
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPVV 598
+ GG+A+AD+I GK P RL T Y A Y ++ P M LRP + PG+TY ++ G V
Sbjct: 574 QSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTPV 633
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
Y FG+GL YT F ++ + K++ +I+ +G +P L++ +
Sbjct: 634 YEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL--- 682
Query: 659 KFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
F ++V+N G M + + G A K ++G++R+ SAK
Sbjct: 683 -LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPVT 741
Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
SL D N +L G + + +
Sbjct: 742 VDSLARTDEEGNRVLYPGRYEVAL 765
>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 923
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 278/748 (37%), Positives = 389/748 (52%), Gaps = 47/748 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L P C+ LP +R + LV ++TL EK+ + D A G R+GLP YEWWSEALHGV+
Sbjct: 159 LCSSPACNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVAA 218
Query: 69 I-GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
G P GT F ATSFP I +A+F++ L +I V E RA N G +G
Sbjct: 219 SPGVTFAGPNGTAFSY----ATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSG 274
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
FW+PNIN RDPRWGR ETPGED + + +Y + + GLQ SD +I
Sbjct: 275 FDFWTPNINPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQ---------GSDPLDKQI 325
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A CKHYA YD++ R+ +D D+ E ++ PF+ CV + + SVMCSYN V+
Sbjct: 326 IATCKHYAVYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVD 381
Query: 248 GIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
GIP CA LL +R W F + Y+VSDCD+++ I H F D+ A A L AG
Sbjct: 382 GIPACASEYLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNF-TDSPAAAAAVALNAG 440
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
DL+CG Y N ++ EA +D +L LY L +G+FDGS +Y LG + +
Sbjct: 441 TDLECGSTYLNLNQ-SLASNMTTEAALDRALTRLYTALHTIGFFDGSARYGGLGWDAVGT 499
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
LA +AA G VLLKN+ LPL++ ++ LA++GP ANAT M GNY G
Sbjct: 500 GDAQVLAYQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLV 559
Query: 425 SPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
SP+ F + N +A G I + + AA+ AAK ADA V + G+D SVE+E
Sbjct: 560 SPLAAFQSAWGADNVLFANGTG-IAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESL 618
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR + PG Q +LI ++A K P+ +V G +D + NP++ ++LW GYPG+ G
Sbjct: 619 DRTAISWPGNQLDLIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAG 677
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPV--------NNFPGRTYKFF 593
G AIAD++ GK P GRLP+T Y A+Y ++ LRP + FPGRTYK++
Sbjct: 678 GAAIADLLTGKQAPAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWY 737
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
G V PFGYGL YT F+ A P+ + DI N ++
Sbjct: 738 TGKPVLPFGYGLHYTTFRTAWADEPRG---------RAYDIAGLFPANTTTTSSAFSAAD 788
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVF-IAAGQSAKVG 712
+ + G D ++ + ++ G A K ++GY R +A G SA++
Sbjct: 789 TYPVLNVSVTVTNTGRGASDYVGLLFLRTRNAGPAPYPNKWLVGYARARGLAPGSSARLE 848
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTIL 740
+ A SL D ++ G + +L
Sbjct: 849 LAV-ALGSLARADEDGRRVVYPGDYELL 875
>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 763
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/745 (36%), Positives = 395/745 (53%), Gaps = 57/745 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS+ P CD L ERAK LV +TL EK+ G A G RLGLP Y WW+EALHGV+
Sbjct: 33 LSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNEALHGVA- 91
Query: 69 IGRRTNSPPGTHFDS--EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
G F+ + ATSFP I+ A+FN++L +++ + +STEARA N +A
Sbjct: 92 ------EKHGVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHA 145
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
G+ +W+PN+N +DPRWGR ETPGEDP RY +V GLQ D +P K
Sbjct: 146 GIDYWTPNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQG--------DDPEKP-K 196
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
+ A CKH AAYDL+ W G RF FD++V+ D+ E ++ PF+ C + V + MCSYN +
Sbjct: 197 VVATCKHLAAYDLEEWGGVSRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNAL 256
Query: 247 NGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NG+P CAD LL +R W + G ++ DC +++ I H ++ E A A L A
Sbjct: 257 NGVPACADRYLLQTVLREHWGWEGPGHWVTGDCGAVERIQTYHHYVESGPE-AAAAALNA 315
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKN 360
G+DLDCG + ++ A +QG I+ +D +L LY L++LGYFD G P ++LG +
Sbjct: 316 GVDLDCGTWLPSYLGEAERQGLISNETLDAALTRLYTSLVQLGYFDPAEGQP-LRSLGWD 374
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
++ + ELA A QG VLLKN + LPL TLAL+GP N T + NY G
Sbjct: 375 DVATSEAEELAKTVAIQGTVLLKNIDWTLPLKANG--TLALIGPFINFTTELQSNYAGPA 432
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+ ++ + APG ++ + A+ A ADA + G+D +VE E
Sbjct: 433 KHIPTMIEAAERLGYNVLTAPGT-EVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEE 491
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DR + PG Q ELI ++A+ + P+T+V G VD + + + +I+W GYP +
Sbjct: 492 SLDRTRIDWPGNQEELILELAELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQ 550
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
GG + DV+ GK P GRLPIT Y +YV ++P T M L+P + PGRTY++++ V+
Sbjct: 551 AGGAGVFDVLTGKAAPAGRLPITQYPKSYVDEVPMTDMNLQPGTDNPGRTYRWYEDAVL- 609
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GL YT F A K D R N ++ ++D
Sbjct: 610 PFGFGLHYTTFNVSWA---KKAFGPYDAATLARGKN---------PSSNIVD-------- 649
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV-FIAAGQSAKVGFTMN 716
TF + V N G + V +V++ P G IK ++GY R I G++ KV +
Sbjct: 650 -TFSLAVTNTGDVASDYVALVFASAPELGAQPAPIKTLVGYSRASLIKPGETRKVDVEVT 708
Query: 717 ACKSLKIVDNAANSLLASGAHTILV 741
+ ++ +L G +T+LV
Sbjct: 709 VAPLTRATED-GRVVLYPGEYTLLV 732
>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 767
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 267/743 (35%), Positives = 394/743 (53%), Gaps = 57/743 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L PERA LV+ +T+ EK+Q + A G PR+GLP Y WWSEALHGV++
Sbjct: 43 CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVAYA----- 97
Query: 75 SPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PGT+F D E +TS+P +L A+F++ L ++IG + EARA N G AGL +W
Sbjct: 98 --PGTYFPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYW 155
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PN+N +DPRWGR ETPGED V RYA RGL E R + + C
Sbjct: 156 TPNVNPFKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQRR--------VISTC 207
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KHYA D ++W G R FD+++T QD+ E +++PF+ C + V S+MC+YN VNG+P+
Sbjct: 208 KHYAGNDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPS 267
Query: 252 CADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
CA+ LL +R WN+ + Y+ SDC+++ + +HK+ T A +AG+D
Sbjct: 268 CANEYLLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKYA-PTNAAGTAICFEAGMDTS 326
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQH 367
C ++ GA QG + E +D +L LY L+R GYFDG Y LG ++ + +
Sbjct: 327 CEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAIYAKLGWKDVNSAEA 386
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA +AA +GIVLLKN NG LPL+ +A++G A+A + G Y G +P
Sbjct: 387 QSLALQAAVEGIVLLKN-NGTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTP- 444
Query: 428 DGFYAYSKVINYAPGCADIVCQNNS---MIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+ A ++ ++ +NN+ AA++AA+ AD + GLD S E DR
Sbjct: 445 -AYAARQLGLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDR 503
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
DL P Q LI K++ K P+ + ++ D + + ++ SILW +PG++GG
Sbjct: 504 TDLEWPEAQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGV 561
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AI +I G+ +P GRLP+T Y +NY IP TSM LRP + +PGRTY+++D P+ FG+
Sbjct: 562 AIMKLITGEKSPAGRLPVTQYPSNYTDLIPMTSMDLRPTSQYPGRTYRWYDKPIKR-FGF 620
Query: 604 GLSYTQFKYKVASS-PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
GL YT FK +V + PK++ I D+ + C A
Sbjct: 621 GLHYTTFKAEVGGAFPKTLRIA--------DLVGCGNEHPDTCPAP------------PL 660
Query: 663 QIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKS 720
+ + N G V + Y S G IK + Y+R+ +A G++A V
Sbjct: 661 PVSITNTGNRTSDYVALAYLSGEYGPRPYPIKTLSAYKRLRDVAPGETATVDLAWT-LGD 719
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
+ D N++L G +TI + E
Sbjct: 720 IARHDEQGNTVLYPGEYTITIDE 742
>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
Length = 755
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 260/624 (41%), Positives = 353/624 (56%), Gaps = 49/624 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L++ CD RA L+ TL EKV G+ + GVPR+GLP YEWW+EALHG++
Sbjct: 28 LANNTVCDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA- 86
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT F S +TSFP IL A+F++ L K+ VSTEARA N+
Sbjct: 87 ------RSPGTTFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR 140
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL FW+PNIN +DPRWGR ETPGEDP+ Y + GLQ G+ D P
Sbjct: 141 FGLNFWTPNINPYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--GL------DDLPY 192
Query: 186 KIS-ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K A CKH+A YDL+N +G R+ FD+ + QD+++ ++ PF+ C + +V SVMCSYN
Sbjct: 193 KKGVATCKHFAGYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYN 252
Query: 245 RVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+NG+PTCAD LL +R W + ++ SDCD+++ I + H + T E + A L
Sbjct: 253 AMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNY-TLTPEQSAADAL 311
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGK 359
AG DLDCG ++ + A QG + +D SL Y L+RLGYFD Y+ L
Sbjct: 312 NAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNW 371
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+N+ P +LA +AA GIVLLKND G LPL++ NI +AL+GP ANATK M GNY GT
Sbjct: 372 DNVSTPAAQQLALQAAEDGIVLLKND-GILPLSS-NITNVALIGPLANATKQMQGNYYGT 429
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
SP+ + Y G ADI QN + AAI AA++AD + V G+D S+EA
Sbjct: 430 APYLRSPLIAAQNAGFKVTYVQG-ADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEA 488
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E +L T LI + +D + +N + ++LW GYPG
Sbjct: 489 EE------ILANLSTPLI-------------ISQMGCMIDSSSLLSNTGVNALLWAGYPG 529
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
++GG AI +++ GK P GRLPIT Y +NYV ++ T M L+P PGRTYK+++G V
Sbjct: 530 QDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQPSRFNPGRTYKWYNGEPV 589
Query: 599 YPFGYGLSYTQFKYKVA-SSPKSV 621
+ +GYGL YT F K+ SSP +
Sbjct: 590 FEYGYGLQYTTFDAKITPSSPNNT 613
>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
Length = 698
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 250/630 (39%), Positives = 344/630 (54%), Gaps = 42/630 (6%)
Query: 7 VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
+ LS P CD L + RA LV + T E + + A GVPRLG+P Y+WW+EALHGV
Sbjct: 27 LPLSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGV 86
Query: 67 SFIGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
+ PG +F+ + G ATSFP VI A+F+++L++ + ++ E RA N
Sbjct: 87 A-------RSPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSN 139
Query: 123 LGNAGLTFWSP-NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
G AGL +SP NIN RDPRWGR ET GEDP + RYA+ VRGLQ G +++
Sbjct: 140 AGRAGLNMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQ---GPAAQDEAN 196
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
R L ++A CKHY AYDL+ G +R+ FD+ V+ QD+ + + F CV +G +++M
Sbjct: 197 PR-LTLAATCKHYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMT 255
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
SYN VNG+P A L R W H Y+ SDCD++ + ++H + D A A
Sbjct: 256 SYNAVNGVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAA 315
Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
L AG DLDCG Y + A+ Q A I ++ +Y L+RLGYFD + +
Sbjct: 316 S-LNAGTDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAAEAQPLRQ 374
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
LG ++ P +LA EAA I LLKN LPL KT+AL+GP+ NAT A+ GNY
Sbjct: 375 LGWKDVNAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNY 434
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID---------AAKNADAT 467
G +P D A + + A IV N + I D AK+AD
Sbjct: 435 AGPSPLVITPFD---AARRTFS----DAHIVSANGTSIAGPYDTATASAALATAKSADII 487
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVI-MSAGAVDINFAKNN 526
V G+D +VE E DR D+ P Q LI ++ AA G V +V+ G VD K +
Sbjct: 488 VYAGGIDPTVEGESLDRRDIAWPANQLRLIQEL--AALGKVLVVVQFGGGQVDGALLKGD 545
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNF 585
+ +++W GYPG+ G A+ D++ GK P GRLPIT Y ANY + T+M LRP +
Sbjct: 546 DGVGALVWAGYPGQSGALALMDILAGKRAPAGRLPITQYPANYTHALRETTMALRPTATY 605
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
PGRTYK++ G +PFG+GL YT F+ +A
Sbjct: 606 PGRTYKWYTGTPTFPFGFGLHYTTFRASIA 635
>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
Length = 804
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 270/746 (36%), Positives = 394/746 (52%), Gaps = 53/746 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ +RA LV TL E + G+ + GVPRLGLP Y+ WSEALHG++ R
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLA---RANF 116
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G + ATSFP+ IL+ A+FN +L +I +ST+ RA N G GL +SPN
Sbjct: 117 TDNGAY-----SWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPN 171
Query: 135 INVVRDPRWGRVLETPGEDPY-VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
IN R P WGR ETPGED Y + YA Y+ G+Q E+ LK++A KH
Sbjct: 172 INTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEH--------LKLAATAKH 223
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YD++NW+ + R D +T+QD+ E + F + + V S MCSYN VNG+P+C+
Sbjct: 224 FAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCS 283
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L +R ++F HGY+ DC ++ + H + + + A A + AG D+DCG
Sbjct: 284 NTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDIDCGT 342
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG----SPQYKNLGKNNICNPQH 367
Y ++ G +A DI+ LY L+ LGYFDG S Y++LG ++
Sbjct: 343 SYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDA 402
Query: 368 IELAAEAARQGIVLLKNDNGALPLNT---GNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +GIVLLKND G LPL + G K++AL+GP ANAT + GNY G
Sbjct: 403 WNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLI 461
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP+D F A ++YAPG +I + + AA+ AA+ AD V + G+D ++EAE +DR
Sbjct: 462 SPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDR 520
Query: 485 VDLLLPGFQTELINKVA--DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
+ PG Q ELI+++A + P+ + M G VD + K+N K+ ++LW GYPG+ G
Sbjct: 521 SSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSG 580
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPVVY 599
G A+ D++ G P GRL T Y A Y + M LRP PG+TY ++ G VY
Sbjct: 581 GLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVY 640
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
FG+GL YT F ASS ++ K YT AA +
Sbjct: 641 AFGHGLFYTTFN---ASSAQAAKTK-----------YTFNITDLTSAAHPDTTTVGQRTL 686
Query: 660 FTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAA--GQSAKVGFTM 715
F F + N G+ D +VY + G + K ++G++R+ A G +A++ +
Sbjct: 687 FNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPV 746
Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
A L VD A N++L G + + +
Sbjct: 747 -AVDRLARVDEAGNTVLFPGRYEVAL 771
>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 791
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 259/701 (36%), Positives = 371/701 (52%), Gaps = 44/701 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA L+ T E V G++ +PRLGLP Y+ W+EALHG+
Sbjct: 55 LSKTMVCDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLD- 113
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N T F + ATSFP+ ILT A+ N +L +IG VST+ RA N G GL
Sbjct: 114 ---RANL---TEF-GDYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGL 166
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+SPNIN R P WGR ETPGED + Y + Y+ GLQ D + LK++
Sbjct: 167 DVYSPNINSFRHPVWGRGQETPGEDIQLCSVYGLEYITGLQG--------GLDPKELKLA 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A KH+A YD++NW + R D ++ D + F V + V SVM SYN VNG
Sbjct: 219 ATAKHFAGYDIENWGNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
+P A+ LL +R WNF GY+ SDCDS+ + H + + A + +AG D
Sbjct: 279 VPASANSFLLQTLLRDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSI-QAGTD 337
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNP 365
+DCG Y + + QG+I+ ++I+ + Y L+ LGYFDG + +Y++L +++
Sbjct: 338 IDCGATYQLYLNQSFTQGEISRSEIERAATRFYSNLVSLGYFDGDNSKYRDLDWSDVVAT 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ EAA +GIVLLKND G LPL+ + ++AL+GP AN T M GNY G T
Sbjct: 398 DAWNISYEAAVEGIVLLKND-GTLPLSK-DTHSVALIGPWANVTTTMQGNYYGAAPYLTG 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ A +NYA G +I + S AA+ AA+ +D + G+D SVEAEG DR
Sbjct: 456 PLAALQASDLDVNYAFGT-NISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRE 514
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+ PG Q +LI ++++ K P+ ++ M G VD + K N + S++W GYPG+ GG A
Sbjct: 515 TITWPGNQLQLIEQLSELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
I D++ GK P GRL +T Y A Y ++ P T M LRP + PG+TY ++ G VY FG+G
Sbjct: 574 ILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGSNPGQTYMWYTGKPVYEFGHG 633
Query: 605 LSYTQFKYKVASSPKS---VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
L YT F+ +A+S + + K + Y V I+ V +Y
Sbjct: 634 LFYTTFETSLANSHGANNGASFDIVKLLSRSNAGYNV-----------IEQVPFMNYT-- 680
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERV 701
IEVEN G + M + H K ++G++R+
Sbjct: 681 --IEVENTGTVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRL 719
>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
Length = 796
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 245/599 (40%), Positives = 340/599 (56%), Gaps = 28/599 (4%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
RA+ LV TL E + + A GVPRLGLP Y+ W+EALHG+ R S G
Sbjct: 73 RAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLD---RANFSDSG----- 124
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
E ATSFP IL+ ASFN +L +I ++T+ARA N G GL ++PNIN R P W
Sbjct: 125 EYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGFRSPLW 184
Query: 144 GRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
GR ETPGED + + YA Y+ GLQ GV D +KI A KH+A YDL+NW
Sbjct: 185 GRGQETPGEDAFFLSSAYAYEYITGLQG--GV------DPEHVKIVATAKHFAGYDLENW 236
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
R ++ +T+QD+ E + F S+MCSYN VNG+P+C++ L +
Sbjct: 237 GNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPSCSNSFFLQTLL 296
Query: 263 RGDWNF--HGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
R +NF GY+ SDCD++ + H + LN + A A L AG D+DCG
Sbjct: 297 RESFNFVDDGYVSSDCDAVYNVFNPHGYALN--QSGAAADSLLAGTDIDCGQTMPWHLNE 354
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAEAARQG 378
+ + ++ DI+ SL LY L+RLGYFDG+ Y+NL N++ ++ EAA +G
Sbjct: 355 SFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSVYRNLNWNDVVTTDAWNISYEAAVEG 414
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVIN 438
I LLKND G LPL + ++++AL+GP ANAT M GNY GTP SP++ A +N
Sbjct: 415 ITLLKND-GTLPL-SKKVRSIALIGPWANATVQMQGNYYGTPPYLISPLEAAKASGFTVN 472
Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELIN 498
YA G +I + AI AAK +D + G+D ++EAEG+DR DL PG Q +LI
Sbjct: 473 YAFGT-NISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDLKWPGNQLDLIE 531
Query: 499 KVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG 558
+++ K P+ ++ M G VD + K N + +++W GYPG+ GG A+ D++ GK P G
Sbjct: 532 QLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALFDILTGKRAPAG 590
Query: 559 RLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS 616
RL T Y A Y + P M LRP + PG+TY ++ G VY FG+GL YT+F+ A+
Sbjct: 591 RLVSTQYPAEYATQFPANDMNLRPNGSNPGQTYIWYTGTPVYEFGHGLFYTEFQESAAA 649
>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
Length = 737
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 258/757 (34%), Positives = 388/757 (51%), Gaps = 97/757 (12%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
++P+ + L ER DLV ++TL EKV QM + + RL +P Y WW+E LHG IG
Sbjct: 24 NYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHG---IG 80
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA---- 126
R D +V T FP I A++N+ L K++ +S E RA+YN +
Sbjct: 81 RT---------DYKV---TVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNR 128
Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GLT+W+PNIN+ RDPRWGR ET GEDP++ G ++V GLQ D+
Sbjct: 129 EIYYGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQG---------DDT 179
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ LK +AC KHYA + + N R F++ VT+ D+ +T++ F V E V+ VMC+
Sbjct: 180 KYLKAAACAKHYAVH---SGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN NG P C + L+ + +R WNF GY+ SDC +I + HK D K A A +
Sbjct: 237 YNAYNGEPCCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHHKTHPDAKY-AAADAVY 295
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
G D+DCG+ + AV+ G I E ID SL+ L+ + RLG FD + +Y + +
Sbjct: 296 NGTDIDCGNEAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVKYSQISTS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H +LA + R+ IVLLKN+N LPL + +K +A+VGP+AN +++GNY G P
Sbjct: 356 VLESQKHKDLALKITRESIVLLKNENNTLPL-SKKLKKVAVVGPNANNEVSVLGNYNGFP 414
Query: 421 CRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSM--IPAAIDAAKNADATVIVAGLDLS 476
+P + K + Y G + NS + A + K+ D + V G+
Sbjct: 415 TEIVTPYEAVKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPE 474
Query: 477 VEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
+E E G DR + LP QT+ + K A K P V+M+ A+ + N
Sbjct: 475 LEGEEMPVKIEGFTGGDRTSIKLPKIQTDFM-KALVAEKIPTVFVMMTGSAIATEWESQN 533
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
I +I+ Y G++ G AIADV+FG YNP G+LP+T+Y + + +P
Sbjct: 534 --IPAIVNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYAKD------SDLPAFNSYEMK 585
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
RTY++F+G V+YPFGYGLSYT+F+Y P ++D G N
Sbjct: 586 NRTYRYFNGEVLYPFGYGLSYTKFEYSPIQVPSTID---------------TGNNA---- 626
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAA 705
+ ++N GK++G EVV +Y P G + + G+ RV + A
Sbjct: 627 --------------KVSVSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFNRVSLKA 672
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
G+S V F ++ + L +VD+A +++G I +G
Sbjct: 673 GESKTVEFNLSP-RELGLVDDAGILKVSAGKRKIFIG 708
>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 262/703 (37%), Positives = 374/703 (53%), Gaps = 49/703 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 69 CDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G + ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDSGAY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D +S LK++A KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPESN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A AA K P+ ++ M G VD + KNN K+ ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLD-KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
G+GL YT F + +S+ + ++KL+ +D R + P
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLNIQDILSRTHEELASITQLPV--------------L 695
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F + N GK++ MV++ G A K ++G++R+
Sbjct: 696 NFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL 738
>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 798
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 259/743 (34%), Positives = 384/743 (51%), Gaps = 47/743 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV +T E V + +G PR+GLP Y+ W+EALHGV+
Sbjct: 57 LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115
Query: 69 IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
H D G +TSFP I T A+ N +L +I +ST+ RA N G
Sbjct: 116 -----------HADFSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D+
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
PLK+ A KHYA YD++NW+ + R D ++T+QD+ E + F + + V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
N VNG+P+C++ L +R ++F GY+ DC ++ + H + + + A A +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
+AG D+DCG Y + +++ D++ + LY L+R GYFDG + Y+N+ +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTSPYRNITWS 395
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
++ + L+ EAA Q IVLLKND G LPL + + KT+AL+GP ANAT M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ F I Y G +++ A+ AK AD + G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E +DR ++ P Q LI K+AD K P+ ++ M G VD + KNN + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
G+ GG+A+AD+I GK P RL T Y A Y ++ P M LRP + PG+TY ++ G
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VY FG+GL YT F ++S + K++ +I+ +G +P L++ +
Sbjct: 634 VYEFGHGLFYTNFTASASASSGT------KNRTSFNIDEVLG--RPHLGYKLVEQMPL-- 683
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMN 716
F ++V+N G M + H K ++G++R+ SAK
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741
Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
SL D N +L G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764
>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 760
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 270/743 (36%), Positives = 395/743 (53%), Gaps = 58/743 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD RA LV M EK+ + + + GV RLGL Y+WW+EALHGV+ R
Sbjct: 39 CDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH--NR-- 94
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
G + E AT FP I T+A+F+++L ++IG +STEARA N G A L FW+PN
Sbjct: 95 ---GITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDFWTPN 151
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N RDPRWGR ETPGED + ++A +V+G+Q HR + A CKHY
Sbjct: 152 VNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGPGPT--HR--------VIATCKHY 201
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
AAYDL+N RF+FD++V+ QD+ E ++ PF+ C + V S+MCSYN VN IP CA+
Sbjct: 202 AAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACAN 261
Query: 255 PKLLNQTIRGDWNF---HGYIVSDCDSIQTIVES---HKFLNDTKEDAVARVLKAGLDLD 308
P L++ +R WN+ H YIVSDCD++ + + H++ + A+ L+AG D
Sbjct: 262 PYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRY-KPSYAAAIGASLEAGCDNM 320
Query: 309 CGDYYTNFT----MGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNIC 363
C + T T A G+ ++ +DT++ L+ GYFDG Y+NL ++
Sbjct: 321 C--WATGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDGPGGMYRNLSVADVN 378
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNT-GNIKTLALVGPHANATKAMIGNYEGTPCR 422
+ A +AA GIVLLKND G LPL+ G+ +A++G ANA M+G Y G+P
Sbjct: 379 TQTAQDTALKAAEGGIVLLKND-GILPLSVNGSNFQVAMIGFWANAADKMLGGYSGSPPF 437
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
P+ + +NY G + Q N AA++AA+ ++A V G+D +VE E +
Sbjct: 438 NHDPVTAARSMGITVNYVNGP---LTQPNGDTSAALNAAQKSNAVVFFGGIDNTVEKESQ 494
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR + P Q LI ++A+ K PV +V+ VD + P +++ILW GYPG++G
Sbjct: 495 DRTSIEWPSGQLALIRRLAETGK-PV-IVVRLGTHVDDTPLLSIPNVRAILWAGYPGQDG 552
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ +I G +P GRLP T Y ++Y + P+T+M LRP +++PGRTY+++ V+PF
Sbjct: 553 GTAVVKIITGLASPAGRLPATVYPSSYTSQAPFTNMALRPSSSYPGRTYRWYSN-AVFPF 611
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F V P S I D C D + +D +
Sbjct: 612 GHGLHYTNFSVSVRDFPASFAIA-DLLASCGD------------SVAYLDLCPFP----S 654
Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
+ V N G V + + S G + IK + Y+RVF +V +S
Sbjct: 655 VSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIKTLATYKRVFNIEPGETQVAELDWKLES 714
Query: 721 LKIVDNAANSLLASGAHTILVGE 743
L VD N +L G +T+LV +
Sbjct: 715 LVRVDEKGNRVLYPGTYTLLVDQ 737
>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
Length = 735
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 261/724 (36%), Positives = 394/724 (54%), Gaps = 66/724 (9%)
Query: 47 GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLW 106
GV RLGL Y+WW+EALHGV+ R G + + AT FP I ++A+F++ L
Sbjct: 47 GVSRLGLSAYQWWNEALHGVAH--NR-----GITWGGQFSAATQFPQAITSSAAFDDHLI 99
Query: 107 KKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVR 166
++IG +STEARA N G A L FW+PN+N RDPRWGR ETPGED + ++A +V+
Sbjct: 100 ERIGVIISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQ 159
Query: 167 GLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
G+Q E HR + A CKHYAAYDL+N RF+FD++V+ QD+ E ++ P
Sbjct: 160 GMQGTEST--HR--------VIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPP 209
Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIV 283
F+ C + V S+MCSYN VNG+P CA P L++ +R WN+ + Y+VSDCD++ +
Sbjct: 210 FQQCARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLG 269
Query: 284 ES---HKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDTSLR 336
+ H++ + A+ L+AG D C + T T A + +A +D ++
Sbjct: 270 NANGGHRY-KSSYAAAIGASLEAGCDNMC--WATGGTTPDPASAFNSRQFTQATLDKAML 326
Query: 337 FLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
L++ GYFDG + Y+NL ++ + A +AA +GIVLLKNDN LPL G
Sbjct: 327 RQMQGLVKAGYFDGPNSLYRNLTAADVNTQVARDTALKAAEEGIVLLKNDN-ILPLTLGG 385
Query: 396 IKT-LALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI 454
T +A++G ANA M+G Y G+P P+ + +NY G + Q N+
Sbjct: 386 SNTQVAMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGP---LTQTNADT 442
Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
AA++AA+ + + G+D +VE E +DR + P Q +I ++A K PV +V M
Sbjct: 443 SAAVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRMG 501
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
VD + P +K+ILW GYPG++GG A+ ++I G +P GRLP+T Y ++Y + P
Sbjct: 502 T-HVDDTPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAP 560
Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
YT+M LRP +++PGRTY+++ P V+PFG+GL YT F P + I D C+
Sbjct: 561 YTNMALRPSSSYPGRTYRWYKDP-VFPFGHGLHYTNFSVAPLDFPATFSIA-DLLASCKG 618
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHI 692
+ Y P + + V N G VV+ + + G I
Sbjct: 619 VTYLELCPFP-----------------SVSVSVTNTGSRASDYVVLGFLAGDFGPTPRPI 661
Query: 693 KQVIGYERVF-IAAG--QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE-GVGGV 748
K + Y+RVF + G QSA++ + + +SL VD N +L G +T+L+ + + +
Sbjct: 662 KSLATYKRVFDVQPGKTQSAELDWKL---ESLARVDGKGNRVLYPGTYTLLLDQPTLANI 718
Query: 749 SFPL 752
+F L
Sbjct: 719 TFTL 722
>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39B_FAA]
gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39BFAA]
Length = 706
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 268/759 (35%), Positives = 384/759 (50%), Gaps = 114/759 (15%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
++A+ LV +MTL EK Q+ A V RLG+P Y +W+EALHGV+ G
Sbjct: 13 KKAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------- 61
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A F++ KK+G ++TE RA YN +A GLTFWSPN
Sbjct: 62 -----ATMFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPN 116
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ R + +V G+Q D +K +AC KHY
Sbjct: 117 VNIFRDPRWGRGHETYGEDPYLTSRLGVKFVEGIQ----------GDGPVMKAAACAKHY 166
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + + R FD++ + +DM ET++ FE V E DV +VM +YNR NG P CA
Sbjct: 167 AVH---SGPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAH 223
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
L+ +RG W F G+ SDC +I+ E H + T + A L AG DL+CG+ Y
Sbjct: 224 KYLMEDVLRGKWKFEGHYTSDCWAIRDFHE-HHMVTSTPRQSAAMALNAGCDLNCGNTYL 282
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ MGA Q G + E I S L LG FDGS +Y + + + +HI+ A +
Sbjct: 283 HM-MGAYQDGLVTEEKITESAVRLLTTRYLLGLFDGS-EYDKIPYSVVECKEHIDEALKM 340
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
AR+ VLLKND G LP++ + T+ ++GP+A++ A+IGNY GT Y + ++G +
Sbjct: 341 ARKSCVLLKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEA 399
Query: 435 K---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
I Y+ GC + + + I A+ A+N+D ++ GL+ ++E E
Sbjct: 400 GDDVRILYSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTG 459
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
D+VDL LP Q ELI KV K P +V+M+ A+D+N+A++N IL
Sbjct: 460 NSDASGDKVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDN--CNGILLAW 516
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG GGRAIAD++FGK +P G+LPIT+Y+ MP + RTY++ +
Sbjct: 517 YPGARGGRAIADLLFGKESPSGKLPITFYK------DLEGMPEFTDYSMKNRTYRYMEKE 570
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGL+Y+ A V + D
Sbjct: 571 ALYPFGYGLTYSDTCVTEAEVVGEVSAESD------------------------------ 600
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
+ V+N G +D EVV VY K P + + G++RV + AG+ V
Sbjct: 601 ---IVLKATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSL---CGFKRVSLKAGEEKSVE 654
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
FT++ K++ IVD N +A G H L GVS P
Sbjct: 655 FTISN-KAMNIVDEDGNRYIA-GKHFRL----FAGVSQP 687
>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
Length = 805
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 270/746 (36%), Positives = 391/746 (52%), Gaps = 52/746 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ +RA LV TL E + G+ + GVPRLGLP Y+ WSEALHG +GR
Sbjct: 60 CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHG---LGRANF 116
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G G SFP+ IL+ A+FN +L +I +ST+ RA N G GL +SPN
Sbjct: 117 TDNGALH----AGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPN 172
Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
IN R P WGR ETPGED Y + YA Y+ G+Q E+ LK++A KH
Sbjct: 173 INTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQGGVNPEH--------LKLAATAKH 224
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YD++NW+ + R D +T+QD+ E + F + + V S MCSYN VNG+P+C+
Sbjct: 225 FAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCS 284
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L +R ++F HGY+ DC ++ + H + + + A A + AG D+DCG
Sbjct: 285 NTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDIDCGT 343
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG----SPQYKNLGKNNICNPQH 367
Y ++ G +A DI+ LY L+ LGYFDG S Y++LG ++
Sbjct: 344 SYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDA 403
Query: 368 IELAAEAARQGIVLLKNDNGALPLNT---GNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +GIVLLKND G LPL + G K++AL+GP ANAT + GNY G
Sbjct: 404 WNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLI 462
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP+D F A ++YAPG +I + + AA+ AA+ AD V + G+D ++EAE +DR
Sbjct: 463 SPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDR 521
Query: 485 VDLLLPGFQTELINKVA--DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
+ PG Q ELI+++A + P+ + M G VD + K N K+ ++LW GYPG+ G
Sbjct: 522 SSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSG 581
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNNFPGRTYKFFDGPVVY 599
G A+ D++ G P GRL T Y A Y + M LRP PG+TY ++ G VY
Sbjct: 582 GLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPVY 641
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
FG+GL YT F ASS ++ K YT AA +
Sbjct: 642 AFGHGLFYTTFN---ASSAQAAKTK-----------YTFNITDLTSAAHPDTTTVGQRTL 687
Query: 660 FTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAA--GQSAKVGFTM 715
F F + N G+ D +VY + G + K ++G++R+ A G +A++ +
Sbjct: 688 FNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVPV 747
Query: 716 NACKSLKIVDNAANSLLASGAHTILV 741
A L VD A N++L G + + +
Sbjct: 748 -AVDRLARVDEAGNTVLFPGRYEVAL 772
>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 475
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/450 (46%), Positives = 291/450 (64%), Gaps = 17/450 (3%)
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
V AGLDL+CG + T+ AVQ GK++E+D+D ++ + LMRLG+FDG P+ + N
Sbjct: 28 VAAAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 87
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
LG +++C P + ELA EAARQGIVLLKN G LPL+ +IK++A++GP+ANA+ MIGNY
Sbjct: 88 LGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNY 146
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDL 475
EGTPC+YT+P+ G A + Y PGC ++ C NS+ + AA AA +AD TV+V G D
Sbjct: 147 EGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQ 205
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
S+E E DR LLLPG Q +L++ VA+A+ GP LV+MS G DI+FAK++ KI +ILWV
Sbjct: 206 SIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWV 265
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFF 593
GYPGE GG AIADV+FG +NP GRLP+TWY ++ K+P T M +R P +PGRTY+F+
Sbjct: 266 GYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFY 325
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
G VY FG GLSYT F + + S+PK + ++L + C C +V +
Sbjct: 326 TGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLTEQ---------CPSVEAEGA 376
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
C+ F + V N G+ G V ++S PP + K ++G+E+V + GQ+ V F
Sbjct: 377 HCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLLGFEKVSLEPGQAGVVAF 436
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
++ CK L +VD N +A G+HT+ VG+
Sbjct: 437 KVDVCKDLSVVDELGNRKVALGSHTLHVGD 466
>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 261/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 69 CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G++ ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D DS LK++A KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A +A P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + ++KL+ +DI + A++ V
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LN 696
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F ++N GK++ MV++ G A +K ++G++R+
Sbjct: 697 FTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738
>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 705
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 255/749 (34%), Positives = 383/749 (51%), Gaps = 105/749 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E+A +LV +MTL EK Q+ A +PRLG+P Y WW+EALHGV+ G
Sbjct: 9 EKAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------- 57
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
ATSFP I A+F++ L K +G V+ E RA YN + GLTFWSPN
Sbjct: 58 -----ATSFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPN 112
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ R + YV GLQ + D +K +AC KH+
Sbjct: 113 VNIFRDPRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQ--------DDDFMKTAACAKHF 164
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + + R FD++ +++DM ET++ FE CV E V +VM +YNR NG P C
Sbjct: 165 AVH---SGPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGS 221
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
P L+ +R +W+F G+ VSDC +I H + T E++ A LK+G D++CG Y
Sbjct: 222 PTLIQNILREEWDFQGHYVSDCWAIADF-HMHHMVTKTPEESAALALKSGCDVNCGVTYL 280
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ + A QQG + E +I + L+ LG FD + +Y ++ + +H+ELA +
Sbjct: 281 HL-LKAYQQGLVTEEEITQAAERLFTTRFLLGCFDKN-EYDDIPYEVVECKEHLELAQKM 338
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FY 431
A++ +VLLKND G LPLN +KT+ ++GP+A++ ++GNY GT RY + ++G F
Sbjct: 339 AKESMVLLKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFV 397
Query: 432 AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
+ Y+ GC + + I A+ A+++D V+ GLD ++E E
Sbjct: 398 GEDVRVYYSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTG 457
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
D+ DL LP Q EL+ VA K PV L +MS A+D+ FA + + +IL V
Sbjct: 458 NSYASGDKKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAEH--VNAILQVW 514
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG GG+A A+++FG +P G+LP+T+Y+ P + GRTY++ +
Sbjct: 515 YPGARGGKAAAEILFGACSPSGKLPVTFYK------DLEGFPAFEDYSMKGRTYRYLEKE 568
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGL+Y Q K A +V+ +
Sbjct: 569 PLYPFGYGLTYGQVCVKAAELTGAVE---------------------------------E 595
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK---PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
+ T + VEN GK D +V+ VY K H + ++RV + G+ A++
Sbjct: 596 GKELTIKAMVENSGKYDTDDVIQVYIKDLDSKNAVPNH--SLCAFKRVSLKKGEKAEILL 653
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ ++L VD + S + VG
Sbjct: 654 KV-PYEALMAVDEEGKKYVDSSHFVLSVG 681
>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/383 (53%), Positives = 267/383 (69%), Gaps = 12/383 (3%)
Query: 120 MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
MYN+G AGLTFWSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D
Sbjct: 1 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQQSD------D 54
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
LKI+ACCKHY AYDLDNW+G DRFHF++ VT+QDM +TF PF+ CV +G+V+SV
Sbjct: 55 GSPDRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASV 114
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN+VNG P CADP LL+ +RG+W +GYIVSDCDS+ S + T E+A A+
Sbjct: 115 MCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAK 173
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKN 356
+ AGLDL+CG + T AV+ G + E+ +D ++ + LMRLG+FDG+P Y
Sbjct: 174 AILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGK 233
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
LG ++C +H ELA EAARQGI+LLKN G+LPL+ IKTLA++GP+AN TK MIGNY
Sbjct: 234 LGPKDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNY 293
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
EGTPC+YT+P+ G A Y GC+++ C + + I A A ADATV++ G+D S
Sbjct: 294 EGTPCKYTTPLQGLMALV-ATTYLSGCSNVAC-STAQIDEAKKIAAAADATVLIVGIDQS 351
Query: 477 VEAEGKDRVDLLLPGFQTELINK 499
+EAEG+DRV++ LPG Q LI +
Sbjct: 352 IEAEGRDRVNIQLPGQQPLLITE 374
>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
Length = 797
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 253/631 (40%), Positives = 358/631 (56%), Gaps = 31/631 (4%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
+ I L D C+ + Y ERA+ L+ TL E + + A GVPRLGLP Y+ WSE
Sbjct: 52 DCINGPLKDNIVCNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEG 111
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
LHG+ R N E ATSFP IL+ A+ N +L +I ++T+ARA N
Sbjct: 112 LHGLD----RAN---WAKSGEEWKWATSFPMPILSMAALNRTLINQIASIIATQARAFNN 164
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDP-YVVGRYAINYVRGLQDVEGVEYHRDSD 181
+G GL ++PNIN R P WGR ETPGED ++ YA Y+ GLQ GV D
Sbjct: 165 VGRYGLDAYAPNINGFRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG--GV------D 216
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LKI A KH+A YDL+NW N R FD+ +T+QD+ E + F S MC
Sbjct: 217 PEHLKIVATAKHFAGYDLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMC 276
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
SYN VNG+P+C+ LL +R +W+F +GY+ SDCD+ + H + + A A
Sbjct: 277 SYNSVNGVPSCSSSFLLQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISA-AAAD 335
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLG 358
L+AG D+DCG Y + + +G + +I+ SL LY L++LGYFDG+ +Y+ LG
Sbjct: 336 SLRAGTDIDCGQTYPWYLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGNQSEYRQLG 395
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
N++ ++ EAA +GIVLLKND G LPL+ +K++A++GP ANAT+ + GNY G
Sbjct: 396 WNDVVATDAWNISYEAAVEGIVLLKND-GVLPLSE-KLKSVAVIGPWANATQQLQGNYFG 453
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
+P+ +NYA G +I+ AA+ AAK +D + + G+D ++E
Sbjct: 454 PAPYLITPLQAARDAGYKVNYAFGT-NILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIE 512
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AEG DR+++ PG Q +LI +++ K P+ ++ M G VD + K+N + +++W GYP
Sbjct: 513 AEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYP 571
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRP-VNNFPGRTYKFFDGP 596
G+ GG+AI D++ GK P GRL T Y A Y + P T M LRP + PG+TY ++ G
Sbjct: 572 GQSGGKAIFDILSGKRAPAGRLVTTQYPAEYATQFPATDMNLRPDGKSNPGQTYIWYTGK 631
Query: 597 VVYPFGYGLSYTQFK---YKVASSPKSVDIK 624
VY FGY L YT FK K+ASS S DI
Sbjct: 632 PVYEFGYALFYTTFKETAEKLASS--SFDIS 660
>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 798
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 258/743 (34%), Positives = 383/743 (51%), Gaps = 47/743 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV +T E V + +G PR+GLP Y+ W+EALHGV+
Sbjct: 57 LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115
Query: 69 IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
H D G +TSFP I T A+ N +L +I +ST+ RA N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D+
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
PLK+ A KHYA YD++NW+ + R D ++T+QD+ E + F + + V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
N VNG+P+C++ L +R ++F GY+ DC ++ + H + + + A A +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
+AG D+DCG Y + +++ D++ + LY L+R GYFDG + Y+N+ +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTSPYRNITWS 395
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
++ + L+ EAA Q IVLLKND G LPL + + KT+AL+GP ANAT M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ F I Y G +++ A+ AK AD + G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E +DR ++ P Q LI K+AD K P+ ++ M G VD + KNN + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
G+ GG+A+AD+I GK P RL T Y A Y ++ P M LRP + PG+TY ++ G
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VY FG+GL YT F ++ + K++ +I+ +G +P L++ +
Sbjct: 634 VYEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL-- 683
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMN 716
F ++V+N G M + H K ++G++R+ SAK
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741
Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
SL D N +L G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764
>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
Length = 795
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 275/759 (36%), Positives = 398/759 (52%), Gaps = 57/759 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ Y ERA+ L+ TL E + + GVPRLGLP Y+ W+EALHG+ R
Sbjct: 63 CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 119
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G F ATSFP IL+ A+ N +L +I +ST+ARA N G GL ++PN
Sbjct: 120 ATKGGQFQ----WATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPN 175
Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
IN R P WGR ETPGED V+ Y Y+ G+Q GV D LKI+A KH
Sbjct: 176 INGFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG--GV------DPENLKIAATAKH 227
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YDL+NW R FD+ +T+QD+ E + F S MC+YN VNG+P+CA
Sbjct: 228 FAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCA 287
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L +R W F GY+ SDCD++ + H + + A A L+AG D+DCG
Sbjct: 288 NSFFLQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYA-SNQSSAAASSLRAGTDIDCGQ 346
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
Y + G+++ +I+ S+ LY L+RLGYFD +Y++LG ++ ++
Sbjct: 347 TYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNIS 406
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
EAA +GIVLLKND G LPL+ ++++AL+GP ANAT M GNY G SP++
Sbjct: 407 YEAAVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAK 464
Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
+N+ G + + + AI AAK +DA + G+D +VE EG DR D+ PG
Sbjct: 465 KAGYQVNFELGT-ETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPG 523
Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
Q +LI ++++ K P+ ++ M G VD + K+N K+ S++W GYPG+ GG A+ D++
Sbjct: 524 NQLDLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582
Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GK P GRL T Y A+YV + P M LRP + PG+TY ++ G VY FG G+ YT
Sbjct: 583 GKRAPAGRLVSTQYPADYVHQFPQNDMNLRPDGKSNPGQTYIWYTGKPVYQFGDGIFYTT 642
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
FK ++ S K + + YT P TF +EN
Sbjct: 643 FKETLSGSSKGLKFNVSSVLAAPHPGYTYSEQTP---------------VLTFTANIENS 687
Query: 670 GKMDG--SEVVMVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDN 726
GK D S ++ V + G A K ++G++R+ I G S+K+ + +L VD+
Sbjct: 688 GKTDSPYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSSKLSIPI-PVSALARVDS 746
Query: 727 AANSLLASGAHTI-------------LVGEGVGGVSFPL 752
N ++ G + + LVGE V ++PL
Sbjct: 747 LGNRIVYPGKYELALNTDESIKLEFELVGEEVTIENWPL 785
>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
Length = 804
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 69 CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G++ ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDLGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D DS LK++A KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRAAFEEAGYNVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A +A P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + +IKL+ +DI + A++ V
Sbjct: 651 GHGLFYTTFA-ESSSNTTTREIKLN----IQDI---LSQTHEDLASITQLPV------LN 696
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F ++N GK++ MV++ G A +K ++G++R+
Sbjct: 697 FTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738
>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 793
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 249/614 (40%), Positives = 349/614 (56%), Gaps = 26/614 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV TL E V G+ GVPRLGLP Y+ WSE+LHGV
Sbjct: 57 LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGV-- 114
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N + + ATSFP ILT A+ N +L +IG +ST+ARA N+G GL
Sbjct: 115 --YRANWAS----EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGL 168
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPY-VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
++PNIN R P WGR ETPGED Y + YA Y+ G+Q GV D LK+
Sbjct: 169 DTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--GV------DPETLKL 220
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A KHYA YD++NW+G+ R D ++T+QD+ E + F + + V SVMCSYN VN
Sbjct: 221 VATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVN 280
Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P+C++ L +R + F GY+ DC ++ H++ + + A A ++AG
Sbjct: 281 GVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGT 339
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICN 364
D+DCG Y A +G+I+ DI+ + LY L+RLGYFDG S QY++L +++
Sbjct: 340 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQT 399
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +G VLLKND G LPL +I+++AL+GP ANAT M GNY G T
Sbjct: 400 TDAWNISHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYGPAPYLT 457
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP+ A ++YA G +I + A+ AA+ ADA + G+D ++E E DR
Sbjct: 458 SPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDR 516
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+++ PG Q +LIN+++ K P+ ++ M G VD + K+N + ++LW GYPG+ GG
Sbjct: 517 MNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGT 575
Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
A+ D+I G P GRL T Y A Y + P M LRP PG+TY ++ G VY FG+
Sbjct: 576 ALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTNPGQTYMWYTGTPVYEFGH 635
Query: 604 GLSYTQFKYKVASS 617
GL YT F+ K AS+
Sbjct: 636 GLFYTTFEAKRAST 649
>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 260/748 (34%), Positives = 384/748 (51%), Gaps = 104/748 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E+AK+LV +MTL EK +Q+ + V RL +P Y WW+E LHGV+ G
Sbjct: 14 EKAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------- 62
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A F++ L I + +STE RA YN + G+TFWSPN
Sbjct: 63 -----ATVFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPN 117
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ R + +V+GLQ + + LK +AC KH+
Sbjct: 118 VNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GEGKYLKAAACAKHF 167
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R FD+ V+++D+ ET++ FE CV EGDV +VM +YNR NG P C
Sbjct: 168 AVHS--GPEGL-RHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGS 224
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +RG WNF G++VSDC +I H+ + T ++ A +K G DL+CG+ Y
Sbjct: 225 KTLLRDILRGKWNFKGHVVSDCWAIADFHLHHR-VTSTATESAALAMKNGCDLNCGNVYL 283
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAE 373
+ A ++G + E DI T+ L +RLG FD +Y + + N C +H EL+ +
Sbjct: 284 QLLL-AYKEGLVTEEDITTAAERLMATRIRLGMFDEECEYNKIPYELNDCK-EHNELSLK 341
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY-- 431
AAR +VLLKN NG LPLN N+K++A++GP+A++ + GNY GT RY + ++G +
Sbjct: 342 AARNSMVLLKN-NGILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEA 400
Query: 432 -AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
+ Y+ GC + + + N + AI A+ +D ++ GLD ++E E
Sbjct: 401 VGEDVRVYYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDA 460
Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
D+ L LPG Q EL+ K+ + PV LVI + A+ N A++ K +IL
Sbjct: 461 GNSEGAGDKASLNLPGRQQELLEKIIETGT-PVILVIGAGSALTFNNAED--KCSAILDA 517
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG GGRA+AD+IFGK +P G+LPIT+Y +P + RTY++
Sbjct: 518 WYPGSRGGRAVADLIFGKCSPSGKLPITFYRNT------KDLPEFIDYSMKDRTYRYMSC 571
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGL+Y+ K P D+K D +DV+
Sbjct: 572 ESLYPFGYGLTYSTVKLSELHVP---DVKSD-----------------------FEDVE- 604
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+++ N G D EV+ Y K + G++RV + G+S K+
Sbjct: 605 ------VSVKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGES-KIAKM 657
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
S ++V++ +L S + VG
Sbjct: 658 KIKKSSFEVVNDDGERILDSKRFKLFVG 685
>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
1015]
Length = 804
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/702 (37%), Positives = 376/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 69 CDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G + ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDSGAY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D +S LK++A KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPESN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A AA K P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + ++KL+ +DI + A++ V
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEDLASITQLPV------LN 696
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F + N GK++ MV++ G A K ++G++R+
Sbjct: 697 FTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL 738
>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
Length = 810
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/702 (37%), Positives = 376/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 69 CDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G + ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDSGAY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D +S LK++A KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPESN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A AA K P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + ++KL+ +DI + A++ V
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEDLASITQLPV------LN 696
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F + N GK++ MV++ G A K ++G++R+
Sbjct: 697 FTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRL 738
>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
Length = 778
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 260/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 43 CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 99
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G++ ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 100 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 154
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D DS LK++A KHY
Sbjct: 155 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 206
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN V+G+P CAD
Sbjct: 207 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACAD 266
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 267 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 325
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 326 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 385
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 386 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 445
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 446 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALD 504
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A +A P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 505 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSG 564
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 565 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 624
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + ++KL+ +DI + A++ V
Sbjct: 625 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LN 670
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F ++N GK++ MV++ G A +K ++G++R+
Sbjct: 671 FTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 712
>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
Length = 798
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 259/743 (34%), Positives = 385/743 (51%), Gaps = 47/743 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV +T E V + +G PR+GLP Y+ W+EALHGV+
Sbjct: 57 LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115
Query: 69 IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
H D G +TSFP I T A+ N +L +I +ST+ RA N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D+
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
PLK+ A KHYA YD++NW+ + R D ++T+QD+ E + F + + V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
N VNG+P+C++ L +R ++F GY+ DC ++ + H + + + A A +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
+AG D+DCG Y + +++ D++ + LY L+R GYFDG + Y+N+ +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTSPYRNITWS 395
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
++ + L+ EAA Q IVLLKND G LPL + + KT+AL+GP ANAT M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ F I Y G +++ A+ AK AD + G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E +DR ++ P Q LI K+AD K P+ ++ M G VD + KNN + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
G+ GG+A+AD+I GK P RL T Y A Y ++ P M LRP + PG+TY ++ G
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VY FG+GL YT F ++ + K++ +I+ +G +P L++ +
Sbjct: 634 VYEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL-- 683
Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F ++V+N G M + + G A K ++G++R+ SAK
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741
Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
SL D N +L G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764
>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
Length = 798
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 259/743 (34%), Positives = 385/743 (51%), Gaps = 47/743 (6%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV +T E V + +G PR+GLP Y+ W+EALHGV+
Sbjct: 57 LSKTLVCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVA- 115
Query: 69 IGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
H D G +TSFP I T A+ N +L +I +ST+ RA N G
Sbjct: 116 -----------HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAG 164
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSR 183
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D+
Sbjct: 165 RYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--GV------DAN 216
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
PLK+ A KHYA YD++NW+ + R D ++T+QD+ E + F + + V SVMCSY
Sbjct: 217 PLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSY 276
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
N VNG+P+C++ L +R ++F GY+ DC ++ + H + + + A A +
Sbjct: 277 NAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATN-ESSAAADSI 335
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKN 360
+AG D+DCG Y + +++ D++ + LY L+R GYFDG + Y+N+ +
Sbjct: 336 RAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTSPYRNITWS 395
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPL--NTGNIKTLALVGPHANATKAMIGNYEG 418
++ + L+ EAA Q IVLLKND G LPL + + KT+AL+GP ANAT M+GNY G
Sbjct: 396 DVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYG 454
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ F I Y G +++ A+ AK AD + G+D ++E
Sbjct: 455 PAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLE 514
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E +DR ++ P Q LI K+AD K P+ ++ M G VD + KNN + +++W GYP
Sbjct: 515 TEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNALIWGGYP 573
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRPVNNFPGRTYKFFDGPV 597
G+ GG+A+AD+I GK P RL T Y A Y ++ P M LRP + PG+TY ++ G
Sbjct: 574 GQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSNPGQTYMWYTGTP 633
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
VY FG+GL YT F ++ + K++ +I+ +G +P L++ +
Sbjct: 634 VYEFGHGLFYTNFTASASAGSGT------KNRTSFNIDEVLG--RPHPGYKLVEQMPL-- 683
Query: 658 YKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F ++V+N G M + + G A K ++G++R+ SAK
Sbjct: 684 --LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPGSAKTMVIPV 741
Query: 717 ACKSLKIVDNAANSLLASGAHTI 739
SL D N +L G + +
Sbjct: 742 TVDSLARTDEEGNRVLYPGRYEV 764
>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
Length = 804
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 260/702 (37%), Positives = 378/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R
Sbjct: 69 CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G++ ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P WGR ETPGED + YA Y+ G+Q D DS LK++A KHY
Sbjct: 181 INTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN V+G+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A G I + S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRAAFEEAGYKVNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A +A P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + ++KL+ +DI + A++ V
Sbjct: 651 GHGLFYTTFA-ESSSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LN 696
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F ++N GK++ MV++ G A +K ++G++R+
Sbjct: 697 FTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738
>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 762
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 275/756 (36%), Positives = 399/756 (52%), Gaps = 85/756 (11%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ CDA L P+RA LV MT EK+Q + + G PR+GLP Y WWSEALHGV++
Sbjct: 43 LASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAY 102
Query: 69 IGRRTNSPPGTHFDS---EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT F S +TSFP +L A+F++ L +++G+ + E RA N G
Sbjct: 103 A-------PGTQFRSGNGTFNSSTSFPMPLLMAATFDDELIERVGEVIGIEGRAFGNAGF 155
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+G +W+PN+N +DPRWGR ETPGED + RYA + +RGL EG R+
Sbjct: 156 SGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGL---EGPVRERER----- 207
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+I A CKHYAA D ++W G+ R F+++VT QD+ E ++ PF+ C + V S+MCSYN
Sbjct: 208 RIVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNA 267
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VNG+P CA+ L+ +R WN+ YI SDC+++ I +H + T + A +
Sbjct: 268 VNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISANHHYAK-TNAEGTALAFE 326
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNN 361
AG+D C ++ +GA QG + ++ +D +LR LY L+++GYFDG+ +Y +LG N+
Sbjct: 327 AGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQVGYFDGNRSEYASLGWNH 386
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPL---NTGNIKTLALVGPHANATKAMIGNYEG 418
+ P+ E+A +AA +GIVLLKND LPL G LA++G AN K + G Y G
Sbjct: 387 VNRPKSQEVALQAAVEGIVLLKNDK-TLPLGVKKNGPKLKLAMIGFWANDPKTLSGGYSG 445
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGCADIVCQN----NSMIPAAIDAAKNADATVIVAGLD 474
TP SP+ A + A G V QN ++ AA+ AAK+A+ + G D
Sbjct: 446 TPAFEHSPVYATQAMGFKVTTAGGP---VLQNSTSKDTWTQAALAAAKDANYILYFGGQD 502
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
S E KDR + P Q +LI ++ K P+ +V M +D + I SILW
Sbjct: 503 TSAAGETKDRTTINWPEAQLQLITDLSKLGK-PLVVVQM-GDQLDNTPLLASKAINSILW 560
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFF 593
+P P GRLP+T Y ANY +P T M LRP + PGRTY+++
Sbjct: 561 ANWP----------------VPAGRLPVTQYHANYTAAVPMTDMTLRPSDKLPGRTYRWY 604
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
P V PFG+GL YT FK K+ P+ IK D +C + Y PP
Sbjct: 605 PTP-VQPFGFGLHYTTFKTKIVRLPR-FAIK-DLLSRCGNA-YPDTCGLPP--------- 651
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVF-IAAGQ-- 707
++EV N GK VV+ + K G G IK ++ Y R+ ++ G+
Sbjct: 652 --------LKVEVTNTGKRSSDYVVLAFLK--GDVGPKPYPIKTLVSYTRLRDLSPGRKT 701
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+A + +T+ + D N++L G +T++V E
Sbjct: 702 TAHLDWTLG---DIARYDEQGNTVLYPGTYTVIVDE 734
>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
Length = 776
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 249/614 (40%), Positives = 349/614 (56%), Gaps = 26/614 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA LV TL E V G+ GVPRLGLP Y+ WSE+LHGV
Sbjct: 75 LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGV-- 132
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N + + ATSFP ILT A+ N +L +IG +ST+ARA N+G GL
Sbjct: 133 --YRANWAS----EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGL 186
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPY-VVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
++PNIN R P WGR ETPGED Y + YA Y+ G+Q GV D LK+
Sbjct: 187 DTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--GV------DPETLKL 238
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A KHYA YD++NW+G+ R D ++T+QD+ E + F + + V SVMCSYN VN
Sbjct: 239 VATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVN 298
Query: 248 GIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P+C++ L +R + F GY+ DC ++ H++ + + A A ++AG
Sbjct: 299 GVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGT 357
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICN 364
D+DCG Y A +G+I+ DI+ + LY L+RLGYFDG S QY++L +++
Sbjct: 358 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQT 417
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
++ EAA +G VLLKND G LPL +I+++AL+GP ANAT M GNY G T
Sbjct: 418 TDAWNISHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYGPAPYLT 475
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
SP+ A ++YA G +I + A+ AA+ ADA + G+D ++E E DR
Sbjct: 476 SPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDR 534
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+++ PG Q +LIN+++ K P+ ++ M G VD + K+N + ++LW GYPG+ GG
Sbjct: 535 MNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGT 593
Query: 545 AIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
A+ D+I G P GRL T Y A Y + P M LRP PG+TY ++ G VY FG+
Sbjct: 594 ALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTNPGQTYMWYTGTPVYEFGH 653
Query: 604 GLSYTQFKYKVASS 617
GL YT F+ K AS+
Sbjct: 654 GLFYTTFEAKRAST 667
>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
16/4]
Length = 713
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 257/751 (34%), Positives = 390/751 (51%), Gaps = 107/751 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RAK+LV +MT+ EK QM A + RLG+P Y WW+EALHGV+ G
Sbjct: 7 KRAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAG------------ 54
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+E L +K+ STE RA YN GLT+W+PN
Sbjct: 55 ----DATVFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPN 110
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ G+ + YVRGLQ D P K +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGQLGMAYVRGLQG--------DDLDNP-KSAACAKHF 161
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + E R HFD++V +QD+ +T++ F+ V + V +VM +YNRVNG P C
Sbjct: 162 AVHSGPEAE---RHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGS 218
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
+LL +RGDW F G++VSDC +I+ E+HK E A A + G DL+CG Y
Sbjct: 219 KRLLKDILRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESA-ALAVNNGCDLNCGCVYE 277
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF-DGSPQYKNLGKNNICNPQHIELAAE 373
+ A + + E I S+ L + +RLG + +Y ++ + +H ELA E
Sbjct: 278 KL-LYAYKANLVTEETITESVERLIELRLRLGTLPERRSKYDDIPYEVVECKEHKELAIE 336
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA++ +VLLKND G LPL IKT+ ++GP++N+ A++GNYEG Y + ++G Y
Sbjct: 337 AAKRSMVLLKND-GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQY 395
Query: 434 ---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
+ ++ G ++ + A+ A+++D V+ GLD ++E E
Sbjct: 396 VGDDVRVFHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDA 455
Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
D+ L LPG Q EL+ K+ K PV L++++ A+D+++A N + +I+
Sbjct: 456 GNEFGSGDKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWANEN--VNAIMHC 512
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG GG+AIA V+FG+ +P G+LP+T+Y+++ P+ + GRTY++F G
Sbjct: 513 WYPGARGGKAIAQVLFGEDSPSGKLPLTFYKSDADLPPFEDYSME------GRTYRYFKG 566
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGLSY+ +Y A I+ T G I D
Sbjct: 567 TPLYPFGYGLSYSDIQYSNAG-----------------IDKTEGA---------IGD--- 597
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKV 711
KFT ++ V+N G E V VY K +A ++++ +V + G+S +V
Sbjct: 598 ---KFTVKVTVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKIA---KVELLPGESKEV 651
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++A + I+D + ++ G + VG
Sbjct: 652 SLELSA-RDFAIIDEKGHCIVEPGKFKVFVG 681
>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 730
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 226/617 (36%), Positives = 347/617 (56%), Gaps = 68/617 (11%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E+A+ LV++MTL EKV Q + A + RLG+ Y WW+E LHGV+ G
Sbjct: 23 EKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------- 71
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
AT FP I A+F+E L + +G+ VSTEARA Y++ GLT W+PN
Sbjct: 72 -----ATIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPN 126
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN+ RDPRWGR ET GEDP++ R I Y+RGLQ S + LK +AC KH+
Sbjct: 127 INIFRDPRWGRGHETYGEDPWLTSRLGIRYIRGLQG---------SHEKYLKTAACVKHF 177
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + R FD+ V+E+D++ET++ FE CV +GDV +VM +YNRVNG+P C +
Sbjct: 178 AVHSGPE---ELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGN 234
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R +W FHG++VSDC +I+ E H + D+ ++V+ + G DL+CG+ +T
Sbjct: 235 EYLLETILRKEWGFHGHVVSDCWAIKDFHEGHG-VTDSPVESVSMAMNHGCDLNCGNLFT 293
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELA 371
+ + AV++GK+ E +D ++ L+ ++LG + Y + + +P +L
Sbjct: 294 -YLIQAVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLN 352
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
AA + +VLLKN G LP++T KT+ ++GP+A++ +A++GNYEGT Y + ++G
Sbjct: 353 RSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIR 412
Query: 432 AYSK---VINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE-- 480
++ + Y+ GC + N + + +D + GLD ++E E
Sbjct: 413 EAAEPEARVLYSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMGLDSTLEGEQG 472
Query: 481 -------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
G D+ DL+LPG Q +++ D+ K PV LV+++ A+ + +A + + +IL
Sbjct: 473 DTGNIYAGGDKPDLMLPGLQQKILETAYDSGK-PVVLVLLAGSAMAVTWADEH--LPAIL 529
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
YPG EGGR +ADV+FG NP GRLP+T+Y +T+ + GRTY+F
Sbjct: 530 TAWYPGAEGGRGVADVLFGTVNPEGRLPVTFYRTTEELPDFTNYSME------GRTYRFM 583
Query: 594 DGPVVYPFGYGLSYTQF 610
+YPFG+GLSYT+F
Sbjct: 584 KQKALYPFGFGLSYTEF 600
>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 717
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 264/756 (34%), Positives = 396/756 (52%), Gaps = 97/756 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D + ++A++LV++MTL EK Q+ A +PRL +P Y WW+E+LHGV+ G
Sbjct: 5 DVRKRARKQAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT---- 60
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAG 127
AT FP I ASF+ + ++IG+ ++ E RA YN G
Sbjct: 61 ------------ATVFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKG 108
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
LTFW+PN+N+ RDPRWGR ET GEDPY+ R ++Y+RGLQ D +K
Sbjct: 109 LTFWAPNVNIFRDPRWGRGHETYGEDPYLSSRLGVSYIRGLQ----------GDGETMKA 158
Query: 188 SACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+AC KH+A + G + R FD+ V+E+D++ET++ F+ CV EG V +VM +YN
Sbjct: 159 AACAKHFAVHS-----GPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNC 213
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P C LL + +R +W F G++VSDC +I+ E+H + T + A ++AG
Sbjct: 214 VNGEPCCGSETLLKKILREEWGFDGHVVSDCWAIKDFHENH-LVTGTPVQSAALAMEAGC 272
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
DL+CG Y + + A Q+G + EA I + L+ LG FDGS +Y ++ +
Sbjct: 273 DLNCGVTYLHL-VHACQEGLVTEAQITEAAIRLFTTRFLLGMFDGS-EYDSVPYTVVECK 330
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H +L+ AAR+ IVLLKN NG LPL+ +KT+ ++GP+A++ KA+IGNY GT Y +
Sbjct: 331 EHRDLSERAARESIVLLKN-NGILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYIT 389
Query: 426 PMDG---FYAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
++G I Y+ GC + + + + A A+ +D ++ GLD +
Sbjct: 390 VLEGVRRLVGDEVRILYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDET 449
Query: 477 VEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
+E E D+VDL LP Q L+ VA K P L +M+ +D++FA+ +
Sbjct: 450 LEGEEGDTGNSYASGDKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHF 508
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
LW YPG GG A AD++FGK +P G+LPIT+YE+ V + +R G
Sbjct: 509 DAIVDLW--YPGAYGGAAAADILFGKCSPSGKLPITFYESLEVLPSFEDYSMR------G 560
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++ + YPFGYGL+YT+ K + + +KD + T G N AA
Sbjct: 561 RTYRYLEQKAQYPFGYGLTYTKMKIRNVWLENA-----EKDMK----EVTDGENAE--AA 609
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAG 706
V++ C EVEN G MD EV+ +Y + T + G+ER+F+ G
Sbjct: 610 VIV----CA--------EVENCGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVEKG 657
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V +N + +VD + SG + I G
Sbjct: 658 VKKLVKIPVNR-SAFTVVDESGRRFTDSGKYEIFAG 692
>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 794
Score = 407 bits (1046), Expect = e-110, Method: Compositional matrix adjust.
Identities = 267/735 (36%), Positives = 389/735 (52%), Gaps = 45/735 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ Y ERA+ L+ TL E + + GVPRLGLP Y+ W+EALHG+ R
Sbjct: 64 CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLD---RANF 120
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
+ G F+ TSFP IL+ A+ N +L +I +ST+ARA N G GL ++PN
Sbjct: 121 ATKGGEFE----WGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPN 176
Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
IN R P WGR ETPGED V+ Y Y+ G+Q GV D LKI+A KH
Sbjct: 177 INGFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG--GV------DPENLKIAATAKH 228
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A YDL+N+ R FD+ +T+QD+ E + F S MC+YN VNG+P+C+
Sbjct: 229 FAGYDLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCS 288
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L +R W F +GY+ SDCD+I + H + N ++ A A LKAG D+DCG
Sbjct: 289 NSFFLQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYAN-SQSSAAADSLKAGTDIDCGQ 347
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
Y + G ++ +I+ S+ LY L+RLGYFD +Y++LG ++ ++
Sbjct: 348 TYPWHLNESFVAGTVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNIS 407
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
EAA +GIVLLKND G LPL + ++++AL+GP NAT+ + GNY GT SP+
Sbjct: 408 YEAAVEGIVLLKND-GTLPL-SKKVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAK 465
Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPG 491
+NY G I Q + AI AAK +DA + + G+D ++E EG DR D+ PG
Sbjct: 466 KAGYEVNYELGTG-INNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPG 524
Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
Q +LI ++++ K P+ ++ M G VD + K+N K+ S++W GYPG+ GG A+ D++
Sbjct: 525 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILS 583
Query: 552 GKYNPGGRLPITWYEANYV-KIPYTSMPLRP-VNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GK P GRL T Y A YV + M LRP PG+TY ++ G VY FG GL YT
Sbjct: 584 GKRAPAGRLVSTQYPAEYVHQFAQNDMNLRPDGKKNPGQTYIWYTGKPVYQFGDGLFYTT 643
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
FK + K +K + Q +G P V FTF ++N
Sbjct: 644 FKETLG---KQSTLKFNASQ-------ILGAGHPGYTYSEQTPV------FTFTANIQNS 687
Query: 670 GKMDG--SEVVMVYSKPPGIAGTHIKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDN 726
GK S + V + G K ++G++R+ I G S+ + + +L VD+
Sbjct: 688 GKTASPYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSSTLSIPI-PLNALSRVDS 746
Query: 727 AANSLLASGAHTILV 741
N ++ G + +++
Sbjct: 747 NGNKIVYPGKYELVL 761
>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
Length = 732
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 263/757 (34%), Positives = 396/757 (52%), Gaps = 99/757 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ + +L + RA+ LV MT+ EK+ Q+ +PRL +P Y WW+EALHG++ G+
Sbjct: 30 WFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN---- 125
AT FP I A+F+ L +++ +S EARA Y ++GN
Sbjct: 88 --------------ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQY 133
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AGLTFW+PN+N+ RDPRWGR ET GEDP + + +V+GLQ D + L
Sbjct: 134 AGLTFWTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQG---------DDPKYL 184
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K + KH+A + + R FD +++D+ ET++ FE V + V+ VMC+YN
Sbjct: 185 KSAGVAKHFAVHSGPE---SLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNG 241
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
V G P+CA LL + ++ W F+GY+VSDC ++ HK ++ E A A L+AG+
Sbjct: 242 VYGQPSCASEFLLGEMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESA-ALALRAGV 300
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNIC 363
DL+CG Y A ++G I ++ ID L+ L ++ RLG FD S + +G+ I
Sbjct: 301 DLNCGFTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRLGLFDPSELNPHNAIGQEVIH 360
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+ +HIELA + A + IVLLKN+ LPL+ +IK + GP A ++ ++GNY G
Sbjct: 361 SLEHIELARKVAAKSIVLLKNEKQVLPLSK-DIKVPYVTGPFAASSDMLMGNYYGISDSL 419
Query: 424 TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE-- 478
+ ++G + +NY G N + A + AK ADA + V G+ +E
Sbjct: 420 VTVLEGIAGKVSLGSSLNYRAGALPFHSNINPL-NWAPEVAKTADAVIAVVGISADMEGE 478
Query: 479 -------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
A+ DRV + LP Q + + ++A+ KGP+ LV+ + VDI ++ +P +
Sbjct: 479 EVDAIASADRGDRVAITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDI--SELDPLADA 536
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF--PGRT 589
ILW+ YPGE+GG A+ADVIFG NP G LP+T +VK T L P +++ GRT
Sbjct: 537 ILWIWYPGEQGGNAVADVIFGDTNPSGHLPLT-----FVK---TIDDLPPFDDYTMTGRT 588
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
YKF +YPFG+GLSYTQFK+ S K Q+ +IN +V
Sbjct: 589 YKFLKKLPLYPFGFGLSYTQFKFGKLSLSKRA------PQEGENINISV----------- 631
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQS 708
EVEN +DG VV VY P + I + ++RV I A +
Sbjct: 632 ---------------EVENSTALDGETVVQVYLSPQVPLKNEAITNLKAFKRVHIGAYEK 676
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
+ FT+ K+L V++A ++ SGA+T+ VG+ +
Sbjct: 677 RLIEFTIEG-KNLYRVNDAGENVWPSGAYTLAVGDSL 712
>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 758
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 273/755 (36%), Positives = 391/755 (51%), Gaps = 70/755 (9%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDL------AYGVPRLGLPLYEWWSEALHGVSF 68
CD P+RA LVE M + EK+ + + + G PRLGLP YEWWSEALHGV+
Sbjct: 11 CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69
Query: 69 IGRRTNSPPGTHFD---SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
+ PG F+ ATSF I +A+F++ L +K+ +STEARA N G+
Sbjct: 70 ------ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGS 123
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AGL FW+PNIN RDPRWGR ETPGEDP + Y + +RGL+ E ++
Sbjct: 124 AGLDFWTPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEESIK---------- 173
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A CKHYAAYDL+ W R+ FD+ V+ QD+ E ++ PF+ C + V S+MCSYN
Sbjct: 174 KVIATCKHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNS 233
Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIV-ESHKFLNDTKEDAVARVL 301
+NG P CA+ L++ +R W + + YI SDC++I+ + + H F E A A
Sbjct: 234 LNGTPACANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYT 293
Query: 302 KAGL---DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYK 355
++ YT+ +GA Q ++E ID +LR LY L+R GYFD SP Y+
Sbjct: 294 AGTDTVCEVAGSPPYTD-VVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPASASP-YR 351
Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
++G +++ + LA ++A G+VLLKND G LP+ KT+AL+G A+ T++M+G
Sbjct: 352 DIGWSDVNTAEAQALALQSASDGLVLLKND-GTLPIKLEG-KTVALIGHWASGTRSMLGG 409
Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPG-CADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
Y G P Y SP+ + YA G A ++ A+ AA +D + GLD
Sbjct: 410 YSGIPPYYHSPVYAAGQLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLD 469
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSIL 533
SV +E KDR + P Q LI +A K LV++ G VD NP + +IL
Sbjct: 470 QSVASEDKDRDSIAWPPAQLTLIQTLAGLGK---PLVVIQLGDQVDDTPLLTNPNVSAIL 526
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTY 590
W GYPG+ GG A+ + I G P GRLP+T Y ++Y ++P T M LR P + PGRTY
Sbjct: 527 WAGYPGQSGGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTY 586
Query: 591 KFF-DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
++ V PFGYGL YT F + ++ T PC
Sbjct: 587 RWLPRNATVLPFGYGLHYTNFT--------------ARPNPAQNFTLTPSALLAPCKLAH 632
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVF-IAAG 706
D C + +EV N G V +V+ ++ G +K ++ Y R+ IA G
Sbjct: 633 RD--LCP-LPYPVTVEVTNTGARTSDYVGLVFATTRDAGPPPHPLKTLVAYARLRGIAPG 689
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
++A+ + A L VD A N +L G + ++
Sbjct: 690 RTARAQVQV-ALGDLARVDAAGNRVLYPGRYGFVL 723
>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
Length = 847
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 265/763 (34%), Positives = 397/763 (52%), Gaps = 75/763 (9%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI-GRRT 73
CD ERA LV+ M L EK++ + + + G PR+GLP YEWWSEALHGV+ G
Sbjct: 96 CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKSPGVTF 155
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
N G F S ATSF I+ +A+F++ L + + +STEARA N G AGL +W+P
Sbjct: 156 NKSSGAAFSS----ATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTP 211
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN +DPRWGR +ETPGED + +Y +RGL+ SD K+ A CKH
Sbjct: 212 NINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE---------GSDPTTRKMVANCKH 262
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV------- 246
YAA DL+ W G R++FD+ VT QD+ E ++ F+ C + +V S MC+YN +
Sbjct: 263 YAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDL 322
Query: 247 --NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
NG P CA L+N +R W + + +I SDC+++ + H + +DT+E+A
Sbjct: 323 SWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAGSAY 381
Query: 302 KAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLG 358
AG D C +Y GA +G + E +D +L+ LY L+R GYFDG Y+N+
Sbjct: 382 TAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYRNIT 441
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNI----KTLALVGPHANATKAMIG 414
++ P+ +LA +A +G+VL KN NG LP+ + KT+AL+G + + M+G
Sbjct: 442 WADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWVDNGEQMLG 500
Query: 415 NYEGTPCRYTSPMDGFYAYS-KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
Y G +P+ A + K++ +S A++AA AD + G+
Sbjct: 501 TYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGI 560
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
DLSVEAE +DR L P Q +L++ + +A G T+V+ +D +N I +I+
Sbjct: 561 DLSVEAEDRDRYSLAWPSAQAKLLSDI--SALGKPTVVVQLGTMLDDTALLDNKNISAII 618
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF------- 585
W GYPG++GG A D+I GK P GRLP+T Y A Y ++P T M +RP +
Sbjct: 619 WAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASN 678
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
PGRTY+++D V+PFG+GL +T F VA S S D + C+ +
Sbjct: 679 PGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESGCKSEKH--------- 728
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYE 699
ID KC + + ++ V N DG Y+ + G + +K ++ Y
Sbjct: 729 ----ID--KCS-FPSSLEVSVTN----DGKSTTSSYAALAFVRGEYGPKPYPLKTLVAYG 777
Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
++ IA GQ+ KV + + +N + +L G + +LV
Sbjct: 778 KLHDIAPGQTKKVKLELTLGDLARTAEN-GDLVLYPGKYEVLV 819
>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 792
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 265/763 (34%), Positives = 397/763 (52%), Gaps = 75/763 (9%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI-GRRT 73
CD ERA LV+ M L EK++ + + + G PR+GLP YEWWSEALHGV+ G
Sbjct: 41 CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKSPGVTF 100
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
N G F S ATSF I+ +A+F++ L + + +STEARA N G AGL +W+P
Sbjct: 101 NKSSGAAFSS----ATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTP 156
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN +DPRWGR +ETPGED + +Y +RGL+ SD K+ A CKH
Sbjct: 157 NINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE---------GSDPTTRKMVANCKH 207
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV------- 246
YAA DL+ W G R++FD+ VT QD+ E ++ F+ C + +V S MC+YN +
Sbjct: 208 YAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKGKDL 267
Query: 247 --NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
NG P CA L+N +R W + + +I SDC+++ + H + +DT+E+A
Sbjct: 268 SWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAGSAY 326
Query: 302 KAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLG 358
AG D C +Y GA +G + E +D +L+ LY L+R GYFDG Y+N+
Sbjct: 327 TAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYRNIT 386
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNI----KTLALVGPHANATKAMIG 414
++ P+ +LA +A +G+VL KN NG LP+ + KT+AL+G + + M+G
Sbjct: 387 WADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWVDNGEQMLG 445
Query: 415 NYEGTPCRYTSPMDGFYAYS-KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
Y G +P+ A + K++ +S A++AA AD + G+
Sbjct: 446 TYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYFGGI 505
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
DLSVEAE +DR L P Q +L++ + +A G T+V+ +D +N I +I+
Sbjct: 506 DLSVEAEDRDRYSLAWPSAQAKLLSDI--SALGKPTVVVQLGTMLDDTALLDNKNISAII 563
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF------- 585
W GYPG++GG A D+I GK P GRLP+T Y A Y ++P T M +RP +
Sbjct: 564 WAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGAASN 623
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
PGRTY+++D V+PFG+GL +T F VA S S D + C+ +
Sbjct: 624 PGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTSDLESGCKSEKH--------- 673
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQVIGYE 699
ID KC + + ++ V N DG Y+ + G + +K ++ Y
Sbjct: 674 ----ID--KCS-FPSSLEVSVTN----DGKSTTSSYAALAFVRGEYGPKPYPLKTLVAYG 722
Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
++ IA GQ+ KV + + +N + +L G + +LV
Sbjct: 723 KLHDIAPGQTKKVKLELTLGDLARTAEN-GDLVLYPGKYEVLV 764
>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Subdoligranulum variabile DSM 15176]
Length = 717
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 231/639 (36%), Positives = 343/639 (53%), Gaps = 73/639 (11%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
Y ERA+ LV +MTL EK+ QM A +PRLG+P Y WW+E +HGV G
Sbjct: 11 YRERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT--------- 61
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
AT FP I ASF+E L ++G+ V EAR YN+ + GLT W+
Sbjct: 62 -------ATVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWA 114
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ET GEDPY+ R + +V G+Q D L+ +AC K
Sbjct: 115 PNVNIFRDPRWGRGHETYGEDPYLTSRLGVRFVEGMQG---------DDPDYLRAAACAK 165
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+A + + R +FD++V++QD+ ET++ F V E V +VM +YNR NG P C
Sbjct: 166 HFAVHSGPE---DQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCC 222
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL +RG WNF G++ SDC +I+ E H + D+VA + G DL+CGD
Sbjct: 223 GSKTLLVDILRGKWNFQGHVTSDCWAIKDFHEGH-MVTSGPVDSVALAVNNGCDLNCGDL 281
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIEL 370
Y + AV +GK+ E ID SL L+ M+LG FD + Y +G + + + + L
Sbjct: 282 YA-YLEEAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAEEKVPYNKIGYDAVDSREMQAL 340
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
E A + +VLLKN+N LPL+ + +A+VGP+A+ KA++GNYEGT RY + +DG
Sbjct: 341 NLEVAEKILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGI 400
Query: 431 YAY---SKVINYAPGC---ADIV---CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
Y + Y+ GC AD + ++N +I D + GLD +E E
Sbjct: 401 QEYLGEDVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEE 460
Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
D+ L LPG Q ++ ++ K PV +V++S A+ + A+ ++
Sbjct: 461 GDQGNQFASGDKQSLSLPGNQESVLKACIESGK-PVVVVVLSGSALALGTAQEGA--AAV 517
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
L YPG +GGRA+A +FG+ NP G+LP+T+Y ++ +T ++ GRTY++
Sbjct: 518 LQAWYPGAQGGRAVARALFGECNPQGKLPVTFYHSDEDLPAFTDYAMK------GRTYRY 571
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASS------PKSVDIKL 625
+ +YPFGYGLSY+ F ++ A + P VD+++
Sbjct: 572 MEKEPLYPFGYGLSYSHFTFRDAKADAAQIGPDGVDVRV 610
>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
Length = 756
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 260/724 (35%), Positives = 380/724 (52%), Gaps = 58/724 (8%)
Query: 32 MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
M EK++ + + GV RLGLP Y WW EALHGV+ PG +F ATSF
Sbjct: 52 MQTQEKLENLVSKSKGVARLGLPAYNWWGEALHGVA-------GAPGINFTGSYRTATSF 104
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
P +L +A+F++ L +I + EARA N G A + FW+P+IN RDPRWGR ETPG
Sbjct: 105 PMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPDINPFRDPRWGRGSETPG 164
Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
ED + Y + + GL EG + R KI A CKHY YD++NW G DR HFD
Sbjct: 165 EDILRIKGYTKSLLSGL---EGDKAQR-------KIIATCKHYVGYDVENWNGTDRHHFD 214
Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF--- 268
+++T QD+ E F+ PF+ C + V S MCSYN VNG+PTCAD +L +R WN+
Sbjct: 215 AKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDS 274
Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAE 328
+ YI SDC++++ I HK++ T ++A A G+DL C T+ GA QG +
Sbjct: 275 NNYITSDCEAVKDISLRHKYVA-TLQEATAIAFNNGMDLSCEYSGTSDIPGAFSQGLLNV 333
Query: 329 ADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNG 387
+ ID +L Y L+ GYFDG + Y +LG +I P+ +L + A +G+ LLKND+
Sbjct: 334 SVIDRALTRQYEGLVHAGYFDGAAATYAHLGVQDINTPEAQKLVLQVAAEGLTLLKNDD- 392
Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV-INYAPGCADI 446
LPL+ + +A+VG AN T + G Y G +P+ YA +K+ ++ A I
Sbjct: 393 TLPLSLKSGSKVAMVGFWANTTSKLSGIYSGPAPYLHTPV---YAGNKLGLDMAVATGPI 449
Query: 447 VCQN---NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
+ + ++ A++AAK +D + GLD S AEG DR D+ P Q +LI K+ A
Sbjct: 450 LQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKL--A 507
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
A G +VI VD + S++W +PG++GG A+ VI G++ GRLPIT
Sbjct: 508 ALGKPLVVIALGDMVDHTPILKMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPIT 567
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK-SVD 622
Y A Y ++ M +RP N PGRTY++++ V PFG+GL YT+F K SS +V+
Sbjct: 568 QYPAEYTQLSMLDMNMRPGGNNPGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSSSGLTVN 626
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
I+ DI + + P V ++ V N G + + +
Sbjct: 627 IQ--------DIMKSCTKDHPDLCDVP-----------PIEVAVTNEGNRTSDFIALAFI 667
Query: 683 KPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
K G G +K ++ Y R+ +G K+ +L VD + N + G +T+
Sbjct: 668 K--GEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRVDQSGNLVAYPGEYTL 725
Query: 740 LVGE 743
L+ E
Sbjct: 726 LLDE 729
>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
Length = 788
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 257/689 (37%), Positives = 373/689 (54%), Gaps = 47/689 (6%)
Query: 28 LVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG 87
L+ TL E + G+ GV RLGLP Y+ WSEALHG+ R S G++
Sbjct: 66 LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD---RANFSDSGSY-----NW 117
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVL 147
ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PNIN R P WGR
Sbjct: 118 ATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQ 177
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
ETPGED + YA Y+ G+Q D DS LK++A KHYA YD++NW + R
Sbjct: 178 ETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHYAGYDIENWHNHSR 229
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D +T+QD+ E + F + + V SVMC+YN VNG+P CAD L +R +
Sbjct: 230 LGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFG 289
Query: 268 F--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGK 325
F HGY+ SDCD+ I H + + ++ A A + AG D+DCG Y ++ G
Sbjct: 290 FVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTTYQWHLNESITAGD 348
Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQHIELAAEAARQGIV 380
++ DI+ + LY L++ GYFD + Y++L +++ ++ +AA QGIV
Sbjct: 349 LSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIV 408
Query: 381 LLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV 436
LLKN N LPL + T+AL+GP ANAT ++GNY G SP F
Sbjct: 409 LLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYK 468
Query: 437 INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTEL 496
+N+A G I + S AA+ AA++AD + G+D ++EAE DR + PG Q +L
Sbjct: 469 VNFAEGTG-ISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDL 527
Query: 497 INKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYN 555
I K+A +A P+ ++ M G VD + KNN + ++LW GYPG+ GG A+ D+I GK N
Sbjct: 528 IQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKN 587
Query: 556 PGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
P GRL T Y A+Y + P T M LRP + PG+TYK++ G VY FG+GL YT F +
Sbjct: 588 PAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEFGHGLFYTTFA-ES 646
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
+S+ + ++KL+ +DI + A++ V F ++N GK++
Sbjct: 647 SSNTTTKEVKLN----IQDI---LSQTHEELASITQLPV------LNFTANIKNTGKLES 693
Query: 675 SEVVMVYSKPP--GIAGTHIKQVIGYERV 701
MV++ G A +K ++G++R+
Sbjct: 694 DYTAMVFANTSDAGPAPYPVKWLVGWDRL 722
>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
Length = 785
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 251/693 (36%), Positives = 376/693 (54%), Gaps = 42/693 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L MTL E + G+ +PRLGLP Y+ W+EALHG+ ++ T
Sbjct: 62 CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGL-YLANFTE 120
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S P + +TSFP+ ILT A+ N +L +I Q ++T+ RA N G GL +SPN
Sbjct: 121 SGPFSW-------STSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPN 173
Query: 135 INVVRDPRWGRVLETPGEDPYVV-GRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
IN R P WGR ETPGED + YA Y+ GLQ ++ KI A KH
Sbjct: 174 INAFRHPVWGRGQETPGEDANCLCSAYAYEYITGLQ----------GNATNPKIIATAKH 223
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
YA YD++NW RF D +T+QD+ E F F + V + V SVM SYN VNG+P+ A
Sbjct: 224 YAGYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSA 283
Query: 254 DPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ LL +R W F GY+ SDCD++ + H + + A A L+AG D+DCG
Sbjct: 284 NTFLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSS-ASAMSLRAGTDIDCGI 342
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIEL 370
Y ++ QG+I+ ++I+ ++ Y L+ GYFDG Y++L +++ +
Sbjct: 343 SYLTTLNESLTQGQISRSEIERAVTRFYSNLVSAGYFDGPDAPYRDLSWSDVVRTNRWNV 402
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A EAA G+VLLKND G LPL + +++ +AL+GP ANAT+ M GNY G TSP+
Sbjct: 403 AYEAAVAGVVLLKND-GVLPL-SKSVQRVALIGPWANATEQMQGNYHGVAPYLTSPLAAV 460
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
A +NYA G +I + AA+ AA+ +D + G+D ++EAE DR ++ P
Sbjct: 461 QASGLEVNYAFGT-NITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWP 519
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q ELI+++ + K P+ ++ M G VD + K + K+ ++LW GYPG+ GG+A+ D++
Sbjct: 520 GNQLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDIL 578
Query: 551 FGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
G+ P GRL T Y A Y ++ P T M LRP + PG+TY ++ G VY FG+GL YT
Sbjct: 579 TGQRAPAGRLTTTQYPAEYALQFPATDMSLRPRGDNPGQTYMWYTGEPVYAFGHGLFYTT 638
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F +A + + ++ DI + +P L++ + F ++V N
Sbjct: 639 FATALAGPGQ-------EPERSFDIGALLA--RPHAGYNLVEQLPF----LNFTVKVTNT 685
Query: 670 GKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERV 701
G++ M ++ H K ++G++R+
Sbjct: 686 GEVISDYTAMAFANTTAGPRPHPNKWLVGFDRI 718
>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
Length = 792
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 260/753 (34%), Positives = 388/753 (51%), Gaps = 45/753 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA L TL E V G++ VPRLGLP Y+ WSEALHG+
Sbjct: 56 LSKTIVCDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGLD- 114
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N + ATSFP+ IL A+ N +L +IG+ +ST+ RA N G GL
Sbjct: 115 ---RANLTESGDYS----WATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGL 167
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
++PNIN R P WGR ETPGED + Y + Y+ G+Q + R LK++
Sbjct: 168 DVYAPNINSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQG--------GLNPRDLKLA 219
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A KH+A YDL+NW + R + ++ D+ + F V + V SVM SYN VNG
Sbjct: 220 ATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNG 279
Query: 249 IPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
+P+ A+ LL +R WNF GY+ SDCD++ + H + + A + +AG D
Sbjct: 280 VPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSI-QAGTD 338
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNP 365
+DCG Y + ++ +I+ ++I+ ++ Y L+ LGYFDG + +Y++L ++
Sbjct: 339 IDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGDNSKYRHLHWPDVVAT 398
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
++ EAA +GIVLLKND G LPL + N +++AL+GP AN T + GNY G T
Sbjct: 399 DAWNISYEAAVEGIVLLKND-GTLPL-SNNTRSVALIGPWANVTTTLQGNYYGAAPYLTG 456
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ A + +NYA G +I + S AA+ AA ++ + G+D +VEAEG DR
Sbjct: 457 PLAALQASNLDVNYAFGT-NISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRE 515
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
+ PG Q +LI +++ K P+ ++ M G VD + K N + S++W GYPG+ GG A
Sbjct: 516 SITWPGNQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPA 574
Query: 546 IADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
I D++ GK P GRL +T Y A Y ++ P T M LRP N PG+TY ++ G VY FG+G
Sbjct: 575 ILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGNNPGQTYMWYTGKPVYEFGHG 634
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
L YT FK +A + + ++P +++ + +Y +
Sbjct: 635 LFYTTFKVSLA--------HFHGAENGTSFDIVQLLSRPNAGYSVVEQIPFINYT----V 682
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
EV N G + M + H K ++G++R+ G S + TM +L
Sbjct: 683 EVMNTGNVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRL---GGISPRTTQTMTIPITLDN 739
Query: 724 V---DNAANSLLASGAHTI-LVGEGVGGVSFPL 752
V D N ++ G + + L E +SF L
Sbjct: 740 VARTDERGNRIVYPGKYELTLNNERSAVLSFTL 772
>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
Length = 782
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 257/759 (33%), Positives = 395/759 (52%), Gaps = 66/759 (8%)
Query: 9 LSDFP-YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
LS P CD +P+ RA LV + T E + + A GVPRLG+P Y+WW+EALHGV+
Sbjct: 30 LSKIPDICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVA 89
Query: 68 FIGRRTNSPPGTHFD-----SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
PG +FD +E AT FP I A+F++ L+++I +++E RA N
Sbjct: 90 -------KSPGVNFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNN 142
Query: 123 LGNAGLTFWSP-NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
G AGL +SP NIN RDPRWGR ET GEDP + R+A++ V GLQ G +++
Sbjct: 143 AGKAGLNLYSPLNINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQ---GPHAQNEAE 199
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
L ++A CKH+ AYDL+ ++ +R+ FD+ V++QD+ + + F CV +G +++M
Sbjct: 200 GNKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMT 259
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
SYN VN +P A L R W H Y+ SDCD++ + + H++ + E A A
Sbjct: 260 SYNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNYVE-AAA 318
Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKN 356
+ + AG DLDCG Y+ A++Q A I ++ +Y L+RLGYFD S +
Sbjct: 319 KSINAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFDDPASQPLRQ 378
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
L ++ +P LA +A I LLKN + LP+ K +A++GP+ N + + GNY
Sbjct: 379 LTWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPIKQKPTK-IAIIGPYTNVSTSFSGNY 437
Query: 417 EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNS-----MIPA-AIDAAK---NADAT 467
G P + M +A S+V A IV N + IP+ A DA K +AD+
Sbjct: 438 AG-PAAFNMTM--VHAASQVFP----DAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSV 490
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA----AKGPVTLVIMSAGAVDINFA 523
V G+D S+E E DR D+ P Q LI++++ + K + +V G +D
Sbjct: 491 VFAGGIDASIERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASL 550
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPV 582
K++ + +++W GYPG+ A+ D++ GK P GRLP+T Y A+Y+ +P ++M LRP
Sbjct: 551 KSDDAVGALVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRPK 610
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+PGRTYK++ G YPFG+GL YT F +A Q I T
Sbjct: 611 AGYPGRTYKWYKGVPTYPFGHGLHYTTFSASLAKP------------QPYAIPTTPAAKG 658
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERV 701
P V + + D Q ++N GK+ +++++ G A K ++GY +V
Sbjct: 659 P--EGVHAEHISVAD----VQANIKNTGKVASDYTALLFARHSNGPAPYPRKTLVGYTKV 712
Query: 702 F-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
++AG+ + V + +L D N L G++ +
Sbjct: 713 KNLSAGEESSVTIKITQA-ALARADEEGNQFLYPGSYQL 750
>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1425
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 257/756 (33%), Positives = 381/756 (50%), Gaps = 97/756 (12%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + +L +R DLV R+TL EKV+QM + A + RLG+P Y WW+E LHGV GR
Sbjct: 712 YPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGV---GR 768
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
T FP I AS+N+ L K++ +++ E RA+YN
Sbjct: 769 TKYH------------VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYS 816
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LT+W+PNIN+ RDPRWGR ET GEDPY+ + +V GLQ D R
Sbjct: 817 QYHALTYWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQG---------DDPR 867
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK SAC KHYA + +R F+S V+ D+ +T++ F V + +VS VMC+Y
Sbjct: 868 YLKASACAKHYAVHSGPE---KNRHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAY 924
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N G P C + L+ +R WNF GY+ SDC +I I HK D A V
Sbjct: 925 NAFKGQPCCGNDLLMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFH- 983
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G DLDCG + AV+ G I E +D S++ L+ + RLG FD + Q Y ++ +
Sbjct: 984 GTDLDCGQSAYLALVKAVKNGIITEKQLDVSVKRLFTIRFRLGLFDPAEQVDYAHIPISV 1043
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +H +LA + AR+ +VLLKND LPL +K + ++GP+A+ A++GNY G P
Sbjct: 1044 LECKKHQDLAKQLARESMVLLKNDR-LLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPS 1102
Query: 422 RYTSPMDGFYAYSKVIN---YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
R +P+ K + Y G I + + ++ AK ADA + + G+ +E
Sbjct: 1103 RMLTPLQAIRERLKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLE 1162
Query: 479 AE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF-AKNNP 527
E G DR + LP QT+L+ + A + P V+M+ A+ I + AK+ P
Sbjct: 1163 GEEMSVNKDGFDGGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTGSALAIPWEAKHVP 1221
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
+IL Y G+ GG AIADV+FG YNP G+LP+T+Y + + +P + G
Sbjct: 1222 ---AILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD------SDLPDFESYDMQG 1272
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++F G +YPFGYGLSYT F+Y P + +
Sbjct: 1273 RTYRYFKGKALYPFGYGLSYTDFRYSSLKMPTACN------------------------- 1307
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
D + + V+N GKMDG EVV +Y S P + + G++R+++ AG
Sbjct: 1308 -------TTDKEIPVTVTVKNTGKMDGEEVVQLYVSHPDKKILVPVTALKGFKRIYLKAG 1360
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ ++ F++++ + L VD + G I VG
Sbjct: 1361 EAKQITFSLSS-EDLSCVDENGIRKVLPGTVKIQVG 1395
>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
Length = 804
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 261/702 (37%), Positives = 377/702 (53%), Gaps = 47/702 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA L+ TL E + G+ GV RLGLP+Y+ WSEALHG+ R
Sbjct: 69 CDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPVYQVWSEALHGLD---RANF 125
Query: 75 SPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPN 134
S G++ ATSFP ILTTA+ N +L +I +ST+ RA N G GL ++PN
Sbjct: 126 SDSGSY-----NWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPN 180
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN R P GR ETPGED + YA Y+ G+Q D DS LK++A KHY
Sbjct: 181 INTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQG-------PDPDSN-LKLAATAKHY 232
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A YD++NW + R D +T+QD+ E + F + + V SVMC+YN VNG+P CAD
Sbjct: 233 AGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACAD 292
Query: 255 PKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
L +R + F HGY+ SDCD+ I H + + ++ A A + AG D+DCG
Sbjct: 293 SYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTT 351
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-----YKNLGKNNICNPQH 367
Y ++ G ++ DI+ + LY L++ GYFD + Y++L +++
Sbjct: 352 YQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDA 411
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTG----NIKTLALVGPHANATKAMIGNYEGTPCRY 423
++ +AA QGIVLLKN N LPL + T+AL+GP ANAT ++GNY G
Sbjct: 412 WNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYM 471
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP F +N+A I N S AA+ AA++AD + G+D ++EAE D
Sbjct: 472 ISPRVAFEEAGYNVNFAERTG-ISSTNTSGFAAALSAAQSADVIIYAGGIDNTLEAEALD 530
Query: 484 RVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R + PG Q +LI K+A +A P+ ++ M G VD + KNN + ++LW GYPG+ G
Sbjct: 531 RESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSG 590
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
G A+ D+I GK NP GRL T Y A+Y + P T M LRP + PG+TYK++ G VY F
Sbjct: 591 GFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDNPGQTYKWYTGEAVYEF 650
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GL YT F + +S+ + +IKL+ +DI + A++ V
Sbjct: 651 GHGLFYTTFA-ESSSNTTTREIKLN----IQDI---LSQTHEDLASITQLPV------LN 696
Query: 662 FQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERV 701
F ++N GK++ MV++ G A +K ++G++R+
Sbjct: 697 FTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRL 738
>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
Length = 759
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 262/747 (35%), Positives = 388/747 (51%), Gaps = 65/747 (8%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L Y RA LV T E + + A GVPRLG+P Y+WW+EALHGV+
Sbjct: 30 LSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHGVA- 88
Query: 69 IGRRTNSPPGTHF--DSEVP--GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
PG +F D E P AT+FP +I A+F+++L++++ ++ E RA N G
Sbjct: 89 ------GSPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAG 142
Query: 125 NAGLTFWSP-NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
AGL +SP NIN RDPRWGR ET GEDP + RYA+ V+GLQ E
Sbjct: 143 KAGLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPNQDE-------- 194
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L+++A CKHY AYDL+ W+G +R+ FD++V+ Q++ E ++ F CV +G ++M SY
Sbjct: 195 -LRLAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSY 253
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
N VN +P A L R +W H Y+ SDCD++ + + H + D+ A A
Sbjct: 254 NAVNNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHYA-DSYVQAAADS 312
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNL 357
+ AG DL+CG Y++ A++Q I T++ +Y +RLG FD G P + L
Sbjct: 313 INAGTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPKQGQP-LREL 371
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
G ++ +LA +A + LLKN NG LP++ G K +A++GP++NAT A+ GNY
Sbjct: 372 GWEHVNTKAAQDLAYSSAAASVTLLKN-NGTLPVD-GATK-VAVIGPYSNATFALRGNYA 428
Query: 418 GT-PCRYTSPMDGFYAYSK-VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
G P T +S+ I+ A G N++ AA+ AK AD + G+D
Sbjct: 429 GPGPFAITMTEAAQRVFSQATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDP 488
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
++E+E DR + P Q +LI+ + AK + +V G +D K + I ++LW
Sbjct: 489 TIESEELDRATIAWPPNQLQLIHALGGMAK-KMAVVQFGGGQIDGASIKADGNIGALLWA 547
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRTYKFFD 594
GYPG+ G A+ DVI G P GRLPIT Y A Y+ + T+M LRP +PGRTYK++
Sbjct: 548 GYPGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLAETTMALRPNATYPGRTYKWYS 607
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
G YP+ +GL YT+FK ++A + YT+ T + V
Sbjct: 608 GTPTYPYAHGLHYTEFKAELA----------------QPAPYTIAT----AGYAEFERVA 647
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERV-FIAAGQSAKVG 712
T Q + N G+ +V+++ H K ++GY++V IA G+S V
Sbjct: 648 ------TVQATITNAGQRTSDYAALVFARHTNGPAPHPNKTLVGYKKVKAIAPGESRSVE 701
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTI 739
+ +L D N +L G + +
Sbjct: 702 VEITQA-ALARGDEEGNLVLYPGKYEL 727
>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 756
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 260/724 (35%), Positives = 379/724 (52%), Gaps = 58/724 (8%)
Query: 32 MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
M EK+ + + GV RLGLP Y WW EALHGV+ PG +F ATSF
Sbjct: 52 MQTQEKLDNLVSKSKGVARLGLPAYNWWGEALHGVA-------GAPGINFTGPYRTATSF 104
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
P +L +A+F++ L +I + EARA N G A + FW+P+IN RDPRWGR ETPG
Sbjct: 105 PMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPDINPFRDPRWGRGSETPG 164
Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
ED + Y + + GL EG + R KI A CKHY YD+++W G DR FD
Sbjct: 165 EDILRIKGYTKSLLSGL---EGDKAQR-------KIIATCKHYVGYDMEDWNGTDRHSFD 214
Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF--- 268
+++T QD+ E F+ PF+ C + V S MCSYN VNG+PTCAD +L +R WN+
Sbjct: 215 AKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDS 274
Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAE 328
+ YI SDC++++ I HK++ T ++A A G+DL C ++ GA QG +
Sbjct: 275 NNYITSDCEAVKDISLRHKYVA-TLQEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNV 333
Query: 329 ADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNG 387
+ ID +L Y L+ GYFDG + Y NLG +I P+ +L + A +G+ LLKND+
Sbjct: 334 SVIDRALTRQYEGLVHAGYFDGAAATYANLGVQDINTPEAQKLVLQVAAEGLTLLKNDD- 392
Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV-INYAPGCADI 446
LPL+ + +A+VG AN + + G Y G +P+ YA +K+ ++ A I
Sbjct: 393 TLPLSLKSGSKVAMVGFWANDSSKLSGIYSGPAPYLHNPV---YAGNKLGLDMAVATGPI 449
Query: 447 VCQN---NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
+ ++ ++ A+DAAK +D + GLD S AEG DR D+ P Q +LI K+ A
Sbjct: 450 LQKSGAADNWTTKALDAAKKSDTILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKL--A 507
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
A G +VI VD N + S++W +PG++GG A+ VI G++ GRLPIT
Sbjct: 508 ALGKPLVVIALGDMVDHMPILNMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPIT 567
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVD 622
Y A Y ++ M LRP N PGRTY++++ V PFG+GL YT+F K S S +V+
Sbjct: 568 QYPAKYTQLSMLDMNLRPGGNNPGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSNSSLTVN 626
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
I+ DI + + P V ++ V N G + + +
Sbjct: 627 IQ--------DIMKSCTKDHPDLCDVP-----------PIEVAVTNKGNRTSDFIALAFI 667
Query: 683 KPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
K G G +K ++ Y R+ +G K +L VD + N + G +T+
Sbjct: 668 K--GEVGPKPYPLKTLVSYARLRDISGSQTKTASLALTLGTLSRVDQSGNLVAYPGEYTL 725
Query: 740 LVGE 743
L+ E
Sbjct: 726 LLDE 729
>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
Length = 723
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 254/748 (33%), Positives = 390/748 (52%), Gaps = 107/748 (14%)
Query: 25 AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
AK+LV +MTL EK +Q+ + V RL +P Y WW+E LHGV+ G
Sbjct: 29 AKELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT------------- 75
Query: 85 VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNIN 136
AT FP I A F+E K+ ++TE RA YN + GLT+WSPN+N
Sbjct: 76 ---ATVFPQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVN 132
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+ RDPRWGR ET GEDPY+ R + +V+GLQ D + LK+SAC KH+A
Sbjct: 133 IFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDGKYLKLSACAKHFAV 182
Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
+ + + R F++ V+++D+ ET++ FE CV E +V SVM +YNR NG P C
Sbjct: 183 H---SGPESLRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKA 239
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
LL +RG W F G++VSDC ++ HK + T ++VA ++ G DL+CG+ Y N
Sbjct: 240 LLKDILRGKWGFKGHVVSDCWALADFHMHHK-VTSTATESVALAIENGCDLNCGNMYLNL 298
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAEAA 375
+ A ++G + E I T+ L +LG FD +Y + + N C +H +++ EA+
Sbjct: 299 LL-AYKEGLVTEEQITTAAERLMTTRFKLGMFDEDCEYNQIPYEVNDCK-EHNQVSLEAS 356
Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-- 433
R+ +VLLKN NG LPL+ +K +A++GP+AN+ + GNY GT +YT+ +DG +
Sbjct: 357 RKSMVLLKN-NGILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLD 415
Query: 434 -SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE------ 480
+ Y+ GC + + + + + A+ A+ AD ++ GLD ++E E
Sbjct: 416 DDVRVYYSEGCHLYKEKVEDLARRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGN 475
Query: 481 ---GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
D++DL LPG Q EL+ KV + K PV +V+ + + +N A+ + +IL Y
Sbjct: 476 GYGAGDKLDLNLPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGAEE--RCAAILNAWY 532
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGRTYKFFD-G 595
PG GG A AD++FGK +P G+LP+T+Y+ + K+P +T ++ GRTY++ D
Sbjct: 533 PGSHGGTAAADILFGKCSPSGKLPVTFYK-DTDKLPEFTDYAMK------GRTYRYMDES 585
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGL+Y+ + P +V + D ID
Sbjct: 586 NCLYPFGYGLTYSTVELSNLQVP-AVRGEFDG----------------------ID---- 618
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFT 714
+E+EN G D EVV Y K + + G++RV + G+S V
Sbjct: 619 ------ISVEIENTGSYDIEEVVQCYIKDLESKYAVLNHSLAGFKRVSLKKGESKTVTMK 672
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
+N ++ + VD+A +L S + VG
Sbjct: 673 LNR-RAFEAVDDAGERILDSKKFKLFVG 699
>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
Length = 731
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 253/757 (33%), Positives = 385/757 (50%), Gaps = 99/757 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ D + + +RA LV MT+ EK+ Q+ + RL +P Y WW+EALHG++ G+
Sbjct: 29 WFDPDISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNGK-- 86
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN---- 125
AT FP I A+F+ L ++ +S EARA Y ++GN
Sbjct: 87 --------------ATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQY 132
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AGLTFW+PN+N+ RDPRWGR ET GEDP++ + +V+GLQ D + L
Sbjct: 133 AGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQG---------DDPKYL 183
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K + KH+A + + + R HFD +++D+ ET++ FE V + V+ VMC+YN
Sbjct: 184 KSAGVAKHFAVH---SGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNA 240
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P CA +LL+ ++ W FHGYIVSDC ++ HK E A A L++G+
Sbjct: 241 VNGEPACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESA-ALALQSGV 299
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
+L+CG Y +F A++Q + ID L L ++ +LG+FD G Y + + I
Sbjct: 300 NLNCGSTYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFFDPAGLNPYNEVTPDVIH 359
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+P+HI L+ + AR+ IVLLKNDN LPL+ +IK + GP A ++ +IGNY G
Sbjct: 360 SPEHINLSRDVARKSIVLLKNDNHVLPLSK-DIKVPYVTGPFAASSDMLIGNYYGISDSL 418
Query: 424 TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVE 478
S ++G + +NY G +N++ P A AK ADA + V G+ +E
Sbjct: 419 VSVLEGIAGKVSLGSSLNYRSGSLPF---HNNINPLNWAPQVAKTADAVIAVVGVSADME 475
Query: 479 ---------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
A+ DRV + LP Q + + ++A KGP+ LV+ + VDI + P
Sbjct: 476 GEEVDAIASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDI--SDLEPLA 533
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
+ILW+ YPGE+GG A+ADV+FG NP G LP+T+ ++ P+ + GRT
Sbjct: 534 DAILWIWYPGEQGGNAVADVLFGDTNPSGHLPLTFVKSIDDLPPFDDYAMT------GRT 587
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
YKF + +YPFG+G SYT+F + + TV K
Sbjct: 588 YKFLEKAPLYPFGFGRSYTEFSFN---------------------DLTVSQGK------- 619
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG-IAGTHIKQVIGYERVFIAAGQS 708
+ T +EVEN G + G VV Y P + I + ++R+ +A ++
Sbjct: 620 ----AIEGEALTLSVEVENRGDIAGETVVQAYLSPIARMNNEAISSLKSFKRIHLAPKET 675
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
V T+ K L V+NA ++ G +++ VG+ +
Sbjct: 676 RWVELTIQG-KDLYQVNNAGETVWPQGRYSLAVGDSL 711
>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
17629]
Length = 714
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 230/626 (36%), Positives = 349/626 (55%), Gaps = 68/626 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E AK LV +MT+ EK+ QM + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------- 55
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+ L +KIG VSTE R +N + GLTFW+PN
Sbjct: 56 -----ATVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPN 110
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ G+ Y+RGLQ D LK +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQG---------DDPDHLKSAACAKHF 161
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + R FD++ ++ DM +T++ F+ CV + V +VM +YNRVNG P C
Sbjct: 162 AVH---SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGS 218
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R ++ F G++VSDC +I E H + DT E++ A + G DL+CG +
Sbjct: 219 RTLLKDILRDEFGFEGHVVSDCWAILDFHE-HHHVTDTVEESAAMAVNNGCDLNCGSAFL 277
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAE 373
+ A +G +++ I ++ L V +RLG P Y+++ + +H+EL+ E
Sbjct: 278 HLK-DAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVE 336
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AAR+ +VLLKN + LPL+ N+KT+A++GP+AN+ A+IGNY GT RY +P++G Y
Sbjct: 337 AARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQY 396
Query: 434 ----SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
++V+ YA GC + + A+ A+ +D V+ GLD ++E E
Sbjct: 397 LGEDTRVL-YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGD 455
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D++ L+LPG Q EL+ VA K PV LV+ + A+D+++A+ + + +I+
Sbjct: 456 AGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEEH--VDAIID 512
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG GG+A+A+ IFG+Y+P G+LP+T+Y+ ++P + RTY++ +
Sbjct: 513 SWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGT------ENLPEFTDYSMAHRTYRYTN 566
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKS 620
V+YPFGYGL Y + Y S K+
Sbjct: 567 ENVLYPFGYGLHYGETNYDGMSVDKA 592
>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
Length = 714
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 230/626 (36%), Positives = 349/626 (55%), Gaps = 68/626 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E AK LV +MT+ EK+ QM + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------- 55
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+ L +KIG VSTE R +N + GLTFW+PN
Sbjct: 56 -----ATVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPN 110
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ G+ Y+RGLQ D LK +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQG---------DDPDHLKSAACAKHF 161
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + R FD++ ++ DM +T++ F+ CV + V +VM +YNRVNG P C
Sbjct: 162 AVH---SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGS 218
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R ++ F G++VSDC +I E H + DT E++ A + G DL+CG +
Sbjct: 219 RTLLKDILRDEFGFEGHVVSDCWAILDFHE-HHHVTDTVEESAAMAVNNGCDLNCGSAFL 277
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAE 373
+ A +G +++ I ++ L V +RLG P Y+++ + +H+EL+ E
Sbjct: 278 HLK-DAYDKGMVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVE 336
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AAR+ +VLLKN + LPL+ N+KT+A++GP+AN+ A+IGNY GT RY +P++G Y
Sbjct: 337 AARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQY 396
Query: 434 ----SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
++V+ YA GC + + A+ A+ +D V+ GLD ++E E
Sbjct: 397 LGEDTRVL-YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGD 455
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D++ L+LPG Q EL+ VA K PV LV+ + A+D+++A+ + + +I+
Sbjct: 456 AGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEEH--VDAIID 512
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG GG+A+A+ IFG+Y+P G+LP+T+Y+ ++P + RTY++ +
Sbjct: 513 SWYPGARGGKAVAEAIFGEYSPNGKLPVTFYQGT------ENLPEFTDYSMAHRTYRYTN 566
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKS 620
V+YPFGYGL Y + Y S K+
Sbjct: 567 ENVLYPFGYGLHYGETNYDGLSVDKA 592
>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
Length = 714
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 230/626 (36%), Positives = 349/626 (55%), Gaps = 68/626 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E AK LV +MT+ EK+ QM + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------- 55
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+ L +KIG VSTE R +N + GLTFW+PN
Sbjct: 56 -----ATVFPQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPN 110
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ G+ Y+RGLQ D LK +AC KH+
Sbjct: 111 VNIFRDPRWGRGHETYGEDPYLTGKLGCAYIRGLQG---------DDPDHLKSAACAKHF 161
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + R FD++ ++ DM +T++ F+ CV + V +VM +YNRVNG P C
Sbjct: 162 AVH---SGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGS 218
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R ++ F G++VSDC +I E H + DT E++ A + G DL+CG +
Sbjct: 219 RTLLKDILRDEFGFEGHVVSDCWAILDFHE-HHHVTDTVEESAAMAVNNGCDLNCGSAFL 277
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAE 373
+ A +G +++ I ++ L V +RLG P Y+++ + +H+EL+ E
Sbjct: 278 HLK-DAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVE 336
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AAR+ +VLLKN + LPL+ N+KT+A++GP+AN+ A+IGNY GT RY +P++G Y
Sbjct: 337 AARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQY 396
Query: 434 ----SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
++V+ YA GC + + A+ A+ +D V+ GLD ++E E
Sbjct: 397 LGDDTRVL-YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGD 455
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D++ L+LPG Q EL+ VA K PV LV+ + A+D+++A+ + + +I+
Sbjct: 456 AGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAEEH--VDAIID 512
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG GG+A+A+ IFG+Y+P G+LP+T+Y+ ++P + RTY++ +
Sbjct: 513 SWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGT------ENLPEFTDYSMAHRTYRYTN 566
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKS 620
V+YPFGYGL Y + Y S K+
Sbjct: 567 ENVLYPFGYGLHYGETNYDGLSVDKA 592
>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 763
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 269/752 (35%), Positives = 383/752 (50%), Gaps = 68/752 (9%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD P ERA LV M EK+ + + GV RLGLP Y WW EALHGV+
Sbjct: 31 LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVA- 89
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
PG F ATSFP IL +A+F++ L KI + EARA N G A +
Sbjct: 90 ------GAPGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPM 143
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+P+IN VRD RWGR E+PGED + Y + GL EG + R KI
Sbjct: 144 DYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGL---EGDQAQR-------KII 193
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKHY YD++ W G DR +F +++T QD+ E ++ PF+ C + V S MCSYN VNG
Sbjct: 194 ATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNG 253
Query: 249 IPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+PTCAD +L +R WN+ + YI SDC+++ I E+HK++ +T A G+
Sbjct: 254 VPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGM 312
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICN 364
DL C ++ GA QG + + ID +L Y L+ GYFDG+ Y NL N+I
Sbjct: 313 DLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDGAKATYANLSYNDINT 372
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
P+ +L+ + +G+V+LKND+ LPL +A++G AN + + G Y G P
Sbjct: 373 PEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRH 431
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIP-----AAIDAAKNADATVIVAGLDLSVEA 479
SP+ F ++ A ++ NS +P A+DAA+ +D + G D +V
Sbjct: 432 SPV--FAGEQMGLDMAIAWGPMI--QNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQ 487
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSILWVGYP 538
EG DR + P Q +L+ K+A K LV+++ G D + + I SI+W +P
Sbjct: 488 EGYDRTTISFPQVQIDLLAKLAKLGK---PLVVITLGDMTDHSPLLSMEGINSIIWANWP 544
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
G++GG AI +VI G + P GRLPIT Y A+YVK+ M LRP PGRTY++F+ V
Sbjct: 545 GQDGGPAILNVISGVHAPAGRLPITEYPADYVKLSMLDMNLRPHAESPGRTYRWFN-ESV 603
Query: 599 YPFGYGLSYTQFKYKVASSPK-SVDIKLDKD---QQCRDINYTVGTNKPPCAAVLIDDVK 654
PFG+GL YT F+ AS + DI+ D QQ +D+ C
Sbjct: 604 QPFGFGLHYTTFEAGFASEEGLTYDIQETLDSCTQQYKDL----------CEVA------ 647
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVFIAAGQSAKV 711
++ V N G V + + K G G +K +I Y R+ G + K
Sbjct: 648 ------PLEVTVANKGNRTSDFVALAFIK--GEVGPKPYPLKTLITYGRLRDIHGGAKKS 699
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
L VD + N+++ G +T+L+ E
Sbjct: 700 ASLPLTLGELARVDQSGNTVIYPGEYTLLLDE 731
>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
[Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
nidulans FGSC A4]
Length = 803
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 236/605 (39%), Positives = 340/605 (56%), Gaps = 27/605 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS P CD L +RA LV T E V G+ GV RLGLP Y+ W EALHGV
Sbjct: 55 LSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVG- 113
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N +F ATSFP I A+ N++L +IG VST+ RA N G G+
Sbjct: 114 ---RANFVESGNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGV 166
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+SPNIN R P WGR ETPGED ++ Y Y+ LQ GV D LKI
Sbjct: 167 DVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG--GV------DPETLKII 218
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A KHYA YD+++W + R D ++T+Q++ E + PF + + V SVMCSYN VNG
Sbjct: 219 ATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNG 278
Query: 249 IPTCADPKLLNQTIRGDWNFH--GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
+P+CA+ L +R + F GY+ DC ++ + H + ++ + A A + AG D
Sbjct: 279 VPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASN-EAAASADSILAGTD 337
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNP 365
+DCG Y + A + ++ +DI+ + LY L++ GYFDG Y+++ +++ +
Sbjct: 338 IDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLST 397
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+A EAA +GIVLLKND LPL+ +IK++A++GP AN T+ + GNY G S
Sbjct: 398 DAWNIAYEAAVEGIVLLKNDE-TLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLIS 455
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ GF ++YA G ++ + S A+ AAK ADA + G+D ++EAE DR
Sbjct: 456 PLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRE 514
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++ PG Q +LI+K+++ K P+ ++ M G VD + K+N + +++W GYPG+ GG A
Sbjct: 515 NITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHA 573
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
+AD+I GK P GRL T Y A Y ++ P M LRP + PG+TY ++ G VY FG
Sbjct: 574 LADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFG 633
Query: 603 YGLSY 607
+GL Y
Sbjct: 634 HGLFY 638
>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
Length = 755
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 268/751 (35%), Positives = 378/751 (50%), Gaps = 67/751 (8%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L D CD ERA LVE M EK+ +L GV RLGLP Y WW EALHGV+
Sbjct: 28 LKDNKICDVTAAPAERAAALVEAMQTNEKLD---NLMRGVTRLGLPKYNWWGEALHGVA- 83
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
PG +F ATSFP +L +A+F++ L KI + EARA N G A +
Sbjct: 84 ------GAPGINFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPV 137
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
FW+P+IN RDPRWGR ETPGED + Y + + GL EG + R KI
Sbjct: 138 DFWTPDINPFRDPRWGRGSETPGEDIVRIKGYTKHLLAGL---EGDKPQR-------KII 187
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKHY YD++ W G DR F++++ QD+ E ++ PF+ C + V S MCSYN VNG
Sbjct: 188 ATCKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNG 247
Query: 249 IPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+PTCAD +L +R WN+ + YI SDC++++ I HK+ T + AG+
Sbjct: 248 VPTCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAK-TNAEGTGLAFTAGM 306
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICN 364
D C ++ GA Q ++ ID +L+ Y L+R GYFDG + Y NLG +I
Sbjct: 307 DNSCEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAAATYANLGVKDINT 366
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
P+ +L+ + A +G+VLLKND+ LPL+ N +A++G AN T + G Y G
Sbjct: 367 PEAQQLSLQVASEGLVLLKNDD-TLPLSLTNGSKVAMLGFWANDTSKLSGIYSGPAPYLR 425
Query: 425 SPMDGFYAYSKV-INYAPGCADIVCQNNS-----MIPAAIDAAKNADATVIVAGLDLSVE 478
SP+ +A K+ ++ A I+ Q+NS A+ AA+ +D + GLD S
Sbjct: 426 SPV---WAGQKLGLDMAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAA 482
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP-----KIKSIL 533
AEG DR + P Q +LI K+A K V LV+ + N+P + S++
Sbjct: 483 AEGFDRNSIAWPTAQVDLIKKLAAIGKPLVVLVLG-------DLMDNSPLLELDGVNSVI 535
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
W +PG++GG A+ V+ G GRLPIT Y ANY ++ M +RP ++ PGRTY++F
Sbjct: 536 WANWPGQDGGSAVMQVVTGAVAVAGRLPITQYPANYTELSMLDMNMRPSSSSPGRTYRWF 595
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
+G V PFG GL YT F K A++ I Y + C D
Sbjct: 596 NG-AVQPFGTGLHYTTFDAKFAAN--------------STIEYDISNITKECTNQYPDTC 640
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
+ + V N G + + + K G A +K +I Y RV G K
Sbjct: 641 SVP----SIPVAVTNSGNRTSDFIALAFIKGENGPAPYPLKTLISYTRVRDVKGGQTKSA 696
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+L VD N++L G +T+L+ E
Sbjct: 697 EMQLTLGNLARVDQMGNTVLYPGEYTVLLDE 727
>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 696
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 236/641 (36%), Positives = 347/641 (54%), Gaps = 80/641 (12%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
++AK LV MTL E+ Q+ + + RLG+P Y WW+EALHGV+ G
Sbjct: 8 KKAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------- 56
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
ATSFP I A+F++ L K++ + ++ E RA YN + GLTFWSPN
Sbjct: 57 -----ATSFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPN 111
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ R + +V+GLQ EG LK +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG-----------LKTAACAKHF 160
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + DR HFD+RV+++D+ ET++ FE V E +V SVM +YNR NG P C
Sbjct: 161 AVHSGPE---ADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGS 217
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
P L+ +R W F G+ VSDC +I+ E H + T +++ A LK+G DL+CG+ Y
Sbjct: 218 PTLMKDILREKWGFQGHYVSDCWAIKDFHE-HHMVTSTAQESAALALKSGCDLNCGNTYL 276
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ M A Q G + E +I T+ L+ LG FDGS Y + + + H+ +A EA
Sbjct: 277 HILM-AYQNGLVTEEEITTAAERLFTTRYLLGLFDGST-YDAIPYEVVESKPHLSVADEA 334
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA-- 432
+ IVLLKN NG LPLN +IKT+ ++GP+AN+ KA+IGNY GT +Y + ++G
Sbjct: 335 TAKSIVLLKN-NGLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEV 393
Query: 433 -------YSKVIN-YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
YS+ + YA + Q + + A I AK++D ++ GLD ++E E
Sbjct: 394 GDEVRILYSEGSHLYADRVEPLAYQRDRLSEAKI-VAKHSDVVIVCVGLDETLEGEEGDT 452
Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
D+ DL LP Q EL+ +A K PV L + + A+D+ +A + ++L
Sbjct: 453 GNAYASGDKRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYA--DAHYDAVLQA 509
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG GG+ IA + G+ P G+LP+T+Y + +P + GRTY++
Sbjct: 510 WYPGARGGQVIAKALLGEIVPSGKLPVTFYR------DLSGLPAFEDYSMQGRTYRYMQE 563
Query: 596 PVVYPFGYGLSYTQFKYKVASSPK---------SVDIKLDK 627
+YPFGYGL+Y + + + AS + VD KL++
Sbjct: 564 EALYPFGYGLTYGKCRIEEASYDQGSLRVLVHNEVDFKLEE 604
>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 763
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 267/752 (35%), Positives = 382/752 (50%), Gaps = 68/752 (9%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD P ERA LV M EK+ + + GV RLGLP Y WW EALHGV+
Sbjct: 31 LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVA- 89
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
PG F ATSFP IL +A+F++ L KI + EARA N G A +
Sbjct: 90 ------GAPGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPV 143
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+W+P+IN VRD RWGR E+PGED + Y + GL EG + R KI
Sbjct: 144 DYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGL---EGDQAQR-------KII 193
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A CKHY YD++ W G DR +F +++T QD+ E ++ PF+ C + V S MCSYN VNG
Sbjct: 194 ATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNG 253
Query: 249 IPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
IPTCAD +L +R WN+ + YI SDC+++ I E+HK++ +T A G+
Sbjct: 254 IPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGM 312
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICN 364
DL C ++ GA QG + + ID +L Y L+ GYFDG+ Y NL +I
Sbjct: 313 DLSCEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKATYANLSYKDINT 372
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
P+ +L+ + +G+V+LKND+ LPL +A++G AN + + G Y G P
Sbjct: 373 PEARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRH 431
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIP-----AAIDAAKNADATVIVAGLDLSVEA 479
SP+ F ++ A ++ NS +P A+DAA+ +D + G D +V
Sbjct: 432 SPV--FAGEQMGLDMAIAWGPMI--QNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQ 487
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG-AVDINFAKNNPKIKSILWVGYP 538
EG DR + P Q +L+ K+A K LV+++ G D + + + SI+W +P
Sbjct: 488 EGYDRTTISFPQVQIDLLTKLAKLGK---PLVVITLGDMTDHSPLLSMEGVNSIIWANWP 544
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
G++GG AI +V+ G + P GRLPIT Y A+YVK+ M LRP PGRTY++F+ V
Sbjct: 545 GQDGGPAILNVVSGAHAPAGRLPITEYPADYVKLSMLDMNLRPHTESPGRTYRWFN-ESV 603
Query: 599 YPFGYGLSYTQFKYKVASSPK-SVDIKLDKD---QQCRDINYTVGTNKPPCAAVLIDDVK 654
PFG+GL YT F+ AS + DI+ D QQ +D+ C
Sbjct: 604 QPFGFGLHYTTFEASFASEEGLTYDIEEILDGCTQQYKDL----------CEVA------ 647
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVFIAAGQSAKV 711
++ V N G V + + K G G +K +I Y R+ G + K
Sbjct: 648 ------PLEVTVANKGNRTSDFVALAFIK--GEVGPKPYPLKTLITYGRLRDIHGGAKKS 699
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
L VD + N+++ G +T+L+ E
Sbjct: 700 ASLPLTLGELARVDQSGNTVIYPGEYTLLLDE 731
>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 824
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 269/763 (35%), Positives = 388/763 (50%), Gaps = 68/763 (8%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L ERA LV +T+ EK+ + + A GVPRL +P YEWWSE LHGV+
Sbjct: 65 CDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVA------- 117
Query: 75 SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW- 131
S PGT F ATSFP I+ ++F++ L K IG+ VS EARA N G +GL +
Sbjct: 118 SSPGTKFAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGLDLYV 177
Query: 132 --------------------SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV 171
SPNIN +DPRWGR ETPGEDP+ + Y + GL
Sbjct: 178 SSISRHIEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGL--- 234
Query: 172 EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCV 231
EG + + K+ A CKHYAA D +N++G DR FD+ +T QD+ E ++ PF+ C
Sbjct: 235 EGGDPSK-------KLIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCA 287
Query: 232 NEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKF 288
+ V S MCSYN +NG P CA+P LL +R W ++G Y+ +DCD + +V H +
Sbjct: 288 VDKKVGSFMCSYNAINGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHY 347
Query: 289 LNDTKEDAVARVLKAGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D A A +KAG DL+C + + + A Q I+E ++D SL +Y L+ +G
Sbjct: 348 APDLGH-AAAWAMKAGTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQ 406
Query: 348 FD---GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG-NIKTLALVG 403
FD G P ++L +++ + +LA +A +G VLLKND G LPL+ K AL+G
Sbjct: 407 FDSARGQP-LRSLSWDDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIG 464
Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
P NAT M GNY G S + Y+ G + + A+D+A
Sbjct: 465 PWINATTQMQGNYFGPAPYLISLYQAAKEFGLDFTYSLGSR--INSTDDSFKQALDSAHA 522
Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
A V G+D ++EAE +DR L P Q +L+ V+ K PV ++ G VD
Sbjct: 523 AALIVFAGGVDNTLEAETRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTEL 581
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLR-- 580
N I ++LW GYPG+ GG+A+ D++FG+ P GRL +T Y A+Y + +P T M LR
Sbjct: 582 LANHSINALLWGGYPGQSGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPG 641
Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
P N+ GRTY +++G V P+G+GL YT F K+ + S IK ++ +Y GT
Sbjct: 642 PGNSGLGRTYMWYNGDAVVPYGFGLHYTTFDAKLKARQASALIKTEEVSSLLSNDYVSGT 701
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYE 699
L+ + I V N G + V +++ + G K + GY
Sbjct: 702 --------LVWQQILTKPVVSVLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYH 753
Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
R I G ++ ++ + L VD N +L G++ + V
Sbjct: 754 RFRNIQPGDRSEREVSIT-IERLVRVDELGNRVLHPGSYELFV 795
>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
Length = 739
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 255/756 (33%), Positives = 381/756 (50%), Gaps = 97/756 (12%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+ D +LP +R +DLV R+TL EKV+QM + V RLG+P Y WW+E LHG IGR
Sbjct: 25 FPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHG---IGR 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
T FP I A++N++L K++ +++ E RA+YN
Sbjct: 82 TKYH------------VTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYS 129
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LT+W+PNIN+ RDPRWGR ET GEDPY+ R +V+GLQ + R
Sbjct: 130 QYHALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQG---------DNPR 180
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK SAC KHYA + + +R F+S V+ D+ +T++ F V + VS VMC+Y
Sbjct: 181 YLKASACAKHYAVH---SGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N G P C + L+ +R WNF GY+ SDC +I I HK D A V
Sbjct: 238 NAFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFH- 296
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
G DLDCG + AV+ G I E +D S++ L+ + RLG FD Y + +
Sbjct: 297 GTDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVDYARIPISI 356
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +H +LA + AR+ +VLLKND LPL +K + ++GP+A++ ++++GNY G P
Sbjct: 357 LECRKHQDLAKQLARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPS 415
Query: 422 RYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
R +P+ +++V Y G + + + ++ AK ADA + + G+ +
Sbjct: 416 RMLTPLQAIRERLGGWTEV-EYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRL 474
Query: 478 EAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
E E G DR + LP QT+++ K A P V+M+ A+ I + N
Sbjct: 475 EGEEMPVSKDGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGSALAIPWEAQN- 532
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
+ +IL Y G+ GG AIADV+FG YNP G+LP+T+Y + + +P + G
Sbjct: 533 -VPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD------SDLPDFESYDMQG 585
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++F+G +YPFGYGLSYT F Y PK CR
Sbjct: 586 RTYRYFNGKALYPFGYGLSYTSFAYSSLKLPKV----------CR--------------- 620
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
D + + V+N G +G EVV +Y S P + + G++R+ + AG
Sbjct: 621 -------TTDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGFKRIQLKAG 673
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ +V F++++ + L VD + +G I VG
Sbjct: 674 EAQRVTFSLSS-EDLSCVDENGIRKVWAGTVKIQVG 708
>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
Length = 740
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 266/764 (34%), Positives = 384/764 (50%), Gaps = 108/764 (14%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ D L R +DLV R+TL EKV QM A +PRLG+P Y +W+E LHGV+ GR
Sbjct: 22 PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNGR- 80
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
AT FP +I A+++ L ++ +S EARA ++ A
Sbjct: 81 ---------------ATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQ 125
Query: 127 -----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLTFW+PNIN+ RDPRWGR ET GEDP++ R A +VRGLQ D+
Sbjct: 126 TQQYQGLTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQG--------DTP 177
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK++AC KHYA + + N+R F++RVT D+ ++++ FE V V SVM
Sbjct: 178 DTHLKLAACAKHYAVH---SGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMG 234
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+YNR P CA LL +R W F G++VSDC +++ I E+H+ D E A A L
Sbjct: 235 AYNRTLDEPCCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESA-ALAL 293
Query: 302 KAGLDLDCGDYYTNFTM--GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
G DL CG T F + AVQ+G I EADID +L +LG FD + +N
Sbjct: 294 TKGCDLACG---TTFELLGEAVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWS 350
Query: 360 NN------ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
N + H LA EAA VLL+N N LPL +++++ + GP A A++
Sbjct: 351 NPPAPEAIVTCAAHTALACEAAVASCVLLQNHNHILPLRP-DVRSIYITGPLAATQDALL 409
Query: 414 GNYEGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
GNY G P R + +DG A +Y PG + N++ A D A + D T+
Sbjct: 410 GNYYGLPPRAITLLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCA-SCDVTIAC 468
Query: 471 AGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
GL +E E DR D+ LP Q + + +G +VI+ G+ ++
Sbjct: 469 LGLTALLEGEEGEAIASSLHGDRDDISLPPPQRLFLESLIQ--RGARVIVILFGGSA-LS 525
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
K+++ILW GYPG+EGGRA+AD++ G+ +P GRLPIT+YE PY + +R
Sbjct: 526 LGPLADKVEAILWAGYPGQEGGRALADILLGRASPSGRLPITFYENINDLPPYANYSMR- 584
Query: 582 VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
GRT+++FDG +PFG+GL+YT+F Y D + D+ Y+ G +
Sbjct: 585 -----GRTHRWFDGTPAWPFGFGLTYTRFTY--------------SDLRVSDV-YSPGND 624
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK---PPGIAGTHIKQVIGY 698
P C +VL+ N G + +E+V +Y PG + + +
Sbjct: 625 SPLCGSVLL----------------TNTGDHEAAEIVQIYLTDFDAPGNGPVPRENLADF 668
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
RV +A GQS +V F++ + + +VD A A T+ VG
Sbjct: 669 HRVTLAPGQSRRVEFSIPP-EHILLVDTNGRRTRAPLAFTVHVG 711
>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
Length = 709
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 224/612 (36%), Positives = 338/612 (55%), Gaps = 69/612 (11%)
Query: 25 AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
A+ +V RMTL EK+ Q+ A +PRL +P Y WW+EALHGV+ G
Sbjct: 14 ARRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI------------- 60
Query: 85 VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPNIN 136
AT FP I A F+ + ++IG +STE RA YN GLTFWSPN+N
Sbjct: 61 ---ATVFPQAIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVN 117
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+ RDPRWGR ET GEDPY+ R A+ ++RG+Q D + LK +AC KH+A
Sbjct: 118 IFRDPRWGRGQETYGEDPYLTARLAVAFIRGIQ----------GDGKYLKAAACAKHFAV 167
Query: 197 YDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
+ G + R FD+RV+++D+ ET++ F+ V E V VM +YNRVNG+P CA
Sbjct: 168 HS-----GPEALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACAS 222
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
+LL+ +R +W F G++VSD ++++ I + H ++ D + +A LKAG +L C
Sbjct: 223 HELLSDILRSEWGFEGHVVSDYEALEDIFKHHHYVAD-EAHTMAVALKAGCNL-CAGKIA 280
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+V +G I+E +I ++ L+ + +G Y ++G P+H +LA EA
Sbjct: 281 RHLRSSVDEGLISEDEITEAVERLFTTRIMMGMMADDCPYDSIGYEENDTPEHHQLAVEA 340
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FY 431
A + VLLKND G LPL I ++A++GP+AN+ K + GNY GT RY + ++G
Sbjct: 341 ASRSFVLLKND-GLLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLV 399
Query: 432 AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
S + Y+ GC + N + A+ AA++AD V+ GLD ++E E
Sbjct: 400 GDSVRVWYSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVE 459
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
D+ +L LPG Q L++ + K PV L++ S A+ + +N+ +K+IL +
Sbjct: 460 VGFGSGDKPNLSLPGRQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIW 518
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG GG+A+ADV+FG+ P G+LP+T+Y + +P + GRTY++ G
Sbjct: 519 YPGAMGGKAVADVLFGRRAPAGKLPVTFYASA------DELPAFEDYSMAGRTYRYMKGN 572
Query: 597 VVYPFGYGLSYT 608
+YPFGYGL+Y+
Sbjct: 573 ALYPFGYGLTYS 584
>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
Length = 802
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 233/605 (38%), Positives = 337/605 (55%), Gaps = 27/605 (4%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS P CD L +RA LV T E V G+ GV RLGLP Y+ W EALHGV
Sbjct: 54 LSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVG- 112
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R N +F ATSFP I A+ N++L +IG VST+ RA N G G+
Sbjct: 113 ---RANFVESGNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGV 165
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+SPNIN R P WGR ETPGED ++ Y Y+ LQ D KI
Sbjct: 166 DVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQGA--------VDPETSKII 217
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
A KHYA YD+++W + R D ++T+Q++ E + PF + + V SVMCSYN VNG
Sbjct: 218 ATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNG 277
Query: 249 IPTCADPKLLNQTIRGDWNFH--GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
+P+CA+ L +R + F GY+ DC ++ + H + ++ + A A + AG D
Sbjct: 278 VPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASN-EAAASADSILAGTD 336
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNP 365
+DCG Y + A + ++ +DI+ + LY L++ GYFDG Y+++ +++ +
Sbjct: 337 IDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLST 396
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+A EAA +GIVLLKND LPL+ +IK++A++GP AN T+ + GNY G S
Sbjct: 397 DAWNIAYEAAVEGIVLLKNDE-TLPLSK-DIKSVAVIGPWANVTEELQGNYFGPAPYLIS 454
Query: 426 PMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
P+ GF ++YA G ++ + S A+ AAK ADA + G+D ++EAE DR
Sbjct: 455 PLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRE 513
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++ PG Q +LI+K+++ K P+ ++ M G VD + K+N + +++W GYPG+ GG A
Sbjct: 514 NITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHA 572
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKI-PYTSMPLRP--VNNFPGRTYKFFDGPVVYPFG 602
+AD+I GK P GRL T Y A Y ++ P M LRP + PG+TY ++ G VY FG
Sbjct: 573 LADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFG 632
Query: 603 YGLSY 607
+GL Y
Sbjct: 633 HGLFY 637
>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 835
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 245/640 (38%), Positives = 349/640 (54%), Gaps = 37/640 (5%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
+ K LSD CD LP ERA LV +T EK+Q + A G PR+GLP Y WWSEA
Sbjct: 17 DCTKPPLSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEA 76
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEAR 118
LHGV+ PGT F + PG +TSFP +L A+F++ L + +G + TEAR
Sbjct: 77 LHGVAHA-------PGTQF-RDGPGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEAR 128
Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
A N G +GL +W+PN+N RDPRWGR ETPGED + RYA + +RGL+
Sbjct: 129 AFGNAGWSGLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSC 188
Query: 179 DSDS--RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
S P ++ + CKHYA D ++W G R FD+ ++ QD+ E ++ PF+ C + V
Sbjct: 189 SFGSGGEPPRVISTCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRV 248
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTK 293
SVMC+YN VNG+P+CA+ L+N +RG WN+ Y+ SDC+++ + H + DT
Sbjct: 249 GSVMCAYNAVNGVPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHYA-DTN 307
Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--S 351
+ +AG+D C ++ GA G + +D +L LY L+R+GYFDG S
Sbjct: 308 AEGTGLCFEAGMDTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPES 367
Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL---------NTGNIKTLALV 402
P + +LG ++ P+ ELA AA +GIVLLKNDN LPL G + +A++
Sbjct: 368 P-HASLGWADVNRPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMI 426
Query: 403 GPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC---ADIVCQNNSMIPAAID 459
G A+A + G Y G P SP + A G D + ++ A++
Sbjct: 427 GFWADAPDKLFGGYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVE 486
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
AA +AD V GLD S E KDR+ + P Q LI+++A K PV +V M D
Sbjct: 487 AAADADYIVYFGGLDTSAAGETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDD 545
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMP 578
+ + + ++LW +PG++GG A+ ++ G +P GRLP+T Y ANY +P T M
Sbjct: 546 TPLFELD-GVGAVLWANWPGQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVPLTDMT 604
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
LRP PGRTY+++ P V PFG+GL YT F+ + P
Sbjct: 605 LRPSATNPGRTYRWYPTP-VRPFGFGLHYTTFRAEFGPHP 643
>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 738
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 259/763 (33%), Positives = 382/763 (50%), Gaps = 109/763 (14%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + L ER DLV RMTL EKV QM + A + RLG+P Y WW+E LHGV+
Sbjct: 31 YPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVA---- 86
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGN---- 125
T F T +P I A+++++ +G + E RA+YN + N
Sbjct: 87 ------RTPFK-----VTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHD 135
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT+W+PNIN+ RDPRWGR ET GEDP++ G +V+GLQ D R
Sbjct: 136 IYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQ---------GDDPR 186
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK + C KHYA + + + R F++ +++ D+ +T++ F V + V+ VMC+Y
Sbjct: 187 YLKAAGCAKHYAVH---SGPEDLRHKFNTDISDYDLWDTYLPAFRKLVVDAKVTGVMCAY 243
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVL 301
N G P C L+N + W F GY+ SDC I +H+ D E A A +
Sbjct: 244 NAFKGQPCCGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDA-ESAAADAV 302
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGK 359
G D++CG+ + AV+ GK++E ID SL+ L+ V +LG FD + +Y +GK
Sbjct: 303 LHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVKYNQIGK 362
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + P H A + A Q IVLLKN+ LPL+ N+K +A++GP+A+ +++GNY GT
Sbjct: 363 DALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSK-NLKKIAVLGPNADNAVSVLGNYNGT 421
Query: 420 PCRYTSPMDGF---------YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
P R + + G Y K ++Y AD + N AA K+ADA + +
Sbjct: 422 PSRIVTALQGIKNKLPAGTEVIYDKAVDY---VADSAARYNYAAMAA--KVKDADAIIYI 476
Query: 471 AGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
G+ +E E G DR +LLPG QTEL+ K A PV V+M+ A+
Sbjct: 477 GGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELL-KALKATGKPVVFVMMTGSAIAT 535
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
+ N + +I+ Y G+ G AIADV+FG YNP GRLP+T+Y ++ +P
Sbjct: 536 PWEAEN--LPAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFYGSD------KDLPSF 587
Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
+ RTY++F G +Y FGYGLSY++F+Y +P ++
Sbjct: 588 TDYSMDNRTYRYFKGKPLYAFGYGLSYSKFEYAPLDAPLTLK------------------ 629
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYE 699
T ++V N KMDG EV +Y GI T I+ + G+E
Sbjct: 630 ---------------AGEALTVHVKVTNKSKMDGEEVTELYLSHIGIKQKTAIRALKGFE 674
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
R I AG++ + F +++ L I D N + ASG I VG
Sbjct: 675 RTLIKAGETKDITFKLSSA-DLSITDLNGNLVKASGKIAISVG 716
>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
Length = 1241
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 197/326 (60%), Positives = 236/326 (72%), Gaps = 17/326 (5%)
Query: 111 QTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
Q VSTEARAMYN+G GLT+WSPNINVVRDPRWGR LETPGEDPYVVGRYA+N+VRG+QD
Sbjct: 916 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975
Query: 171 V---EGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPF 227
+ E V D ++RPLK SACCKHYAAYDLD+W + RF FD+RV E+DM ETF PF
Sbjct: 976 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035
Query: 228 EMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK 287
EMCV +GDVSSVMCSYNRVNGIP CAD +LL+QTIR DW HGYIVSDCD+++ + ++
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095
Query: 288 FLNDTKEDAVARVLKAGLDLDCG-------------DYYTNFTMGAVQQGKIAEADIDTS 334
+L T +A A LKAGLDLDCG D+ T + M AV +GK+ E+DID +
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155
Query: 335 LRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
L Y+ LMRLGYFD QY +LG+ +IC QH LA + ARQGIVLLKNDN LPL+
Sbjct: 1156 LTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 1215
Query: 395 NIKTLALVGPHANA-TKAMIGNYEGT 419
+ + + GPH A K M G+Y GT
Sbjct: 1216 KVGFVNVRGPHVQAPEKIMDGDYTGT 1241
>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 721
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 247/742 (33%), Positives = 375/742 (50%), Gaps = 96/742 (12%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
R +DL+ R+TL EKV +G + VPRL +P Y WW+E LHGV+ G
Sbjct: 40 RVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHGVARAGE------------ 87
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNI 135
AT FP I A+F+++L K++ VSTEARA YNL A GLTFWSPNI
Sbjct: 88 ----ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYNLSTAMGRHLQYMGLTFWSPNI 143
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ + YV GLQ +D LK SA KH+
Sbjct: 144 NIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQ---------GTDPLHLKTSATAKHFV 194
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A+ EG +R +FD+ V E+D+++T++ F+ V +G V S+M +YNRVNG+P +
Sbjct: 195 AH--SGPEG-ERDYFDALVDEKDLRDTYLYAFKSLV-DGGVESIMTAYNRVNGVPNSINK 250
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
L+N + +W F G++V+DC ++ + ++HK L + E A A +KAG+DLDC +
Sbjct: 251 TLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAA-AIKAGVDLDCSSIFQT 309
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELAAE 373
+ A+ + E +D +L + +LG+FD S + + G ++I N H+ LA +
Sbjct: 310 DIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDAPSSSPFYSFGADSIHNDSHVMLARQ 369
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA- 432
A++ +VLLKND LPL N ++ +VGP+A + A++ +Y G + + ++G A
Sbjct: 370 MAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGITAA 429
Query: 433 --YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---------G 481
+ Y G AD ++ I A NAD TV V GL +E E G
Sbjct: 430 VDKGTRVEYDLG-ADY---RDTTHFGGIWGAGNADVTVAVIGLTPVLEGEAGDAFLSQTG 485
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
D+ DL LP + + + K P+ V+ S VDI A P +++ YPGE+
Sbjct: 486 GDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDI--AAIAPYADAVILAWYPGEQ 543
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
GG A+AD++FGK +P G LP+T+Y + +P + GRTY++F G V YPF
Sbjct: 544 GGNALADILFGKISPSGHLPLTFYNS------VNDLPAYNNYSMKGRTYRYFAGAVQYPF 597
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
G+GLSYT F Y+ PK+ KD
Sbjct: 598 GFGLSYTTFNYQWQQQPKT-------------------------------SYSAKD-TIQ 625
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+ V+N G + EVV Y P + +K++ G++R+ + G ++ ++ +
Sbjct: 626 LSVVVKNTGNISADEVVQAYIGYPTLNRMPLKELKGFKRITLNKGSTSLASISIPVTELQ 685
Query: 722 KIVDNAANSLLASGAHTILVGE 743
K + L G +T+ +G
Sbjct: 686 KWNSSKHQFELYPGNYTVYLGS 707
>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
Length = 805
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 271/783 (34%), Positives = 391/783 (49%), Gaps = 99/783 (12%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGD--------------------LAYGVPRLGLP 54
CD P RA LV+ + + EK+ + + ++ G R+GLP
Sbjct: 36 CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95
Query: 55 LYEWWSEALHGVSFIGRRTNSPPGTHFDS---EVPGATSFPTVILTTASFNESLWKKIGQ 111
Y WW+EALHGV+ + PG F+ E ATSF I A+F+ L ++
Sbjct: 96 AYAWWNEALHGVA-------ASPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVAD 148
Query: 112 TVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE------------------TPGED 153
T+STEARA N AGL +W+PNIN +DPRWGR E TPGED
Sbjct: 149 TISTEARAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGED 208
Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR 213
P + Y + GL+ + + K+ A CKH+AAYDL+ W+G R+ F++
Sbjct: 209 PVHIKGYVQALLEGLEGRDKIR----------KVIATCKHFAAYDLERWQGALRYRFNAV 258
Query: 214 VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF---HG 270
VT QD+ E ++ PF+ C + V S MCSYN +NG P CA L++ +R WN+ +
Sbjct: 259 VTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNN 318
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC---GDYYTNFTMGAVQQGKIA 327
YI SDC++IQ + + + T A A AG D C G +GA Q ++
Sbjct: 319 YITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLS 378
Query: 328 EADIDTSLRFLYIVLMRLGYFD-GSPQ-YKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
E ID +LR LY L+R GY D SP Y + + + P+ LA ++A GIVLLKN
Sbjct: 379 EEIIDRALRRLYEGLIRAGYLDSASPHPYTKISWSQVNTPKAQALALQSATDGIVLLKN- 437
Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCAD 445
NG LPL+ N KT+AL+G ANAT+ M+G Y G P Y +P+ + ++APG +
Sbjct: 438 NGLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQLNVTFHHAPGPVN 496
Query: 446 IV--CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
N++ A+ AA +D + + G DLS+ AE +DR + P Q L+ +A
Sbjct: 497 QSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLAQM 556
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K T+V VD +NP I SILWVGYPG+ GG A+ ++I G +P RLP+T
Sbjct: 557 GKP--TIVARLGDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLPVT 614
Query: 564 WYEANYVK-IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
Y Y IP T+M LRP + PGRTY+++ PV+ PFG+GL YT F K +S+
Sbjct: 615 VYPETYTSLIPLTAMSLRPTSARPGRTYRWYPSPVL-PFGHGLHYTTFTAKFGVF-ESLT 672
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
I + + + +N C +D + + V N G++ V +V+
Sbjct: 673 INIAE----------LVSN---CNERYLDLCRFPQ----VSVWVSNTGELKSDYVALVFV 715
Query: 683 KPP-GIAGTHIKQVIGYERVF-IAAGQS--AKVGFTMNACKSLKIVDNAANSLLASGAHT 738
+ G IK ++GY+R+ I G + A VG + L VD N +L G +
Sbjct: 716 RGEYGPEPYPIKTLVGYKRIRDIEPGTTGAAPVGVVVG---DLARVDLGGNRVLFPGKYE 772
Query: 739 ILV 741
L+
Sbjct: 773 FLL 775
>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 249/746 (33%), Positives = 383/746 (51%), Gaps = 104/746 (13%)
Query: 25 AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
AK+LV +MTL EK +Q+ + + L +P Y WW+E LHGV+ G
Sbjct: 16 AKELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT------------- 62
Query: 85 VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNIN 136
AT FP I A F++ K+ ++TE RA YN + GLT+WSPNIN
Sbjct: 63 ---ATVFPQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNIN 119
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+ RDPRWGR ET GEDPY+ R + +++GLQ + + LK++AC KH+A
Sbjct: 120 IFRDPRWGRGHETYGEDPYLTSRLGVAFIKGLQ----------GEGKYLKLAACAKHFAV 169
Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
+ EG R F++ V ++D+ ET++ FE CV E +V SVM +YNR NG P C
Sbjct: 170 HS--GPEGL-RHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKT 226
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
LL +RG W F G++VSDC ++ H + T ++VA ++ G DL+CG+ Y N
Sbjct: 227 LLKDILRGKWGFKGHVVSDCWALADF-HLHHMVTSTATESVALAIENGCDLNCGNMYLNL 285
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
+ A ++G + E I T+ L +LG FD +Y + + +H E+A A+R
Sbjct: 286 LL-AYKEGLVTEEQITTAAERLMTTRFKLGMFDEECEYNKIPYEVNDSREHNEVALIASR 344
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY---AY 433
+ +VLLKN NG LPL+ N+K++A++GP+AN+ + GNY GT +YT+ ++G +
Sbjct: 345 KSMVLLKN-NGTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGN 403
Query: 434 SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE------- 480
+ Y+ GC + + + + + AI A+ +D V+ GLD ++E E
Sbjct: 404 DVRVYYSEGCHLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNS 463
Query: 481 --GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
D+ +L LPG Q L+ KV + K PV +V+ + A+ +N A+ K +IL YP
Sbjct: 464 YGAGDKENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGAEE--KCAAILNAWYP 520
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPV 597
G GG A+AD++FGK +P G+LP+T+Y+ + K+P +T ++ GRTY++
Sbjct: 521 GSHGGTAVADILFGKCSPSGKLPVTFYK-DTAKLPDFTDYSMK------GRTYRYLGHES 573
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
+YPFGYGL+Y+ + P VK
Sbjct: 574 LYPFGYGLTYSTVELSNLQVP---------------------------------SVKQGF 600
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMN 716
F IE++N G+ D EVV Y K + + G++RV + G+S V +N
Sbjct: 601 GSFDISIEIKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLN 660
Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
KS ++V++ LL S + VG
Sbjct: 661 K-KSFEVVNDDGERLLDSKKFKLFVG 685
>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 718
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/766 (33%), Positives = 400/766 (52%), Gaps = 106/766 (13%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S+K + D + + + ERA+ +V+++T+ EK+ Q+ + A V RL +P Y+WW+E L
Sbjct: 8 SLKAQ-EDCSFRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECL 66
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+++ +L ++G +STEARA YN+
Sbjct: 67 HGVARAGR----------------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNV 110
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GLTFW+PN+N+ RDPRWGR ET GEDP++ R +++V+GLQ
Sbjct: 111 FSKHGYRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQG----- 165
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+ + LK++A KHYA + N R FD++V+ +D+ ET++ FE V E
Sbjct: 166 ----NHPKYLKVAALAKHYAVH---NGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAG 218
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
V VM +YNR NG P CA P L+ + +R W F GY VSDC +I HK + DT E+
Sbjct: 219 VEGVMGAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIV-DTPEE 277
Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQ 353
A A L AG +L+CGD Y + + ++++G E +ID S++ L+ +RLG F +G+
Sbjct: 278 AAAMALNAGCNLNCGDTYASL-LKSLEKGLTTEEEIDRSVKQLFKTRLRLGLFAPEGAVP 336
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y + + I + +H +LA EAAR+ +VLLKN+ LP+ ++K + + GP A +A++
Sbjct: 337 YDTISTDVIRSKEHQKLALEAARKSVVLLKNEANTLPV-ARDVKKVYVTGPTATHVQALL 395
Query: 414 GNYEGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
NY G T+ ++G + Y G N+M + AA +AD TV
Sbjct: 396 ANYYGVSEDMTTILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFS-GAAASADVTVAC 454
Query: 471 AGLDLSVEAEG---------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
G+ +E E DR LP Q + + ++ +AK + +VI S A+ +
Sbjct: 455 LGISQLIEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAK-KLVVVITSGSAISL- 512
Query: 522 FAKNNPKI----KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
P+I ++L+V YPGE+GG+A+ADV+FG P GRLP+T ++ PY +
Sbjct: 513 -----PEIYDMADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVTVVKSVDDLPPYENY 567
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
++ GRTY++ + +PFG+GLSYT F Y N T
Sbjct: 568 DMK------GRTYRYMEVSPQFPFGFGLSYTDFTYS---------------------NLT 600
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQ-VI 696
+ +NK VK + ++ N G+ D EVV Y + KQ +I
Sbjct: 601 LESNK----------VKSGE-SVRLSFDLTNEGEYDADEVVQFYITDVEASVNVPKQSLI 649
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
G++RV +AAG+S K+ FT+ +KIVDN +L SG I +G
Sbjct: 650 GFKRVGLAAGESTKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIG 694
>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
Length = 736
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/772 (33%), Positives = 383/772 (49%), Gaps = 108/772 (13%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
+ ++ + P+ +A LP R KDLV R+TL EKV M + +PRLG+P Y+WW+EALHG
Sbjct: 18 QAQVENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHG 77
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--- 122
V+ RT + T FP I A+F+ +K+G STE RA++N
Sbjct: 78 VA----RT-----------LEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDW 122
Query: 123 ------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
GLT+W+PNIN+ RDPRWGR ET GEDPY+ + VRGL+
Sbjct: 123 KAGKTGTRYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEG------ 176
Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
D LK AC KHYA + + +R FD+R + D+ +T++ F V + V
Sbjct: 177 ---EDPHYLKSVACAKHYAVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKV 230
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
VMC+YNR+NG P C + LL +R W+F GY+ SDC +++ E HK + A
Sbjct: 231 HGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHK-THPEHTIA 289
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--Y 354
++ L AG DL+CG+ Y G V++G +E DI+ SL L+ +L ++G FD + + Y
Sbjct: 290 MSDALLAGTDLECGNLYHLLAEG-VKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPY 348
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
++G+ + H + A A++ IVLL+N N LPL+ IK++AL+GP+A+ + +
Sbjct: 349 SSIGREVLECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLA 408
Query: 415 NYEGTPCRYTSPMDGFYAY--SKV-INYAPGCA--DIVCQNNSMIPAAIDAAKNADATVI 469
NY GTP +P K+ INY PG D + S + A AA+ +D V
Sbjct: 409 NYFGTPSEIVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQ-SDVIVF 467
Query: 470 VAGLDLSVE-------------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
V+G+ E DR + LP Q EL+ K+ + P+ +V MS
Sbjct: 468 VSGISADYEGEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGS 526
Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
+ + N ++L Y G+ G AI DV+FG NP GR+P+T Y+++
Sbjct: 527 VMSFEWESQNA--DALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD------ND 578
Query: 577 MPLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
+P P N+ GRTY++F G YPFGYGLSYT F Y D QC D
Sbjct: 579 LP--PFENYSMLGRTYRYFKGEPRYPFGYGLSYTTFAY--------------SDVQCVDE 622
Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHI 692
+T T + + V N G DG EVV +Y P G +
Sbjct: 623 THTGDTAR-------------------VTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPL 663
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
+ G++R+ + G+S V FT+ + L + + N + +G T+ VG G
Sbjct: 664 CALKGFKRIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVGGG 714
>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
Length = 461
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/468 (44%), Positives = 296/468 (63%), Gaps = 21/468 (4%)
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QY 354
A +KAGLDLDCG + T AV++G +++ +I+ +L V MRLG FDG P QY
Sbjct: 1 ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
NLG ++C P H +LA EAARQGIVLL+N +LPL+T +T+A++GP+++ T MIG
Sbjct: 61 GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
NY G C YT+P+ G Y++ I+ A GC D+ C N + AA AA+ ADATV+V GLD
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 179
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
S+EAE DR LLLPG Q EL+++VA A++GP LV+MS G +D+ FAKN+P+I +I+W
Sbjct: 180 QSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 239
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGRTYK 591
VGYPG+ GG AIA+V+FG NPGG+LP+TWY NYV +P T M +R P +PGRTY+
Sbjct: 240 VGYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 299
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGTNKPPCAAV 648
F+ GPVV+PFG GLSYT F + +A P V + L + + ++ TV + P C A+
Sbjct: 300 FYIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHPDCNAL 359
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
DV ++V+N G MDG+ ++V++ PP KQ++G+ ++ IA G
Sbjct: 360 SPLDV---------HVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHKIHIATGSE 410
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+V ++ CK L +VD + G H + +G+ VS LQ NL
Sbjct: 411 KRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVS--LQTNL 456
>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
Length = 729
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 262/759 (34%), Positives = 385/759 (50%), Gaps = 103/759 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ D L + ER LV+ MTL EK+ Q+ + V RL +P Y WW+EALHGV+ G+
Sbjct: 26 WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GN---- 125
+T FP I A+F+ L K++ +S EARA +N+ GN
Sbjct: 84 --------------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQY 129
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AGLTFW+PN+N+ RDPRWGR ET GEDPY+ + + +V+GLQ + + L
Sbjct: 130 AGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQG---------NHPKYL 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KH+A + + R HF++ +++D+ ET++ FE V + +V VM +YN
Sbjct: 181 KSAACAKHFAVH---SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNA 237
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
V G+P + LL +T+R W F GYIVSDC ++ I + HK + T +A A LKAG+
Sbjct: 238 VYGVPAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVK-TMPEAAAVALKAGV 296
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
+L+CG Y AVQQG ++E IDT L+ L +LG+FD Y + + I
Sbjct: 297 NLNCGYVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEANPYNAIPTSVIH 356
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+ HI LA + A++ IVLLKN N LPL+ NIK + GP A+++ ++ NY G
Sbjct: 357 SDDHIALARKTAQKSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTTNL 415
Query: 424 TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVE 478
S ++G + +NY G N ++ P A + AK ADA + V GL E
Sbjct: 416 VSVLEGIADKVSLGTSLNYRMGALPF---NKNLNPKNWAPNVAKTADAVIAVVGLSADFE 472
Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
E D+ DL LP Q + + ++A KGP+ LV+ S AV + +
Sbjct: 473 GEEVDAIASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSAVALGELYDLADA 532
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP--G 587
++W YPGE+GG A+ADV+FG +P G LP+T+ P + L P ++ G
Sbjct: 533 IVLMW--YPGEQGGNAVADVLFGDVSPSGHLPVTF--------PKSVAQLPPFEDYSMQG 582
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTYK+ + ++PFG+GLSYT FK+ +V I +K
Sbjct: 583 RTYKYMEEEPLFPFGFGLSYTDFKF------SNVQISEEK-------------------- 616
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVFIAAG 706
+K KD FT V N GK+DG EVV +Y P K Q++ ++R+ I
Sbjct: 617 -----IKKKD-SFTVSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKN 670
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
S V F + A K L V+ G + ++V +
Sbjct: 671 TSKTVSFNLEA-KDLFQVNKEGKKTWIKGKYKLVVANAL 708
>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
Length = 705
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 247/749 (32%), Positives = 380/749 (50%), Gaps = 103/749 (13%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
Y ++A++LV +MTL EK Q+ + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 YKKKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT--------- 57
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
AT FP I A F++ KI ++ EARA YN + GLT WS
Sbjct: 58 -------ATVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWS 110
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN+ RDPRWGR ET GEDP++ G+ + +++GLQ D + +AC K
Sbjct: 111 PNINIFRDPRWGRGHETYGEDPFLSGKLGVAFIKGLQ----------GDKDVMMTAACVK 160
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+AAY + + R F++ VT++D+ ET++ FE CV + V +VM YNR NG P C
Sbjct: 161 HFAAY---SGPEDLRHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCC 217
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL +R W F G++VSDC +I+ H + T E++VA + AG DL+CG+
Sbjct: 218 GSYTLLRDILREKWGFEGHVVSDCWAIKDFHTDH-MVTKTPEESVALAIDAGCDLNCGNM 276
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
Y + A+Q+G I E I + ++ +LG F+GS ++ N+ + +H E+A
Sbjct: 277 YLMLLI-ALQEGLITEEHITRAAVRIFTTRFKLGLFEGS-EFDNIPYEVVECSEHKEMAI 334
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF-- 430
EAAR+ VLLKND G LP+N G IKT+ ++GP+AN+ A+ GNY GT RY + ++G
Sbjct: 335 EAARKSAVLLKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQD 393
Query: 431 -YAYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK- 482
+ Y+ GC +++ N + A+ A+++D V+ GLD ++E E
Sbjct: 394 EVGDEVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSD 453
Query: 483 --------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D+ DL LP Q L+ K+ K P L +M+ A+++++A + IL
Sbjct: 454 EGNNGGSGDKKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAINLSYAHEH--CNGILL 510
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG GG+A+AD++FG +P G+LP+T+Y + P T ++ RTY++ +
Sbjct: 511 TWYPGARGGKAVADILFGNASPSGKLPVTFYRSLDNLPPITDYSMK------NRTYRYIE 564
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGL+Y + K +V+I+ +DI TV
Sbjct: 565 EAPLYPFGYGLTYGDVELKHVEIKGTVEIE-------KDIYITV---------------- 601
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
++N G + EVV Y K + + + RV + A + +V
Sbjct: 602 ----------TLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCAFMRVGLGANEEKQVSM 651
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ SLK+V+ +L S T+ G
Sbjct: 652 RI-PFDSLKVVNLDGEKVLDSKKFTLFAG 679
>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 643
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 241/628 (38%), Positives = 345/628 (54%), Gaps = 68/628 (10%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
+ +RA+ LV +MTL EKV QM A + RLG+P Y WW+E LHGV G
Sbjct: 4 FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT--------- 54
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGNAG----LTFWS 132
AT FP I ASF+ESL + + Q +S EARA YN G G LTFWS
Sbjct: 55 -------ATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWS 107
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN+ RDPRWGR ET GEDP + GR ++RGLQ+ E DS+ K+ A K
Sbjct: 108 PNINLFRDPRWGRGHETYGEDPLLTGRMGTAFIRGLQEGE--------DSQYRKLDATVK 159
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+AA+ R F++ V+ +DM ++++ F C+ ++VM +YNR+NG P C
Sbjct: 160 HFAAHSGPE---AGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPAC 216
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
A L + +W F GY+VSDC +IQ I E+H + KE A A + G L+CG
Sbjct: 217 ASSTYLKGVLYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESA-ALAVNNGCQLNCGKA 275
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
Y ++ AV+ G I+E + ++ L+ RLG FD Y ++ N I +H EL
Sbjct: 276 Y-HWVKAAVEDGLISEDTVTCAVERLFEARFRLGMFDSDCVYDSIPMNVIECRKHRELNR 334
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
+ A++ IVLLKN NG LPLN KT+A++GP+A+ ++GNY GTP +T+ + G
Sbjct: 335 KMAQESIVLLKN-NGILPLNPE--KTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQD 391
Query: 433 YSK-VINYAPGCADIVCQNNSM------IPAAIDAAKNADATVIVAGLDLSVE------- 478
++ + YA G ++ + ++ + AI AK AD V+ GL +E
Sbjct: 392 QARGEVYYARG--SVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAY 449
Query: 479 --AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
A+ DR D+ LP Q +L+ + D K PV LV +S G VD+ A + + +IL
Sbjct: 450 NGADSGDRKDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDLRQA--DERCAAILQCF 506
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG EGG A+AD++FG+ +P GRLP+T+Y P+T ++ GRTY+FFDG
Sbjct: 507 YPGAEGGNALADILFGRVSPSGRLPVTFYRTVEDLPPFTDYSMK------GRTYRFFDGK 560
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIK 624
+YPFG+GL+Y K + + P +V +K
Sbjct: 561 PLYPFGHGLTYADIKEQW-TDPYTVRVK 587
>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
Length = 711
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 251/757 (33%), Positives = 387/757 (51%), Gaps = 107/757 (14%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
+ AK+LV +M L EK Q+ A + RLG+P Y WW+EALHGV+ G
Sbjct: 4 FKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV--------- 54
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
AT FP I A F+E +I ++ E RA YN + G+TFW+
Sbjct: 55 -------ATVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWA 107
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN+ RDPRWGR ET GEDPY+ R + +++GLQ E +Y LK +AC K
Sbjct: 108 PNINIFRDPRWGRGHETYGEDPYLTARLGVAFIKGLQGDENEDY--------LKAAACAK 159
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+A + + DR HFD+ V+++D+ ET++ FE V E +V VM +YNRVNG P C
Sbjct: 160 HFAVH---SGPEEDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPAC 216
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL ++ DW F GYIVSDC +I+ H + T ++ A + G +L+CG+
Sbjct: 217 GSKTLLVDILKKDWGFDGYIVSDCWAIRDFHTEH-MVTHTAAESAALAINNGCELNCGNT 275
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELA 371
Y + + A Q+G + E I + L + M+LG FD + +Y + N C H E+A
Sbjct: 276 YLHM-LEAHQEGLVKEEIITEAAEKLMRIRMQLGLFDKNCKYNEIPYAVNDCKV-HREVA 333
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
EA+R+ +V+LKND G LPLN +K++ ++GP AN + GNY GT RYT+ ++G
Sbjct: 334 LEASRRSMVMLKND-GILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQ 392
Query: 432 AY---SKVINYAPGC-------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
Y + Y+ GC +++ +N+ A I A+ +D V+ GLD ++E E
Sbjct: 393 DYVGDDVRVYYSEGCHLFANGMSNLAWENDREAEALI-VAEQSDVVVLCLGLDSTIEGEQ 451
Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
G D++ L L G Q +L+ KV K PV LV+ + A+ IN+A + +I
Sbjct: 452 GDTGNAFAGGDKLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYA--DEHCNAI 508
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
YPG +GG+A+A ++FG+Y+P G+LP+T+Y+ +P + RTY++
Sbjct: 509 FQTWYPGAQGGKALAQLLFGEYSPSGKLPVTFYKTT------EELPAFEDYSMKDRTYRY 562
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+YPFGYGLSY K +SV + LD + N++ G
Sbjct: 563 MPNEALYPFGYGLSYADIKV------QSVKV-LDGAKGEEITNFSAGQT----------- 604
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-------PGIAGTHIKQVIGYERVFIAA 705
K+ ++E+EN +D +VV +Y K P + + ++ VF+ A
Sbjct: 605 ------KYKVKVELENKSNVDSYDVVQIYIKDMESQYAVPNFS------LCSFKSVFLKA 652
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
G+S +V + K+ +++ ++ S + +G
Sbjct: 653 GESKEVTLNVGE-KAFTVINEEGKRIVDSKKFKLFIG 688
>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 918
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 259/742 (34%), Positives = 390/742 (52%), Gaps = 48/742 (6%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L +RA LV +T+ EK+ + + A G+PRL +P YEWWSE LHGV+
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVA------- 222
Query: 75 SPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PGT F S+ ATSFP IL ++F++ L + +G+ VS EARA N G +GL +S
Sbjct: 223 RSPGTKFTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRSGLDLYS 282
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGED + + +Y + GL+ D K+ A CK
Sbjct: 283 PNINAFKDPRWGRGQETPGEDTFHLQKYVSAMLSGLE----------GDDPDKKLIATCK 332
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
HYAA D +N++G DR F++ ++ QD+ E ++ PF+ C E +V S MCSYN +NG P C
Sbjct: 333 HYAANDFENYKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLC 392
Query: 253 ADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
A+ L+ +R W ++G Y+ +DCD + +V H + D A A ++AG DL+C
Sbjct: 393 ANSYLIEDILRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGH-AAAWSMQAGTDLEC 451
Query: 310 GDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQ 366
+ + + A Q I+E D+D +L +Y L+ +G FD + ++LG + + +
Sbjct: 452 NAFPGSEALQSAWNQSLISEKDVDKALTRMYTSLVSVGLFDLDRKDPLRSLGWDEVNTKE 511
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP 426
+LA AA +G VL+KND G LPL+ + K AL+GP +AT M GNY G SP
Sbjct: 512 AQDLAYRAAVEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISP 570
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
Y G +++S AI AA+ AD + + G+D ++E E DR
Sbjct: 571 RKAAKDLGLDFTYFLGSR--TNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNT 628
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L P Q +L+ +++ K P+ ++ G VD N + +ILW GYPG+ GG+AI
Sbjct: 629 LAWPEPQLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAI 687
Query: 547 ADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGY 603
D++FG+ P GRL +T Y A+Y +P T M LR P N+ GRTY+++ G P+G+
Sbjct: 688 LDIVFGRAAPAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGF 747
Query: 604 GLSYTQFK--YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GL YT+F K AS+ ++DI Q + N + P L + T
Sbjct: 748 GLHYTKFSVDMKPASNVHNIDIA----QMAAEANDDAASEIPSWQRGL------ERRMVT 797
Query: 662 FQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACK 719
+ +N G + V +V+ + G K ++GY R+ I G+ K + +
Sbjct: 798 VTVSAKNEGNVISDYVALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKM-E 856
Query: 720 SLKIVDNAANSLLASGAHTILV 741
L VD N +L G +++ +
Sbjct: 857 QLVRVDEVGNRVLYEGLYSLFL 878
>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 849
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 268/777 (34%), Positives = 390/777 (50%), Gaps = 71/777 (9%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD +RA ++ M + EK+ + D++YG RLGLP YEWWSEALHGV+
Sbjct: 43 CDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVA------- 95
Query: 75 SPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG +F S ATSFP I +++F++ + I +STEARA N GL +++
Sbjct: 96 GSPGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARGGLDYFT 155
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDP + Y N + GL+ + Y S S K+ A CK
Sbjct: 156 PNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDD-GYFNTSHSGYKKMIATCK 214
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+A YDL++W+G R+ +D+ +T QD+ E ++ PF+ C + +V+S+MCSYN VN +P C
Sbjct: 215 HFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVPAC 274
Query: 253 ADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
A+ L +R W + + YI SDC++I I +H + + A L G+D C
Sbjct: 275 ANSYLQETILREHWGWTIDNNYITSDCNAISDIYYNHNY-SVNNAAAAGLSLSNGMDTAC 333
Query: 310 GDYYTNFTM---GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICN 364
T G+ G + EA I T+L Y L+ GYFD S Y+++G +++
Sbjct: 334 IVANTGVMTDVNGSYYGGYVTEATITTALIRQYEALVIAGYFDPASSNPYRSIGWSSVNT 393
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
P LA +AA +G LLKN G LP + +A++G AN T M G Y G
Sbjct: 394 PAAQTLARQAATEGTTLLKN-TGLLPYKFTSQTKVAMIGMWANGTSQMQGGYSGPAPYLH 452
Query: 425 SPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
SP+ YA S++ NYA G + ++ A AA+NAD + G+D SVEAE
Sbjct: 453 SPL---YAASQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSVEAEA 509
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR + PG Q LI ++ AA G +V+ +D +N I +++WVGYPG++
Sbjct: 510 MDRYQIAWPGAQQALIAQL--AALGKPMIVLQMGSMLDATPILSNNNISALVWVGYPGQD 567
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG A D++ G P GRLP+T Y A+YV ++P T+M LRP PGRTYK+++ V+ P
Sbjct: 568 GGVAAFDILTGAVAPAGRLPVTMYPADYVNQVPMTNMSLRPGPGNPGRTYKWYNNAVL-P 626
Query: 601 FGYGLSYTQFK--------------YKVASSPKSVDIK--------------LDKDQQCR 632
F YGL YT FK ++P S ++ + Q
Sbjct: 627 FAYGLHYTTFKATFNGGPPGPGSPWSPPWNAPWSAKVRRGWGWGNWGPPNWGWTQPSQVA 686
Query: 633 DIN------YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PP 685
N Y + + C A D + I V+N G+ V +V+S
Sbjct: 687 PGNGGLSSSYNIQSLLSSCTAAHPDLCAFP----SVAISVQNAGQTTSDFVALVFSNTTA 742
Query: 686 GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
G A K + Y R+ +AAGQ+ M L D+ N +L G + +L+
Sbjct: 743 GPAPYPYKSLASYTRLHSVAAGQTVTASLNMT-LGVLARRDDQGNQILYPGTYNLLL 798
>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
Length = 735
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 251/755 (33%), Positives = 390/755 (51%), Gaps = 99/755 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+ + L + +R DLV R+TL EK+ QM + A + RLG+P Y+WW+E LHGV GR
Sbjct: 27 FPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNECLHGV---GR 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
T FP I A+++++L++++ +++ E RA+Y+ +
Sbjct: 84 TPYK------------VTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISKGVHE 131
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT+W+PNIN+ RDPRWGR ET GEDPY+ G +V GLQ D +
Sbjct: 132 IYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQ---------GDDPK 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK SAC KHYA + + R F++ V+ D+ +T++ F V + VSSVMC+Y
Sbjct: 183 YLKASACAKHYAVH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N + G P C + L+ +R W F GY+ SDC +I ++ HK D + VL
Sbjct: 240 NALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFLK-HKTHADAAHASADAVLH- 297
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
G DL+CG + AV+QG I EA ID S++ L++ RLG FD + +Y + +
Sbjct: 298 GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDPADRVKYADTPLSV 357
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +H LA + +R+ +VLLKNDN LPL N+K +A++GP+A+ + ++GNY G P
Sbjct: 358 LECDEHKALALKMSRESVVLLKNDN-VLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPS 415
Query: 422 RYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
+ +P++ + ++VI Y + + + A I+ K D + V G+ +
Sbjct: 416 KVITPLEAIRSKVGKRTQVI-YDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRL 474
Query: 478 EAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
E E G DR + LP QTEL+ K+ +A PV V+M+ A+ I + N
Sbjct: 475 EGEELPISVDGFRGGDRTTIALPEVQTELMKKMKEAGL-PVIFVMMTGSALGIEWESQN- 532
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
I +IL Y G+ G+AIADV+FG YNP G+LP+T+Y ++ P+ + +
Sbjct: 533 -IPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSDSDLPPFGAFSM------AN 585
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++F G +YPFG+GLSYT F Y V P+ V + G P
Sbjct: 586 RTYRYFKGEALYPFGFGLSYTMFDYSV---PQVV---------------SGGKVGEPIKV 627
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
++V+N+GK +G EVV +Y G+ I + G++RV++ AG+
Sbjct: 628 ---------------SVKVKNIGKKNGDEVVQLYLSHEGVEKAPITALKGFKRVYLKAGE 672
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ F ++ + + + D+ + G TI G
Sbjct: 673 EKTLSFEISP-RDMSLPDDNGIITVFPGKKTIYAG 706
>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
Length = 709
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 251/727 (34%), Positives = 382/727 (52%), Gaps = 108/727 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RAK+LV +MT+ EK Q+ A + RLG+P Y WW+EALHGV+ G
Sbjct: 8 KRAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------- 56
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+E L ++G+ ++ EARA YN + GLTFW+PN
Sbjct: 57 -----ATMFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPN 111
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDP++ R A+ +V+ +Q D +K +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPFLTSRLAVPFVKAMQ----------GDGEYMKAAACAKHF 161
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + E R FD++ +++D++ET++ FE V E +V +VM +YNR NG P CA+
Sbjct: 162 AVHSGPEGE---RHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCAN 218
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
L+ T+RG W F G+ VSDC +I+ E+HK + + E++ L+ G DL+CG Y
Sbjct: 219 KPLMVDTLRGKWGFQGHFVSDCWAIKDFHENHK-VTSSPEESAKLALEMGCDLNCGCTYQ 277
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ M V+ G I E I S L+ LG FD + ++ + + +H+ +A A
Sbjct: 278 SI-MNGVRAGLIDEKLITESCERLFTTRFLLGMFDKT-EFDEIPYEKVECKEHLAVAKRA 335
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
AR+ +VLLKND G LPLN +IKT+ +VGP+AN+ ++IGNY GT RY + ++G
Sbjct: 336 ARESVVLLKND-GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGI--QD 392
Query: 435 KV-----INYAPGCADIVCQNN----------SMIPAAIDAAKNADATVIVAGLDLSVEA 479
KV + Y+ GC + QNN + A A ++D V+V GLD ++E
Sbjct: 393 KVGDDVRVLYSEGCD--IFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEG 450
Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
E D+++L LP Q +L+N V D K P ++ M+ A+D++ A++ +
Sbjct: 451 EEGDAGNQFASGDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQD--EAN 507
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
++L YPG GG +AD++FG +P G+LP+T+Y++ +P + RTY
Sbjct: 508 AVLQAFYPGARGGADVADILFGDVSPSGKLPVTFYKSA------DDLPDFKDYSMKNRTY 561
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
K+F G +YPFGYGL+Y K D ++ V
Sbjct: 562 KYFTGTPLYPFGYGLTYGDCYVKP------------------DYDFNVK---------YA 594
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSA 709
D K + T + V N GK+D EVV +Y K T ++G++RV + AG
Sbjct: 595 DADKVSGAEIT--VTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGFKRVHVPAGGET 652
Query: 710 KVGFTMN 716
+V T++
Sbjct: 653 RVTLTVS 659
>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
Length = 697
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 245/749 (32%), Positives = 378/749 (50%), Gaps = 111/749 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
++A+ LV RMTL EK Q+ A + RLG+P Y WW+E LHGV+ G+
Sbjct: 8 KKAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------- 56
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+ ++ V+TE RA YN + GLTFWSPN
Sbjct: 57 -----ATVFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPN 111
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ +++V+ LQ + +K +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPYLTKELGVSFVKALQ----------GNGDTMKAAACAKHF 161
Query: 195 AAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
A + G + R FD+ + +DM+ET++ FE V E V +VM +YNR NG P C
Sbjct: 162 AVHS-----GPEALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCC 216
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
P L + +RG+W F G+ VSDC +I+ E H + DT ++ A + G DL+CG+
Sbjct: 217 GSP-TLQKKLRGEWKFQGHFVSDCWAIRDFHEHH-MVTDTAVESAALAINNGCDLNCGNT 274
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
Y + M A ++G + E I + L+ LG FDGS +Y NL + +P+H++ A
Sbjct: 275 YLHI-MKAYEKGLVTEETITRAAVRLFTTRYLLGLFDGS-EYDNLSYMEVESPRHLDAAE 332
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
+AA + VLLKN NG LPL+ +KT+ ++GP+A++ +A+IGNY GT RY + +G
Sbjct: 333 KAAEKSFVLLKN-NGILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQD 391
Query: 433 Y---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
Y I + GC + + I A A+N+D ++ GLD ++E E
Sbjct: 392 YVGDDVRILTSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGD 451
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D+ D+ LPG Q EL+ +AD K PV +++ +D+ +A +LW
Sbjct: 452 TGNSYVSGDKEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW 510
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG +GG+A A V+FG+ +P G+LP+T+YE+ +T ++ GRTY++ +
Sbjct: 511 --YPGCQGGKAAAKVLFGEISPSGKLPVTFYESLEELPDFTDYSMK------GRTYRYME 562
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+PFGYGL+Y++ + +DK + VK
Sbjct: 563 RKAQFPFGYGLTYSK-------------VAVDKAE-----------------------VK 586
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGF 713
K ++EV+N G D +VV +Y K ++ G++R+F+ AG+ K+
Sbjct: 587 TCGQKINVEVEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKIEI 646
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ K+ +VD + I G
Sbjct: 647 PIWE-KAFTVVDETGKRMEEGKKFEIYAG 674
>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 734
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 257/785 (32%), Positives = 376/785 (47%), Gaps = 120/785 (15%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S P+ + KL + R DLV R+TL EKV+QM + A +PRLG+P Y+WWSE LHGV+
Sbjct: 24 SQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGVART 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
T T +P I A+++ + + E RA++N
Sbjct: 84 PYHT---------------TVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGK 128
Query: 126 -----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
GLT+W+PNIN+ RDPRWGR ET GEDP++ +VRGLQ
Sbjct: 129 NGDRYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQ---------GE 179
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
D + LK +AC KHYA + + R FD V++ D+ T++ F+ V V+ VM
Sbjct: 180 DPKYLKAAACAKHYA---IHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
C+YN P C L+ +R W F GY+ SDC +I HK + E A
Sbjct: 237 CAYNAFRKKPCCGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHK-THPNAEAAAIDA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
+ G D++CG+ AV+ G+IAE +ID S++ L+++ MRLG FD Y
Sbjct: 296 VTNGTDVECGNRAYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFDPVSMVSYAQTS 355
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + H A + A++ IVLLKN+N LPL+ +IK +A+VGP+A+ + A++GNY G
Sbjct: 356 PAVLESAPHKAQALKMAQESIVLLKNENHLLPLSK-SIKKIAVVGPNADNSIAVLGNYNG 414
Query: 419 TPCRYTSPMDGFYA---------YSKVINYAPGCADIVCQNNSMIP-------AAIDAAK 462
TP + + +DG A Y K +N+ N+M+P A K
Sbjct: 415 TPSKIVTALDGIKAKLGTNGSVVYEKAVNF----------TNAMLPEGKTDFAALTSRVK 464
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+ADA + V G+ +E E DR +LLP QTE + K A PV V+
Sbjct: 465 DADAIIFVGGISPQLEGEEMKVNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVM 523
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
M+ A+ I + + N I +I+ Y G+ G AIADV+FG YNP GRLP+T+Y+++
Sbjct: 524 MTGSALAIPWEQEN--IPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSD---- 577
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
+P RTY++F G +YPFGYGLSYT F+Y+ P +V K+
Sbjct: 578 --ADLPAFDDYRMENRTYRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVKNKV------- 628
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI-AGTH 691
+ I++ N G G EVV +Y G
Sbjct: 629 --------------------------RIPVSIQLTNTGAKGGEEVVQLYISYQGQPIKKP 662
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
+K + G++RV++ GQ+ + F + +L I L G I VG G V+ P
Sbjct: 663 LKALKGFQRVWLNRGQTKTIKFLLTP-DALAIAGENGKLLNPKGKLRISVGGGQPDVNTP 721
Query: 752 LQLNL 756
N+
Sbjct: 722 ATSNV 726
>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
Length = 709
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 244/746 (32%), Positives = 378/746 (50%), Gaps = 104/746 (13%)
Query: 25 AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE 84
AK+LV +MTL E+ +Q+ + + L +P Y WW+E LHGV+ G
Sbjct: 16 AKELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT------------- 62
Query: 85 VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNIN 136
AT FP I A F+E +I +STE RA YN + GLT+WSPN+N
Sbjct: 63 ---ATVFPQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVN 119
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+ RDPRWGR ET GEDPY+ R + +++GLQ + + LK++AC KH+A
Sbjct: 120 IFRDPRWGRGHETYGEDPYLTSRLGVAFIKGLQ----------GEGKYLKLAACAKHFAV 169
Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
+ EG R F++ V ++D+ ET++ FE CV E +V SVM +YNR NG P C
Sbjct: 170 HS--GPEGL-RHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKT 226
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
LL +RG W F G++VSDC ++ H + T ++VA ++ G DL+CG+ Y N
Sbjct: 227 LLKDILRGKWGFKGHVVSDCWALADF-HLHHMITSTATESVALAIENGCDLNCGNMYLNL 285
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAEAA 375
+ A ++G + E I T+ L +LG FD +Y + + N C +H E+A A+
Sbjct: 286 LL-AYKEGLVTEEQITTAAERLMTTRFKLGMFDEDCEYNRIPYEVNDCK-EHNEIALIAS 343
Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY---A 432
R+ +VLLKND G LPL+ ++K++A++GP+AN+ + GNY GT +YT+ ++G +
Sbjct: 344 RKSMVLLKND-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVG 402
Query: 433 YSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE------ 480
+ + Y+ GC + + + + AI A+ +D ++ GLD ++E E
Sbjct: 403 DNIRVYYSEGCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGN 462
Query: 481 ---GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
D+ L LPG Q L+ KV + K PV +V+ + A+ N A+ K +IL Y
Sbjct: 463 SYGAGDKESLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTFNGAEE--KCAAILNAWY 519
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
PG GG A+AD++FGK +P G+LP+T+Y+ +T ++ GRTY++ +
Sbjct: 520 PGSHGGTAVADILFGKCSPSGKLPVTFYKDTANLPEFTDYSMK------GRTYRYLEHES 573
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
+YPFGYGL+Y++ + P VK
Sbjct: 574 LYPFGYGLTYSKVELSNLQVPF---------------------------------VKADF 600
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMN 716
F I++ N G EVV Y K + + G++RV + G+S V ++
Sbjct: 601 ESFDISIDIRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELS 660
Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
+S + V+N LL S + + VG
Sbjct: 661 K-RSFEAVNNDGERLLDSKSFKLFVG 685
>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
Length = 905
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 260/764 (34%), Positives = 384/764 (50%), Gaps = 112/764 (14%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ D+ P RA DL+ RM+L EKV Q+ + A G+PRLGLP Y++W+EA HG++ G
Sbjct: 205 WRDSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI-- 262
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN------LGNA- 126
AT FP I A++N +L + G + E RA +N G++
Sbjct: 263 --------------ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSK 308
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT+W+PNIN+ RDPRWGR ET GEDP++ I +V+G+Q D R
Sbjct: 309 WWTGLTYWAPNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQG---------DDPR 359
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
+ AC KHYA + R F++ + E+D+ +T++ FE V EG V+ VM +Y
Sbjct: 360 YMLAMACAKHYAVHSGPE---RTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAY 416
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVL 301
N VNG+P A+ LL + +R W F GY+ SDCD+I+ I + H ++ T E+A A +
Sbjct: 417 NAVNGVPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVK-TAEEAAALAV 475
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK----NL 357
KAG +L CG Y N + AVQQG + E D+D +L RLG FD + Q L
Sbjct: 476 KAGCNLCCGGDY-NALVRAVQQGLVTEKDLDGALYHTLWTRFRLGLFDPAEQVPFSGYTL 534
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
N++ P H ++A E ARQ IVLLKND G LPL+ +K +A++GP+A + + GNY
Sbjct: 535 KDNDL--PAHSQVALELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYH 591
Query: 418 GTPCRYTSPMDGF-----------YAYSKVINYAPGCADIVCQNNSM-------IPAAID 459
G+ R S +D +A + PG A Q+N+ A+
Sbjct: 592 GSASRSISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALK 651
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
A ADA + V G+ + E E DR + LP Q +LI + K PV +V S A+
Sbjct: 652 LAAEADAIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-PVVMVNCSGSAMA 710
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
+ + N + +I+ YPG+EGGRA+A+V+FG+ NP G LPIT+Y + +P
Sbjct: 711 LTWQDEN--LPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYRST------ADLPD 762
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ RTY++F G +Y FG+GLSY+ F+Y N V
Sbjct: 763 FSDYSMKNRTYRYFTGRPLYAFGHGLSYSTFEYA---------------------NLRVA 801
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIGY 698
P A + T +++ N GK DG +VV +Y+ PP + ++ + G+
Sbjct: 802 ----PAA----------NGALTVTLDLTNSGKRDGDDVVQLYATPPASSQPQELRALCGF 847
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
R + AG++ V T+ A + + + SG TI G
Sbjct: 848 RRTHVKAGETRTVTVTVPAVALRRWDIAKKDYAIPSGDWTIAAG 891
>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 723
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 257/754 (34%), Positives = 380/754 (50%), Gaps = 103/754 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D P R +DL+ ++TL EKV QM D++ VPRL LP Y WW+EALHGV+ G
Sbjct: 24 YLDPFNPTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
AT FP I A+F++ L K+ +S EARAMYN
Sbjct: 82 --------------ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKY 127
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFW+PNIN+ RDPRWGR ET GEDP++ + + +++GLQ D L
Sbjct: 128 GGLTFWTPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQ---------GDDPEHL 178
Query: 186 KISACCKHYAAYDLDNWEGNDRFH--FDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K++AC KH+A + G +R F++ + +D++ET++ F+ VN V +VMC+Y
Sbjct: 179 KVAACAKHFAVHS-----GPERLRHSFNAIASPKDLRETYLPAFKALVN-ARVEAVMCAY 232
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NR N C LL+Q +R +W+F G++VSDC +I HK + E AVA +K
Sbjct: 233 NRTNSEVCCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPE-AVALAVKH 291
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKN 360
G+DL+CGD Y + AV++G I E +ID +L L +LG FD SP Y N+ +
Sbjct: 292 GVDLNCGDEYPAL-IEAVKRGLITEKEIDKALATLLKTRFKLGLFDPKQNSP-YNNIPVS 349
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + H LA E A + IVLLKN+ LPL N+ + GP+A + A++GNY G
Sbjct: 350 VINSTDHRALAKEVALKSIVLLKNEK-CLPLKN-NLSKYYITGPNAASVDALMGNYYGVN 407
Query: 421 CRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
++ ++G + Y PG + NN+ I AK +D T +V G+ +
Sbjct: 408 PHMSTILEGIAGAIQPGSQMQYKPGIL-LDRDNNNPIDWTTGDAKASDVTFVVMGITGLL 466
Query: 478 EAEG---------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
E E DR+D LP Q + + K+ K V +I G +N ++ +
Sbjct: 467 EGEEGEAIASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAII--TGGSPMNLSEVHEL 524
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
++L YPGEEGG A+AD++FGK +P GRLP+T+ ++ PY ++ GR
Sbjct: 525 ADAVLLAWYPGEEGGNAVADILFGKVSPSGRLPVTFPKSFAQLPPYEDYSMK------GR 578
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY++ +Y FGYGLSY+ + Y + L + Q +++ T
Sbjct: 579 TYRYMTAEPMYTFGYGLSYSTYTYS--------SLTLSEKQIKKNMTIIAET-------- 622
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
V N GKM+G EVV +Y P + G++RV + AG+S
Sbjct: 623 ----------------MVTNTGKMEGEEVVQLYITVPQTEKNPQYSLKGFKRVNLKAGES 666
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
KV F + +K VD + +L SG++ + +G
Sbjct: 667 RKVQFQITP-DLMKSVDANGSEVLLSGSYVVRIG 699
>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
Length = 702
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 256/754 (33%), Positives = 373/754 (49%), Gaps = 108/754 (14%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
R KDLV R+TL EKV M + +PRLG+P Y+WW+EALHGV+ RT
Sbjct: 2 RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVA----RT---------- 47
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN---------LGNAGLTFWSPN 134
+ T FP I A+F+ +K+G STE RA++N GLT+W+PN
Sbjct: 48 -LEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPN 106
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN+ RDPRWGR ET GEDPY+ + VRGL+ D LK AC KHY
Sbjct: 107 INIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEG---------EDPHYLKSVACAKHY 157
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + +R FD+R + D+ +T++ F V + V VMC+YNR+NG P C +
Sbjct: 158 AVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGN 214
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R W+F GY+ SDC +++ E HK + A++ L AG DL+CG+ Y
Sbjct: 215 DPLLVDILRNQWHFDGYVTSDCWALKDFAEFHK-THPEHTIAMSDALLAGTDLECGNLYH 273
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAA 372
G V++G +E DI+ SL L+ +L ++G FD + + Y ++G+ + H + A
Sbjct: 274 LLAEG-VKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPYSSIGREVLECEAHKQHAE 332
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
A++ IVLL+N N LPL+ IK++AL+GP+A+ + + NY GTP +P
Sbjct: 333 RMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKR 392
Query: 433 Y--SKV-INYAPGCA--DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE--------- 478
K+ INY PG D + S + A AA+ +D V V+G+ E
Sbjct: 393 RLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQ-SDVIVFVSGISADYEGEAGDAGAA 451
Query: 479 ----AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
DR + LP Q EL+ K+ + P+ +V MS + + N ++L
Sbjct: 452 GYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMSGSVMSFEWESQNA--DALLQ 508
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP--GRTYKF 592
Y G+ G AI DV+FG NP GR+P+T Y+++ +P P N+ GRTY++
Sbjct: 509 AWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD------NDLP--PFENYSMLGRTYRY 560
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
F G YPFGYGLSYT F Y D QC D +T T +
Sbjct: 561 FKGEPRYPFGYGLSYTTFAY--------------SDVQCVDETHTGDTAR---------- 596
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGYERVFIAAGQSAK 710
+ V N G DG EVV +Y P G + + G++R+ + G+S
Sbjct: 597 ---------VTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFKRIHLKRGESTS 647
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
V FT+ + L + + N + +G T+ VG G
Sbjct: 648 VSFTLTP-EELALTETDGNLVEKNGQVTLFVGGG 680
>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
Length = 738
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 253/759 (33%), Positives = 375/759 (49%), Gaps = 101/759 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ D KL +R DLV R+TL EKV QM + + RL +P Y WW+E LHG IGR
Sbjct: 24 YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHG---IGR 80
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
T + T FP I A+++ L K + +S E RA+YN +A
Sbjct: 81 -------TEYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYS 128
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT+W+PN+N+ RDPRWGR ET GEDPY+ G ++V GLQ DS+
Sbjct: 129 IYHGLTYWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQG---------DDSQ 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK +AC KHYA + + N R F++ VT D+ +T++ F V + V+ VMC+Y
Sbjct: 180 YLKAAACAKHYAVH---SGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAY 236
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +G P C + L+ + +R W F GY+ SDC +I HK D K A A + +
Sbjct: 237 NAFSGEPCCGNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKY-AAADAVYS 295
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
G D+DCG+ + AV+ G I E ID SL+ L+ + RLG FD + ++ + +
Sbjct: 296 GTDIDCGNEAYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVKFSKIPLSV 355
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ + H +LA + R+ IVLLKN+N LPL + +K +A++GP+A+ +++GNY G P
Sbjct: 356 LESQPHKDLALKITRESIVLLKNENNFLPL-SKKLKKVAVIGPNADNEVSVLGNYNGFPT 414
Query: 422 RYTSPMDGFYAYSK--VINYAPGCADIVCQNNSM--IPAAIDAAKNADATVIVAGLDLSV 477
+ +P K + Y G + NS I A K D + G+ +
Sbjct: 415 QIITPYKAIKNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPEL 474
Query: 478 EAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
E E G DR + LP QTEL+ + A + P V+M+ A+ + N
Sbjct: 475 EGEEMPVKIEGFTGGDRTSIKLPKIQTELMQAL-KAERIPTVFVMMTGSAIAAEWESQN- 532
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
+ +IL Y G++ G AIADV+FG YNP G+LP+T+Y + + +P
Sbjct: 533 -VPAILNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYTKD------SDLPAFNSYEMKN 585
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++FDG V+YPFGYGLSYT+F+Y P S+ G N
Sbjct: 586 RTYRYFDGQVLYPFGYGLSYTKFEYSPIQMPASIK---------------AGEN------ 624
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH----IKQVIGYERVFI 703
I V+N GK DG EVV +Y GT+ + + +ER+ +
Sbjct: 625 ------------MEVSITVKNTGKTDGEEVVQLYISHDN-NGTNRQLPLYALKSFERISL 671
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
AG+S V F ++ + + + D + G + +G
Sbjct: 672 KAGESKSVTFKLSP-REMALADEDGVLKMTKGKSKLYIG 709
>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
Length = 733
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 258/764 (33%), Positives = 384/764 (50%), Gaps = 96/764 (12%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+D P+ D +LP ER + L++ MTL EK Q+ + + RLGLP Y++W+EALHGV+
Sbjct: 22 NDHPWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARN 81
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
GR AT FP I A+F++ L + +S EARA +N +GN
Sbjct: 82 GR----------------ATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGN 125
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
+GLTFW+PNIN+ RDPRWGR ET GEDPY+ + V GLQ
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176
Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+ LK +A KH+A + G + R FD+ +E+DM ET+ FE V E DV +V
Sbjct: 177 PKYLKTAAAAKHFAVH-----SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETV 231
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
M +YNRVNG P LLN +R W F G+IVSDC + E HK + E A A
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESA-AL 290
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
+ G DL+CG YT AV+ G + E IDT L + +LG+FD Y ++
Sbjct: 291 AINTGTDLNCGSVYTALP-DAVEAGLVDEKTIDTRLHKVLATKFKLGFFDPKDDNPYNSI 349
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + + H ++A E A + IVLL+N+N LPL+ NI+ + + GP A++++ ++GNY
Sbjct: 350 SADVVNSDAHADVAYEMAVKSIVLLQNENQVLPLDK-NIRNVYVTGPFASSSEVLLGNYY 408
Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
G + T+ +DG A V INY G N + +A + D + V GL
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468
Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ E E DR+ L LP Q E + K+ PV +V+++AG +N +
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIEFLRKLRKDNDKPV-IVVLTAG-TPVNVTEI 526
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+I++ YPG+EGG+A+AD++FG+ +P GRLPIT+ ++ PY ++
Sbjct: 527 AQLADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQ----- 581
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
GRTY++ +YPFG+GLSY K+ N T+G
Sbjct: 582 -GRTYRYMTEEPMYPFGFGLSYATVKFD---------------------NITLGN----- 614
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFI 703
A + + + V N G + EVV +Y K P AG I+ + G++R+ +
Sbjct: 615 -AEALSSTDGQKGTLDVSVNVTNTGTRELEEVVQLYLKTPN-AGIDQPIQSLKGFQRIKL 672
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
A GQ+ +V FT++ K L ++ +L G + ++VG G
Sbjct: 673 APGQTGQVSFTVSK-KQLYSINAKGKPVLLEGDYHVIVGNASPG 715
>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
Length = 689
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 252/753 (33%), Positives = 396/753 (52%), Gaps = 114/753 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D +L ERA L + ++ E+ QQ+ A + + GLP Y WW+E LHGV+ G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F++ + ++G+ VSTEARAMYN
Sbjct: 62 --------------ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDPY+ R +N+V+G+Q E + L
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE----------KYL 157
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ +AC KH+A + + + R FD+RV+E+D++ET++ F+ V EG V VM +YNR
Sbjct: 158 RAAACAKHFAVH---SGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNR 214
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P+CA KL+ + +R +W F GY VSDC +I+ +HK + DT + A LKAG
Sbjct: 215 VNGEPSCASEKLMGK-LR-EWGFDGYFVSDCGAIRDFHTNHK-ITDTAPQSAAMALKAGC 271
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
D++CG+ Y + + A+++G I + DI T+ +RLG D + ++ +L + I
Sbjct: 272 DVNCGNTYLHI-LAALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACD 329
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ L+ EAA + +VLL ND G LPL+ I ++A++GP+A++ A++GNYEGTP R +
Sbjct: 330 GNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVT 388
Query: 426 PMDGFY-AYSKVINYAPGCADIVCQNNSM-IPA-----AIDAAKNADATVIVAGLDLSVE 478
++G A+ + YA GC + + +P A+ A + AD TV+ GLD ++E
Sbjct: 389 FLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLE 448
Query: 479 AE-------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
E D+ DL LP Q L+ K+ D K P+ +V+ + +V+ N +
Sbjct: 449 GEEGDTENKSGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN-----A 502
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
++ YPG+ GG+A+A+++FG+ +P G+LP+T+Y++ + +T ++ RTY+
Sbjct: 503 LINAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK------NRTYR 556
Query: 592 FFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
F D V+YPFGYGL+Y+ F +C DI+Y
Sbjct: 557 FCDDESNVLYPFGYGLTYSHF-------------------ECGDISY------------- 584
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSA 709
KD T + V N G +V+ VY + H + +ERV + G+S
Sbjct: 585 ------KDN--TLAVNVTNTGSRSAEDVLQVYIRSENGVKNH--SLCAFERVSLFDGESR 634
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ + + + VD+ + SG +T+ G
Sbjct: 635 TISINIPE-GAFETVDDNGVRAVRSGRYTLYAG 666
>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 839
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 275/802 (34%), Positives = 389/802 (48%), Gaps = 108/802 (13%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD PERA LV+++T+ EK+ + D A G R+GLP Y WWSE LHGV+
Sbjct: 37 CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVA------- 89
Query: 75 SPPGTHFDSE---VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PG F++ ATSF I ASF++ L ++G +STEARA N G GL +W
Sbjct: 90 GSPGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYW 149
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PN+N +DPRWGR ETPGEDP + Y + GL+ E V K+ A C
Sbjct: 150 TPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGNETVR----------KVIATC 199
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV----- 246
KHYAAYDL+ W G R+ F++ VT QD+ E ++ PF+ C + V S+MCSYN +
Sbjct: 200 KHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDM 259
Query: 247 -------------NGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLN 290
P CA+ L+ +R WN+ + YI SDC++I + + +
Sbjct: 260 AGGSKPDEIINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITSDCNAILDFLPDNHNFS 318
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFT--MGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T +A A KAG D C + T +GA Q + EA IDT+LR LY L+R GY
Sbjct: 319 QTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYL 378
Query: 349 D--------------GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
D SP Y L N++ P ELA +A +GIVLLKN LPL+
Sbjct: 379 DHGRSAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFS 438
Query: 395 NIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI 454
K +AL+G ANAT M G Y G P Y +P+ + ++YA G ++
Sbjct: 439 G-KKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWT 497
Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
A+ AA+ AD + G D +V +E DR + P Q +L++++A K PV +VI
Sbjct: 498 APALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAGLGK-PV-VVIQL 555
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
VD + NN + SILWVGYPG+ GG A+ DV+ GK P GRLP+T Y YV ++P
Sbjct: 556 GDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVP 615
Query: 574 YTSMPLRPVNN-------------------------------FPGRTYKFFDGPVVYPFG 602
T M LRP N+ PGRTYK++ PV+ PFG
Sbjct: 616 LTEMALRPFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYSTPVL-PFG 674
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYKFT 661
YGL YT F ++ L ++++ + PC A +D
Sbjct: 675 YGLHYTTF-----------NVSLSLSSNASSPSFSIPSLLTPCTATHLDLCPFSPSANSA 723
Query: 662 FQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACK 719
+ + N G V +++ S G +K ++ Y+RV I G++ V +
Sbjct: 724 LSVSITNTGTHTSDYVALLFLSGEFGPEPYPLKTLVSYKRVKDIKPGETVTVKDVPVSLG 783
Query: 720 SLKIVDNAANSLLASGAHTILV 741
++ VD N++L G + +V
Sbjct: 784 AISRVDGDGNTVLYPGTYRFVV 805
>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 714
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 252/760 (33%), Positives = 378/760 (49%), Gaps = 109/760 (14%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D ER +DLV +MTL EKV Q+ A V RLG+P Y WW+EALHGV+ G
Sbjct: 5 YLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG--- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GNAGL- 128
AT FP I A F+E+L +KIG + E RA Y+ G+ GL
Sbjct: 62 -------------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLY 108
Query: 129 ---TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
TFWSPNIN+ RDPRWGR ET GEDP + GR Y++G+Q + + L
Sbjct: 109 KGITFWSPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQ----------GNGKRL 158
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KH+AA+ + R F+S V+++D+ ET+ FE CV E V VM YNR
Sbjct: 159 KAAACVKHFAAH---SGPEKGRHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNR 215
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+NG C L+ + +R W F GY VSDC +I+ H L DT +++ A LK+G
Sbjct: 216 LNGEAACGSHHLITEILREKWGFDGYYVSDCGAIKDF-HMHHGLTDTPQESAALALKSGC 274
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICN 364
DL+CG Y + M A QG ++ DID ++ L + MRLG FD ++ + N C
Sbjct: 275 DLNCGAVYLH-VMSAYNQGLVSAEDIDRAVTHLMMTRMRLGMFDQHTEFDEIPYEINDC- 332
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
+H LA +AA + +VLLKND G LPL+ +KT+A++GP+ ++ + + GNY GT
Sbjct: 333 AEHHGLALKAAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKY 391
Query: 425 SPMDGFYAY---------SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
+ ++G A S+ + + + + + + A+ A +D + GL+
Sbjct: 392 TILEGIRAVLGKETRIFCSEGSHLYRDNVENLAEADDRLKEAVSMAVRSDVVFLCLGLNG 451
Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
++E E G D+ DL LP Q L+ V PV L++ + A+ IN+A +
Sbjct: 452 TLEGEEGDANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLAAGSAMAINYAAEH 510
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
+IL + YPG+ GG A A ++ G+ P GRLP+T+Y+ +T ++
Sbjct: 511 --CSAILHIWYPGQMGGLAAARLLTGEAVPSGRLPVTFYQTTEELPEFTDYSMK------ 562
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++ + +YPFGYGLSY F+Y S+ K+ + D Q R
Sbjct: 563 GRTYRYMEREALYPFGYGLSYGDFEY---SNFKAEQTEAGPDGQVR-------------- 605
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK----QVIGYERVF 702
F +++ N K + E+ VY + IA + + + + R+
Sbjct: 606 ---------------FSVKITNRSKAECDEIAEVYVR---IADSELAAPGGSLADFRRIH 647
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ AG+S V FT+ K+ +V+ +L + G
Sbjct: 648 MKAGESVTVPFTL-PVKAFMVVNEEGEYILDGSTAVVTCG 686
>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 693
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 261/750 (34%), Positives = 384/750 (51%), Gaps = 114/750 (15%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
ER L+ +M++ EK M A G+PRLG+P Y WW+EALHGV+ G
Sbjct: 5 ERMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------- 53
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-LGNA-------GLTFWSPN 134
AT FP I A+F+ L +++ + +STEARA +N +G GLTFWSPN
Sbjct: 54 -----ATVFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPN 108
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN+ RDPRWGR ET GEDP++ + +++V+GLQ Y+ ++++AC KHY
Sbjct: 109 INIYRDPRWGRGQETYGEDPFLTSKIGVSFVKGLQGDH--PYY-------MRVAACAKHY 159
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R FD+RV+E+D+ ET++ FE V G V +VM +YNRVNG P C
Sbjct: 160 AVH--SGPEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGS 215
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
+LL++ +R W F G++VSDC +I HK D E ++A L+AG DL+CG+ Y
Sbjct: 216 KRLLDEILRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYE 274
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ + AV+ G ++E +D S+ L L RLG F Y L ++I H LA EA
Sbjct: 275 HL-LDAVKAGVVSEELVDRSVARLLSTLDRLGLFTDDHPYARLSLSDIDWEAHRALAREA 333
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
A + +VLLKN NG LP + ++ + + GP+A A++GNY G R + ++G Y+
Sbjct: 334 AEKSVVLLKN-NGILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYA 392
Query: 435 K---VINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVEAEGKDRV---- 485
+ Y GC Q N + P A A+ AD TV V G D +VE E D +
Sbjct: 393 GPGITVTYKIGCP---LQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDN 449
Query: 486 -----DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK----SILWVG 536
DL LP Q E + ++ + K P+ +V++S V +P+++ +I++
Sbjct: 450 YGDLSDLDLPREQIEYLRRIKEIGK-PLVVVLLSGAPV------CSPELEELADAIVYAW 502
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPGEEGG AIA V+FG+ +P GRLPIT+ P+T + GRTY++
Sbjct: 503 YPGEEGGNAIARVLFGEISPSGRLPITFPRGVDQLPPFTDYSME------GRTYRYMREE 556
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFG+GLSY F Y+ S S + DK + ++ C
Sbjct: 557 PLYPFGFGLSYATFSYRGLQSSAS---RWDKRETL--------------------ELVC- 592
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
EVEN + EVV +Y + P + +K G+ RV + AG+ +V
Sbjct: 593 --------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLK---GFTRVSLGAGERKQVR 641
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
F ++ + L +D +L G VG
Sbjct: 642 FVLSP-EELSFIDEEGRKVLPEGRLHFHVG 670
>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 710
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 243/750 (32%), Positives = 383/750 (51%), Gaps = 107/750 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E+AK+LV +MTL E+ +Q+ A + L + Y WW+E LHGV+ G
Sbjct: 14 EKAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------- 62
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A F++ L +KI ++TE RA YN + GLTFWSPN
Sbjct: 63 -----ATVFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPN 117
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ R + +V+GLQ D + LKI+AC KH+
Sbjct: 118 VNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQ----------GDEKYLKIAACAKHF 167
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R F++ V+++D+ ET++ FE CV E DV +VM +YNR N P C
Sbjct: 168 AVHS--GPEGL-RHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGS 224
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +RG W F G++VSDC +I H + T ++ A +K G DL+CG+ Y
Sbjct: 225 SLLLKDILRGKWQFKGHVVSDCWAIADFHLYHG-VTSTATESAALAIKNGCDLNCGNVYL 283
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAE 373
+ A ++G + E DI + L +RLG FD ++ + N C H E++
Sbjct: 284 QMLL-AYKEGLVTEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCKEHH-EVSLM 341
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY-- 431
A+R+ IV+L+N NG LPL+ +K++ ++GP+A++ + GNY GT +Y + ++G +
Sbjct: 342 ASRKSIVMLRN-NGLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEA 400
Query: 432 --AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
+ + I Y+ GC + + + + A+ A+++D ++ GLD S+E E
Sbjct: 401 VDSENIRIFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGD 460
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D+++L LPG Q EL+ KV K PV +V+ + A+ + + N +IL
Sbjct: 461 AGNSDGAGDKLNLNLPGKQQELLEKVIATGK-PVIVVLGAGSALTLQGQEEN--CAAILN 517
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG GGRAIAD+IFGK +P G+LP+T+Y+ +T ++ RTY++
Sbjct: 518 AWYPGSFGGRAIADLIFGKCSPSGKLPVTFYKTTEELPEFTDYSMK------NRTYRYMK 571
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFG+GL+Y++ + D DI+
Sbjct: 572 NESLYPFGFGLTYSKVQL--------------SDLSVSDIS------------------- 598
Query: 655 CKDYK-FTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVG 712
KD++ I++ N+G D EV+ Y K + ++RV + G+S V
Sbjct: 599 -KDFEGVEVSIKISNVGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKRVALNKGESKVVK 657
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
T+N ++ ++V++ + +L S + VG
Sbjct: 658 MTINK-RAFEVVNDEGDRILDSKKFKLFVG 686
>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
Length = 701
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 234/634 (36%), Positives = 336/634 (52%), Gaps = 70/634 (11%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
+ E+AK LV M+L E Q+ A +PRLGLP Y WW+EALHG + G
Sbjct: 6 FREQAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT--------- 56
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
AT FP I A F++ K+I +STE RA YN +A GLT WS
Sbjct: 57 -------ATVFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWS 109
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ET GEDPY+ + +++++GLQ D LK +AC K
Sbjct: 110 PNVNIFRDPRWGRGQETYGEDPYLASQLGVSFIQGLQ----------GDGPYLKTAACVK 159
Query: 193 HYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
H+A + G + R F++ V+ +D+ ET++ FE CV EG+V++VM +Y+ VNG P
Sbjct: 160 HFAVHS-----GPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEP 214
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
C P L+ +R DW F G +SDC +I+ +H + + D+VA L AG DL+CG
Sbjct: 215 CCGSPFLITDILRNDWGFEGMYISDCWAIRDFHLNHA-VTKNQVDSVALALNAGCDLNCG 273
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIEL 370
Y + A QQG I I + + LG F Y N+G +H ++
Sbjct: 274 CEYLSLEK-AYQQGLIDRKTITQACIRVMTTRFALGLFSEDCTYSNIGYEQNDTEEHRKV 332
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +A+ +VLLKND G LPL++ ++ +A++GP+A++ +A+ GNY GT YT+ ++GF
Sbjct: 333 AFKASCNSLVLLKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGF 391
Query: 431 ---YAYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
S + Y+ G A + + + N I AI A +D ++ G D +VE E
Sbjct: 392 RKTLGESVKVKYSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEM 451
Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
D+ DL LP Q L+ VA K P+ LV++S GA+D + P +K++
Sbjct: 452 HDDGNGGWAGDKQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEIER-FPNVKAL 509
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
L YPG+EGG AIA I G NP G LP+T+Y + T +P GRTY++
Sbjct: 510 LQGWYPGQEGGLAIAHTILGLNNPSGHLPVTFYRSE------TVLPDFCDYRMEGRTYRY 563
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
V+YPFG+GLSYT F Y S+ K D L+
Sbjct: 564 VQEKVLYPFGFGLSYTTFSYGNLSTGKQADGNLE 597
>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
siraeum DSM 15702]
Length = 691
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 253/755 (33%), Positives = 396/755 (52%), Gaps = 116/755 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D +L ERA L + ++ E+ QQ+ A + + GLP Y WW+E LHGV+ G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F++ + ++G+ +STEARAMYN
Sbjct: 62 --------------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDPY+ R +N+V+G+Q E EY L
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE--EY--------L 157
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ +AC KH+A + + + R FD+RV+E+DM+ET++ F+ V EG V VM +YNR
Sbjct: 158 RAAACAKHFAVH---SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNR 214
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P+CA KL+ + +R +W F GY VSDC +I+ +HK + DT + A LKAG
Sbjct: 215 VNGEPSCASEKLMGK-LR-EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGC 271
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
D++CG+ Y + + A+++G I + +I T+ +RLG D + ++ +L + I
Sbjct: 272 DVNCGNTYLHI-LAALEEGLITKQNIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACD 329
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ L+ EAA + +VLL ND G LPL+ I ++A++GP+A++ A++GNY GTP R +
Sbjct: 330 GNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVT 388
Query: 426 PMDGFY-AYSKVINYAPGCADIVCQNNSM-IPA-----AIDAAKNADATVIVAGLDLSVE 478
++G A+ + YA GC + + +P A+ A + AD TV+ GLD ++E
Sbjct: 389 FLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLE 448
Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
E D+ DL LP Q L+ K+ D K P+ +V+ + +V+ N
Sbjct: 449 GEEGDTGNEFASGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVNTECEGN---- 503
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
+++ YPG+ GG+A+A+++FG+ +P G+LP+T+Y++ + +T ++ RT
Sbjct: 504 -ALINAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK------NRT 556
Query: 590 YKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
Y+F D V+YPFGYGL+Y+ F +C DI+Y
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHF-------------------ECGDISY----------- 586
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
KD T + V N G +V+ VY K H + +ERV + G+
Sbjct: 587 --------KDN--TLAVNVTNTGSRSAEDVLQVYIKSENGVKNH--SLCAFERVSLFDGE 634
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
S + + + DN +++ SG +T+ G
Sbjct: 635 SRTISINIPEGAFETVDDNGVRAVI-SGRYTLYAG 668
>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
Length = 743
Score = 381 bits (978), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/753 (33%), Positives = 366/753 (48%), Gaps = 95/753 (12%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD +RA L+ TL E V G+ + GVPRLGLP Y+ W+EALHG+
Sbjct: 57 LSKTIVCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLD- 115
Query: 69 IGRRTNSPPGTHFDSE--VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
+F E +TSFP ILT ++ N +L ++ +ST+ RA N G
Sbjct: 116 ---------RAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRY 166
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR-YAINYVRGLQDVEGVEYHRDSDSRPL 185
GL +SPNIN R P WGR ETPGED Y + YA Y+ G+Q GV D + L
Sbjct: 167 GLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--GV------DPKSL 218
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A KHYA YD++NW+G+ R D +T+QD+ E + F + + V SVMCSYN
Sbjct: 219 KLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNA 278
Query: 246 VNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
VNG+P+CA+ L +R + F GYI SDCDS + H++ + A A ++A
Sbjct: 279 VNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRA 337
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
G D+DCG Y + AV Q ++ ADI+ + LY LMRLGYFD P
Sbjct: 338 GTDIDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDVGPWM--------- 388
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
N+ T + GNY G
Sbjct: 389 -------------------------------NVST------------QLQGNYFGPAPYL 405
Query: 424 TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
SP+D F +NYA G +I + A+ AAK +DA + G+D S+EAE D
Sbjct: 406 ISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLD 464
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R+++ PG Q ELI++++ K P+ ++ M G VD + K+N + S++W GYPG+ GG
Sbjct: 465 RMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGG 523
Query: 544 RAIADVIFGKYNPGGRLPITWYEANY-VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
+A+ D+I GK P GRL +T Y A Y + P T M LRP N PG+TY ++ G VY FG
Sbjct: 524 QALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNNPGQTYMWYTGTPVYEFG 583
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GL YT F+ A + +K+ +D+ +P + ++ + F
Sbjct: 584 HGLFYTTFRVSHARA-----VKIKPTYNIQDL-----LAQPHPGYIHVEQMPF----LNF 629
Query: 663 QIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+++ N GK M+++ G A K ++G++R+ ++K+ S+
Sbjct: 630 TVDITNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSM 689
Query: 722 KIVDNAANSLLASGAHTILVGEGVGGVSFPLQL 754
D N +L G + + + V PL L
Sbjct: 690 ARTDELGNRVLYPGKYELALNNE-RSVVLPLSL 721
>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
Length = 738
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 254/767 (33%), Positives = 373/767 (48%), Gaps = 108/767 (14%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
D+P+ + LP R +D++ R+TL EKVQ M A VPRLG+P Y WW+EALHGV+
Sbjct: 25 DYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVA--- 81
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGNA 126
RT T FP I A+F+ +K+G S+E RA++N G
Sbjct: 82 -RTKEK-----------VTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKT 129
Query: 127 G-----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
G LT+W+PNIN+ RDPRWGR ET GEDPY+ + V GL+ ++
Sbjct: 130 GEIYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEG---------NN 180
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK AC KHYA + + ++R +D+RV+ D+ +T++ F V + V VMC
Sbjct: 181 PEYLKSVACAKHYAVH---SGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMC 237
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-FLNDTKEDAVARV 300
+YNR G P C +LL +R W F GY+ SDC ++ + HK NDT +AVA
Sbjct: 238 AYNRFEGTPCCGHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDT--EAVADA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
+ G DL+CG+ Y G V++G I+E DI+ SL L+ + +LG +D + + Y ++G
Sbjct: 296 VLNGTDLECGNLYQKLQQG-VEKGLISEKDINVSLARLFEIQFKLGMYDPADRVPYASIG 354
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ I H + A E A++ +VLLKN+ LPLN IK +AL+GP+ + ++ NY G
Sbjct: 355 REVIECDAHKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFG 414
Query: 419 TPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPA---AIDAAKNADATVIVAG 472
TP +P + S I+ G + Q P+ AK AD + V G
Sbjct: 415 TPSEIITPYKSLQKRFGNSIQIDTLTGVG--IVQKLEGAPSFAQVAAQAKKADIIIFVGG 472
Query: 473 LDLSVE-------------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
+ E DR + LP QTEL+ ++ + P+ LV MS +
Sbjct: 473 ISADYEGEAGDAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMSGSVMS 531
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
++ N +IL Y G+ G AI DV+FG YNP GR+P+T Y + +P
Sbjct: 532 FDWESRNA--DAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTYMND------EDLPD 583
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ RTY++F G V YPFGYGLSYT F Y + +V K + + T
Sbjct: 584 FEDYSMANRTYRYFKGDVRYPFGYGLSYTTFGYAPLQNASTV-----KTGESIQVTTT-- 636
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI--KQVIG 697
V N GK G EVV +Y P T + + + G
Sbjct: 637 --------------------------VTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKG 670
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++R+ + G+S +V FT++ + L +VD N + G + +G G
Sbjct: 671 FKRIHLDTGESRQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIGGG 716
>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 454
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/446 (42%), Positives = 275/446 (61%), Gaps = 9/446 (2%)
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNN 361
+D++CG Y AV +GK+ E DID +L L+ V +RLG FDG + + LG +
Sbjct: 1 MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+C +H +LA EAARQGIVLLKN+ LPLN + +LA++GP AN ++ G+Y G C
Sbjct: 61 VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
S DG AY K +YA GC+++ C ++ P AI AK AD ++VAG+DLS E E
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
+DR+ LLLPG Q L++ VA A+K PV LV+ G VD++FAK + +I SILW+GYPGE
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
G +A+AD+IFG+YNPGGRLP+TWY ++ +P M +R P +PGRTY+F+ G VY
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTNVPMNDMNMRANPNRGYPGRTYRFYTGERVY 300
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV-KCKDY 658
FG GLSYT + YK S+P + + R + + ID++ C
Sbjct: 301 GFGEGLSYTNYAYKFLSAPSKLSLSGSLTATSR--KRILHQRGDRLDYIFIDEISSCNSL 358
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
+FT QI V N+G MDGS VVM++S+ P ++ GT KQ++G+ER+ + +S + ++
Sbjct: 359 RFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSILLDP 418
Query: 718 CKSLKIVDNAANSLLASGAHTILVGE 743
CK L I + ++ G+H +L+G+
Sbjct: 419 CKHLSIANGQGKRIMPVGSHVLLLGD 444
>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 735
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 257/766 (33%), Positives = 390/766 (50%), Gaps = 103/766 (13%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
+ K+ S+F + D L ER DL+ R+TL EK QQM + + + RLG+P Y+WW+EA
Sbjct: 23 QQTKIDKSEFDFYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEA 82
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
LHG+ G AT FP I A+F++ L K+ +S EARA +N
Sbjct: 83 LHGLGRSGV----------------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFN 126
Query: 123 LGNA----------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
NA GLTFW+PN+N+ RDPRWGR ET GEDPY+ + +V+GLQ
Sbjct: 127 --NAVKHGYHRKYGGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQG-- 182
Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
D+D + LK +A KHYA + + R F++ V+E+D+ ET++ F+ V
Sbjct: 183 ------DND-KYLKTAAAAKHYAVH---SGPEKLRHEFNADVSEKDLWETYLPAFKTLV- 231
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
+ +V ++MC+YN NG P CA+ +L+N +R W F+G++VSDC ++Q V H + ++
Sbjct: 232 DANVETIMCAYNSTNGEPCCANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHDIV-ES 290
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--G 350
E A A ++ G++L+CGD Y NF AV+ G ++E +D L L +LG FD
Sbjct: 291 PEAAAALAVEVGIELNCGDTY-NFLAKAVEDGLVSEELVDKRLHKLLETRFKLGLFDPEE 349
Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
S Y +G + + +H LA E AR+ IVLLKND G LPL N+ + GP+A +
Sbjct: 350 SNPYNKIGVEVMNSDEHRALARETARKSIVLLKND-GVLPLKN-NLSKYFITGPNATNIE 407
Query: 411 AMIGNYEGTPCRYTSPMDGFYAYSK---VINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
++GNY G + ++G K + Y G + N A+ +A N+DAT
Sbjct: 408 VLLGNYHGVNPDMVTVLEGIAKAIKPESQLQYRMGTRLNLPNENPQDWASPNAG-NSDAT 466
Query: 468 VIVAGLDLSVEAEG---------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
+V G+ +E E DR+D LP Q + + KV++AA+ + I++ G+
Sbjct: 467 FVVMGISGLLEGEEGESIASPTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGS- 525
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N + + ++L V YPGEEGG A+AD+IFGK +P GRLPIT+ + +P
Sbjct: 526 PMNLTEVHKLADAVLLVWYPGEEGGNAVADIIFGKNSPSGRLPITF------PMTIEDLP 579
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
GRTYK+ D +YPFGYGLSYT F+Y K K + +
Sbjct: 580 AYEDYTMEGRTYKYMDVVPMYPFGYGLSYTDFEYSEIKLSKDKIKKKESVEA-------- 631
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK--QVI 696
+I V N G + EVV VY K A + + +++
Sbjct: 632 ------------------------RISVTNTGDFEADEVVQVYLKDVK-ASSRVPNFELV 666
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ + + G+S ++ F + + L +D+ L GA I +G
Sbjct: 667 AFKNIHLKRGESKELTFEITP-EMLSFIDDNGKEKLEKGAFEIYIG 711
>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 721
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 256/743 (34%), Positives = 379/743 (51%), Gaps = 98/743 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
AK+++ RMT+ EK+ Q+ + + + LG+ Y+WWSE LHGV GR
Sbjct: 32 RHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------- 80
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPN 134
AT FP I A+F+E+L ++IG V+TE RA +N+ NAGLTFWSPN
Sbjct: 81 -----ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPN 135
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR +ET GEDP + G YVRGLQ D+ LK AC KHY
Sbjct: 136 VNIFRDPRWGRGMETYGEDPLLSGMLGTAYVRGLQ---------GDDAFYLKTGACAKHY 186
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R D + +D+ ET++ F+M V +G V +VM +YNRV G P
Sbjct: 187 AVH--SGPEGT-RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R W F+G+IVSDCD+I H+++ T E+A A +KAGL+++CG +
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVK-TPEEACAAAIKAGLNVECGHTFK 302
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIELAA 372
GA+ QG +AEAD+D +L L + ++LG D + Y + ++ IC+P H LA
Sbjct: 303 AM-QGALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALAL 361
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
AA + +VLLKN NG LPL+ NI+TL + GP A+ ++GNY G RY++ + G +
Sbjct: 362 RAADEAMVLLKN-NGILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVS 419
Query: 433 Y---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--------- 480
+N+ P I + N M A++ A A+ ++V G + ++E E
Sbjct: 420 RVSSGTSVNFRPAFMQITEELNDM-NWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRV + LP Q + +V A KG +V+++ G+ I+ K + +++ YPG+
Sbjct: 479 RGDRVGIGLPASQLNYLRRV-KARKGGRIVVVLTGGS-PIDLRKISKLADAVVMAWYPGQ 536
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
EGG A+ D++FG N GRLPIT+ S+P + GRTYK+ G V+YP
Sbjct: 537 EGGEALGDLLFGDKNFSGRLPITF------PADVDSLPAFDDYSMNGRTYKYMSGNVMYP 590
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGLSY + Y A VG K K
Sbjct: 591 FGYGLSYGRVTYTDAR--------------------VVGRIK-------------KGEPL 617
Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
++ + N G EV Y + P G+ + ++G+ RV I S K F + +
Sbjct: 618 AVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVPER 677
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
+ I + ++ LL G +T+ +G
Sbjct: 678 LMTIQSDGSSKLL-KGNYTLTIG 699
>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6192]
gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
Length = 693
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 263/750 (35%), Positives = 380/750 (50%), Gaps = 114/750 (15%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
ER L+ RM++ EK M A GVPRLG+P Y WW+EALHGV+ G
Sbjct: 5 ERMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------- 53
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-LGNA-------GLTFWSPN 134
AT FP I A+F+ L +++ +S EARA +N +G GLTFWSPN
Sbjct: 54 -----ATVFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPN 108
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN+ RDPRWGR ET GEDP++ + + +V+GLQ Y+ L+++AC KHY
Sbjct: 109 INIYRDPRWGRGQETYGEDPFLTSKIGVAFVKGLQGDH--PYY-------LRVAACAKHY 159
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R FD+RV+E+D+ ET++ FE V G V +VM +YNRVNG P C
Sbjct: 160 AVH--SGPEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGS 215
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
+LL + +R W F G++VSDC +I HK D E ++A L+AG DL+CG+ Y
Sbjct: 216 KRLLEEILRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYE 274
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ + AV+ G ++E +D S+ L L RLG F Y L +I H LA EA
Sbjct: 275 HL-LDAVKAGAVSEELVDRSVARLLSTLDRLGLFTDDHPYVRLSLADIDWEAHRALAREA 333
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
A + +VLLKN NG LPL+ ++ + + GP+A A++GNY G R + ++G Y+
Sbjct: 334 AEKSVVLLKN-NGILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYA 392
Query: 435 K---VINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGLDLSVEAEGKDRV---- 485
+ Y GC Q N + P A A+ AD TV V G D +VE E D +
Sbjct: 393 GPGITVTYKIGCP---LQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDN 449
Query: 486 -----DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK----SILWVG 536
DL L Q + + ++ + K P+ +V++S V +P+++ +I++
Sbjct: 450 YGDLSDLNLSREQIDYLRRIKEIGK-PLVVVLLSGAPV------CSPELEELADAIVYAW 502
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPGEEGG AIA V+FG+ +P GRLPIT+ + P+T + GRTY++
Sbjct: 503 YPGEEGGNAIARVLFGEVSPSGRLPITFPKGVDQLPPFTDYSME------GRTYRYMKEE 556
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFG+GLSY F Y+ PKS + DK + +V C
Sbjct: 557 PLYPFGFGLSYATFSYR---DPKSSASRWDKRETL--------------------EVVC- 592
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSK----PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
EVEN + EVV +Y + P + +K G+ RV + G+ +V
Sbjct: 593 --------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLK---GFTRVSLGTGERIQVR 641
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
F ++ + L +D +L G VG
Sbjct: 642 FVLSP-EDLSFIDEKGRKVLPEGRLRFHVG 670
>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
Length = 691
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 252/755 (33%), Positives = 395/755 (52%), Gaps = 116/755 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D +L ERA L + ++ E+ QQ+ A + + GLP Y WW+E LHGV+ G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F++ + ++G+ +STEARAMYN
Sbjct: 62 --------------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDPY+ R +++V+G+Q E EY L
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE--EY--------L 157
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ +AC KH+A + + + R FD+RV+E+DM+ET++ F+ V EG V VM +YNR
Sbjct: 158 RAAACAKHFAVH---SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNR 214
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P+CA KL+ + +R +W F GY VSDC +I+ +HK + DT + A LKAG
Sbjct: 215 VNGEPSCASEKLMGK-LR-EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGC 271
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
D++CG+ Y + + A+++G I + DI T+ +RLG D + ++ +L + I
Sbjct: 272 DVNCGNTYLHI-LAALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACD 329
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+ L+ EAA + +VLL ND G LPL+ I ++A++GP+A++ A++GNY GTP R +
Sbjct: 330 GNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVT 388
Query: 426 PMDGFY-AYSKVINYAPGCADIVCQNNSM-IPA-----AIDAAKNADATVIVAGLDLSVE 478
++G A+ + YA GC + + +P A+ A + AD TVI GLD ++E
Sbjct: 389 FLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLE 448
Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
E D+ DL LP Q L+ + D K P+ +V+ + +V+ N
Sbjct: 449 GEEGDTGNEFASGDKPDLRLPEVQRVLLQNLKDTGK-PLIIVLAAGSSVNTECEGN---- 503
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
+++ YPG+ GG+A+A+++FG+ +P G+LP+T+Y++ + +T ++ RT
Sbjct: 504 -ALINAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDFTDYSMK------NRT 556
Query: 590 YKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
Y+F D V+YPFGYGL+Y+ F +C D++Y
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHF-------------------ECGDVSY----------- 586
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
KD T + V N G +V+ VY K H + +ERV + G+
Sbjct: 587 --------KDN--TLAVNVTNTGSRSAEDVLQVYIKSENGVKNH--SLCAFERVSLFDGE 634
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
S + + + + VD+ + SG +T+ G
Sbjct: 635 SRTISINIPE-GAFETVDDNGIRAVRSGRYTLYAG 668
>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 720
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 254/753 (33%), Positives = 377/753 (50%), Gaps = 98/753 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ D L RAK L+ +TL EK+ +G V RL +P Y WW+EALHGV+ G
Sbjct: 29 FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAGE-- 86
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F+ +L +I +STEAR+ YN+ +
Sbjct: 87 --------------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQY 132
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
G+TFW+PNIN+ RDPRWGR ET GEDP++ +V+GLQ E R L
Sbjct: 133 LGITFWTPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSE--------PERRL 184
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +A KH+A + DR HF++ V E+D++ET++ F+ V G V+++MC+YNR
Sbjct: 185 KTAAGAKHFAVHSGPE---ADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYNR 240
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P C LL +R +W F G +V+DC ++ I HK + T+ + A +KAG+
Sbjct: 241 VNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTI-PTRVEVAAAAVKAGV 299
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
+LDC + A+++ + +D++L ++LG++D Y++ G +++
Sbjct: 300 NLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYDDPSHSPYRHYGIDSVN 359
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
N HI LA EAA + +VLLKND G LPL I ++ +VG +A + A+ GNY G
Sbjct: 360 NSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGNM 418
Query: 424 TSPMDGFYAYS---KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+ ++G + Y GC+ + S I AA D T+ V GL +E E
Sbjct: 419 VTFVEGLVKAGGPGMSVQYDYGCS---FADTSHF-GGIWAAGFTDVTIAVIGLSPLLEGE 474
Query: 481 ---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
G D+ DL +P + K+ ++ PV V+ A+DI+ + P +
Sbjct: 475 HGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAIE--PYADA 532
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
I++ YPGE+GG A+AD+IFG+ +P GRLPIT+Y+ PY N RTY+
Sbjct: 533 IIYAWYPGEQGGTALADLIFGEVSPSGRLPITFYKDIKDLPPYHDY------NMTNRTYR 586
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
+F G V+YPFGYGLSYT F Y+ S P + V D
Sbjct: 587 YFQGDVLYPFGYGLSYTSFHYEWLSKPST--------------------------KVSED 620
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
D+ + I V N G MD EV+ VY P I ++++ G+ R+ I AGQ+
Sbjct: 621 DI------ISVNIAVTNTGTMDADEVIQVYIVYPDIERMPLRELKGFSRIHIKAGQTQNT 674
Query: 712 GFTMNACKSLKIVDNAANSL-LASGAHTILVGE 743
+ K+LK D+ N L G + I V +
Sbjct: 675 DIQI-PVKNLKKWDSKNNRWKLYKGKYKIQVSQ 706
>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
Length = 721
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 255/743 (34%), Positives = 379/743 (51%), Gaps = 98/743 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
AK+++ RMT+ EK+ Q+ + + + LG+ Y+WWSE LHGV GR
Sbjct: 32 RHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------- 80
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPN 134
AT FP I A+F+E+L ++IG V+TE RA +N+ NAGLTFWSPN
Sbjct: 81 -----ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPN 135
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR +ET GEDP + G YVRGLQ D+ LK AC KHY
Sbjct: 136 VNIFRDPRWGRGMETYGEDPLLSGMLGTAYVRGLQ---------GDDAFYLKTGACAKHY 186
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R D + +D+ ET++ F+M V +G V +VM +YNRV G P
Sbjct: 187 AVH--SGPEGT-RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R W F+G+IVSDCD+I H+++ T E+A A +KAGL+++CG +
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVK-TPEEACAAAIKAGLNVECGHTFK 302
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIELAA 372
GA+ QG +AEAD+D +L L + ++LG D + Y + ++ IC+P H LA
Sbjct: 303 AM-QGALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALAL 361
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
AA + +VLLKN NG LPL+ NI+TL + GP A+ ++GNY G RY++ + G +
Sbjct: 362 RAADEAMVLLKN-NGILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVS 419
Query: 433 Y---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--------- 480
+N+ P I + N M A++ A A+ ++V G + ++E E
Sbjct: 420 RVSSGTSVNFRPAFMQITEELNDM-NWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRV + LP Q + +V A KG +V+++ G+ I+ + + +++ YPG+
Sbjct: 479 RGDRVGIGLPASQMNYLRRV-KARKGGRIVVVLTGGS-PIDLREISKLADAVVMAWYPGQ 536
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
EGG A+ D++FG N GRLPIT+ S+P + GRTYK+ G V+YP
Sbjct: 537 EGGEALGDLLFGDKNFSGRLPITF------PADVDSLPAFDDYSMNGRTYKYMSGNVMYP 590
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGLSY + Y A VG K K
Sbjct: 591 FGYGLSYGRVTYTDAR--------------------VVGRIK-------------KGEPL 617
Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
++ + N G EV Y + P G+ + ++G+ RV I S K F + +
Sbjct: 618 AVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKIVPER 677
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
+ I + ++ LL G +T+ +G
Sbjct: 678 LMTIQSDGSSKLL-KGNYTLTIG 699
>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 793
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 259/767 (33%), Positives = 389/767 (50%), Gaps = 79/767 (10%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD L ERA LV + + EK+ + A G R+GLP Y WWSEALHGV++
Sbjct: 44 CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVAYA----- 98
Query: 75 SPPGTHFDSEVPG----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F PG +TSFP +L ASF++SL +KIG + TE+RA N +GL +
Sbjct: 99 --PGTQF-RRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDY 155
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
W+PN+N +DPRWGR ETPGED + RYA + ++GL+ H + + R + +
Sbjct: 156 WTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGP-----HPEKERR---VVST 207
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKHYAA D ++W G R FD+R++ QD+ E +++PF+ C + V S+MC+YN VNG+P
Sbjct: 208 CKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVP 267
Query: 251 TCADPKLLNQTIRGDWNFHG---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
+CA+ LL+ +R W + G Y+ SDC+++ + HK+ T + A +AG D
Sbjct: 268 SCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYAR-TNAEGTAMCFEAGTDT 326
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNICNPQ 366
C ++ GA QG + E +D +L LY L+R+GYFDG S + ++ ++ P
Sbjct: 327 SCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSAFSDISWADVNAPA 386
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKT-------------LALVGPHANATKAMI 413
+L+ ++A +GIV+LKND G LPL G + LA++G A+A + +
Sbjct: 387 AQDLSLQSAVEGIVMLKND-GTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLR 445
Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN------NSMIPAAIDAAKNADAT 467
G Y GT +P YA ++ V Q ++ A+ AA+ AD
Sbjct: 446 GGYSGTAAYLRTPA---YAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYI 502
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
V GLD + E KDR D+ PG Q L+ ++ AA G +V+ +D N
Sbjct: 503 VYFGGLDETAAGENKDRWDVEWPGAQLALVKRL--AALGKPLVVVQMGDQLDGTPLLANA 560
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRP--VNN 584
+ ++LW +PG++GG A+ ++ G +P GRLP+T Y ANY + +P T M LRP +
Sbjct: 561 GVGAVLWASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGS 620
Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP----KSVDIKLDKDQQCRDINYTVGT 640
PGRTY+++ PV+ PFG+GL YT F V P S + CRD +
Sbjct: 621 RPGRTYRWYSTPVL-PFGFGLHYTNFTPAVTVPPALAAASGVTTSSLLEACRDPHPERCA 679
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYE 699
P ++ V N G+ V + + S G IK + Y
Sbjct: 680 LPP------------------LRVAVANTGRRASDYVALAFVSGDYGPRPRPIKTLAAYA 721
Query: 700 RVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
R+ + AG SA+ + D N++L G + + + E V
Sbjct: 722 RLRGVRAGGSAEADLAWT-LGDIARHDEDGNTVLYPGTYKVQIDEPV 767
>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
Length = 2278
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 255/771 (33%), Positives = 391/771 (50%), Gaps = 95/771 (12%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
FP+C++ L R +DL++R+ L EKV+ + A +PRLG+P Y W + +HGV
Sbjct: 34 FPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGV-- 91
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
S GTH ATSFP + A F+ + K+ Q + E RA+ G
Sbjct: 92 -----QSTCGTH------CATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGAREN 140
Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
+ GL WSPNIN+ RDPRWGR +ETP EDPYV +Y + Y +GLQ+ +
Sbjct: 141 YARGPHIGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQEGQ------- 193
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
DSR L+ KHY AY +N+ G DR FD+ V+ D +T+ FE V +G +
Sbjct: 194 -DSRFLQAVVTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGI 252
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN +NGIPTCA+ K LNQ +R D F GYI SD +IQ I + HK+ T +A
Sbjct: 253 MCSYNSLNGIPTCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFDGHKY-TKTLCEATKI 310
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
+++G+D+ G+ Y N + + A ID ++R + +LG FD + G
Sbjct: 311 AMESGVDICSGNAYWN-CLKQLANSTNFSASIDEAIRRTLKLRFQLGLFDAIGDQPHFGP 369
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
++ + ++L+ + AR+ IVLL+N LPL G +A++GPH+ + ++GNY G
Sbjct: 370 EDVRTAKSLQLSLDLARKSIVLLQNHGNTLPLRLG--LRIAVIGPHSMTRRGIMGNYYGQ 427
Query: 420 PCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
C SP++ + + N + GC I + + A+ A + AD V+
Sbjct: 428 LCHGDYDEVRCIQSPLEAIQSVNGRNNTHHVNGCG-INDTSTAEFDDALQAVRTADVAVL 486
Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
G+D+S+E E KDR ++ +P Q EL+ + A K P +V+ + G + I K
Sbjct: 487 FLGIDISIERESKDRDNIDVPHIQLELLKAIRVAGK-PTVVVLFNGGILGIE--KLILYA 543
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY---VKIPYTSMPLRPVNNFP 586
S+L YPG G +AIA+++FG NP G+LP+T Y +N+ V + SM L +P
Sbjct: 544 DSVLEAFYPGFFGAQAIAEILFGSINPSGKLPVTMYRSNFINDVDMKSMSMTL-----YP 598
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GR+Y+++ VY FG+GLSYT F S +S+D R +N+ V T +P
Sbjct: 599 GRSYRYYTEVPVYSFGWGLSYTTF------SIQSID-----SHDTRAMNH-VLTAQPK-- 644
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-----IKQVIGYERV 701
++I + N GK G EV+ + +P I T +Q+ Y RV
Sbjct: 645 --------------MYRILITNNGKYYGEEVLFAFFRPLDIHATGPVESLQQQLFNYTRV 690
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV-GGVSFP 751
+ G +V + ++L + D N + G + +++ GV ++FP
Sbjct: 691 RLDPGDMREVPLHVKD-ENLALHDRNGNLCVFEGFYELIISNGVEEQLTFP 740
>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
Length = 699
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 247/745 (33%), Positives = 375/745 (50%), Gaps = 103/745 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
++A+ L+ MTL EK+ QM + G+PRLG+ Y+WW+E LHGV GR
Sbjct: 11 QKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR----------- 59
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
AT FP I A+FN +L ++IG ++TE RA YN+ GLTFWSPN
Sbjct: 60 -----ATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPN 114
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN+ RDPRWGR +ET GEDP++ G I YV+G+Q +D LK++AC KHY
Sbjct: 115 INIFRDPRWGRGMETYGEDPFLTGTLGIAYVQGMQ---------GNDPFYLKVAACGKHY 165
Query: 195 AAYDLDNWEGNDRFHFDSRV--TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
A + G + ++ V T++D+ ET++ F+M V +G V ++M +YNRV G
Sbjct: 166 AVHS-----GPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACS 220
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL +R W F G+IVSDCD++ I HK + T+ +A A +KAGL+++CG
Sbjct: 221 GSKYLLTDVLRKQWGFRGHIVSDCDAVADIHAGHKIVK-TEAEACAIAIKAGLNIECGHT 279
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY--FDGSPQYKNLGKNNICNPQHIEL 370
+ AV Q + E +ID +L L + ++LG +D Y + + IC+P+HI L
Sbjct: 280 FEAMKQ-AVAQKLLTEQEIDRALLPLMMTRLKLGILEYDAECPYNEVKETEICSPEHIAL 338
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA + +VLLKN NG LPL+ N+ TL + GP A+ + ++GNY G RY + + G
Sbjct: 339 ARKAATESMVLLKN-NGILPLDK-NLHTLFIAGPGASDSFWLMGNYFGISNRYCTYLQGI 396
Query: 431 ---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------ 481
+ +N+ P + N+ I A+D A A+ T++V G + ++E E
Sbjct: 397 ADKVSSGTAVNFRPAFGESTPTKNT-INWALDEAIAAEKTIVVMGNNGNLEGEEGESIAS 455
Query: 482 ---KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
DRV + LP Q + + + A K + +V+ +D+ + W YP
Sbjct: 456 ETRGDRVSMRLPASQMKFLRDL-KARKNGIVVVLTGGSPIDVREISRLADAVVMAW--YP 512
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
G+EGG A+AD++FG N GRLP+T+ E+ P+ ++ GRTYK+ +
Sbjct: 513 GQEGGYALADLLFGDENFSGRLPVTFPESTDALPPFEDYAMK------GRTYKYQTAHIQ 566
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
YPFGYGLSYT Y A ++ + K
Sbjct: 567 YPFGYGLSYTTVTYAHAK---------------------------------VETMPQKGR 593
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT-HIKQVIGYERVFIAAGQSAKVGFTMNA 717
T ++N G EV VY PG T + ++ ++R+ + G+ V F +
Sbjct: 594 GMTVSAVLKNTGNKAVDEVAQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDIPF 653
Query: 718 CKSLKIVDNAANSLLASGAHTILVG 742
+ L + ++ LL G +TI VG
Sbjct: 654 DRLLTVQEDGTAQLL-KGNYTITVG 677
>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 733
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 254/763 (33%), Positives = 380/763 (49%), Gaps = 94/763 (12%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+D P+ D +LP ER L++ MTL EK Q+ + + RLGLP Y++W+EALHGV+
Sbjct: 22 NDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARN 81
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
GR AT FP I A+F++ L K +S EARA +N +GN
Sbjct: 82 GR----------------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
+GLTFW+PNIN+ RDPRWGR ET GEDPY+ + V GLQ
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176
Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+ LK +A KH+A + G + R FD+ + +DM ET+ FE + E +V +V
Sbjct: 177 PKYLKTAAAAKHFAVHS-----GPEALRHEFDAIASPKDMYETYFPAFEALITEANVETV 231
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
M +YNRVNG P LLN +R W F G++VSDC + + HK + E A A
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-AL 290
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
+ G DL+CG Y N AV+ G + E ID L + +LG+FD Y N+
Sbjct: 291 AINTGTDLNCGAVY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNI 349
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + + H ++A E A + IVLL+N N LPL+ NI+ L + GP A++++ ++GNY
Sbjct: 350 SADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYY 408
Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
G + T+ +DG A V INY G N + +A + D + V GL
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468
Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ E E DR+ L LP Q + K+ PV +V+++AG +N +
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV-IVVLTAG-TPVNLTEI 526
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+I++ YPG+EGG+A+AD++FG+ +P GRLPIT+ ++ PY ++
Sbjct: 527 AELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQ----- 581
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
GRTY++ +YPFG+GLSY Q K+ +I L Q N+P
Sbjct: 582 -GRTYRYMTQEPMYPFGFGLSYAQVKFD--------NITLGNTQAL------ASKNEP-- 624
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
T + V N G+ + EVV +Y K P + + + G+ R+ +A
Sbjct: 625 -----------QENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLA 673
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
AGQ+ +V F++ K L ++ +L G ++++VG G
Sbjct: 674 AGQTEQVLFSI-PKKHLYSINEQGKPVLLKGQYSVIVGNASPG 715
>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
Length = 712
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 256/775 (33%), Positives = 394/775 (50%), Gaps = 116/775 (14%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ RMTL EK+ QM + +PRLG+P Y++WSEALHGV+ G+
Sbjct: 8 YLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-- 65
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-----LGNA-- 126
AT FP I A+++ L +++ +++EARA ++ G
Sbjct: 66 --------------ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDI 111
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT WSPNIN+ RDPRWGR ET GEDPY+ G +VRGLQ D
Sbjct: 112 YQGLTMWSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQG---------KDPHY 162
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
LK +AC KHY + E R F++ VT +++ +T++ F+ V E V +VM +YN
Sbjct: 163 LKTAACAKHYTVHSGPEKE---RHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYN 219
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
R G P C P LL + +R W F G++VSDC +I H+ D E A A +K G
Sbjct: 220 RTLGEPCCGSPYLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESA-ALGIKNG 278
Query: 305 LDLDCGDYYT--NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLG 358
D+ C Y+ N T A+ +G I E DID +LR +LG FD PQ Y ++
Sbjct: 279 CDMACICTYSYENLTE-ALNRGLITEEDIDHALRNTLRTRFKLGLFD--PQEKVPYAHIS 335
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + H +LA E A + VLLKN N LP+ ++K++ +VGP+A ++GNY G
Sbjct: 336 MSVVGCEAHRKLAYETAVKSAVLLKNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYG 394
Query: 419 TPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
T+ M+G + + PG + ++ I A A +++A + L
Sbjct: 395 LSDSMTTFMEGLVGRLPEGVRMEFMPGS---LLTDSKKIKNDWSVASAASFDLVIAFMGL 451
Query: 476 SVEAEGK----------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
S EG+ DR D+ LP Q E I +A A + LV+ A+ +N ++
Sbjct: 452 SPLLEGEEGEAILSDNGDREDIALPKAQQEYIRDLA-ATGAKIVLVLTGGSAIALNGIED 510
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+++ILWVGYPG+EGGRAIAD+IFG ++P G+LPIT+ P ++ L P +
Sbjct: 511 --LVEAILWVGYPGQEGGRAIADLIFGDHSPSGKLPITF--------PVSTDQLPPFREY 560
Query: 586 P--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
RTY++ ++PFG+GLSYTQF+YK +++L+ P
Sbjct: 561 SMKERTYRYMTSSPLFPFGFGLSYTQFEYK--------NLQLE---------------HP 597
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
+A + + TF E+ N+G+ +G EVV VY S ++++I ++RV
Sbjct: 598 VLSA-------GEALRGTF--ELANVGEYEGEEVVQVYLSDLEASTIVPLQKLISFQRVR 648
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
+ G++ ++ F + +++ ++D+ N +L G + +G P+Q +L+
Sbjct: 649 LKPGETVQLSFAIQP-EAMMMIDDEGNQVLEPGKFKLTIGGAA-----PIQRSLD 697
>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
Length = 743
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 261/760 (34%), Positives = 377/760 (49%), Gaps = 109/760 (14%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L + ERA+DLV RMTL EK+ QM A + RLG+P Y WW+EALHGV+ G
Sbjct: 30 YRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
+T FP I A+F+ L +K +STE RA Y+
Sbjct: 88 --------------STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIY 133
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSP IN+ RDPRWGR ET GEDPY+ R A++++RG+Q R L
Sbjct: 134 KGLTFWSPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQ----------GRGRYL 183
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KH+A + ++R F++ V+++D+ ET++ FE V E V+ VM +YNR
Sbjct: 184 KAAACAKHFAVHSGPE---SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNR 240
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P C LL +RG+W F GY+ SDC +I+ I E H + T E++ A +K+G
Sbjct: 241 VNGEPCCGSGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHG-VTKTIEESSALAVKSGC 299
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG-KNNI 362
DL+CG Y + + A + G I E +IDT++ L + MRLG FD + Y ++ + N
Sbjct: 300 DLNCGCAYASL-VKAYRAGLIGEKEIDTAVHRLMLTRMRLGMFDAPEKVPYSSIPYEKND 358
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
C +H A E A + +VLL+N +G LPL+ I+++A++GP+A++ A+ GNY GT
Sbjct: 359 C-AEHRAFALEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASE 417
Query: 423 YTSPMDGFYAY---SKVINYAPGCADI------VCQNNSMIPAAIDAAKNADATVIVAGL 473
Y + +DG + YA G + Q N + A AA+ AD V+ GL
Sbjct: 418 YVTVLDGIREAVGDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGL 477
Query: 474 DLSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAK 524
+ +E E D+ DL LPG Q EL+ V A PV LV++S A+ +N+A
Sbjct: 478 NRDIEGEEGDPSNEYPAGDKRDLRLPGLQEELLETV-KATGTPVVLVLLSGSALAVNWAD 536
Query: 525 NNPKIKSILWVGYPGEEG-GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
N W YPG + GR A +FG P G P + + R
Sbjct: 537 ENADAVVQAW--YPGAQAEGRRGA--LFGIIRPAGGFP--------SRSTVRTRTSRIFG 584
Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
G +YPFGYGLSYT+F+Y D+KL ++
Sbjct: 585 TIHENRLPLLQGDPLYPFGYGLSYTKFQYG--------DLKL-------------AASEI 623
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK-QVIGYERVF 702
P +D + + V N G+ D EVV +Y + + K Q+ G+ RV
Sbjct: 624 PAG----EDAEV-------SVTVRNAGERDSDEVVQLYLQDLESSVPVPKWQLAGFRRVH 672
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G+SA V FT+ A + + ++D +L G + G
Sbjct: 673 LKPGESAGVRFTV-AARQMALIDEDGRCVLEPGGFRVYAG 711
>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 723
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 260/764 (34%), Positives = 377/764 (49%), Gaps = 119/764 (15%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DLV R+TL EK+ M + + V RLG+ YEWW+EALHGV+ G
Sbjct: 24 PYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL- 82
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN--- 125
AT +P I ASFN++L ++ ++S EAR Y GN
Sbjct: 83 ---------------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKR 127
Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFW+PNIN+ RDPRWGR ET GEDPY+ R ++ V GLQ + +Y+
Sbjct: 128 YTGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYN------- 180
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K AC KHYA + W +R F++ + +D+ ET++ F+ V +G+V VMC+Y
Sbjct: 181 -KTHACAKHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAY 236
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-----ESHKFLNDTKEDAVA 298
NR G P C +LL +R +WN+ G +VSDC +I E+HK K DA A
Sbjct: 237 NRFEGDPCCGSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHK----NKADASA 292
Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG 358
+ +G DL+CG YT + AV++G I E+ ID SL L LG D + + L
Sbjct: 293 AAVLSGTDLECGRSYTGL-ISAVKEGLINESAIDQSLCRLMKARFELGEMDDTTPWDQLP 351
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + H +LA + AR+ + LL+N LPL+ T+AL+GP+AN + NY G
Sbjct: 352 DSLLSCHAHQQLALQMARESMTLLQNHKNILPLDKE--MTVALIGPNANDSVMQWANYNG 409
Query: 419 TPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSM-------IPAAIDAAKNADATVI 469
P + ++G Y + + Y P +I Q I A I+ A AD +
Sbjct: 410 FPVHTITLLEGLTQYLPQERLIYIPQ-KNIEVQKYPWVNYYPNDIQAVINQAAKADVIIY 468
Query: 470 VAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
G+ S+E E G DR + LP Q +L+ K A P+ V S A+
Sbjct: 469 AGGISASLEGEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSGCAMG 527
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
+ + +IL YPG+ GG AIA+V+FG YNP GRLPIT+Y+ + +P
Sbjct: 528 LQ--PESQICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKD------NQLPD 579
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
N GRTY++ + +YPFG+GLSYT F Y S+P
Sbjct: 580 FEDYNMQGRTYRYLNYEPLYPFGHGLSYTTFSY---STP--------------------- 615
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYE 699
I++ K K ++V N G +G EV+ +Y K +K + G++
Sbjct: 616 ---------FIENGKLK-------VKVTNSGNYNGDEVIQLYIKRYDDPDGPLKTLRGFQ 659
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
R+ I AGQ+++V F + + + D +N++ G + ILVG
Sbjct: 660 RIHIPAGQTSEVSFPLTS-DTFTWWDKDSNTVHPLQGRYKILVG 702
>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
S85]
Length = 727
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 256/768 (33%), Positives = 389/768 (50%), Gaps = 108/768 (14%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
D + D ERA+ LV +MTL EK+ Q+ + A + RL +P Y+WW+EALHGV+ G
Sbjct: 18 DLSFLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNG 77
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY----NLGN- 125
+ AT FP I A+F+ L ++ +STEARA Y +GN
Sbjct: 78 K----------------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNH 121
Query: 126 ---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
AGLTFW+PN+N+ RDPRWGR ET GEDPY++ + + +V+GLQ D
Sbjct: 122 SRYAGLTFWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQG---------DDP 172
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
LK +AC KHYA + + + R F++ T+QD+ ET++ FE V + +V VM +
Sbjct: 173 NYLKSAACAKHYAVH---SGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPA 229
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
+N V G P A+ LL +R W F GY+V+DC +I+ I HK++ D++ A A LK
Sbjct: 230 HNAVFGAPMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYV-DSEVAAAAVALK 288
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGK 359
AG +L+CG Y A+ QG + E + + L+ RLG FD Y +G
Sbjct: 289 AGTNLNCGATYKELKK-AIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGP 347
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
I + +HIELA EAA++ IV+LKN N LPL T +IK + GP AN++ ++G+Y G
Sbjct: 348 ELIHSKEHIELAREAAQKSIVMLKNKNNLLPLPT-DIKVPYVTGPFANSSDMLMGSYYGV 406
Query: 420 PCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
+ + G + +NY G +N + A + A +D T+ V GL
Sbjct: 407 SPGVVTILAGITDAVSLGTSLNYRSGALPF-QKNINPKNWAPNVAGMSDVTICVVGLTAD 465
Query: 477 VEAEG---------KDRVDLLLPGFQTELINKVADAAK-GPVTLVIMSAGAVDINFAKNN 526
E EG DR+DL LP Q + ++A K P+ LVI S V + + +
Sbjct: 466 REGEGVDAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSPVSLEGIEEH 525
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
+IL + YPGE+GG A+ADV+FGK +P G LP+T+ ++ +P +
Sbjct: 526 --CDAILQIWYPGEQGGNAVADVLFGKVSPTGHLPMTFPKS------VAQLPDYKDYSMK 577
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTYK+ ++PFG+GL+Y++ ++K
Sbjct: 578 GRTYKYMTEEPMFPFGFGLTYSKTEFK--------------------------------- 604
Query: 647 AVLIDDVKC-KDYKFTFQIEVENMGKMDGSEVVMVYSKPP------GIAGTHIKQVIGYE 699
++++D K K +EV N+G D E+V +Y P G+ T +K ++
Sbjct: 605 NLVVEDAKLRKKESLKVSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLK---AFK 661
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
RV + G++ KV FT++ +SLK+++ + GA+ + VG G
Sbjct: 662 RVALKKGETQKVEFTIHP-ESLKVINVKGQKVWRKGAYKVTVGNSSPG 708
>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
phytofermentans ISDg]
Length = 717
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 220/622 (35%), Positives = 334/622 (53%), Gaps = 67/622 (10%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
+ +RA +LV++MTL EKV Q A +PRL + Y +W+EALHGV+ G
Sbjct: 10 FQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV--------- 60
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWS 132
AT FP I A+F+E L ++I T+STE R +N GLTFWS
Sbjct: 61 -------ATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWS 113
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ET GEDP++ G +V G+Q D LK +AC K
Sbjct: 114 PNVNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQG---------HDETYLKAAACAK 164
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+A + + + R F++ V+EQD++ET++ F+ V E V +VM +YNR NG P C
Sbjct: 165 HFAVH---SGPEDIRHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCC 221
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL +RG+W F G++ SDC +I+ E H ++ E +VA + G DL+CG+
Sbjct: 222 GSKTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVE-SVALAMNRGCDLNCGNL 280
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIEL 370
Y N + AV+ G + E IDT+L L+ M+LG FD S + + + + EL
Sbjct: 281 YVNL-LQAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESIPFNTITYDQVDTKSSKEL 339
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
+A+++ +VLLKN++ LPLN I ++ ++GP+AN A++GNYEGT Y + ++G
Sbjct: 340 NIKASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGI 399
Query: 431 YAY---SKVINYAPGCADI------VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
+ ++ GC + Q N I +++D + GLD +E E
Sbjct: 400 KQVVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEE 459
Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
D+ L LPG Q +++ + + K PV L+++S A+ + +A + I +I
Sbjct: 460 GDQGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAI 516
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
L YPG +GGRAIA++IFG NP G+LP+T+Y +T ++ RTY++
Sbjct: 517 LQGWYPGAQGGRAIAELIFGDGNPEGKLPVTFYRTTEELPEFTDYAMK------NRTYRY 570
Query: 593 FDGPVVYPFGYGLSYTQFKYKV 614
+YPFGYGLSYT F++ +
Sbjct: 571 MKNEALYPFGYGLSYTTFEHTL 592
>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
Length = 411
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 192/411 (46%), Positives = 261/411 (63%), Gaps = 21/411 (5%)
Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
MRLG+FDG+P+ Y LG ++C ++ ELA E ARQGIVLLKN G+LPL+ IKTL
Sbjct: 1 MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60
Query: 400 ALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID 459
A++GP+AN TK MIGNYEG C+YT+P+ G Y GC ++ C + A
Sbjct: 61 AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
AA +ADATV+V G D ++E E DR+DL LPG Q EL+ +VA AA+GPV LVIMS G D
Sbjct: 121 AA-SADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMP 578
I FAKN+ KI SI+WVGYPGE GG AIADVIFG++NP G+LP+TWY +YV K+P T+M
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239
Query: 579 LRP--VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
+RP N + GRTY+F+ G VY FG GLSYT F +++ +PK V + LD+ Q CR
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRS--- 296
Query: 637 TVGTNKPPCAAVLIDDVKCKD-----YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
P C ++ C+ F Q++V N+G +G+E V +++ PP + G+
Sbjct: 297 ------PECQSLDAIGPHCEKAVGERSDFEVQLKVRNVGDREGTETVFLFTTPPEVHGSP 350
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
KQ++G+E++ + + V F ++ CK L +VD LA G H + VG
Sbjct: 351 RKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVG 401
>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
Gv29-8]
Length = 722
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 256/728 (35%), Positives = 383/728 (52%), Gaps = 62/728 (8%)
Query: 32 MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI--GRRTNSPPGTHFDSEVPGAT 89
+TL EK + + A GV RLGLP YEW +EALHG++ + G+ NS T + +T
Sbjct: 12 LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINST-FTQGNVAFNSST 70
Query: 90 SFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLET 149
FP+ I+ A+F++ L I VSTEARA N AGL +W+PNIN RDPRWGR ET
Sbjct: 71 QFPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQET 130
Query: 150 PGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFH 209
PGEDPY V +YA NYV GL+ G K+ + CKH+A YD+++ +G R
Sbjct: 131 PGEDPYHVAQYAYNYVVGLKGGVG--------PAKSKVVSTCKHFAGYDIEDSDGVVRGS 182
Query: 210 FDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFH 269
+++ ++ QD+ E ++ F C + +VMCSYN VNG P+CA+ +L+ +R W +
Sbjct: 183 YNAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWG 242
Query: 270 G---YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKI 326
++ DC ++ + H + + VA + G DLDCG Y + AVQ
Sbjct: 243 SSAHWVTGDCGAVDGVFNQHH-VGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYT 301
Query: 327 AEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
EA +D +L LY L+ LGYFD +Y+ LG +++ P +LA A +GI +L
Sbjct: 302 TEAQLDQALSRLYSSLIVLGYFDPPEGQEYRTLGVSDVNTPSTQKLAYTALVEGINILP- 360
Query: 385 DNGALPLNTGNIKTLALVGPHA-NATKAMIGNYEGTPCRYTSPMD--GFYAYSKVINYAP 441
P+ +T+ VGP A NA+ +M GNY G T P+ AY+ + Y+
Sbjct: 361 ---IRPMG----QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQ 413
Query: 442 GCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
G ++ + S AA+ AA+ AD V + G+D VEAE DR + PG Q LI ++
Sbjct: 414 GLQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQL- 472
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
AA PV +V + G VD + N +K +LW+GYPG+E G + D++ G P GRLP
Sbjct: 473 -AAVKPVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLP 531
Query: 562 ITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQF--KYKVASSP 618
+T Y ANY+ ++P T LRP ++ PGRTY++++G V+ PFG G+ YT+F +K S
Sbjct: 532 VTQYPANYITQVPMTDQSLRPSSSNPGRTYRWYNGSVI-PFGTGIHYTKFNISWKTGGSG 590
Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY-KF-TFQIEVENMGKMDGSE 676
+ D I+ KD +F FQI VEN+G
Sbjct: 591 RGTYDTAD----------------------FINAEDPKDLAEFDVFQINVENVGSTTSDY 628
Query: 677 VVMVY--SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
V +++ S G +K ++ Y R G++ K+ +N + + D++ N +L
Sbjct: 629 VALLFVKSSDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVNVGQ-IARNDSSGNLVLY 687
Query: 734 SGAHTILV 741
GA+T+ +
Sbjct: 688 PGAYTLEI 695
>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
Length = 733
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 252/763 (33%), Positives = 380/763 (49%), Gaps = 94/763 (12%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+D P+ D +LP +R L++ MTL EK Q+ + + RLGLP Y++W+EALHGV+
Sbjct: 22 NDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARN 81
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
GR AT FP I A+F++ L K +S EARA +N +GN
Sbjct: 82 GR----------------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
+GLTFW+PNIN+ RDPRWGR ET GEDPY+ + V GLQ
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176
Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+ LK +A KH+A + G + R FD+ + +DM ET+ FE V E +V +V
Sbjct: 177 PKYLKTAAAAKHFAVHS-----GPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETV 231
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
M +YNRVNG P LLN +R W F G++VSDC + + HK + E A A
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-AL 290
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
+ G DL+CG Y N AV+ G + E ID L + +LG+FD Y N+
Sbjct: 291 AINTGTDLNCGAVY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNI 349
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + + H ++A E A + IVLL+N N LPL+ NI+ L + GP A++++ ++GNY
Sbjct: 350 SADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYY 408
Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
G + T+ +DG A V INY G N + +A + D + V GL
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468
Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ E E DR+ L LP Q + K+ PV +V+++AG +N +
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV-IVVLTAG-TPVNLTEI 526
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+I++ YPG+EGG+A+AD++FG+ +P GRLPIT+ ++ PY ++
Sbjct: 527 AELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQ----- 581
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
GRTY++ +YPFG+GLSY Q K+ N T+G +
Sbjct: 582 -GRTYRYMTQEPMYPFGFGLSYAQVKFD---------------------NITLGNTQALA 619
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
+ + + T + V N G+ + EVV +Y K P + + + G+ R+ +A
Sbjct: 620 SKNELQE------NMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLA 673
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
AGQ+ +V F + K L ++ +L G ++++VG G
Sbjct: 674 AGQTEQVLFNI-PKKHLYSINEQGKPVLLKGQYSVIVGNASPG 715
>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 721
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 255/743 (34%), Positives = 377/743 (50%), Gaps = 98/743 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
AK+++ RMT+ EK+ Q+ + + + LG+ Y+WWSE LHGV GR
Sbjct: 32 RHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR----------- 80
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------NAGLTFWSPN 134
AT FP I A+F+E+L ++IG V+TE RA +N+ NAGLTFWSPN
Sbjct: 81 -----ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPN 135
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RD RWGR +ET GEDP + G YVRGLQ D+ LK AC KHY
Sbjct: 136 VNIFRDLRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------DDAFYLKTGACAKHY 186
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + EG R D + +D+ ET++ F+M V +G V +VM +YNRV G P
Sbjct: 187 AVHS--GPEGT-RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGS 243
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
LL +R W F+G+IVSDCD+I H+++ T E+A A +KAGL+++CG +
Sbjct: 244 KYLLTDILRKSWGFNGHIVSDCDAINDFYGGHRYVK-TPEEACAAAIKAGLNVECGHTFK 302
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIELAA 372
GA+ QG +AEAD+D +L L + ++LG D + Y + ++ IC+P H LA
Sbjct: 303 AM-QGALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALAL 361
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
AA + +VLLKN NG LPL+ NI+TL + GP A+ ++GNY G RY++ + G +
Sbjct: 362 RAADEAMVLLKN-NGILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVS 419
Query: 433 Y---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--------- 480
+N+ P I + N M A++ A A+ ++V G + ++E E
Sbjct: 420 RVSSGTSVNFRPAFMQITEELNDM-NWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASAS 478
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
DRV + LP Q + +V A KG +V+++ G+ I+ + + +++ YPG+
Sbjct: 479 RGDRVGIGLPASQLNYLRRV-KARKGGRIVVVLTGGS-PIDLREISKLADAVVMAWYPGQ 536
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
EGG A+ D++FG N GRLPIT+ S+P + GRTYK+ G V+YP
Sbjct: 537 EGGEALGDLLFGDKNFSGRLPITF------PADVDSLPAFDDYSMNGRTYKYMSGNVMYP 590
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGLSY + Y A VG K K
Sbjct: 591 FGYGLSYGRVTYTDAR--------------------VVGRIK-------------KGEPL 617
Query: 661 TFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
++ + N G EV Y + P G+ + ++G+ RV I S K F + +
Sbjct: 618 AVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPPKSSVKAVFKI-VPE 676
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
L V + +S L G +T+ +G
Sbjct: 677 RLMTVQSDGSSKLLKGNYTLTIG 699
>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
[Marvinbryantia formatexigens DSM 14469]
Length = 700
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 243/736 (33%), Positives = 371/736 (50%), Gaps = 113/736 (15%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA+ LV +MT+ EK Q+ A + RLG+P Y WW+EALHGV+ G+
Sbjct: 8 KRAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------- 56
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+E+L +I ++TE RA YN A GLTFWSPN
Sbjct: 57 -----ATVFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPN 111
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDP + R + +V+GLQ D +K +AC KH+
Sbjct: 112 VNIFRDPRWGRGHETYGEDPCLTSRLGVAFVKGLQ----------GDGETMKAAACAKHF 161
Query: 195 AAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
A + G + R F++ + +DM+ET++ FE V E DV +VM +YNR NG C
Sbjct: 162 AVHS-----GPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACC 216
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
A P +L + +R DW F G+ VSDC +I+ E H L T +++ A + +G DL+CG+
Sbjct: 217 ASP-VLQKILREDWGFEGHFVSDCWAIRDFHE-HHMLTATAKESAAMAINSGCDLNCGNT 274
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
Y + + A + G ++E I + L+ LG FDGS +Y ++ + + +H+ LA
Sbjct: 275 YLHI-LHAYRDGLVSEETITEAAVRLFTTRFLLGLFDGS-EYDDIPYTVVESKEHLALAE 332
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
+AA + VLLKN NG LPL ++T+ ++GP+A++ A+ GNY GT RY + G
Sbjct: 333 KAALESAVLLKN-NGILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQD 391
Query: 433 Y----SKVINYAPGCA-------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE- 480
Y +V+ + GCA + + + A I A+N+D ++ GLD ++E E
Sbjct: 392 YLGEDVRVLT-SVGCALSEDRTEKLALAGDRLAEAQI-VAENSDVVILCLGLDETLEGEE 449
Query: 481 --------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
D+ LLLP Q +L+ VA K PV L +MS +D+++A + +I
Sbjct: 450 GDTGNSYASGDKETLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAEH--FDAI 506
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
L + YPG +GG A A ++FG+ +P G+LP+T+YE +P + GRTY++
Sbjct: 507 LQLWYPGSQGGSAAAKLLFGEVSPSGKLPVTFYET------LEELPAFEDYSMKGRTYRY 560
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
P YPFG+GL+Y D + D N +
Sbjct: 561 MGHPAQYPFGFGLTY-------------------GDVRVTDANIRGAS------------ 589
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKV 711
+ T + EN G EV+ +Y K A + + R+ + AG+ +
Sbjct: 590 ---AEGDLTLAVTAENAGNAVTDEVLQIYVKCTDSANAVPNPALAAFGRIHLEAGEKKTI 646
Query: 712 GFTMNACKSLKIVDNA 727
T+ A ++ +VD A
Sbjct: 647 EMTVPA-RAFTVVDEA 661
>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
gi|224035251|gb|ACN36701.1| unknown [Zea mays]
gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 405
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/407 (46%), Positives = 263/407 (64%), Gaps = 17/407 (4%)
Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
MRLG+FDG P+ + NLG +++C P + ELA EAARQGIVLLKN G LPL+ +IK++
Sbjct: 1 MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSM 59
Query: 400 ALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSM-IPAAI 458
A++GP+ANA+ MIGNYEGTPC+YT+P+ G A + Y PGC ++ C NS+ + AA
Sbjct: 60 AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATV-YQPGCTNVGCSGNSLQLDAAT 118
Query: 459 DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
AA +AD TV+V G D S+E E DR LLLPG Q +L++ VA+A+ GP LV+MS G
Sbjct: 119 KAAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPF 178
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
DI+FAK++ KI +ILWVGYPGE GG AIADV+FG +NP GRLP+TWY ++ K+P T M
Sbjct: 179 DISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMR 238
Query: 579 LR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
+R P +PGRTY+F+ G VY FG GLSYT F + + S+PK + ++L + C
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLTEQ- 297
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI 696
C +V + C+ F + V N G+ G V ++S PP + K ++
Sbjct: 298 --------CPSVEAEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAVHNAPAKHLL 349
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
G+E+V + GQ+ V F ++ CK L +VD N +A G+HT+ VG+
Sbjct: 350 GFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 396
>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
Length = 695
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 228/613 (37%), Positives = 336/613 (54%), Gaps = 74/613 (12%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
+A LVE+MTL E+ QM A VPRLG+P Y WW E LHGV+ G
Sbjct: 9 KAVRLVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT------------ 56
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GN----AGLTFWSPNI 135
AT FP I A F+ L ++I VSTE RA YN G+ GLTFWSPN+
Sbjct: 57 ----ATMFPQAIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNV 112
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ R +VRGLQ D LKI+AC KH+A
Sbjct: 113 NIFRDPRWGRGHETYGEDPYLTSRLGTAFVRGLQ----------GDGEHLKIAACAKHFA 162
Query: 196 AYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+ G + R F + +++D+ ET++ FE CV E V SVM +YN +G P CA
Sbjct: 163 VHS-----GPEALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCA 217
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
+ L+ + +RG W F G+ VSDC +I+ ++ + DT ++ A +K G DL+CG+ Y
Sbjct: 218 NTLLMEEILRGQWGFEGHFVSDCWAIRDFHMNY-MVTDTAMESAALAVKKGCDLNCGNTY 276
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAE 373
+ A ++G + +A + ++ L+ LG + + +Y ++ + +H ELA E
Sbjct: 277 LQ-VLKACEEGLLDDACVTEAVVRLFTTRYLLGMGEET-EYDDIPYEVVECKEHRELAVE 334
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF--- 430
AAR+ +VLLKND G LPL+ + T+A++GP+A+ A+IGNY GT YT+ ++G
Sbjct: 335 AARRSMVLLKND-GLLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDA 393
Query: 431 YAYSKVINYAPGC-------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
+ YA GC + + + A I AK++D V+ GLD ++E E
Sbjct: 394 VGEDVRVLYAEGCHLFKDRVEHLAVAGDRLSEARI-VAKHSDVVVLCVGLDETLEGEEGD 452
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D+ DLLLP Q L+ ++ + K PV + MS A+D++ A+ K +++
Sbjct: 453 TGNSHASGDKKDLLLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQE--KAGAVIQ 509
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
V YPG EGGRA+AD++FGK +P G+LP+T+Y+ ++P + GRTY++
Sbjct: 510 VWYPGAEGGRALADLLFGKASPSGKLPVTFYK------DLENLPPFEDYSMDGRTYRYLT 563
Query: 595 GPVVYPFGYGLSY 607
+YPFG+GL+Y
Sbjct: 564 AEPLYPFGFGLTY 576
>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
Length = 733
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 253/763 (33%), Positives = 378/763 (49%), Gaps = 94/763 (12%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+D P+ D +LP +R L++ MTL EK Q+ + + RLGLP Y++W+EALHGV+
Sbjct: 22 NDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARN 81
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN 125
GR AT FP I A+F++ L K +S EARA +N +GN
Sbjct: 82 GR----------------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
+GLTFW+PNIN+ RDPRWGR ET GEDPY+ + V GLQ
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQG---------DH 176
Query: 182 SRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
+ LK +A KH+A + G + R FD+ + +DM ET+ FE V E +V +V
Sbjct: 177 PKYLKTAAAAKHFAVHS-----GPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETV 231
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
M +YNRVNG P LLN +R W F G++VSDC + + HK + E A A
Sbjct: 232 MAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-AL 290
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNL 357
+ G DL+CG Y N AV+ G + E ID L + +LG+FD Y N+
Sbjct: 291 AINTGTDLNCGAVY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNI 349
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + + H ++A E A + IVLL+N N LPL+ NI+ L + GP A++++ ++GNY
Sbjct: 350 SADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYY 408
Query: 418 GTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
G + T+ +DG A V INY G N + +A + D + V GL
Sbjct: 409 GLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLS 468
Query: 475 LSVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ E E DR+ L LP Q + K+ PV +V+++AG +N +
Sbjct: 469 GAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV-IVVLTAG-TPVNLTEI 526
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+I++ YPG+EGG+A+AD++FG+ +P GRLPIT+ ++ PY ++
Sbjct: 527 AELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPITFPKSEAQLPPYDDYSMQE---- 582
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
RTY++ +YPFG+GLSY Q K+ +I L Q N+P
Sbjct: 583 --RTYRYMTQEPMYPFGFGLSYAQVKFD--------NITLGNTQAL------ASKNEP-- 624
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
T + V N G+ + EVV +Y K P + + + G+ R+ +A
Sbjct: 625 -----------QENMTVTVNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLA 673
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
AGQ+ +V F + K L ++ +L G ++++VG G
Sbjct: 674 AGQTEQVLFNI-PKKHLYSINAQGKPVLLKGQYSVIVGNASPG 715
>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
Length = 699
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 257/743 (34%), Positives = 375/743 (50%), Gaps = 111/743 (14%)
Query: 28 LVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG 87
L+ M+L EK+ M A G+PRLG+P Y WW+EALHGV+ G
Sbjct: 15 LISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE---------------- 58
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-LG-------NAGLTFWSPNINVVR 139
AT FP I A+F+E L ++ + +S EARA +N +G + GLTFW+PNIN+ R
Sbjct: 59 ATVFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFR 118
Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
DPRWGR ET GEDP + R YVRGLQ SD L+ +AC KH+A +
Sbjct: 119 DPRWGRGQETYGEDPVLTSRLGTAYVRGLQ---------GSDPYYLRAAACAKHFAVH-- 167
Query: 200 DNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLN 259
EG R F++ V+++D++ET++ F+ V G V SVM +YNRVNG P C LL
Sbjct: 168 SGPEGL-RHTFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACGSTYLLK 225
Query: 260 QTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
Q +R +W F G++VSDC +I ++HK ND E ++A L++G DL+CGD Y N+
Sbjct: 226 QKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILE-SIALALRSGCDLNCGDAY-NYLAE 283
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
AV +G + E DI+ ++ L I L +LG Y+ + + I +H LA EAA + I
Sbjct: 284 AVLKGYVTEDDINRAVVRLLITLDKLGLIHDDGPYQGITIHQIDWKKHDSLALEAAEKSI 343
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---V 436
VLLKN NG LPL I + + GP+A + A++GNY G R + ++ +
Sbjct: 344 VLLKN-NGVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEIT 402
Query: 437 INYAPGC--ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV--------- 485
+ Y GC A+ N A K AD T+ V G D SVE E D +
Sbjct: 403 VTYKKGCPLAERRVNPNDW---ASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFE 459
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
DL L Q ++K+ ++ K P+ +V+M G I + + +IL YPG+ GG A
Sbjct: 460 DLNLNDEQLSYLHKLKESGK-PLIVVLM--GGAPICSPELHEIADAILVAWYPGQAGGTA 516
Query: 546 IADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP--GRTYKFFDGPVVYPFGY 603
+++++FGK NP G+LP+T+ P + L N+ GRTY++ +YPFG+
Sbjct: 517 VSNIVFGKTNPSGKLPVTF--------PKSVRQLPEFENYSMQGRTYRYMTEEPLYPFGF 568
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT+ ++K + G K P LI
Sbjct: 569 GLSYTKMEFK----------------------HVTGRWKSPEKDELI-----------VS 595
Query: 664 IEVENMGKMDGSEVVMVY----SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
E+ N G +DG EVV +Y P + +I ++RV +AAG S F + +
Sbjct: 596 TELYNQGTIDGEEVVQLYYHWKDAPFAVPNW---SLIDFKRVLVAAGASCICEFKI-PLE 651
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
L+ +D + ++ +G VG
Sbjct: 652 KLQCIDPSGKGVIPTGTLQFYVG 674
>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 721
Score = 371 bits (952), Expect = e-99, Method: Compositional matrix adjust.
Identities = 247/757 (32%), Positives = 372/757 (49%), Gaps = 109/757 (14%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
SI+ +++ +P+ DA+L + +RA DL +R+TL EK M + + VPRLG+ ++WW EAL
Sbjct: 17 SIQAQVT-YPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEAL 75
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG + G AT FP I ASF++ L ++ STEARA YN+
Sbjct: 76 HGSARTGL----------------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNV 119
Query: 124 G--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ ++ W+PN+N+ RDPRWGR ET GEDPY+ R V GLQ +G
Sbjct: 120 AAKKGYFDTSWSVSLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPH 179
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+ K AC KH+A + W + D V+ +D ET++ F+ V G
Sbjct: 180 KY-------YKAFACAKHFAVHSGPEWNRHS-ISIDD-VSPRDFHETYLPAFKHLVQVGG 230
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
V VMC+YN ++G P C+D +LL Q +R +W F G +VSDC +I I K ++ + D
Sbjct: 231 VKEVMCAYNSIDGEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWR--KGFHEVEPD 288
Query: 296 AV---ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DG 350
A AR +K G D+ CG Y + AV+ GK+ E ID SL+ L + M+LG F D
Sbjct: 289 AAHASARAVKGGTDMSCGQTYGSLPE-AVRLGKVTEERIDKSLKRLIVGRMQLGEFDPDS 347
Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
++ + ++ P E+A + AR+ + LL N ALPL+ +K + ++GP+AN +
Sbjct: 348 ITRWNAISMKDVSTPASREVALKMARETMTLLHNPMHALPLSK-QLKQVVVMGPNANDSV 406
Query: 411 AMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQ---NNSMIPAAIDAAKNAD 465
M GNY GTP + +DG ++ + + GC + N ++ + +
Sbjct: 407 MMWGNYNGTPHHTVTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDN 466
Query: 466 ATVI--------VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
TVI + G L VEA +G DRV + LP Q E+I + A K +++++
Sbjct: 467 KTVIFVGGISPQLEGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGK---QVIMVN 523
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
I +IL YPGE GG A+ADV+FG YNP G+LP+T+Y +
Sbjct: 524 CSGSAIGLVPEVTHTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD------ 577
Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
+ +P N RTY++F G ++PFG+GLSYT FK
Sbjct: 578 SQLPDYLDYNMRNRTYRYFKGKPLFPFGHGLSYTSFK----------------------- 614
Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQ 694
I K ++ K T + V+N GK DG EVV +Y IK
Sbjct: 615 ---------------IGKAKMRNGKLT--VSVKNTGKRDGEEVVQLYISCLDDPNGPIKS 657
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
+ G++R+ + AG+ V + KS + D N++
Sbjct: 658 LRGFKRMALQAGEQRTVTLNLPR-KSFERFDEQTNTI 693
>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
Length = 734
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 261/760 (34%), Positives = 382/760 (50%), Gaps = 113/760 (14%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+F + D + +RAK LV +TL EK+ M D + + RL +P Y WW+E LHGV+ G
Sbjct: 38 NFEWFDTNKSFEKRAKALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNG 97
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN- 125
R AT FP I A+F++ L K+ +STEARA +N +GN
Sbjct: 98 R----------------ATVFPQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNR 141
Query: 126 ---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
AGLTFW+PNIN+ RDPRWGR ET GEDPY+ + +N+V+GLQ +
Sbjct: 142 GKYAGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQG---------NHP 192
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ LK +AC KHYA + + R FD+ +++DM ET++ FE V E V VM +
Sbjct: 193 KYLKSAACAKHYAVH---SGPEELRHEFDAIASKKDMAETYLPAFEALVKEAKVEGVMGA 249
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YNRVNG CA P LL + ++ W F GYIVSDC ++ + + HK + T E++ A L
Sbjct: 250 YNRVNGEGACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFHK-VTQTAEESAAAALN 308
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GL+++CG+ Y GA++QG +E +D L+ + +LG+FD S Y + +
Sbjct: 309 VGLNVNCGNVYPALD-GAIKQGLTSEKQLDNVLQHQLLTRFKLGFFDPSNNNPYNKITTD 367
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + H +A EAA++ IVLLKN+N L ++K++ + GP+A ++GNY G
Sbjct: 368 VVDSEAHRAIALEAAQKSIVLLKNNNNLL-PLKKDLKSVYVAGPNAAREDVLLGNYYGVT 426
Query: 421 CRYTSPMDGFYAYSKV-----INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
+ + +DG SKV INY G +N + I + AD +IV GL
Sbjct: 427 SKTQTILDGI--VSKVSAGTSINYKQGLLPF-QKNVNPIDWSTGEISRADVGIIVMGLSG 483
Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKN 525
+ E E DRVD+ LP Q + I K+ G P+ LV+ G I +
Sbjct: 484 NYEGEEGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVL--TGGSPIAMPEV 541
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+ +I++ YPGEEGG+A+AD++FG P G+LPIT+ P + L P N++
Sbjct: 542 YDLVDAIVFAWYPGEEGGQAVADILFGDVVPSGKLPITF--------PKSVDDLPPYNDY 593
Query: 586 P--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
GRTYK+ +PFG+GLSYT FKY
Sbjct: 594 AMKGRTYKYMTKTPQFPFGFGLSYTSFKY------------------------------- 622
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
D++K K +F I N G +D EV VY S P G + ++G+ RV
Sbjct: 623 -------DNLKVYKEKASFSI--TNNGNVDAEEVAQVYVSSPNAGKGDPLNTLVGFTRVS 673
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ AG + +V + K+ D+ + G +TI VG
Sbjct: 674 LKAGATKQVSIPFSK-KAFVQFDSDGKEITRKGTYTIHVG 712
>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 742
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 254/727 (34%), Positives = 366/727 (50%), Gaps = 99/727 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ D ER KDLV R+TL EK QM A + RLG+ Y WW+EALHGV+ GR
Sbjct: 38 YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTGR 97
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP + A+F+E L +IGQ +S EA A YN+
Sbjct: 98 ----------------ATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYG 141
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
+G+TF++PN+N+ RDPRWGR ET GEDP++ R + YV+G+Q +D +
Sbjct: 142 QYSGITFYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQG---------NDPK 192
Query: 184 PLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK +AC KHY + G + R +D+ +D ET++ FE V EG V SVMC
Sbjct: 193 YLKTAACAKHYVVHS-----GPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMC 247
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+YNR G P C LL+ +R W F GY+ +DC +IQ H D+ E A A +
Sbjct: 248 AYNRTFGKPCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLE-ACALAI 306
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLG 358
K+G++L+CG+ + N+ AV++G + E ++D +L L RLG FD SP Y +
Sbjct: 307 KSGVNLNCGNEF-NYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD-SPNENPYAKIK 364
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ I + Q+I+LA EAA + +VLL+N N LPL ++K+L +VGP+A ++GNY G
Sbjct: 365 EEVIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNG 423
Query: 419 TPCRYTSPMD---GFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVI--VAGL 473
R T+ M G + +NY G NSM + +AA + ++G+
Sbjct: 424 VNSRLTTIMQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGV 483
Query: 474 DLSVEAEGK------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
E E DR+DL LP Q + + ++ K P+ LV+ G I +
Sbjct: 484 FEGEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVL--TGGSPICTPELAD 541
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
+ +IL+V YPG+EGG A+ADVIFG NP GRL IT+ ++ + +P + G
Sbjct: 542 MVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCITFPKS------VSQLPAFEDYSMKG 595
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++ +YPFG+GLSYT + Y S D K Q + T
Sbjct: 596 RTYRYMTEEPLYPFGFGLSYTNYSY----SNIKTDKDKIKKGQSVHVTAT---------- 641
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
V N GK G EV +Y + A T + + G +RV +AAG
Sbjct: 642 ------------------VSNTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAG 683
Query: 707 QSAKVGF 713
+S +V F
Sbjct: 684 ESKEVSF 690
>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
Length = 721
Score = 368 bits (944), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 243/728 (33%), Positives = 365/728 (50%), Gaps = 74/728 (10%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY-------GVPRLGLPLYEWWSEAL 63
D P+CD L + +RA DL +R+TL E QQ+ ++ GVPRLGL Y + +E L
Sbjct: 41 DLPFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGL 100
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
HG+ N P AT +P V A+ N SL ++ + TE RA+ N
Sbjct: 101 HGIR-DANVVNYP-----------ATLYPQVTAMAATANASLIHEMSTIMGTELRAVNNR 148
Query: 123 -------LGNAG-LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
G G L+ + P +N++RD RWGR E+ EDP++ G YA+N+V GL+
Sbjct: 149 AQELGEIFGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLE----- 203
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGN-DRFHFDSRVTEQDMQETFILPFEMCVNE 233
R+S S+ L+ + CKH AY + + R F++ + E D+ +T++ F CV
Sbjct: 204 --QRNS-SKYLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVEL 260
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
G V +MCSYN VNGIP CA + N +R W F G IVSDCD++ I +H + T
Sbjct: 261 GHVQQIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNY-TRTP 319
Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGS 351
EDAV L+ G DLDCGD+Y+ AVQQ A + S+ + + LG F D S
Sbjct: 320 EDAVTVALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPDTS 379
Query: 352 PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
Y+ LG+ I P + + A+R+ +VLL+N LP+ +AL+GP+ N T
Sbjct: 380 VPYRQLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTI 439
Query: 412 MI-GNYEGTPCRYTSPMDGFYAYSKV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVI 469
M+ G + TP T+ GF A + +PGC +I + A+ A AD V+
Sbjct: 440 MMGGKLDYTPSFITTYFQGFQAIGITHLTSSPGC-NITAPLPGALDKAVQIATQADLVVL 498
Query: 470 VAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA-AKGPVTLVIMSAGAVDINFAKNN-P 527
GL +E EG DR L LP Q +L + ++ A + +V+++ G V ++ K
Sbjct: 499 TLGLSSDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIA 558
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRP--VNN 584
+ +I+ Y G+ G A+A+ IFG+ NP G LP T + +N +P+T M LRP
Sbjct: 559 RTPTIIEAFYGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATG 618
Query: 585 FPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
FPGRT++FFD PV++PFG+GLSY+ F +D+ I T G P
Sbjct: 619 FPGRTHRFFDAPVMWPFGHGLSYSTFSLAW------------QDETVPSI--TTGDFTQP 664
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFI 703
L+ + + V N G + G + +Y + P ++ ++G ++ ++
Sbjct: 665 ---TLMHQL--------LSVNVTNHGPLPGRRALHLYVTVPVTNVSVPLRNLVGLQKHWL 713
Query: 704 AAGQSAKV 711
A QS V
Sbjct: 714 AVDQSMTV 721
>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
FD-1]
Length = 690
Score = 366 bits (939), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 242/750 (32%), Positives = 367/750 (48%), Gaps = 118/750 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L ERA+DL R+TL E+ Q+ A V RL +P Y WWSE LHGV+ G
Sbjct: 4 YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F+E K+G + EARA YN +A
Sbjct: 62 --------------ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL WSPN+N+ RDPRWGR ET GEDPY+ R + + +GLQ + L
Sbjct: 108 KGLCLWSPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQ----------GEGEVL 157
Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K +AC KH A + G + R FD+ + +DM+ET++ FE V E V VM +Y
Sbjct: 158 KTAACAKHLAVH-----SGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAY 212
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG P CA L+ + +W F GY VSDC +I+ +H + T ++ A LK
Sbjct: 213 NRVNGEPACASKFLMGKL--DEWGFDGYFVSDCWAIRDFHTNH-MVTKTAPESAAMALKL 269
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
G DL+CG+ Y + + A +G I + DI + L +RLG FD +Y L + +
Sbjct: 270 GCDLNCGNTYLHL-LHAYNEGLINDEDIKKACTHLMRTRVRLGMFDDETEYDKLDYSIVA 328
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
N ++ A + + + +V+LKN NG LPL+ IKT+ ++GP+A++ A+ GNY G RY
Sbjct: 329 NEENKAYARKCSERSMVMLKN-NGILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRY 387
Query: 424 TSPMDGFY-AYSKVINYAPG-------CADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
+ ++G A+ + Y+ G C + ++ + A I +++D V+ GLD
Sbjct: 388 ITFLEGIQDAFGGRVLYSEGSHLYKDRCMGLAVADDRLSEAEI-VTEHSDVVVLCVGLDA 446
Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
++E E D+ DL LP Q +L+ V K PV +V + A+++
Sbjct: 447 TIEGEEGDTGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGSAINV-----E 500
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNF 585
+++ YPG+ GG A+AD++FGK +P G+LP+T+Y + K+P +T ++
Sbjct: 501 ADCDALIHAWYPGQFGGTALADILFGKISPSGKLPVTFY-TDTTKLPEFTDYSMK----- 554
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
GRTY++ ++YPFGYGL+Y++ +
Sbjct: 555 -GRTYRYTQDNILYPFGYGLTYSKTE---------------------------------- 579
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAA 705
+ D+K ++ K + ++V N G D +VV Y K G + G+ RVF+
Sbjct: 580 ----VSDLKFENGKAS--VKVTNTGDFDTEDVVQFYIKGEGSDYVPFYSLCGFRRVFLKK 633
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASG 735
G+S V T+ + +N S A G
Sbjct: 634 GESTVVEVTLGDSAFEAVDENGRRSRSAKG 663
>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 747
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 247/744 (33%), Positives = 368/744 (49%), Gaps = 100/744 (13%)
Query: 2 FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG-------VPRLGLP 54
F I L DFP+ + LP+ ER DLV R+TL E V QM G + RLG+
Sbjct: 15 FSLISTILGDFPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIG 74
Query: 55 LYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
Y W +E LHG D ATSFP A+F+ L ++I +
Sbjct: 75 PYSWNTECLHG----------------DVAAGPATSFPQAFGLAATFDAVLIEQIANATA 118
Query: 115 TEARAMYNL--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVR 166
E RA YN + GL+ +SP IN+ R P WGR+ ET GEDPY+ G A +YV
Sbjct: 119 YEVRAKYNNYAKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVN 178
Query: 167 GLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
GLQ + R + +A CKH+ AY + R FD++V+++D++ TF+
Sbjct: 179 GLQG---------NHPRYVTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPA 229
Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
F C+ G S+MCSYN +NG+P CA+ KLL +R +WNF GY++SD +++ + ++H
Sbjct: 230 FHECIQAG-THSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAH 288
Query: 287 KFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
+ D + A+A V +GL+L+ D T AV+QG + + + L+
Sbjct: 289 HYTKDMLDTAIACV-NSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTR 347
Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
MRLG FD P+ Y L + I + +H EL+ +AA + VLLKN+N LPL I L
Sbjct: 348 MRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKL 405
Query: 400 ALVGPHANATKAMIGNYEGTPCRYT-SPMDGFYAYSKVINYAPGCADIVCQ--NNSMIPA 456
A+VGP A+ A+ G+Y TP YT +P +G + +YA GC + C+ ++ + +
Sbjct: 406 AVVGPLADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSGQVKS 465
Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
A+ AD V+ G +E+EG DR +L LPG Q L+ PV L++ +AG
Sbjct: 466 AVSG---ADMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAG 522
Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG---KYNPGGRLPITWYEANYVKIP 573
+D+++A NP +++I+ +P + G A+ + + NP GRLP+TW + P
Sbjct: 523 PLDVSWAVENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTWPRSMEQVPP 582
Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
T ++ GRTY++ D ++PFG+GLSYT FKY S+ +V D
Sbjct: 583 MTDYTMK------GRTYRYSDADPLFPFGFGLSYTLFKYYNTSASPTVIKSCD------- 629
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK 693
T + V N+G G EV+ VY + T K
Sbjct: 630 -------------------------TVTIPLTVTNVGDFPGDEVMQVYISWSNASVTVPK 664
Query: 694 -QVIGYERVF-IAAGQSAKVGFTM 715
Q++G+ RV I SA V F +
Sbjct: 665 LQLVGFRRVREIEPSASATVHFAV 688
>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
Length = 696
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 201/490 (41%), Positives = 297/490 (60%), Gaps = 22/490 (4%)
Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTS 334
DCD++ I ++ + + EDAVA VLKAG+D++CG Y T A+QQ K++E DID +
Sbjct: 221 DCDAVSIIYDAQGYAK-SPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRA 279
Query: 335 LRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
L L+ V +RLG F+G P Y N+ N +C+P H LA +AAR GIVLLKN+ LP
Sbjct: 280 LLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPF 339
Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
+ ++ +LA++GP+A+ K ++GNY G PC+ +P+D +Y K Y GC + C +N
Sbjct: 340 SKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SN 398
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
+ I A+ AKNAD V++ GLD + E E DRVDL LPG Q ELI VA+AAK PV LV
Sbjct: 399 AAIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLV 458
Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
++ G VDI+FA NN KI SI+W GYPGE GG AI+++IFG +NPGGRLP+TWY ++V
Sbjct: 459 LICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVN 518
Query: 572 IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQ 630
I T M +R +PGRTYKF+ GP VY FG+GLSY+ + Y+ + + ++ + K Q
Sbjct: 519 IQMTDMRMRSATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQT 578
Query: 631 CRD-INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GI 687
D + YT+ + + C K +EVEN G+M G V+++++ G
Sbjct: 579 NSDSVRYTLVSE--------MGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGE 630
Query: 688 AGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVG 746
G KQ++G++ + ++ G+ A++ F + C+ L + +L G + + VG+
Sbjct: 631 DGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDS-- 688
Query: 747 GVSFPLQLNL 756
PL +N+
Sbjct: 689 --ELPLIVNV 696
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 106/198 (53%), Positives = 134/198 (67%), Gaps = 13/198 (6%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C LP +RA+DLV R+T+ EK+ Q+ + A G+PRLG+P YEWWSEALHGV++ G
Sbjct: 36 YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYEWWSEALHGVAYAG- 94
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTF 130
PG F+ V ATSFP VILT ASF+ W +I Q + EAR +YN G A G+TF
Sbjct: 95 -----PGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNAGQANGMTF 149
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
W+PNIN+ RDPRWGR ETPGEDP + G YA+ YVRGLQ +G R + S L+ S
Sbjct: 150 WAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDG----RKTLSNHLQAS 205
Query: 189 ACCKHYAAYDLDNWEGND 206
ACCKH+ AYDLD W+ D
Sbjct: 206 ACCKHFTAYDLDRWKDCD 223
>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
Length = 658
Score = 364 bits (934), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 249/700 (35%), Positives = 361/700 (51%), Gaps = 71/700 (10%)
Query: 56 YEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVST 115
Y WWSEAL+ F S ATSFP I A+F++ L I +ST
Sbjct: 1 YNWWSEALN----------------FSS----ATSFPAPITMGATFDDGLIHAIATVIST 40
Query: 116 EARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
EARA N+ GL F++PNIN +DPRWGR ETPGEDP+ + +Y V GLQ G
Sbjct: 41 EARAFNNVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVG-- 98
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
LKI+A CKH+AAYDL+N G RF FD++VT QD+ E + F+ C+ +
Sbjct: 99 ------PTNLKIAADCKHWAAYDLENL-GVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAK 151
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF--HGYIVSDCDSIQTIVESHKFLNDTK 293
V+S+MCSYN VNGIP+CA+ LL R W +I DC ++ I H + +D
Sbjct: 152 VASIMCSYNAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDD-P 210
Query: 294 EDAVARVLKAGLDLDC---GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
+ A L AG D+DC Y+ A+ + ++E + T++ Y L+RL + D
Sbjct: 211 ANGTAVALNAGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSWDD- 269
Query: 351 SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
+ +LA +AA +GIVLLKND G LPL + ++K +A+VGP ANAT
Sbjct: 270 -----------VNTEPAQQLAYQAAVEGIVLLKND-GILPLAS-SVKKVAVVGPMANATT 316
Query: 411 AMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
M NY G SP F + +A G + + S AAI AA +AD V
Sbjct: 317 QMQSNYNGIAPFLVSPQQAFRNAGFNVTFANGTG-LNSSDTSGFSAAIAAADDADVVFYV 375
Query: 471 AGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
G+D ++E E +DR ++ G Q L+ ++A K P+ ++ M G VD + ++N +
Sbjct: 376 GGIDTTIEREDRDRPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVN 434
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPVNNFPGRT 589
+++W GYPG+ GG A+ D+I GK P GRLPIT Y A+YV P T M LRP ++ PGRT
Sbjct: 435 ALIWGGYPGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRPSSSNPGRT 494
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
YK++ G ++ FG+GL YT F + AS S ++ D ++ + V
Sbjct: 495 YKWYTGAPIFEFGFGLHYTTFDAEWASGGDSFSVQ-DLVSSAKN------------SGVA 541
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQ 707
D+ D TF + V N G + V +++S+ G + K+++ Y RV I G
Sbjct: 542 HVDLGVLD---TFNVTVTNSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRVKGIEPGA 598
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
S+ + ++ D N +L G + +L+ G G
Sbjct: 599 SSAASLKVT-LGAVARTDEQGNRVLYPGEYVLLLDTGAEG 637
>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
Length = 511
Score = 364 bits (934), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 197/487 (40%), Positives = 285/487 (58%), Gaps = 20/487 (4%)
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
Y+ SDCD++ TI ++H + + ED VA +KAG+D++CG+Y M AVQ+G + E D
Sbjct: 16 YVASDCDAVATIRDAHHY-TLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKD 74
Query: 331 IDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
ID +L L+ V MRLG+FDG P+ Y +LG ++C+P H LA EAA+ GIVLLKND
Sbjct: 75 IDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDA 134
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-SKVINYAPGCAD 445
GALPL + +LA++GP+A+ A+ GNY G PC T+P+ G Y + GC
Sbjct: 135 GALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDS 194
Query: 446 IVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK 505
C + A A ++D V+ GL E EG DR LLLPG Q LI VA+AA+
Sbjct: 195 PACAVAATN-EAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAAR 253
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV LV+++ G VD+ FAK+NPKI +IL GYPG+ GG AIA V+FG +NP GRLP+TWY
Sbjct: 254 RPVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWY 313
Query: 566 EANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS---PKS 620
+ K+P T M +R P +PGR+Y+F+ G VY FGYGLSY++F ++ SS +
Sbjct: 314 PEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNA 373
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKMDGSEV 677
++ L R G + ++ L+ ++ +C F +EV+N G MDG
Sbjct: 374 GNLSLLAGVMAR----RAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHS 429
Query: 678 VMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
V++Y + P G +Q+IG+ + G+ A V F ++ C+ V ++ GA
Sbjct: 430 VLMYLRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGA 489
Query: 737 HTILVGE 743
H ++VG+
Sbjct: 490 HFLMVGD 496
>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
(Beta-D-glucoside glucohydrolase) [Treponema
azotonutricium ZAS-9]
Length = 706
Score = 363 bits (933), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 250/755 (33%), Positives = 378/755 (50%), Gaps = 110/755 (14%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
R K+++ +MTL EKV Q+ A V G+P Y WW+E LHGV+ G
Sbjct: 6 RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL------------ 53
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGNA----GLTFWSPNI 135
AT FP I A+F+E+ + + +S E RA YN GN GLTFW+PN+
Sbjct: 54 ----ATVFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNV 109
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ GR + +++GLQ D+ LK++AC KHYA
Sbjct: 110 NIFRDPRWGRGQETYGEDPYLTGRIGLAFMKGLQ---------GDDTEHLKVAACAKHYA 160
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
+ + R FD+ V+++D+ ET++ F++ V G V +VM +YNR G P
Sbjct: 161 VH---SGPEKLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGST 216
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL + +RG W F G++ SDC +I+ E+HK + + E++ A L AG DL+CG Y
Sbjct: 217 YLLKEILRGRWGFKGHVTSDCWAIRDFHENHK-VTKSPEESAAMALNAGCDLNCGCTYPY 275
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAE 373
T+ + ++G + + IDT+L L +LG FD Q Y+NLG + + +H LA E
Sbjct: 276 LTV-SHKKGLVTDETIDTALTRLLRTRFKLGLFDPPEQDPYRNLGNDIVGCEKHRNLALE 334
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA++ IVLLKND+ LPL+ K L L+GP A ++ NY G R + ++G
Sbjct: 335 AAQKSIVLLKNDSNILPLDDSARKIL-LMGPGAANILTLLANYYGMSSRLVTILEGLAEK 393
Query: 434 SKV-----INYAPGCADIVCQNNSMIP---------AAIDAAKNADATVIVAGLDLSVEA 479
K Y G + S +P A I D + V GLD S+E
Sbjct: 394 IKTKTAISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEG 453
Query: 480 E---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
E DR + LP +Q + ++ A K +V++ G I F ++
Sbjct: 454 EEGDSIASDANGDRDTIELPSWQLNFLRRIRKAGK---KVVLILTGGSPIAFPED--LAD 508
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
++L+ YPGE+GG A+AD++FG +P G+LPIT+ ++ PY L+ GRTY
Sbjct: 509 AVLFAWYPGEQGGNAVADILFGDVSPSGKLPITFPQSTAQLPPYDDYALK------GRTY 562
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
++ +YPFG+GLSYT F++ SV++ K + G
Sbjct: 563 RYMKETPLYPFGFGLSYTSFRF------DSVELSSSK--------ISAG----------- 597
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSA 709
+ VK K ++V N GK D EVV +Y +K + G+ R+ I AG+SA
Sbjct: 598 NSVKAK-------VQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSA 650
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
V + A + + ++ S+L G++T++ +
Sbjct: 651 SVEIELPAS-AFETINAEGASVLIPGSYTVIAADA 684
>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
Length = 713
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 239/772 (30%), Positives = 386/772 (50%), Gaps = 117/772 (15%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
+AK +V++MT+ EK+ Q+ A + RL +P Y +W+EALHGV+ G
Sbjct: 13 QAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV------------ 60
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNI 135
AT FP I A+F++ L I + TE RA YN GLTFWSPN+
Sbjct: 61 ----ATVFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNV 116
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ ++ + +++GLQ ++ LK++A KH+A
Sbjct: 117 NIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGLQ----------GQAKYLKLAATAKHFA 166
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
+ EG R FD+ V+++D+ ET++ F+ V E DV S+M +YN V+G+P
Sbjct: 167 VHS--GPEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSE 223
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL + W+F G++VSD + + + E+HK+ D E + +KAGL+L G +
Sbjct: 224 MLLRDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAE-TMGLAIKAGLNLVAG-HIEQ 281
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAA 375
A+ +G + E +I ++ LY +RLG F +Y + H L+ AA
Sbjct: 282 SLHEALNRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAIPYEANDTKAHNNLSEIAA 341
Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-- 433
+ VLLKND G LPL ++ +A+VGP+A++ A++GNY GTP R + ++G
Sbjct: 342 EKSFVLLKND-GVLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLG 400
Query: 434 -SKVINYAPG-------CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
++Y+ G A+ + + + AI AA+++D V V GLD ++E E
Sbjct: 401 DDVRVHYSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAG 460
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
D+ +L LPG Q +L+ ++ K PV +++ S ++ ++ +N+P +++I+ +
Sbjct: 461 NSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIW 519
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG GG A+ADV+FG +P G+LP+T+Y+ ++P N GRTY++
Sbjct: 520 YPGARGGLAVADVLFGTVSPSGKLPVTFYKNT------DNLPAFEDYNMAGRTYRYMTEE 573
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGL+Y+ +V + D++ K
Sbjct: 574 ALYPFGYGLTYS--------------------------------------SVELSDLQVK 595
Query: 657 DYK--FTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
Y+ T + ++N G D EVV VY K Q+ G++RVF+ G + F
Sbjct: 596 SYEETATATVTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITF 655
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE--------GVGGVSFPLQLNLN 757
+ + ++ D ++ + S I VG + GV PLQ LN
Sbjct: 656 DLR-PQDFEVFDEQGHNFIDSNRFEISVGVSQPDARSIALTGVQ-PLQTELN 705
>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 449
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 182/450 (40%), Positives = 283/450 (62%), Gaps = 21/450 (4%)
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
+D++CG+Y N+T AV++ K++E++ID +L L+ + MRLG F+G+P Y ++ +
Sbjct: 1 MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+C+ +H +A EAAR GIVLLKN N LPL+ +LA++GP+A+ + ++GNY G PC
Sbjct: 61 VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ +P G Y K Y PGC+ + C +++ I AI AK AD V+V GLD + E E
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVAC-SSAAIDQAIKIAKEADQVVLVMGLDQTQEREE 179
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DRVDL+LPG Q ELI VA AAK PV LV++ G VDI+FAK + I ILW GYPGE
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVY 599
GG A+A++IFG +NPGGRLP+TWY ++ K+P T M +R P + +PGRTY+F+ G V+
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDFTKVPMTDMRMRPQPSSGYPGRTYRFYKGKKVF 299
Query: 600 PFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK---C 655
FGYGLSY+ + Y++ S + + ++ DQ+ N P I +++ C
Sbjct: 300 EFGYGLSYSNYSYELVSVTQNKISLRSSIDQKAE--------NSSPIGYKTISEIEEELC 351
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
+ KF+ + V+N G+M G V+++++ PG +G IK++I ++ V + AG++A++ +
Sbjct: 352 ERSKFSVTVRVKNQGEMTGKHPVLLFARQDKPG-SGGPIKKLIAFQSVKLNAGENAEIEY 410
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGE 743
+N C+ L + ++ G+ +LVG+
Sbjct: 411 KVNPCEHLSRANEDGLMVMEEGSQYLLVGD 440
>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 704
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/619 (35%), Positives = 327/619 (52%), Gaps = 67/619 (10%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
+A LV +M L EK + + + RLG+P Y WWSEALHGV+ G
Sbjct: 8 KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV------------ 55
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNI 135
AT FP I A F+E +I ++TEARA YN G+T W+PNI
Sbjct: 56 ----ATVFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNI 111
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ R + ++ GLQ E Y K +AC KH+A
Sbjct: 112 NIFRDPRWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHY--------WKAAACAKHFA 163
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
+ E R HFD+ V+++D+ ET++ FE V +G V+ +M +YNRVNG P C
Sbjct: 164 VHSGPEEE---RHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSK 220
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL ++ +W F GY+VSDC +I+ H + T ++ A + G L+CG+ Y +
Sbjct: 221 VLLQDILKEEWGFDGYVVSDCWAIRDFHTEH-MVTHTATESAALAINNGCQLNCGNTYLH 279
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNPQHIELAAEA 374
+ A ++G + E I S + L + M+LG FD + +Y + + N C H ++A +
Sbjct: 280 M-LQAYKEGLVTEETITKSAQKLMAIRMKLGLFDKNCEYNKIPYEVNDCKV-HRDIALDV 337
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY- 433
AR+ +VLLKN NG LPLN K + ++GP AN+ + GNY GT RYT+ ++G Y
Sbjct: 338 ARRSMVLLKN-NGILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYV 396
Query: 434 --SKVINYAPGCADI------VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
+ + YA GC + N + A+ A+ +D ++ GLD S+E E
Sbjct: 397 GDAARVYYAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTG 456
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
D+ DL L G Q L+ +V K P L++ S A+ I+ A+ ++IL
Sbjct: 457 NAFAAGDKSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQE--YCEAILETW 513
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG+ GG+A+A ++FG+Y+P G+LPIT+Y+ +P + GRTY++
Sbjct: 514 YPGQSGGKALAQLLFGEYSPSGKLPITFYKTT------EELPDFRDYSMAGRTYRYMKNE 567
Query: 597 VVYPFGYGLSYTQFKYKVA 615
+YPFGYGL+Y + + K A
Sbjct: 568 ALYPFGYGLNYAKVEVKDA 586
>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
Length = 698
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 238/749 (31%), Positives = 370/749 (49%), Gaps = 107/749 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA++LVERM LP+ + Q+ A + LG+P Y WW+E LHG + G
Sbjct: 5 QRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------- 53
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
AT FP I + F+ + VSTE RA YNL GLT WSPN
Sbjct: 54 -----ATVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPN 108
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR ET GEDPY+ R A+ ++RGLQ + LK ++C KH+
Sbjct: 109 VNIFRDPRWGRGQETFGEDPYLTARLAVAFIRGLQ----------GEGPVLKTASCVKHF 158
Query: 195 AAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
AA+ G + R F++ V ++D++ET++ F V E +VM +Y+ +N P C
Sbjct: 159 AAHS-----GPEPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCC 213
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
A L+ +T+R W F G +SDC +I+ +HK + +E++ A LK G DL CG
Sbjct: 214 ASSFLMEETLRLRWGFEGMYISDCWAIRDFHLNHK-VTKNEEESAALALKRGCDLACGCE 272
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
Y + A Q+G I I + + +LG FD Y LG ++ + +H LA
Sbjct: 273 YQSLEK-AFQKGLITREQIKKAAIRVMTTRFKLGQFDQGTAYDTLGLESLDSDEHAALAF 331
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA 432
EA+ + +VLLKND LPL + LA++GP+A++ +A+ GNY GT RY + ++G
Sbjct: 332 EASCRSLVLLKND-ALLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRD 390
Query: 433 Y---SKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE--- 480
Y S I Y+ G + + +++ + A+ AK +D V+ GL+ +VE E
Sbjct: 391 YVGSSTRILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHD 450
Query: 481 ------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
D+ DL LP Q +L+ VA+ K P+ +V++S G++D + +K+++
Sbjct: 451 DGNGGWAGDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEI-EQYANVKALIQ 508
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPG+EGG+AIA +++G P G+LP+T+Y+A P+T L RTY++ D
Sbjct: 509 AWYPGQEGGKAIAHLLYGALCPSGKLPVTFYKAEAKLPPFTDYSL------IRRTYRYCD 562
Query: 595 GP-VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
P V+YPFG+GLSY F + ++++ + T + AA ++
Sbjct: 563 DPDVLYPFGFGLSYASFSFCLSAAQE--------------------TEQNGVAATVL--- 599
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
V N +D VV +Y G + G + V + AG+ ++ F
Sbjct: 600 ------------VRNTSALDARTVVQLYLAMEGKDLPPHPVLCGMKSVHLKAGEETQITF 647
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ K V N G +T+ G
Sbjct: 648 ILEE-KQFTAVQEDGNRYAVRGGYTLYAG 675
>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
Length = 786
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 265/773 (34%), Positives = 367/773 (47%), Gaps = 104/773 (13%)
Query: 45 AYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE---VPGATSFPTVILTTASF 101
A G RLGLP Y WWSE LHGV+ PG F++ ATSF I ASF
Sbjct: 8 ALGASRLGLPKYAWWSEGLHGVA-------GSPGVKFNTTGYPFSYATSFANAINLGASF 60
Query: 102 NESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYA 161
++ L ++G +STEARA N G GL +W+PN+N +DPRWGR ETPGEDP + Y
Sbjct: 61 DDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV 120
Query: 162 INYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQE 221
+ GL+ E V K+ A CKHYAAYDL+ W G R+ F++ VT QD+ E
Sbjct: 121 KAILAGLEGNETVR----------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSE 170
Query: 222 TFILPFEMCVNEGDVSSVMCSYNRV-----------------NGIPTCADPKLLNQTIRG 264
++ PF+ C + V S+MCSYN + P CA P L+ +R
Sbjct: 171 YYLPPFQQCARDSKVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRD 229
Query: 265 DWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT--MG 319
WN+ + YI SDC++I + + + T +A A KAG D C + T +G
Sbjct: 230 HWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG 289
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFD---------------GSPQYKNLGKNNICN 364
A Q + EA IDT+LR LY L+R GY D SP Y L ++
Sbjct: 290 AYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNT 349
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
P ELA +A +GIVLLKN LPL+ K +AL+G ANAT M G Y G P Y
Sbjct: 350 PSTQELALRSATEGIVLLKNAGSLLPLDFSG-KKVALIGHWANATGTMRGPYSGIPPFYH 408
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ + +YA G ++ A+ AA+ AD + G D +V +E DR
Sbjct: 409 NPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDR 468
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+ P Q +L++++A K P+ +VI VD + NN + SILWVGYPG+ GG
Sbjct: 469 ESIAWPETQMQLLSELAGLGK-PL-VVIQLGDQVDDSSLLNNGNVSSILWVGYPGQSGGT 526
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVN-------------------- 583
A+ DV+ GK P GRLP+T Y YV ++P T M LRP N
Sbjct: 527 AVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLT 586
Query: 584 ------------NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC 631
+ PGRTYK++ PV+ PFGYGL YT F ++ S +
Sbjct: 587 IQPRSTPGNKTLSSPGRTYKWYSSPVL-PFGYGLHYTTFNVSLSLSSSNASSSSSSPSFS 645
Query: 632 RDINYTVGTNKPPCAAVLIDDVK-CKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAG 689
T PC A +D + + N G VV+++ S G
Sbjct: 646 IPSLLT------PCTATHLDLCPFSPSANSALSVSITNTGTHTSDYVVLLFLSGEFGPKP 699
Query: 690 THIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+K ++ Y+RV I G++ V + ++ VD N++L G + +V
Sbjct: 700 YPLKTLVSYKRVKDIKPGETVTVKDVPVSLGAISRVDGDGNTVLYPGTYRFVV 752
>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
18P13]
Length = 697
Score = 361 bits (927), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 221/623 (35%), Positives = 338/623 (54%), Gaps = 80/623 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + L ERA+DL +R+T+ E+ Q+ A +PRLG+P Y WW+E LHGV+ G
Sbjct: 9 YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F+ +L +IG+ +TEARA +
Sbjct: 67 --------------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIY 112
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDP++ R + +V+G+Q + + L
Sbjct: 113 KGLTLWAPNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQ----------GEGKVL 162
Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K +AC KH+A + G + R FD++V+ +D++E+++ F V E V VM +Y
Sbjct: 163 KAAACAKHFAVHS-----GPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAY 217
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG P+CA P L+++ + W F GY VSDC +IQ + H + E A A L+
Sbjct: 218 NRVNGEPSCASPMLMDKLHQ--WGFAGYFVSDCWAIQDFHKHHGVTKNVTESA-ALALRT 274
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
G DL+CG+ Y + + A+++G I ADI + + +RLG FD P + + I
Sbjct: 275 GCDLNCGNTYL-YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPHFAACTYDTIA 333
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+P H ++ A + +VLLKND G LPL+ + +A++GP+A++ A+ GNY GT RY
Sbjct: 334 SPAHKAVSLSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRY 392
Query: 424 TSPMDGFY-AYSKVINYAPGCADIVCQNNSMIPAAID-------AAKNADATVIVAGLDL 475
+ ++G A+ ++YA GC + S + A D AA+ +D ++ GLD
Sbjct: 393 VTFLEGIQDAFPGRVHYAQGC-HLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDA 451
Query: 476 SVEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
++E E D+ DL LP Q +L+ K+ K PV LV+ + A+ N
Sbjct: 452 TLEGEEGDTGNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSAL-------N 503
Query: 527 PKIK--SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNN 584
P+I ++L YPG+ GG+A+A ++FGK +P G+LP+T+YE +T ++
Sbjct: 504 PEISCNAVLQAWYPGQCGGQALAHILFGKVSPSGKLPVTFYETAEQLPDFTDYSMQ---- 559
Query: 585 FPGRTYKFFDGPVVYPFGYGLSY 607
RTY++ V+YPFGYGL+Y
Sbjct: 560 --NRTYRYARNNVLYPFGYGLTY 580
>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
Length = 691
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 225/623 (36%), Positives = 335/623 (53%), Gaps = 72/623 (11%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L ERA+ L + MT E+ Q+ A V RLG+P Y WW+E +HG++ G
Sbjct: 4 YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ L KK + S EARA YN +
Sbjct: 62 --------------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDPY+ + + VRGLQ D + +
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQ----------GDGKVI 157
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K +AC KH+A + + R FD++ +DM+ET++ FE V E V SVM +YNR
Sbjct: 158 KAAACAKHFAVH---SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNR 214
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P CA L+++ +W F GY VSDC +I+ E+H + E + A LKAG
Sbjct: 215 VNGEPACASNFLMDKL--KEWEFDGYFVSDCWAIRDFHENHMVTANAIE-STAMALKAGC 271
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
D++CG Y N + A+++G + + DI T+ L +RLG FD +Y ++ + +
Sbjct: 272 DVNCGCTYQNLLV-ALEKGAVTKEDIRTACVHLMRTRIRLGMFDKKTEYDDIPYDKVACK 330
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H ++ E A + +V+L+N NG LP++T KT+A++GP+A++ A+ GNY G RYT+
Sbjct: 331 EHKAISLECAEKSLVMLEN-NGILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTT 389
Query: 426 PMDGFY-AYSKVINYAPGC------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
++G + + +A GC + Q A+ AAK AD T++ GLD ++E
Sbjct: 390 FLNGIQDRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIE 449
Query: 479 AE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
E D+ L LP Q EL+ K+ K PV V+ + A++ K
Sbjct: 450 GEEGDTGNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGSAIN-----TESKP 503
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGR 588
+++ YPG EGG+A+A+V+FG +P G+LP+T+YE + K+P +T ++ GR
Sbjct: 504 DALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK------GR 556
Query: 589 TYKFFDGPVVYPFGYGLSYTQFK 611
TY++ V+YPFGYGL+Y K
Sbjct: 557 TYRYTTENVLYPFGYGLTYGSVK 579
>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
Length = 761
Score = 359 bits (922), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 263/801 (32%), Positives = 374/801 (46%), Gaps = 149/801 (18%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ Y D P RAK L+ +++L EK + + V RLG+ Y WWSEALHGV+
Sbjct: 27 QEISYTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEALHGVARN 86
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASF+ + + VS EAR +
Sbjct: 87 G----------------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGR 130
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
AGL+FW+PNIN+ RDPRWGR +ET GEDPY++G+ + VRGLQ D D
Sbjct: 131 VYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQG--------DPD 182
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
+ LK AC KHYA + E N R FD++V+E+D++ET++ F+ V + V VM
Sbjct: 183 ADVLKTHACAKHYAVH--SGLESN-RHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMT 239
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVAR 299
+YNR G P A L+ + +R +W + G +VSDC +I E H F+ T E+A A
Sbjct: 240 AYNRFRGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVA-TGEEAAAL 298
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK 359
+ GLD++CG ++ A+ QG + E D+D +L + RLG DG + +L
Sbjct: 299 AVANGLDVECGSTFSKIP-AAIDQGLLKEEDLDRNLLRVLTERFRLGEMDGESPWDDLDP 357
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ P+H L+ + AR+ +VLL+N NG LPL G + +AL+GP+A+ + GNY
Sbjct: 358 AIVEGPEHRALSLDIARETMVLLRN-NGVLPLKAG--EKIALIGPNADDAQMQWGNYNPV 414
Query: 420 PC-------------------RYTSPMDGFY-----AYSKVI------------NYAPGC 443
P R +D Y AY+ +I YA
Sbjct: 415 PKSTITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSV 474
Query: 444 ADIVC-------QNNSMIPAAIDAA-----KNADATVIVAGLDLSVEAE----------G 481
DI Q S +PA +AA + D V G+ +E E G
Sbjct: 475 NDIKNYIRRDEEQRRSFMPALDEAAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSG 534
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR D+ LPG Q L+ + DA K V LV S A I +IL YPG+E
Sbjct: 535 GDRTDIELPGVQRRLLKALHDAGK-KVVLVNFSGCA--IGLVPETESCDAILQAWYPGQE 591
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPF 601
GG AIADV+FG NP G+LP+T+Y+ N ++P N G TY++F G +YPF
Sbjct: 592 GGTAIADVLFGDVNPSGKLPVTFYK-NVDQLPDVED-----YNMEGHTYRYFRGEPLYPF 645
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GYGLSYT F + PK VK K+
Sbjct: 646 GYGLSYTSFAF---GEPK---------------------------------VKGKN---- 665
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
+I+V N G + G+EVV +Y + P +K + + RV + AGQ+ KV ++ L
Sbjct: 666 LEIDVTNTGSVAGTEVVQLYVRKPDDTAGPVKTLRAFRRVSVPAGQTVKVSIPLDKETFL 725
Query: 722 KIVDNAANSLLASGAHTILVG 742
+ + + G + +L G
Sbjct: 726 WWSEKDQDMVPVRGRYELLCG 746
>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
FGSC 2508]
Length = 788
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 263/777 (33%), Positives = 371/777 (47%), Gaps = 110/777 (14%)
Query: 45 AYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSE---VPGATSFPTVILTTASF 101
A G R+GLP Y WWSE LHGV+ PG F++ ATSF I ASF
Sbjct: 8 ALGASRIGLPKYAWWSEGLHGVA-------GSPGVTFNTTGYPFSYATSFANAINLGASF 60
Query: 102 NESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYA 161
++ L ++G +STEARA N G GL +W+PN+N +DPRWGR ETPGEDP + Y
Sbjct: 61 DDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV 120
Query: 162 INYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQE 221
+ GL+ E V K+ A CKHYAAYDL+ W G R+ F++ VT QD+ E
Sbjct: 121 KAMLAGLEGNETVR----------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSE 170
Query: 222 TFILPFEMCVNEGDVSSVMCSYNRV-----------------NGIPTCADPKLLNQTIRG 264
++ PF+ C + V S+MCSYN + P CA+ L+ +R
Sbjct: 171 YYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTYLMT-ILRD 229
Query: 265 DWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT--MG 319
WN+ + YI SDC++I + + + T +A A KAG D C + T +G
Sbjct: 230 HWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG 289
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFD---------------GSPQYKNLGKNNICN 364
A Q + EA IDT+LR LY L+R GY D SP Y L ++
Sbjct: 290 AYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNT 349
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYT 424
P ELA +A +GIVLLKN LPL+ + K +AL+G ANAT M G Y G P Y
Sbjct: 350 PSTQELALRSATEGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYH 409
Query: 425 SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
+P+ + +YA G ++ A+ AA+ AD + G D +V +E DR
Sbjct: 410 NPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDR 469
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
+ P Q +L++++A K P+ +VI VD +F N + SILWVGYPG+ GG
Sbjct: 470 ESIAWPKAQMKLLSELAGLGK-PL-VVIQLGDQVDDSFLLENGNVSSILWVGYPGQSGGT 527
Query: 545 AIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNF------------------ 585
A+ DV+ GK P GRLP+T Y YV ++P T M LRP N+
Sbjct: 528 AVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGS 587
Query: 586 ------------------PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
PGRTYK++ PV+ PFGYGL YT F +V + L
Sbjct: 588 GSLTIQPRSTPGNKTLSSPGRTYKWYSNPVL-PFGYGLHYTTF---------NVSLSLSS 637
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVK-CKDYKFTFQIEVENMGKMDGSEVVMVY-SKPP 685
+ ++++ + PC A +D I + N G V +++ S
Sbjct: 638 NASSPSPSFSIPSLLTPCTATHLDLCPFSPSANSALSISITNTGTHTSDYVALLFLSGEF 697
Query: 686 GIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
G +K ++ Y+RV I G++ V + ++ VD N++L G + V
Sbjct: 698 GPKPYPLKTLVSYKRVKDIKPGETVTVKDVPVSLGAISRVDGDGNTVLYPGTYRFAV 754
>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
Length = 750
Score = 357 bits (917), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 387/774 (50%), Gaps = 104/774 (13%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM----GDLAYGVPRLGLPLY 56
RF L FP+ + LP R DL+ R+T+ + + Q G G+ RLG+
Sbjct: 26 RFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTVARYGKFTPGIERLGIKPI 85
Query: 57 EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTE 116
E+ +E L GV RR N AT FP + ASF+ L +++ VS E
Sbjct: 86 EYITECLRGV----RREN-------------ATGFPQALGLAASFSRDLMQRVATAVSVE 128
Query: 117 ARAMYN-------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
RA YN G G+T +SP IN++R P WGR ET GEDPY+ G A YV GLQ
Sbjct: 129 VRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLSGELASQYVSGLQ 188
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
D R L++SA CKH+ A+ + +F FD+++ E+D+Q TF+ F+
Sbjct: 189 G---------DDPRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKK 239
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
C+ +VMCS+N +NG+P+CA+ +LL +R W + G++VSD +++ I H +
Sbjct: 240 CI-AAKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHY- 297
Query: 290 NDTKEDAVARVLKAGLDLD-CGDY---YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRL 345
N + E A +K+G +++ G + Y T A+ + I + ++ ++R +++ L
Sbjct: 298 NSSFETAAVEAIKSGCNMELVGKFDPSYWQLT-KALNEHLITKDELMENVRPVFLTRFLL 356
Query: 346 GYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
G FD + + K+ + + +H LA EAA + VLLKND LPL ++KT+A+VG
Sbjct: 357 GEFDPPALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVG 416
Query: 404 PHANATKAMIGNY--EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN--NSMIPAAID 459
P +N T +IG+Y + P +P+ G + + +A GC++ C + + + AA+D
Sbjct: 417 PMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRATDVAAAVD 476
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAV 518
A+ + G VEAE DR D++LPG Q +L+ A G PV L++ + G +
Sbjct: 477 GAQ---VVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPL 533
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITWYEANYVKIP-Y 574
D+ FA+ I SI+ +P G AI ++ G +P GRLP+TW A ++P
Sbjct: 534 DVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW-PAYLNQVPNI 592
Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
T ++ GRTY+++ +YPFGYGLSYTQFKY S K +++ K Q+ R
Sbjct: 593 TDYTMK------GRTYRYYTEDPLYPFGYGLSYTQFKY---SDLKVTPLEVTKGQEIR-- 641
Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV-VMVYSKPPGIAGTHIK 693
+++V N+G D EV ++V T I
Sbjct: 642 ---------------------------VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIP 674
Query: 694 ----QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
Q++ ++R+ IA+G+S V T+ A L++ N + G T+ +G
Sbjct: 675 VPRWQLVAFDRIHIASGKSETVELTIEA-SLLEVWQNPETGFDILEGEMTLYIG 727
>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
Length = 805
Score = 355 bits (912), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 252/768 (32%), Positives = 381/768 (49%), Gaps = 86/768 (11%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPR-----LGLPLYEWWSEALHG 65
+ P+C+ L +R +DL+ R+ L EK + A PR +GLP Y W + +HG
Sbjct: 34 ELPFCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHG 91
Query: 66 V-SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
V S G TN P TSFP + A F+ + + Q + E RA++ G
Sbjct: 92 VQSTCG--TNCP------------TSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEG 137
Query: 125 ---------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GL WSPNIN+ RDPRWGR ETP EDP V +Y + Y RGLQ+
Sbjct: 138 ATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQE----- 192
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+ D R L+ KHYAAY +N+ G +R FD+ V+ D +T+ F V +G+
Sbjct: 193 -GKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGN 251
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
VMCSYN VNGIP CA+ +L+ +RG F GY+ SD +++ I + H + D++ +
Sbjct: 252 AKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYA-DSQCE 310
Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQ 353
A + AG D++ G Y V ++ E +D +LR + LG FD
Sbjct: 311 AARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQP 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y N+ + + L+ A R+ +V+L+N+ LPL G LA++GPHA + + ++
Sbjct: 371 YWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQKG--VKLAVLGPHAKSKRGLL 428
Query: 414 GNYEGTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKN 463
GNY G C +P+D A + N +A GC I + + A+ AAK
Sbjct: 429 GNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG-ISGNSTAGFEKAVAAAKE 487
Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
ADA V+ G+D S+E E DR ++ LP Q +L+ +V A G T+V++ G V I
Sbjct: 488 ADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRVH--AVGRPTVVVLINGGV-IGAE 544
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPV 582
+ + +++ YPG G RA+ADV+FG NP G+LP+T Y ++YV ++ SM +
Sbjct: 545 EIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDMTA- 603
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
PGRTY++F G V+PFG+GLSYT F V S N + +N
Sbjct: 604 --HPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSG----------------TNSSSHSNN 645
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIG 697
+ + D T + V+N G++ G EVV+ + +P G A +Q+
Sbjct: 646 AAFSGGEVSDTA----NVTISVVVKNDGEVAGDEVVLAFFRPVNSNVTGPATLLNEQLFD 701
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
Y+RV + S +V FT+ +L + D N G++ ++V GV
Sbjct: 702 YQRVSLGPLDSTEVSFTIER-STLALPDEEGNLASFPGSYEVIVSNGV 748
>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
Length = 452
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 188/456 (41%), Positives = 269/456 (58%), Gaps = 16/456 (3%)
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
+D++CG Y + A+QQGKI E DI+ +L L+ V MRLG F+G P+ Y ++G +
Sbjct: 1 MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+C +H +LA EAA+ GIVLLKND GA LPL+ N+ +LA++G +AN + GNY G
Sbjct: 61 VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120
Query: 420 PCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
PC +P+ Y K ++ GC C N + IP A+ AA +AD+ V+ GLD E
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAAC-NVTTIPEAVQAASSADSVVLFMGLDQDQER 179
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E DR+DL LPG Q LI VA+AAK PV LV++ G VD++FAK NPKI +ILW GYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR--PVNNFPGRTYKFFDGPV 597
E GG AIA V+FG++NPGGRLP+TWY ++ ++P T M +R P +PGRTY+F+ GP
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPT 299
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
V+ FGYGLSY+++ ++ A+ P + + T G I C
Sbjct: 300 VFNFGYGLSYSKYSHRFATKPPPT----SNVAGLKAVEATAG-GMASYDVEAIGSETCDR 354
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHIKQVIGYERVFIAAGQSAKVGFT 714
KF + V+N G MDG V+V+ + P +G Q+IG++ + + A Q+A V F
Sbjct: 355 LKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEFE 414
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
++ CK ++ G+H ++VGE +SF
Sbjct: 415 VSPCKHFSRATEDGRKVIDQGSHFVMVGEDEFEMSF 450
>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
Length = 748
Score = 354 bits (909), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 252/764 (32%), Positives = 379/764 (49%), Gaps = 106/764 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQM---GDLAYG----VPRLGLPLYEWWSEALH 64
FP+ D LP+ R +DL+ R+T E V QM G L G +PRLG+ Y W +E L
Sbjct: 27 FPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTECLR 86
Query: 65 GVSFIGRRTNSPPGTHFDSEVPG-ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
G D+E PG AT+FP + A+F+ L ++ +TE RA +N
Sbjct: 87 G----------------DAEAPGWATAFPQALGLAAAFSPELVYRVANATATEVRAKHNS 130
Query: 124 --------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GL+ +SP +N++R P WGR ET GEDPY+ A ++V+GLQ
Sbjct: 131 FVAAGRYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQG----- 185
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
R +K SA CKH++ + R FD++V E+D TF+ F+ CV G
Sbjct: 186 ----QHPRYIKASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGS 241
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
S MCSYNR+NG+P CA+ KLL +RG+W F GY+VSD +++ I+ H++ + E
Sbjct: 242 YS-FMCSYNRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLET 300
Query: 296 AVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
A+A V AGL+L+ N A+ G I + +R L+ +RLG FD
Sbjct: 301 AIASV-NAGLNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFDPP 359
Query: 352 PQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
Y L + + + +H L+ EAA + VLLKN LPL + K LA+VGP A+
Sbjct: 360 AMNPYNALELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNP 419
Query: 410 KAMIGNYEGTP-CRYT-SPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
+ + G+Y P +Y +P G +++A GC + C S +A + AD
Sbjct: 420 RVLFGDYAPVPEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEVE-NAVRGADVV 478
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKNN 526
++ G + VE E +DR DL LPG Q +L+ AA G PV L++ +AG +D+++A+ +
Sbjct: 479 LVCLGTGIDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLH 538
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKY--NPGGRLPITWYEANYVKIPYTSMPLRPVNN 584
+ +IL +P + G AIA V+ GK +P GRLP TW A ++P P+ N
Sbjct: 539 DGVGAILACFFPAQATGLAIASVLLGKQGASPAGRLPATW-PAGMHQVP-------PMEN 590
Query: 585 F--PGRTYKFF--DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
+ GRTY+++ + P +YPFGYGLSYT F Y+ D+ L
Sbjct: 591 YTMEGRTYRYYGQEAP-LYPFGYGLSYTTFHYR--------DLVLS-------------- 627
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK--PPGIAGTHIKQVIGY 698
PP + C + + + +EN G D EVV +Y + P + Q++ +
Sbjct: 628 --PPVLPI------CAN--LSVSVVLENTGPRDSEEVVQLYLRWEQPSVPVPRW-QLVAF 676
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
RV + AG + K+ F + A + + L GA T+ G
Sbjct: 677 RRVAVPAGGATKLSFGVTAAQRAVWMQQWH---LEPGAFTLFAG 717
>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 748
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 254/773 (32%), Positives = 369/773 (47%), Gaps = 112/773 (14%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGLPLYEWWSEAL 63
+FP+ D LP ER KD+V++++L + V+QM A G+P+ + Y+W +E L
Sbjct: 26 EFPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECL 85
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
G D ATSFP I ASFN L K++ + E RA
Sbjct: 86 SG----------------DVNAGDATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTA 129
Query: 124 G--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GL+ WSP +N++RDPRWGR ET GEDPY+ G +V GLQ
Sbjct: 130 AVLNGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQG----- 184
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D + +A CKH+ + R FD+ VT D + TF+ F+ CV G
Sbjct: 185 ----DDPTYVIANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGA 240
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND---- 291
+S +MCSYNR+NG+P CA+ KLL +R +WNF GY+VSD +++ IV H + D
Sbjct: 241 LS-LMCSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTA 299
Query: 292 -TKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF-- 348
L+ G G + AV++G ++ + ++ L+ V +LG F
Sbjct: 300 AADAANAGTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDP 359
Query: 349 -DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA---LPLNTGNIKTLALVGP 404
D + Y N+ + I + +HI+L+ +AA + IVL+KNDN LPL + K +VGP
Sbjct: 360 PDNNNPYANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGP 419
Query: 405 HANATKAMIGNYEGTPCR--YTSPMDGFYAY---SKVINYAPGCAD-IVCQNNSMIPAAI 458
M G+Y T +P+ G S ++NY GC D C+
Sbjct: 420 FIENADTMFGDYSPTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRT 479
Query: 459 DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAA-KGPVTLVIMSAGA 517
A + D ++ AGL +E EG D D+ LPG Q L+ A+ P+ L++ +A
Sbjct: 480 -ACEGVDLVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANP 538
Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP---- 573
+DI++AK+NP+ +IL YPG+E G AIA+V+ G YNP GRLP TW A+ ++P
Sbjct: 539 LDISYAKSNPRFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW-PASLDQVPDMID 597
Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
YT RTY++F +YPFGYGLS+T F Y D
Sbjct: 598 YT---------MKERTYRYFTQEPLYPFGYGLSFTTFNYS-------------------D 629
Query: 634 INY--TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
+N T TN AV + V N G MDG EV Y K +A
Sbjct: 630 LNVASTANTNGEGSIAV--------------SVTVMNTGTMDGDEVTQAYVKWDNVAEAP 675
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS--LLASGAHTILVG 742
Q++G R FI+ GQS V FT+ + L++ N + + G +++ VG
Sbjct: 676 NIQLVGVSRKFISKGQSITVSFTIKP-EQLQVWINGDDGKWSIPGGTYSLFVG 727
>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
Length = 722
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 239/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R K L+++MTL EK Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPAYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E DV SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSR 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL + +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
AA + +VLLKN+N LPL+ K++A+VGP A+ +G Y G P + + G
Sbjct: 383 AAVKSVVLLKNEN-LLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSITLLKGVKDL 439
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+NY G I +S++ A K D ++ G D + E D + LP
Sbjct: 440 MGKRGKVNYLNG---IGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ + + LV S + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKAIYQ-VNPRIVLVFHSGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYSFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 833
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 274/807 (33%), Positives = 378/807 (46%), Gaps = 136/807 (16%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD+ P+RA LVE++T+ EK+ + D + G PRLGLP Y WWSE LHGV+
Sbjct: 37 CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVA------- 89
Query: 75 SPPGTHFDSE---VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
PG F++ ATSF VI A+ ++ L ++G +STEARA G GL +W
Sbjct: 90 GSPGVVFNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYW 149
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
+PNIN +DPRWGR ETPGEDP + Y V GL+ V K+ A C
Sbjct: 150 TPNINPYKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGNGTVR----------KVIATC 199
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC---------- 241
KH+AAYDL+ W G R+ FD+ V+ QD+ E ++ PF+ C + V S+MC
Sbjct: 200 KHFAAYDLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPF 259
Query: 242 ----------------------SYNRVNGIPTCADPKLLNQTIRGDWNF---HGYIVSDC 276
SYN +NG P CA L+ +R WN+ + YI SDC
Sbjct: 260 PSFPRLVTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDC 319
Query: 277 DSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQQGKIAEADID 332
++IQ + + + T +A A AG D C YT+ +GA Q ++E+ ID
Sbjct: 320 NAIQDFLPDNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTD-VVGAYNQSLLSESVID 378
Query: 333 TSLRFLYIVLMRLGYFD-GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
T+LR LY L+R GY D G P + K +P LPL
Sbjct: 379 TALRRLYEGLIRAGYLDHGRPASSSPDKAPFSSPDF---------------------LPL 417
Query: 392 N-TGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN 450
+ TG KT+AL+G ANAT+ + G Y G P Y +PM YA G
Sbjct: 418 DLTG--KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNSTDA 475
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTL 510
++ AA+ AA++AD + G D +V +E DR + P Q LI K+A K V
Sbjct: 476 DTWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQVGKPMV-- 533
Query: 511 VIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV 570
VI VD NN I SILWVGYPG+ GG A+ DV+ GK GRLP+T Y A YV
Sbjct: 534 VIQLGDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYV 593
Query: 571 -KIPYTSMPLRPVNNF----------------------------------PGRTYKFFDG 595
++P T M LRP N+ PGRTYK++
Sbjct: 594 DEVPLTEMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYPR 653
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
PV+ PFGYGL YT F ++ S S + D I + + C A+ +D
Sbjct: 654 PVL-PFGYGLHYTPFNISLSLS-TSSNASSTTDNTSISIRSLLTSQT--CTAIHLDLCPF 709
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF-IAAGQSAKVG- 712
F + + N G V +++ S G +K ++GY+RV I G++ VG
Sbjct: 710 S----PFSVSITNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRVKDIKPGETRVVGG 765
Query: 713 --FTMNACKSLKIVDNAANSLLASGAH 737
+N ++ VD N++L G +
Sbjct: 766 EDIPVN-LAAVARVDGNGNTVLYPGTY 791
>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
Length = 722
Score = 351 bits (900), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 240/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ K+ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 722
Score = 351 bits (900), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 240/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ K+ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
bacterium]
Length = 780
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 252/809 (31%), Positives = 368/809 (45%), Gaps = 149/809 (18%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
++ + LS PY D LP ERAKDLV R+TL EK + V LG+ Y WWSEAL
Sbjct: 36 AVTLSLSAQPYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEAL 95
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ G AT FP I ASF+E L ++ VS EAR Y +
Sbjct: 96 HGVARNG----------------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKI 139
Query: 124 GN--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
G+TFW+PNIN+ RDPRWGR +ET GEDPY+ G+ + VRGLQ
Sbjct: 140 AKESGHIGQYQGVTFWTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQG----- 194
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
SDS LK AC KHYA + W +R +D+ V+E+D++ET++ F+ V + +
Sbjct: 195 ---PSDSPVLKAHACAKHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKAN 248
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKE 294
V VM +YNR G P A L+N +RG+W + G I SDC +++ V+ +
Sbjct: 249 VQEVMTAYNRFRGEPCGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVA 308
Query: 295 DAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY 354
A A + AG+D +CG Y + AV++G + E D+D +L L+ +LG D +
Sbjct: 309 SAAAAAVHAGVDTECGQAYRHIPE-AVERGLLDEKDLDRNLIRLFTARYQLGEMDDISLW 367
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+L + + P+H+ L+ + A++ +VLL+N G LPL +++ +ALVGP+ + + G
Sbjct: 368 DDLPASILEGPEHLALSRKMAQESMVLLQNKGGILPL-APDVR-VALVGPNGDDREMQWG 425
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQ-------NNSM-------------- 453
NY P R + D I Y GC + + NN +
Sbjct: 426 NYNPVPGRTVTLYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAI 485
Query: 454 ------------------------------IPAAIDAAKNADATVIVAGLDLSVEAE--- 480
+ + + + D + G+ E E
Sbjct: 486 ARQYAIGVQDILNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMP 545
Query: 481 -------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
G DR D+ LP Q +L+ + DA K V LV S A I +IL
Sbjct: 546 VNLPGFKGGDRTDIQLPQVQRDLMKALHDAGK-KVILVNFSGCA--IGLVPETESCDAIL 602
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
YPGEEGG AI DV+FG NP G+LP+T+Y + +P + G TY++F
Sbjct: 603 QAWYPGEEGGLAITDVLFGDVNPSGKLPVTFYRS------VEDLPDFENYDMKGHTYRYF 656
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
G ++PFGYGLSY+ F+YK
Sbjct: 657 KGKPLFPFGYGLSYSTFRYK---------------------------------------- 676
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
+ K + I V+N GK + +EVV VY + G +K + + RV I AG++ KV
Sbjct: 677 RAKVRNNSLIIPVKNTGKREATEVVQVYVRRKGDPDGPVKTLRAFRRVTIPAGKTVKVCI 736
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ L + A + + G + +L G
Sbjct: 737 PLEDETFLWWSEEAQDMVPLPGKYELLYG 765
>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
Length = 722
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 242/742 (32%), Positives = 372/742 (50%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ ++ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ D Q G + A+L +C +E+ N
Sbjct: 604 FEF-------------DNIQ---------GNDTLQSDAIL----QC-------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
Length = 757
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 257/761 (33%), Positives = 366/761 (48%), Gaps = 93/761 (12%)
Query: 7 VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM-----GDLAYGVPRLGLPLYEWWSE 61
V+ DFP+ D L + +RA DLV R+TL E Q G + RLG+ Y W +E
Sbjct: 15 VQSYDFPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITE 74
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
L G + N+ AT++P I ASF+E L + + +S E RA +
Sbjct: 75 CLAG------QVNT-----------NATAYPQPIGMAASFSEELLFNVSRDISYEVRAHW 117
Query: 122 NLGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEG 173
N A GL+ +SP IN++R P WGR ET GEDP + G A ++VRGLQ
Sbjct: 118 NANRAVGKYSTKVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQG--- 174
Query: 174 VEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
D R L+ +A CKH+ + RF FD++V +D + TF+ F+MCV+
Sbjct: 175 ------DDPRYLRANAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDA 228
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
G S +MCSYNR+NGIP CA+ +LL R +W FHGYIVSD +I I E H + N T
Sbjct: 229 GSYS-LMCSYNRINGIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTV 287
Query: 294 EDAVARVLKAGLDLDCG---DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
VA +KAG +L+ G + Y + A++QG + E +I ++R L +RLG FD
Sbjct: 288 ATVVA-AIKAGTNLELGGGSNMYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDP 346
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
Y +G + I +P+H E A +AA G VLLKN N LP+ K LA+VGP NA
Sbjct: 347 EAMVDYNKIGVDVIQSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQYSK-LAIVGPFTNA 405
Query: 409 TKAMIGNYEG-TPCRYTSPM-DGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
T + G Y ++TS + +G A GC + C + + A AD
Sbjct: 406 TSELFGTYSSEVNLKFTSTIFEGLSPLGGSTRSANGCTNSAC-SGYVRDDVETAVAGADL 464
Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAKN 525
++ G E+EG DR L L G Q +++ + G PV LV+++AG +DI +AK
Sbjct: 465 VIVALGSGQRFESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKL 524
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
+P + +IL GYP + G A+ + + P GRL TW N ++P +
Sbjct: 525 DPGVTAILSCGYPAQSTGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPKITD----- 578
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GRTY+++ G +YPFG+GLSYT F Y S SV T G N
Sbjct: 579 YTMQGRTYRYYVGEPLYPFGFGLSYTSFSYTRLSISPSV--------------ITQGDN- 623
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERV 701
T ++ ++N G D EVV VY P K + + R
Sbjct: 624 -----------------VTVEVCLKNTGSYDSDEVVQVYMSWPQTPFPLPKWTLAAFARP 666
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FI+AGQ+ V + A + + + A G T+ G
Sbjct: 667 FISAGQTICVKSVIRADQMAVWLSDDAGFGFVPGVMTVYAG 707
>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 742
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 246/743 (33%), Positives = 371/743 (49%), Gaps = 111/743 (14%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGLPLYEWWSEALH 64
FP+ + L +R KD+V+ +TL E V+QM A G+PRL + Y+W +E L
Sbjct: 24 FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
G G ATSFP I ASFN L K++ + E RA +
Sbjct: 84 GNVSAG----------------DATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAA 127
Query: 125 --------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
+ GL+ WSP +N++RDPRWGR ET GEDPY+ G YV GLQ
Sbjct: 128 VKDGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQG------ 181
Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
++SR + +A CKH+ + RF FD++V+ +D + TF+ F+ CV G +
Sbjct: 182 ---NNSRYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGAL 238
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
S +MCSYNR+NG+P CA+ LL +R +W+F GY+VSD +++ IV H + D + A
Sbjct: 239 S-LMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMK-A 296
Query: 297 VARVLKAGLDLDCGDYYTNF------TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD- 349
A AG L+ G+ F + AV+ ++ + ++ L+ V M+LG FD
Sbjct: 297 AADAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFDP 356
Query: 350 -GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA----LPLNTGNIKTLALVGP 404
+ Y N+ + I + HI L+ +AA + IVL+KND+G LP+ T +K +VGP
Sbjct: 357 PDNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPI-TNEVKKACMVGP 415
Query: 405 HANATKAMIGNYEGTPCR--YTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAID 459
++ + + G+Y T R + + G + +NYA GC D N
Sbjct: 416 FSDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRS 475
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK-GPVTLVIMSAGAV 518
A + ++ AGL +E+EGKD D+ LPG Q +L+ A+K V L++ +A +
Sbjct: 476 ACDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPL 535
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSM 577
DI +AK +P+I IL YPG+ G+AIA+V+ G+YNP GRLP TW A+ ++P T+
Sbjct: 536 DIRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-PASLDQVPGITNY 594
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
++ RTY++F +YPFGYGLSYT F Y N
Sbjct: 595 TMKE------RTYRYFTQEPLYPFGYGLSYTTFHYS---------------------NLN 627
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-----SKPPGIAGTHI 692
+ + A +I + V N G MDG+EV VY S P +
Sbjct: 628 ISSTATASGAGMI----------AVSVLVTNTGSMDGTEVTQVYVWCNISYAPKL----- 672
Query: 693 KQVIGYERVFIAAGQSAKVGFTM 715
Q++G + FI+ G++ +V F++
Sbjct: 673 -QLVGVNKDFISKGKTLEVSFSI 694
>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
Length = 722
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 239/742 (32%), Positives = 370/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N + E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVN-SLEEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ ++ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EGQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
Length = 722
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 239/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ ++ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 722
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 239/742 (32%), Positives = 368/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q + + K+ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKFLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--ILAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
Length = 722
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 239/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ ++ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKEIYQ-VNPRIALVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
Length = 722
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 239/742 (32%), Positives = 368/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDP++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+E ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEVAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ K+ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EEQEKLLKKIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
Length = 722
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 238/742 (32%), Positives = 369/742 (49%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R + L+++MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GE+P++ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEEPHLTSRLGVAFVKGLQ---------GDHPTYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E + SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSH 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL+ +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY 433
AA + +VLLKND LPLN IK++A+VGP A+ +G Y G P S + G
Sbjct: 383 AAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKG---- 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ G V N M +A A K AD ++ G D + E D + LP
Sbjct: 436 ---VKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ ++ + LV + + +A + I +I+ YPG+E GRA+A+++
Sbjct: 493 EGQEKLLKEIYQ-VNPRIVLVFHTGNPLTSEWADTH--IPAIMQAWYPGQEAGRALANLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y+ +P + + + GRTY++ G +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYKTE------EQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F++ + ++ D QC +E+ N
Sbjct: 604 FEFDNIQGNDT--LQPDAILQC-------------------------------SVELSNS 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
G++ G EVV VY S+ T+ +K+++ +++V +A+G+ KV FT+ A + L + ++
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
+L SG +T+ +G G G++
Sbjct: 690 KWRML-SGKYTLFIGSGQPGLA 710
>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 347 bits (890), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 245/769 (31%), Positives = 371/769 (48%), Gaps = 91/769 (11%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQM-GDLAYGVPRLGLPLYEWWSEALHGVSF 68
S +P+CD KL +R KDLV R++ + Q+ + + +GLP Y W + A+HG+
Sbjct: 105 SSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQN 164
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG-NAG 127
D + P TSFP +A+FN SL K +G+ + E RA YN + G
Sbjct: 165 TACLA--------DGQCP--TSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNG 214
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
L WSP IN RDPRWGR +E+PGE P+V G+Y Y GLQ+ + +Y +
Sbjct: 215 LDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQNGDDKDY--------TQA 266
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
KH+ AY +++++ R+ +++ V+E D+ +T+ +E V VMCSYN +N
Sbjct: 267 VVTLKHWVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLN 326
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
G+PTC +P L +R DW F GYI SD DSI I H + ++ A L G D+
Sbjct: 327 GVPTCGNPA-LTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVL-ATRDGLLGGCDI 384
Query: 308 DCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNP 365
D GD Y + AV Q + + +D +L Y + LG FD + Y + + +
Sbjct: 385 DSGDTYADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTNAYDRISADEVGMS 444
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
E + AAR+ + LLKND LP TG K +A++G +N+ + ++GNY G C
Sbjct: 445 SSQETSLLAARKSMTLLKNDGQTLPFATG--KKVAVIGKSSNSAEDILGNYVGPIC---- 498
Query: 426 PMDGF----YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
P F Y V G A + + + I AI A +AD V+ + EG
Sbjct: 499 PSGAFDCVQTLYQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTIS-NYGQAGEG 557
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
KDR + L Q EL+ V K P +V+++ G + +++ K+ + ++IL PG
Sbjct: 558 KDRTYIGLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKD--EAQAILVAFAPGVH 614
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVK-IPYTSMPLRPV-------------NNFPG 587
GG+A+A+ IFG NPGG+LP+T Y ++YV + + +M ++ V + PG
Sbjct: 615 GGQAVAETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPG 674
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
R+YK++ G +YPF YGLSYT F + +P
Sbjct: 675 RSYKYYTGEPLYPFAYGLSYTTFNLSWSPAPP---------------------------- 706
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP--------PGIAGTHIKQVIGYE 699
+ T+ V N G + G EVV + KP P IK++ G++
Sbjct: 707 --MTTFTSTLRSTTYTATVTNTGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQ 764
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
RV + GQS +V F +NA ++L V + L SG I + G G V
Sbjct: 765 RVALGPGQSTQVTFELNA-ETLAQVTLDGHRELHSGEFEIELTRGHGEV 812
>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
Length = 691
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 247/756 (32%), Positives = 370/756 (48%), Gaps = 118/756 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L ERA+ L + MT E+ Q+ A + RLG+P Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ L K+ + S EARA YN
Sbjct: 62 --------------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDPY+ + VRGLQ D + +
Sbjct: 108 KGLTLWAPNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQ----------GDGKVM 157
Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K +AC KH+A + G + R FD++ +DM+ET++ FE V E V SVM +Y
Sbjct: 158 KAAACAKHFAVHS-----GPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAY 212
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG P CA L+ + +W F GY VSDC +I+ E H + E A A LKA
Sbjct: 213 NRVNGEPACASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKA 269
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
G D++CG Y N + A+ +G I + I T+ L +RLG FD + ++ + +
Sbjct: 270 GCDVNCGCTYQNL-LAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVA 328
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+H ++ E A + +VLLKN NG LPL+ KT+A++GP+A++ A+ GNY G RY
Sbjct: 329 CAEHKAVSLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRY 387
Query: 424 TSPMDGFY-AYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
T+ ++G + + +A GC + Q A+ AAKNAD ++ GLD +
Sbjct: 388 TTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDAT 447
Query: 477 VEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
+E E D+ L LP Q L+ K+ K PV V+ + A++ ++ P
Sbjct: 448 IEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN---TESQP 503
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFP 586
+++ YPG EGG+A+A+V+FG +P G+LP+T+YE + K+P +T ++
Sbjct: 504 --DALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK------ 554
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++ +++PFGYGL+Y
Sbjct: 555 GRTYRYTTDNILFPFGYGLTY--------------------------------------G 576
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V ++ V+ KD K + VEN G+ +V+ +Y K + G++RV + G
Sbjct: 577 GVKVNAVEYKDGKAV--VSVENSGRAT-EDVIELYLKDYCEQAVPNVSLCGFKRVKLGEG 633
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ A V + K+ VDN + T+L G
Sbjct: 634 EKATVEIAIPE-KAFTAVDNNGVRKVFGSKFTLLAG 668
>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 805
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 246/773 (31%), Positives = 376/773 (48%), Gaps = 90/773 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
FP+CDA L ER +DL+ R+ L EKV + A + +GLP Y W + +HGV
Sbjct: 34 FPFCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGV-- 91
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
S GT+ ATSFP + A F+ + Q + E RA++ G
Sbjct: 92 -----QSTCGTNC------ATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGAREN 140
Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
+ GL WSPNIN+ RDPRWGR +ETP EDP V +Y + Y RGLQ+
Sbjct: 141 YAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQE--------G 192
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
D R L+ KHYAAY ++++G DR F+++V+ D +T++ F V EG V
Sbjct: 193 KDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVEGKAKGV 252
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN VNG+P CA+ +L + +R F GYI SD +I+ I + E
Sbjct: 253 MCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKSLCEAGRLA 312
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
++ +G D++ G Y V G++ E +D ++R + LG FD Y ++
Sbjct: 313 IM-SGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 371
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + + +L+ E R+ IVLL+N LPL G K LA++GPHA A +A++GNY
Sbjct: 372 APSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLRKG--KKLAVIGPHAKAKRALLGNYL 429
Query: 418 GTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
G C +P++ A + N YA G I + + AA AA+ ADA
Sbjct: 430 GQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGSG-INDTSTADFDAAEAAARGADAV 488
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
V+ G+D S+E E DR ++ +P Q +L+ +V A K P +V+ + G V +
Sbjct: 489 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELIL 545
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
+ YPG G +A++D++FG P G+LP+T Y +NY+ M + +PG
Sbjct: 546 HTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYIN--SVDMKSMSMTKYPG 603
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
R+Y+++ V+PFG+GLSYT+F + LD + P
Sbjct: 604 RSYRYYKEVPVFPFGWGLSYTKFT-----------LALDGEM--------------PDDP 638
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVF 702
++I +D T + V N G + G EVV + +P G A +Q+ Y RV
Sbjct: 639 IVI----TRDLDQTVTVIVSNDGDLVGDEVVFAFFRPLNVNATGDAALLNEQLFDYRRVS 694
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG-VSFPLQL 754
+ Q K+ F + +L +VD++ N G + +++ GV V+F + L
Sbjct: 695 LRPTQYRKLTFRIQQ-STLAMVDDSGNKASFPGFYEVIITNGVHERVTFAIHL 746
>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
Length = 690
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 230/727 (31%), Positives = 354/727 (48%), Gaps = 112/727 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L ERA+D+ +R++ EK +Q A RLG Y WWSE LHGV+ G
Sbjct: 6 YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT-- 63
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ + G+T S EARA YN +A
Sbjct: 64 --------------ATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIY 109
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT WSPN+N+ RDPRWGR ET GEDPY+ + Y +GLQ D + L
Sbjct: 110 KGLTLWSPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQ----------GDGKVL 159
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ +AC KH+A + + R FD++ +DM ET+I FE V + V SVM +YNR
Sbjct: 160 RTAACAKHFAVH---SGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNR 216
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P CA ++N+ +W F G+ VSDC +I+ +H + T ++ A LK G
Sbjct: 217 VNGEPACASDFVMNKL--EEWGFDGHFVSDCWAIRDFHTNHG-VTKTAPESAALALKKGC 273
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP 365
DL+CG+ Y + + A +G I E D+ S L +RLG FD S +Y L + +
Sbjct: 274 DLNCGNTYLHL-LAAFNEGLINEEDLRRSCIKLMRTRVRLGMFDKSTEYDGLDYDIVACD 332
Query: 366 QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTS 425
+H E + + + +VLLKN NG LPL+ KT+ ++GP+A++ A+ GNY G Y +
Sbjct: 333 EHKEFSLRCSERSMVLLKN-NGILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYIT 391
Query: 426 PMDGFY-AYSKVINYAPG-------CADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
+ G A+ + Y G C + ++ + A I + + + LD ++
Sbjct: 392 FLSGIREAHDGRVLYTEGSHLYKDRCMGLALPDDRLSEAEI-ITRTLRCSGSLCWLDATI 450
Query: 478 EAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
E E D+ DL LP Q +L+ V AKG +++ +AG+ IN +
Sbjct: 451 EGEEGDTGNEFSSGDKNDLRLPESQRKLVKTV--MAKGKPVIIVTAAGSA-INVEAD--- 504
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
+++ YPG+ GGRA+A+++FGK +P G+LP+T+YE + +P + R
Sbjct: 505 CDALIQAWYPGQLGGRALANILFGKVSPSGKLPVTFYE------DASKLPDFSDYSMKNR 558
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY++ +G +++PFGYGL+Y++ +C ++++ G
Sbjct: 559 TYRYSEGNILFPFGYGLTYSE-------------------TECSELSFENGVA------- 592
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
++V N G +VV +Y K + G++RV + AG+S
Sbjct: 593 --------------TVKVTNTGSRFTEDVVQIYIKGYSENAVPNHSLCGFKRVALDAGES 638
Query: 709 AKVGFTM 715
V T+
Sbjct: 639 RIVQITL 645
>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
Length = 695
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 226/720 (31%), Positives = 352/720 (48%), Gaps = 106/720 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
E A +V +MTL EK+ Q+ A + RL +P Y +W+E LHGV+ G
Sbjct: 10 EEAIKIVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------- 58
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP I A+F+ L K I + +S E RA YN GLTFWSPN
Sbjct: 59 -----ATVFPQAIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPN 113
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
IN+ RDPRWGR ET GEDP++ + + +++GLQ + + L+++AC KH+
Sbjct: 114 INLFRDPRWGRGQETYGEDPFLTAQIGVAFIKGLQ----------GEGKYLRLAACTKHF 163
Query: 195 AAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCAD 254
A + + DR +FD+ V +D+ E ++ F+ + E DV S M +YN +NG P C +
Sbjct: 164 AVH---SGPEADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVN 220
Query: 255 PKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYT 314
+L+ +T+ G W F G++VSD +++ + E+H + T + +A +K G +L C +
Sbjct: 221 EELIAKTLLGKWGFEGHVVSDYAALEDVHENHHY-TQTAAETMALAMKIGTNL-CAGKIS 278
Query: 315 NFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEA 374
+ AV +G + E +I S+ LY +RLG F Y + + +H L+ +A
Sbjct: 279 DALFEAVGKGLVTETEITASVVKLYTTHVRLGMFAEDNDYDTIPYEVNASAEHEMLSLKA 338
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---Y 431
A + +VLLKNDN LPL+ IK++A++GP A A+ GNY GT Y + + G
Sbjct: 339 AEKSMVLLKNDN-FLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQAL 397
Query: 432 AYSKVINYAPGC---AD----IVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE---- 480
+ + YA GC AD + + N AI AA++AD V+ GLD ++E E
Sbjct: 398 SNQARVTYALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDA 457
Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
D+ L LPG Q LI KV + K V LV+ S A+ + + + +K+I+
Sbjct: 458 GNVYGSGDKPSLSLPGQQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQA 516
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG GG A+A+++ GK +P G+LP+T+ + +P + RTY+
Sbjct: 517 WYPGAHGGTALANILLGKVSPSGKLPVTFCKDT------QGLPDFSDYSMAERTYQNTQL 570
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
V+YPFGYGL+Y + K + +DD+
Sbjct: 571 EVLYPFGYGLTYGHAEIKT---------------------------------LQLDDL-- 595
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
T + EN G D EV+ VY K ++I ++R+ + ++ V +
Sbjct: 596 -----TLSVTAENKGDYDIEEVIQVYVKINSEFAPKNHKLIAFKRIALPKNETVTVKIEL 650
>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
Length = 691
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 246/756 (32%), Positives = 369/756 (48%), Gaps = 118/756 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L ERA+ L + MT E+ Q+ A + RLG+P Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A F++ L K+ + S EARA YN
Sbjct: 62 --------------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIY 107
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+PNIN+ RDPRWGR ET GEDPY+ + VRGLQ D + +
Sbjct: 108 KGLTLWAPNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQ----------GDGKVM 157
Query: 186 KISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K +AC KH+A + G + R FD++ +DM+ET++ FE V E V SVM +Y
Sbjct: 158 KAAACAKHFAVHS-----GPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAY 212
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRVNG P CA L+ + +W F GY VSDC +I+ E H + E A A LKA
Sbjct: 213 NRVNGEPACASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKA 269
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
G D++CG Y N + A+ +G I + I T+ L +RLG FD + ++ + +
Sbjct: 270 GCDVNCGCTYQNL-LAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVA 328
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+H ++ E A + +VLLKN NG LPL+ KT+A++GP+A++ A+ GNY G RY
Sbjct: 329 CAEHKAVSLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRY 387
Query: 424 TSPMDGFY-AYSKVINYAPGCA------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
T+ ++G + + +A GC + Q A+ AAKNAD ++ GLD +
Sbjct: 388 TTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDAT 447
Query: 477 VEAE---------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
+E E D+ L LP Q L+ K+ K PV V+ + A++ ++ P
Sbjct: 448 IEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN---TESQP 503
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFP 586
+++ YPG EG +A+A+V+FG +P G+LP+T+YE + K+P +T ++
Sbjct: 504 --DALIHAFYPGAEGSKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK------ 554
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++ +++PFGYGL+Y
Sbjct: 555 GRTYRYTTDNILFPFGYGLTY--------------------------------------G 576
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V ++ V+ KD K + VEN G+ +V+ +Y K + G++RV + G
Sbjct: 577 GVKVNAVEYKDGKAV--VSVENSGRAT-EDVIELYLKDYCEQAVPNVSLCGFKRVKLGEG 633
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ A V + K+ VDN + T+L G
Sbjct: 634 EKATVEIAIPE-KAFTAVDNNGVRKVFGSKFTLLAG 668
>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
Length = 722
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 238/742 (32%), Positives = 361/742 (48%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R K L+++MTL EK Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPAYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E V SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSR 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL + +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
AA + +VLLKN+N LPL+ K++A+VGP A+ +G Y G P + + G
Sbjct: 383 AAVKSVVLLKNEN-LLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDL 439
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+NY G I +S++ A K D ++ G D + E D + LP
Sbjct: 440 MGKRGKVNYLNG---IGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ + + LV S + +A + I +I+ YPG+E GRA+AD++
Sbjct: 493 EEQEKLLKAIYQ-VNPRIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y A +P + + + GRTY++ +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYRAE------DQLPDILDFDMWKGRTYRYMKEDPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F + S +K QC +E+ N
Sbjct: 604 FGFDGIQG--SDTLKSGARLQC-------------------------------SVELSNT 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
GK G EVV VY S+ T+ +K+++ +++V +A G+ +V F + + L + +N
Sbjct: 631 GKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWEN- 688
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
N + +G +T+ +G G G++
Sbjct: 689 GNWRMLTGKYTLFIGSGQPGLA 710
>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 722
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 238/742 (32%), Positives = 361/742 (48%), Gaps = 92/742 (12%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R K L+++MTL EK Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 DLLQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE---- 108
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K++ +STEAR Y GLT+WSP I
Sbjct: 109 ------------VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTI 156
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ R + +V+GLQ LK A KH+
Sbjct: 157 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPAYLKTVATIKHFV 207
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N E N+RF S++ + + E + +E CV E V SVM +YN NG+P
Sbjct: 208 A----NNEENNRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSR 263
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL + +R +W F G++VSDC +I + H+ +N E+A A + +G DL+CG Y
Sbjct: 264 WLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL-EEAAALGVNSGCDLECGTTYKE 322
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAE 373
+ AV+QG I+EA ID +L + +LG FD Y + K + + ELA E
Sbjct: 323 KLVQAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYE 382
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
AA + +VLLKN+N LPL+ K++A+VGP A+ +G Y G P + + G
Sbjct: 383 AAVKSVVLLKNEN-LLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDL 439
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+NY G I +S++ A K D ++ G D + E D + LP
Sbjct: 440 MGKRGKVNYLNG---IGASRDSIVA----AVKGVDVVLVALGSDEKMARENHDMTSIYLP 492
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
Q +L+ + + LV S + +A + I +I+ YPG+E GRA+AD++
Sbjct: 493 EEQEKLLKAIYQ-VNPRIVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLL 549
Query: 551 FGKYNPGGRLPITWYEANYVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
FG NP G+LP+T Y A +P + + + GRTY++ +Y FG+GLSYT
Sbjct: 550 FGNENPSGKLPMTIYRAE------DQLPDILDFDMWKGRTYRYMKEDPLYGFGHGLSYTS 603
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F + S +K QC +E+ N
Sbjct: 604 FGFDGIQG--SDTLKSGTTLQC-------------------------------SVELSNT 630
Query: 670 GKMDGSEVVMVY-SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
GK G EVV VY S+ T+ +K+++ +++V +A G+ +V F + + L + +N
Sbjct: 631 GKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWEN- 688
Query: 728 ANSLLASGAHTILVGEGVGGVS 749
N + +G +T+ +G G G++
Sbjct: 689 GNWRMLTGKYTLFIGSGQPGLA 710
>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 742
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 248/751 (33%), Positives = 368/751 (49%), Gaps = 96/751 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D P +R DL++R TL EK Q+ GVPRLGLP++ W++ LHGV
Sbjct: 38 YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVW------ 91
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
S+ P T FP A+++ L + +S EARA+YN G
Sbjct: 92 ---------SKQP-TTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHG 141
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
L + SP IN+ RDPRWGR+ E EDP + GR + YVRGLQ D + LK+
Sbjct: 142 LVYRSPVINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQ---------GDDLQHLKL 192
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+A KH+A +++ + R H ++ V E+++ E ++ + + E SVM SYN +N
Sbjct: 193 AATVKHFAVNNVE----SGRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAIN 248
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--------VESHKFLNDTKEDAVAR 299
G+P + LL +R W F G++ D ++ + E + ++ A A
Sbjct: 249 GMPDAVNHWLLTDVLRKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAA 308
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
++AG D D ++ TN + AVQ+G + E D+D +LR + V RLG +D + +Y +
Sbjct: 309 AIRAGNDSDDVEFETNLPL-AVQRGLLTEKDVDGALRNVLRVGFRLGAYDPPQASKYSRI 367
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
G + + + H +L+ A + + LL N LPL +K++A++GP A GNY
Sbjct: 368 GMDVVRSQAHRDLSQRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGP-AGGEAYETGNYY 426
Query: 418 GTPCRYTSPMDGFYAY--SKV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
GTP TS +G A S V + Y G + ++ I A + A+ +D V+ G +
Sbjct: 427 GTPAVKTSVTEGLRALLGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTN 486
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
L VEAEG+DR DL LPG Q L+ V AA V LV+M+AG + + +A ++ + +IL
Sbjct: 487 LQVEAEGRDRRDLNLPGAQQRLLEAVY-AANPKVALVLMNAGPLGVTWAHDH--VPAILS 543
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPGE GG AIA +FG NPGG LP T Y AN +P P + G TY++F
Sbjct: 544 AWYPGELGGAAIARTLFGLNNPGGHLPYTVY-ANLDGVP----PQNEYDVSRGYTYQYFK 598
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD-INYTVGTNKPPCAAVLIDDV 653
G +YPFG+GLSYT F Y KL Q D N TV
Sbjct: 599 GVPLYPFGHGLSYTHFDYS----------KLKVTQTSGDHANVTV--------------- 633
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVG 712
FT N G+ G+EV +YS + ++ + G+ERV + G+S V
Sbjct: 634 -----SFTLT----NTGQSAGAEVTQLYSHQVKSSEVQPLRTLRGFERVTLQPGESKAVA 684
Query: 713 FTMNACKSLKIVDNAANSL-LASGAHTILVG 742
++ +L D A ++ + GA +VG
Sbjct: 685 ISI-PTSALGWYDTAVHNFRVEPGAFNFMVG 714
>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
Length = 722
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 234/746 (31%), Positives = 367/746 (49%), Gaps = 109/746 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
++AK ++ ++TL EK+ Q+ A G+ RLG+ Y W +EALHGV GR
Sbjct: 33 QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------- 81
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
AT FP I A+F+ + +IG ++TE RA + + AGLTFW+PN
Sbjct: 82 -----ATVFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPN 136
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR +ET GEDP++ G +V+G+Q D LK +AC KH+
Sbjct: 137 VNIFRDPRWGRGMETYGEDPFLTGTLGTAFVKGMQ---------GDDPFYLKAAACGKHF 187
Query: 195 AAYDLDNWEGNDRFHFDSRV--TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
A + G +R + V T++D+ ET++ F+M V +G V S+M +Y R+ G
Sbjct: 188 AVHS-----GPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCS 242
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL +R DW F G++VSDC ++ + E HK + E AVA +KAGL+L+CG+
Sbjct: 243 GSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAE-AVAFAIKAGLNLECGNS 301
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIEL 370
A+QQ I E D+D +L L + ++LG D + Y ++ I + + ++
Sbjct: 302 MRTMK-DAIQQKLITEKDLDKALLPLMMTRLKLGILQPDAACPYNEFPESVIGSEANRKI 360
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A +AA + +VLLKN NG LP+ +I+TL + GP A ++GNY G RY++ ++G
Sbjct: 361 AEQAAEESMVLLKN-NGVLPI-AKDIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGI 418
Query: 431 ---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------DLSVE 478
+ +NY G V +N + + ++ ++ A+ ++++ G D
Sbjct: 419 VGKVSNGTSVNYKQGFMQ-VFKNLNDVNWSVSESRGAEVSILIMGNSGNTEGEEGDAIAS 477
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
AE DRV+L LP Q E + +V+ + +V+ +D+ + W YP
Sbjct: 478 AERGDRVNLRLPDSQMEYLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YP 535
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
G+EGG A+A+++FG N GRLP+T+ E+ +P + GRTYK+ ++
Sbjct: 536 GQEGGVALANLLFGDANFSGRLPVTFPESA------DRLPAFDDYSMKGRTYKYMTDNIL 589
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
YPFGYGLSY++ Y A+ K + K
Sbjct: 590 YPFGYGLSYSKVTYSNAAVTK---------------------------------MPTKTT 616
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNA 717
T ++V N G M EVV VY PG T I+ +IG++RV K+ +
Sbjct: 617 PMTVYVDVTNNGDMPVDEVVQVYLSTPGAGNTSPIESLIGFKRV--------KIYPHITV 668
Query: 718 CKSLKIVDNAANSLLASGAHTILVGE 743
K +I ++ A G +L GE
Sbjct: 669 TKDFQIPMELLETVQADGTSKLLKGE 694
>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
Length = 674
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 220/722 (30%), Positives = 361/722 (50%), Gaps = 108/722 (14%)
Query: 51 LGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG 110
+ +P Y +W+EALHGV+ G AT FP I A+F++ L +I
Sbjct: 1 MNIPEYNYWNEALHGVARAGV----------------ATVFPQAIGLAATFDDHLINEIA 44
Query: 111 QTVSTEARAMYNLGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAI 162
+ TE RA YN GLTFWSPN+N+ RDPRWGR ET GEDP++ ++ +
Sbjct: 45 DVIGTEGRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGV 104
Query: 163 NYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQET 222
+++GLQ ++ LK++A KH+A + EG R FD+ V+++D+ ET
Sbjct: 105 AFIKGLQ----------GQAKYLKLAATAKHFAVHS--GPEGL-RHGFDAVVSDKDLYET 151
Query: 223 FILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI 282
++ F+ V E DV S+M +YN V+G+P LL + W+F G++VSD + + +
Sbjct: 152 YLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDV 211
Query: 283 VESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
E+HK+ D E + +KAGL+L G + A+ +G + E +I ++ LY
Sbjct: 212 HENHKYTKDAAE-TMGLAIKAGLNLVAG-HIEQSLHEALDRGLVTEEEITNAVISLYATR 269
Query: 343 MRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALV 402
+RLG F +Y + H L+ AA + VLLKND G LPL ++ +A+V
Sbjct: 270 VRLGMFATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVV 328
Query: 403 GPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPG-------CADIVCQNNS 452
GP+A++ A++GNY GTP R + ++G ++Y+ G A+ + + +
Sbjct: 329 GPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADE 388
Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAE---------GKDRVDLLLPGFQTELINKVADA 503
A+ AA+++D V V GLD ++E E D+ +L LPG Q +L+ ++
Sbjct: 389 RESEAVIAAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAV 448
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K PV +++ S ++ ++ +N+P +++I+ + YPG GG A+ADV+FG +P G+LP+T
Sbjct: 449 GK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVT 507
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y+ ++P N GRTY++ +YPFGYGL+Y+
Sbjct: 508 FYK------NVDNLPAFEDYNMAGRTYRYMTDEALYPFGYGLTYS--------------- 546
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVY 681
+V + D++ K Y+ T + ++N G D EVV VY
Sbjct: 547 -----------------------SVELSDLQVKSYEDTATVTATIQNTGNFDTDEVVQVY 583
Query: 682 SKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
K G Q+ G++RV++ G + F + + ++ D + + S I
Sbjct: 584 VKDLGSEFAVPNAQLKGFKRVYLGKGAKQTITFDLR-PQDFEVFDAQGRNFIDSDRFEIS 642
Query: 741 VG 742
VG
Sbjct: 643 VG 644
>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 763
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 232/638 (36%), Positives = 326/638 (51%), Gaps = 75/638 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVP--RLGLPLYEWWSEALHGVSFI 69
+P+ + L + ER DLV R+TL E V QM + P RLG+ Y W SE LHGV
Sbjct: 26 YPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGVV-- 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN------- 122
P G AT+FP I ASF+ L + + + E RA +N
Sbjct: 84 -----PPDGL--------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGV 130
Query: 123 -LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
+ GL+ +SP IN+ R P WGR ET GEDP+++G YVRGLQ
Sbjct: 131 YQEHTGLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQG---------DH 181
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
R + +A CKH+ + RF FD++V E+D Q TF+ F CV G V SVMC
Sbjct: 182 PRYVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMC 240
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
SYNR+N +P CA+ +LL +R +W F GY+VSD +++ I+ SH + D+ D VA +
Sbjct: 241 SYNRINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHY-TDSIVDTVASAV 299
Query: 302 KAGLDLD----CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---Y 354
AG +LD GD AV GKI E + ++ L+ MRLG FD P+ Y
Sbjct: 300 NAGCNLDLAFPVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNPY 358
Query: 355 KNLGKNNICNPQHIELAAEAARQG-----IVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
NL + + + +H ELA +AA Q VLLK + LPL+T + LA++GP A+
Sbjct: 359 ANLNLSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLDT-LVNKLAVIGPFADNP 417
Query: 410 KAMIGNYEGTPCR--YTSPMDGFYAYSKVINYAPGCADIVCQN--NSMIPAAIDAAKNAD 465
+ G+Y P + +P G ++ PGC C + M+ AA+ AD
Sbjct: 418 SYLFGDYSPNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAV---TGAD 474
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAK 524
V+ G + +EAE DR DL LPG Q +L+ V A G P+ L++ +AG +DI +A
Sbjct: 475 LIVVCLGTGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAV 534
Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIF-------GKYNPGGRLPITWYEANYVKIPYTSM 577
NP I+ I+ +P + G A+ + G NPGGRLPITW P +
Sbjct: 535 ENPAIQVIVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW--------PRSMN 586
Query: 578 PLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYK 613
+ P+ N+ GRTY++F+G ++PFGYGLSY F Y
Sbjct: 587 QVPPMTNYTMEGRTYRYFNGDPLFPFGYGLSYGSFSYS 624
>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 338 bits (868), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 240/773 (31%), Positives = 373/773 (48%), Gaps = 86/773 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
F +C+A L ER +DL+ R+ L EKV + A + +GLP Y W + +HGV
Sbjct: 35 FAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGV-- 92
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
S GT+ ATSFP + A F+ + Q V E RA++ G
Sbjct: 93 -----QSTCGTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141
Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
+ GL WSPNIN+ RDPRWGR +ETP EDP V +Y + Y +GLQ+
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE--------G 193
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
D R L+ KHYAAY ++++G DR F++ V+ D +T++ FE V G V
Sbjct: 194 KDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN VNG+P CA+ +L ++ +R F GYI SD +I I + E
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLA 313
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
+L +G D++ G Y V G++ E +D ++R + LG FD Y ++
Sbjct: 314 IL-SGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
N + + +L+ + +R+ IVLL+N LPL G K LA++GPHA A +A++GNY
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG--KKLAVIGPHAAAKRALLGNYL 430
Query: 418 GTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
G C +P++ + N YA G + I + + A AA+ A+
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKG-SGINDTSTAGFDEAEAAARKAETV 489
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
V+ G+D S+E E DR ++ +P Q +L+ +V A K P +V+ + G V +
Sbjct: 490 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELIL 546
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
++ YPG G +A++D++FG P G+LP+T Y +NYV M + +PG
Sbjct: 547 HTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVT--SVDMKSMSMTKYPG 604
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
R+Y+++ V+PFG+GLSYT+F + SS D P
Sbjct: 605 RSYRYYKEVPVFPFGWGLSYTRFTMALDSSSGVTD---------------------PSEP 643
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVF 702
+++ + T + + N G + G EVV + +P G A +Q+ Y RV
Sbjct: 644 IVV----TRQLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDYRRVS 699
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG-VSFPLQL 754
+ Q K+ F + +L +VD++ N G + +++ GV V+F + L
Sbjct: 700 LRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751
>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
Length = 748
Score = 338 bits (867), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 243/738 (32%), Positives = 364/738 (49%), Gaps = 115/738 (15%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQM--------GDLAYGVPRLGLPLYEWWSE 61
S+FP+ + L + ER DLV R+TL + VQQ+ G A + LG+ Y+W +E
Sbjct: 22 SNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGIGPYQWNTE 81
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
L G D E ATSFP I A+F++ L + + +TE RA +
Sbjct: 82 CLRG----------------DVEAGNATSFPQAIGLAAAFSKDLIFNVSKAAATEVRAKH 125
Query: 122 N--------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEG 173
N + GL+ +SP +N++R P WGR ET GEDPY+ G YA +V+GLQ
Sbjct: 126 NDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQG--- 182
Query: 174 VEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
D D R ++ +A CKH+ A+ R FD++V+ +D++ TF+ F+ CV
Sbjct: 183 -----DHD-RYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQA 236
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
G S+MCSYN +NG+P C++ L+ +RG+WNF GY+VSD +I+ + H + N++
Sbjct: 237 G-AYSLMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNS- 294
Query: 294 EDAVARVLKAGLDLDCGDYYTN---FTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
EDA A + AG +L+ T +G AV+ GK+ E+ + ++ L+ MRLG FD
Sbjct: 295 EDAAAGSVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFD 354
Query: 350 GSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDN--------GALPLNTGNIKT 398
P+ Y ++ + I + +H L+ AA + +VLLK + G P +
Sbjct: 355 -PPEMNPYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFP-----SER 408
Query: 399 LALVGPHANATKAMIGNYEGT--PCRYTSPMDGFYAYSKVINYAPGCAD-IVCQNNSMIP 455
+A++GP AN T + G+Y T P +P+ G + +NYA GC D C N S
Sbjct: 409 MAVIGPMANNTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDD 468
Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
A AD V+ G +E+E DR D++LPG Q +L+ V V L++ SA
Sbjct: 469 VKT-ALVGADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSA 527
Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITWYEANYVKI 572
G V+I +A+ + ++ IL YP + G AI + G++NP GRLP TWY
Sbjct: 528 GPVNITWAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWYR------ 581
Query: 573 PYT-SMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQ 630
YT +P + +TY++F G +YPFGYGLSY+ F + K+ PK
Sbjct: 582 -YTEQIPEMTDYSMARKTYRYFTGVPLYPFGYGLSYSTFVFSKLYFLPK----------- 629
Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT 690
V P Q+ V N G DG EV+ VY K
Sbjct: 630 -------VNAGDPNVV----------------QVRVFNEGPFDGDEVLQVYIKWMSTKER 666
Query: 691 HIK-QVIGYERVFIAAGQ 707
+ Q++ +ERVFI + Q
Sbjct: 667 MPRVQLVAFERVFIRSQQ 684
>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 338 bits (866), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 240/773 (31%), Positives = 372/773 (48%), Gaps = 86/773 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAY---GVPRLGLPLYEWWSEALHGVSF 68
F +C+A L ER +DL+ R+ L EKV + A + +GLP Y W + +HGV
Sbjct: 35 FAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGANCVHGV-- 92
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG---- 124
S GT+ ATSFP + A F+ + Q V E RA++ G
Sbjct: 93 -----QSTCGTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141
Query: 125 -----NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD 179
+ GL WSPNIN+ RDPRWGR +ETP EDP V +Y + Y +GLQ+
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE--------G 193
Query: 180 SDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSV 239
D R L+ KHYAAY ++++G DR F++ V+ D +T++ FE V G V
Sbjct: 194 KDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MCSYN VNG+P CA+ +L ++ +R F GYI SD +I I + E
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLA 313
Query: 300 VLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNL 357
+L +G D++ G Y V G++ E +D ++R + LG FD Y ++
Sbjct: 314 IL-SGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
N + + +L+ + +R+ IVLL+N LPL G K LA++GPHA A +A++GNY
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG--KKLAVIGPHAAAKRALLGNYL 430
Query: 418 GTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
G C +P++ + N YA G + I + A AA+ A+
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKG-SGINDTSTGGFDEAEAAARKAETV 489
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
V+ G+D S+E E DR ++ +P Q +L+ +V A K P +V+ + G V +
Sbjct: 490 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFNGGVVGAE--ELIL 546
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
++ YPG G +A++D++FG P G+LP+T Y +NYV M + +PG
Sbjct: 547 HTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYVT--SVDMKSMSMTKYPG 604
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
R+Y+++ V+PFG+GLSYT+F + SS D P
Sbjct: 605 RSYRYYKEVPVFPFGWGLSYTRFTMALDSSSGVTD---------------------PSEP 643
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVF 702
+++ + T + + N G + G EVV + +P G A +Q+ Y RV
Sbjct: 644 IVV----TRQLDQTVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDYRRVS 699
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG-VSFPLQL 754
+ Q K+ F + +L +VD++ N G + +++ GV V+F + L
Sbjct: 700 LRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751
>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
vinifera]
Length = 785
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 171/321 (53%), Positives = 224/321 (69%), Gaps = 14/321 (4%)
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
YIVSDC ++ IV++ +LN++K DAVA+ L+AGLDL+CG YYT+ +V GK+++ +
Sbjct: 10 YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69
Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
+D +L+ +Y++LMR+GYFDG P Y++LG +IC HIELA EAARQGIVLLKND LP
Sbjct: 70 LDRALKNIYVLLMRVGYFDGIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLP 129
Query: 391 LNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQN 450
L G K L LVGPHANAT+ MIGNY G P +Y SP++ F A V YA GC D C N
Sbjct: 130 LKPG--KKLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNV-TYATGCLDASCSN 186
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTL 510
++ A +AAK A+ T+I G DLS+EAE DRVD LLPG QTELI +VA+ + GPV L
Sbjct: 187 DTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVIL 246
Query: 511 VIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG------RLPITW 564
V++S +DI FAKNNP+I +ILWVG+PGE+GG AIADV+FGKYNP +L +W
Sbjct: 247 VVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPDTIPEWLWKLDFSW 306
Query: 565 YEAN----YVKIPYTSMPLRP 581
+ + Y K+P S+ P
Sbjct: 307 LDLSKNQLYGKLP-NSLSFSP 326
>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 769
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 224/638 (35%), Positives = 330/638 (51%), Gaps = 60/638 (9%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPR-----LGLPLYEWWSEALHG 65
+ P+C+ L +R +DL+ R+ L EK + A PR +GLP Y W + +HG
Sbjct: 33 ELPFCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHG 90
Query: 66 V-SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
V S G TN P TSFP + A F+ + + Q + E RA++ G
Sbjct: 91 VQSTCG--TNCP------------TSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEG 136
Query: 125 ---------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GL WSPNIN+ RDPRWGR ETP EDP V +Y + Y RGLQ+
Sbjct: 137 ATENYKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQE----- 191
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+ D R L+ KHYAAY +N+ G +R FD+ V+ D +T+ F V +G+
Sbjct: 192 -GKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGN 250
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
VMCSYN VNGIP CA+ +L+ +RG F GY+ SD +++ I + H + D++ +
Sbjct: 251 AKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYA-DSQCE 309
Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQ 353
A + AG D++ G Y V ++ E +D +LR + LG FD
Sbjct: 310 AARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQP 369
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y N+ + + L+ A R+ +V+L+N+ LPL G LA++GPHA + + ++
Sbjct: 370 YWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQKG--VKLAVLGPHAKSKRGLL 427
Query: 414 GNYEGTPCR--------YTSPMDGFYAYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKN 463
GNY G C +P+D A + N +A GC I + + A+ AAK
Sbjct: 428 GNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCG-ISGNSTAGFEKAVAAAKE 486
Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
ADA V+ G+D S+E E DR ++ LP Q +L+ +V A G T+V++ G V I
Sbjct: 487 ADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRV--HAVGRPTVVVLINGGV-IGAE 543
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPV 582
+ + +++ YPG G RA+ADV+FG NP G+LP+T Y ++YV ++ SM +
Sbjct: 544 EIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDMTA- 602
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
PGRTY++F G V+PFG+GLSYT F V S S
Sbjct: 603 --HPGRTYRYFKGEPVFPFGWGLSYTTFSLSVDSGTNS 638
>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 745
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 238/711 (33%), Positives = 350/711 (49%), Gaps = 102/711 (14%)
Query: 2 FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM---GDLAYG----VPRLGLP 54
F I LSDFP+ + LP+ +R +DLV R+ L E V QM G + G + RL +
Sbjct: 15 FSLISTILSDFPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIG 74
Query: 55 LYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
Y W +E L G D ATSFP A+F+ L K+I +
Sbjct: 75 PYSWNTECLRG----------------DLSAGPATSFPQAFGLAATFDAVLIKQIANATA 118
Query: 115 TEARAMYNL--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVR 166
E RA YN + GL+ +SP IN+ R P WGR+ ET GEDPY+ G A ++V
Sbjct: 119 YEVRAKYNNYTKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVT 178
Query: 167 GLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILP 226
GLQ + R + +A CKH+ AY + R FD++V+++D++ TF+
Sbjct: 179 GLQG---------NHPRYVTANAGCKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPA 229
Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
F C+ G S+MCSYN +NG+P CA+ KLL +R +WNF GY++SD +++ + ++H
Sbjct: 230 FHECIQAG-TYSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAH 288
Query: 287 KFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDTSLRFLYIVL 342
+ D + A+A V +GL+L+ T+ M AV+QG + + + L+
Sbjct: 289 HYTKDMLDTAIACV-NSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTR 347
Query: 343 MRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTL 399
MRLG FD P+ Y L + I + +H EL+ +AA + VLLKN+N LPL I L
Sbjct: 348 MRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKL 405
Query: 400 ALVGPHANATKAMIGNYEGTPCRYT-SPMDGFYAYSKV-INYAPGCADIVCQ--NNSMIP 455
A+VGP + + G+ T +P G +++ +A GC C +
Sbjct: 406 AVVGPFGDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLATTFASGCLSPACTEYDPKSTK 465
Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI-NKVADAAKGPVTLVIMS 514
AID D V+ G VE E DR +L LPG Q L+ + V AA PV L++ +
Sbjct: 466 QAID---RVDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLFN 522
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK--YNPGGRLPITWYEANYVKI 572
AG +DI +A +NP I I+ +P + G A+ + NPGGRLPITW
Sbjct: 523 AGPLDITWAVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPITW-------- 574
Query: 573 PYTSMPLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ 630
P + + P+ ++ GRTY++F+G ++PFGYGLSYT F Y
Sbjct: 575 PKSMSQVPPMEDYTMEGRTYRYFNGDPLFPFGYGLSYTTFHYS----------------- 617
Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
D+ T T PC+++ ID + +EN G + G EV Y
Sbjct: 618 --DLLITPSTPIKPCSSINID------------VFLENTGDVTGDEVTQFY 654
>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
Length = 745
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 233/770 (30%), Positives = 379/770 (49%), Gaps = 107/770 (13%)
Query: 7 VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------VPRLGLPLYEW 58
+ + D+P+ + LP+ R KDLV+R+T+ E V QM G VPRLG+ + W
Sbjct: 21 LHVQDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSW 80
Query: 59 WSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR 118
+E L G + G ATSFP + A+F+ + + S E R
Sbjct: 81 NTECLRGDVYAG----------------NATSFPQALGLAATFSTEVICDVASATSIEVR 124
Query: 119 AMYNL--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
A +N + G++ +SP IN++R P WGR ET GEDP++ G A +V+ LQ
Sbjct: 125 AKFNDYQRRKIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQG 184
Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
D ++ +A CKH+ + RF FD++V+E+D + TF+ F+ C
Sbjct: 185 ---------DDPTYIRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRC 235
Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
V G S +MCS+NR+NG+P C + +LL +R +W F GY+VSD ++I+ I+ H + N
Sbjct: 236 VQAGSYS-LMCSFNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTN 294
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTN----FTMGAVQQGKIAEADIDTSLRFLYIVLMRLG 346
++ D A +KAG +L+ + + A++ GK+ + D+ S+ L+ MRLG
Sbjct: 295 NSV-DTAALCVKAGCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLG 353
Query: 347 YFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP 404
FD Y + + I + +H ++ AA + VLLKN G LP+ T T++++GP
Sbjct: 354 EFDPPDHNPYNFIDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPI-TKLFDTISVLGP 412
Query: 405 HANATKAMIGNY--EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQ--NNSMIPAAIDA 460
A+ IG+Y + P T+P+ G SK + YA GC D C N + I A+++
Sbjct: 413 MADNKYQQIGSYAPDVMPSYTTTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNS 472
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI-NKVADAAKG-PVTLVIMSAGAV 518
+D + G +E E DR + LPG Q +L+ + + +AKG P+ L++ + G V
Sbjct: 473 ---SDIFFVCLGTGPMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPV 529
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIF---GKYNPGGRLPITW--YEANYVKIP 573
+I +A + ++ +I+ +P +E G A+ V+ NP GRLP TW Y+ +
Sbjct: 530 NITWADRSDRVVAIMECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTWPKYQDQIPSMV 589
Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
SM GRTY++F G +YPFGYGLSY+ F + A
Sbjct: 590 NYSM--------EGRTYRYFHGDPLYPFGYGLSYSTFNFTNA------------------ 623
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-I 692
++ + + T ++EV N G DG EV+ VY K T I
Sbjct: 624 ---------------WMNPIISQGQDLTVRVEVCNEGPTDGDEVIQVYLKWLDTNETMPI 668
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
Q++G+ERV + A ++ T+ A +++ + + + + G + + +G
Sbjct: 669 HQLVGFERVSLRAKETLSWLITVRA-ENMAVWNESRGFYIEPGRYRLYIG 717
>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
Length = 1202
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 267/806 (33%), Positives = 373/806 (46%), Gaps = 126/806 (15%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+PYCD LP R DL R T+ E + QMG +A VPRLGLP + EALHGV
Sbjct: 341 YPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEALHGVWSTCA 400
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-------- 123
P T FP ASF+ LW+ +G EARA++
Sbjct: 401 AGRCP------------TQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHNASD 448
Query: 124 ------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
G GLTF++PN+N+ RDPRWGR+ E P EDP + G Y +VRG Q G +
Sbjct: 449 CARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQ---GDGAY 505
Query: 178 RDSDSRPLKISACCKHYAAYDLD---------NWEG-------NDRFHFDSRVTEQDMQE 221
R ++ A KH+A Y+L+ +W G NDR FD+RV+ +D +E
Sbjct: 506 RVAN-------AVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEE 558
Query: 222 TFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQT 281
T++ PF ++ MCSYN VNG P C D LL +RG NF G + +DC +++
Sbjct: 559 TYVGPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALED 617
Query: 282 IVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIV 341
V HK E A A + AG+D +CG T+ A+ G + + L L
Sbjct: 618 AVARHKRYATEAEAAAA-AIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEA 676
Query: 342 LMRLGY---FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
+RLG +D + + +P H LA AAR+G+VLL+N N LPL+ T
Sbjct: 677 RLRLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLD--GRGT 734
Query: 399 LALVGPHANATKAMIGNYEGTPC--RYTSPMDGFYAY---SKVINYAPGCADIVCQNNSM 453
LA++GP+ANA+ ++ Y GTP SP+ A KV+ YA GC + +
Sbjct: 735 LAVIGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVV-YAVGC-NASGAATAA 792
Query: 454 IPAAIDAAKNADATVIVAGL------------DLSV----EAEGKDRVDLLLPGFQTELI 497
+ A+D AK AD V+ GL D + EAE DR L LPG Q L
Sbjct: 793 LDEAVDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALF 852
Query: 498 NKVADAAKG-PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
+K+ K V + ++SAGAVD +FAK+ ++L GY GE GG A+AD + G YNP
Sbjct: 853 SKIWALGKPVAVAVFLVSAGAVDASFAKDK---AALLLAGYGGEFGGVAVADALLGAYNP 909
Query: 557 GGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP---FGYGLSYTQFKYK 613
GG L T + P+ M +RP PGRTY+F D V P FG+GLSYT F
Sbjct: 910 GGALTATMLPDAGLP-PFRDMAMRPSAASPGRTYRFLDERRVAPLWRFGFGLSYTAFAVS 968
Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
+A G + P + F + V N+G +
Sbjct: 969 LA-----------------------GPTRVP-----------RRAATRFSVVVRNVGAVS 994
Query: 674 GSEVVMVYSKPPGIAGTHIKQVIGYERVF-IAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
G VV + G ++++ + RV +A S KV + +SL +VD A
Sbjct: 995 GDVVVACFVAAVGRPDAPLRELFDFARVRDLAPAASTKVSMELRP-RSLSLVDEAGVRST 1053
Query: 733 ASGAHTILVGEGVGGVSFPLQLNLNH 758
+GA+ + G + ++L H
Sbjct: 1054 TAGAYDVRCSAGRVADTEDIRLTTAH 1079
>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 722
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 229/744 (30%), Positives = 365/744 (49%), Gaps = 102/744 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
++AK ++ ++TL EK+ Q+ A G+ RLG+ Y W +EALHGV GR
Sbjct: 33 QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------- 81
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN--------AGLTFWSPN 134
AT FP I A+F+ + ++IG ++TE RA + + AGLTFW+PN
Sbjct: 82 -----ATVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPN 136
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
+N+ RDPRWGR +ET GEDP++ G +V+G+Q +D LK +AC KH+
Sbjct: 137 VNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQ---------GNDPFYLKAAACGKHF 187
Query: 195 AAYDLDNWEGNDRFHFDSRV--TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
A + G +R + V T+ D+ ET++ F+M V +G V S+M +Y R+ G
Sbjct: 188 AVHS-----GPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYGESCS 242
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY 312
LL +R DW F G++VSDC ++ + E HK + E AVA +KAGL+L+CG+
Sbjct: 243 GSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAE-AVAFAIKAGLNLECGNS 301
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF--DGSPQYKNLGKNNICNPQHIEL 370
A++Q I E D+D +L L + ++LG D + Y ++ I + + +
Sbjct: 302 MRTMK-DALKQKLITEKDLDKALLPLMMTRLKLGILQPDVACPYNEFPESVIGSIDNRNI 360
Query: 371 AAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF 430
A AA + +VLLKND G LP+ +I+TL + GP A ++GNY G RY++ ++G
Sbjct: 361 AQRAAEESMVLLKND-GVLPI-AKDIRTLFVTGPGATDAYYLMGNYFGLSDRYSTYLEGI 418
Query: 431 ---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------DLSVE 478
+ +NY G V +N + + ++ ++ A+ ++I+ G D
Sbjct: 419 VGKVSNGTSVNYKQGFMQ-VFKNLNDVNWSVSESRGAEVSIIIMGNSGNTEGEEGDAIAS 477
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
+E DRVDL LP Q + + +V+ + +V+ +D+ + W YP
Sbjct: 478 SERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDVKEITELADAVVMAW--YP 535
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVV 598
G+EGG A+A+++FG N GRLP+T+ E +P + GRTYK+ ++
Sbjct: 536 GQEGGVALANLLFGDANFSGRLPVTFPETT------DKLPSFDDYSMKGRTYKYMTDNIL 589
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
YPFGYGLSY + Y A+ K + K
Sbjct: 590 YPFGYGLSYGKVAYGNATVTK---------------------------------LPTKHS 616
Query: 659 KFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNA 717
T +++ N G M EVV VY S P + I+ ++ ++RV IA + F +
Sbjct: 617 SMTVSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESLVAFKRVKIAPHATVTTDFEI-P 675
Query: 718 CKSLKIVDNAANSLLASGAHTILV 741
+ L+ V S L G + +++
Sbjct: 676 VERLETVQEDGTSKLLKGEYRVMI 699
>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
Length = 732
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 236/773 (30%), Positives = 376/773 (48%), Gaps = 102/773 (13%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLY-EWWSEALHGVSFIGRR 72
+ + ++ R DL+ R+TL +K Q + V G + + W++ LHGV +
Sbjct: 33 FLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEPT 92
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------- 123
TN FPT I A+++ L ++ +S EARA+YN
Sbjct: 93 TN----------------FPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRG 136
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
+ GL + SP IN+ R+P WGR+ E GEDPY GR + YV+GLQ DS
Sbjct: 137 EHKGLIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQG---------DDSH 187
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+++ KHYA +++ DR ++V E+ + E ++ F+ C+ EG SVM SY
Sbjct: 188 YLKLASTLKHYAVNNVEV----DRMKLSAQVPERMLYEYWLPHFKDCIVEGKAQSVMASY 243
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +NG+P + LL ++ W G++VSD ++T+VE H + E+AV R + A
Sbjct: 244 NAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQRQISCEEAVGRSIMA 303
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G D + Y + A+++G + E ++ +LR + +V RLG FD S Y + +
Sbjct: 304 GCDFSDAE-YEKYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFDDFKSVPYSRISPDV 362
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
I +H L+ EAAR+ IVLLKN+ LP++ IK +A++GP+A+ GNY G P
Sbjct: 363 IGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADLFNQ--GNYGGVPK 420
Query: 422 RYTSPMDGF---YAYSKVINYAPGC--ADIVCQNNSMIP----------AAIDAAKNADA 466
+P+ G + + Y G + + IP A++ A+N+D
Sbjct: 421 DPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAEMKKAVEMARNSDV 480
Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
+ G +E EG+DR L+LPG Q EL+ V + K V +V+MSAG V + K N
Sbjct: 481 VFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNK-KVVVVLMSAGPVAVPEVKKN 539
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
I ++L +PG+EGG AIADV+FG YNPGG+LP T Y ++ ++P T +
Sbjct: 540 --IPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYASDE-QVPSTD----EYDISK 592
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
G TY + ++ FG+GLSY++F Y D+++
Sbjct: 593 GFTYMYLKKKPLFAFGHGLSYSKFHYS--------DLQIS------------------SP 626
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAA 705
V ++D + ++V+NMGK G EVV +Y + K++ G++R+ +
Sbjct: 627 VVSVNDT------VSVVLKVKNMGKRTGEEVVQLYVRDVKAKVVRPTKELRGFKRIALQP 680
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVGEGVGGVSFPLQLNLN 757
+ ++ M KSL D + L G+ IL+G + +L +N
Sbjct: 681 NEEQEIRL-MLPVKSLAFYDESIGDFLVEPGSFEILLGSASDDIRLQSKLIVN 732
>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 179/483 (37%), Positives = 284/483 (58%), Gaps = 30/483 (6%)
Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTS 334
DCD++ + K+ T EDAVA LK+G+ Y N+T AV++ K+ ++ID +
Sbjct: 229 DCDAVNVLHVEQKYAK-TPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRA 282
Query: 335 LRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
L L+ MRLG F+G P Y ++G + +C+ +H LA EAA GIVLLKN + LPL
Sbjct: 283 LHNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPL 342
Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
+ I +LA++GP+A+ + ++GNY G C+ + ++G Y +Y GC ++ C +
Sbjct: 343 SKSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSC-TS 401
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
+ ++ A+ D ++V GLD S E E DR+DL+LPG Q LI VA AAK P+ LV
Sbjct: 402 AAKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLV 461
Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP---GGRLPITWYEAN 568
++ +D+ FAKNN KI SILW GYPG+ G A+A +IFG++NP GGRLP+TWY +
Sbjct: 462 LLGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQD 521
Query: 569 YVKIPYTSMPLR--PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKL 625
+ K+P T M +R P PGRTY+F++G V+ FGYGLSY+ + Y AS + +++K
Sbjct: 522 FTKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASVAQNQLNVKD 581
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDV---KCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
+QQ N L+ D+ +C++ KF + V+N G+M G V++++
Sbjct: 582 SSNQQPE--------NSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFA 633
Query: 683 K--PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ PG G IK+++G++ V + AG+ ++ + ++ C+ L + ++ G+ +L
Sbjct: 634 RHAKPG-KGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILL 692
Query: 741 VGE 743
VG+
Sbjct: 693 VGD 695
Score = 209 bits (531), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 101/195 (51%), Positives = 128/195 (65%), Gaps = 10/195 (5%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ +C LP RA+DLV R+T EK Q+ D + +PRLG+P YEWWSE LHG+ F+ R
Sbjct: 42 YDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIPAYEWWSEGLHGIGFLTR 101
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-AGLTF 130
+ F+ + ATSFP VILT ASF+ +W +IGQ V EARA+YN G GL F
Sbjct: 102 VQQGI--SFFNRTIQHATSFPQVILTAASFDAHIWYRIGQ-VGKEARALYNAGQVTGLGF 158
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ--DVEGVEYHRDSDSRPLKIS 188
W+PN+N+ RDPRWGR ETPGEDP VVG+Y ++VRG+Q EG D L+ S
Sbjct: 159 WAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDSFEGESTLGDH----LQAS 214
Query: 189 ACCKHYAAYDLDNWE 203
ACCKHY A+DLDNW+
Sbjct: 215 ACCKHYTAHDLDNWD 229
>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 332 bits (851), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 207/520 (39%), Positives = 292/520 (56%), Gaps = 37/520 (7%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+D CD K PERA LV+ + + EK+Q + D++ G RLGLP Y WWSEALHGV+
Sbjct: 33 LADNTVCDPKATPPERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSEALHGVAA 92
Query: 69 I-GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
G R N G F S ATSF I +A+F++ L K+ T+STEARA N G AG
Sbjct: 93 SPGVRFNRTAGGRFSS----ATSFANSITLSAAFDDELVYKVADTISTEARAFANAGLAG 148
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
L +W+PNIN +DPRWGR ETPGEDP + Y + GL+ D K+
Sbjct: 149 LDYWTPNINPYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEG---------DDPSIRKV 199
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
A CKHYAAYDL+ W+G R FD+ V+ QD+ E ++ PF+ C + V S MCSYN +N
Sbjct: 200 VATCKHYAAYDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALN 259
Query: 248 GIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLN----DTKEDAVARV 300
G P CA L++ +R W + + YI SDC++IQ + K+ N T+ +A A
Sbjct: 260 GTPACASTYLMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVA 319
Query: 301 LKAGLDLDC----GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQ 353
+AG D C YT+ +GA Q ++E IDT+L+ LY L+R+GYFD GSP
Sbjct: 320 YQAGTDTVCEVPGWPPYTD-VIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFDPASGSP- 377
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA-- 411
Y+++G ++ P+ ELA ++ G+VLLKND G LPLN + KT+AL+G AN+T
Sbjct: 378 YRSIGWEDVNTPEAQELALQSGTDGLVLLKND-GTLPLNLED-KTVALIGFWANSTNGGR 435
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPG-CADIVCQN--NSMIPAAIDAAKNADATV 468
++G Y G P SP+D + +YA G A+ + Q + + A++ AK ++ +
Sbjct: 436 ILGGYSGFPPYIHSPVDAAEKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVIL 495
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPV 508
G D S+ AE DR + P Q +I ++ + P
Sbjct: 496 YFGGTDTSIAAEDLDRDSIAWPEIQLAVIEALSALRQAPA 535
>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
Length = 731
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 243/765 (31%), Positives = 375/765 (49%), Gaps = 106/765 (13%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPE----KVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
+ FP+ D L + +R DLV+R+T+ E V Q G V RLG+ Y++ +E + G
Sbjct: 18 AKFPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITG 77
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-- 123
V R NS T+FP I ASF+ L + Q ++ E R YN
Sbjct: 78 V----RWENS-------------TAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEV 120
Query: 124 -----GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
G+ G+ ++P IN++R P WGR ET GEDP++ G+ ++ +V+GLQ
Sbjct: 121 KSQIYGHRGVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQG-------- 172
Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGN---DRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
R ++ S CKH+ D+ N N RF FD++V+E+D + TF+ F+ CV G
Sbjct: 173 -DHPRYIQASGGCKHF---DVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGS 228
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
++ +MCSYNR+NG+P CA+ KLL +R +W F+GY++SD +I+ IV HK+ T +
Sbjct: 229 IN-IMCSYNRINGVPACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKY-TKTLAE 286
Query: 296 AVARVLKAGLDLD------CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
A A +KAG +++ G Y N + AV+Q I+E ++ +L+ MR G FD
Sbjct: 287 AAADSVKAGCNVELTGATGSGVAYFNL-LNAVKQNLISEEELRENLKKPMYSRMRQGEFD 345
Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ + + + + +H +LA +A+ VL+KN N LPL LA++GP A+
Sbjct: 346 PVDMNPFTKIDMSVVLSQEHQDLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFAD 404
Query: 408 ATKAMIGNY--EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID-AAKNA 464
+ + G+Y P ++P +G + + YA GC D C N P AI+ A K A
Sbjct: 405 NAETLFGDYIPNWDPKFVSTPYEGLKSLGDDVRYASGCDDPSCTNYD--PKAIEKAVKGA 462
Query: 465 DATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAK-GPVTLVIMSAGAVDINFA 523
+ G+ ++E EG DR DL LPG+Q +++ ++ P+ LV+ +AG VD+ +
Sbjct: 463 QFVFVCLGVGSNLEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWP 522
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYN---PGGRLPITWYEANYVKIPYTSMPLR 580
K +P++ I+ YP G+A+ V+ + P RLP TW A ++P +
Sbjct: 523 KLSPEVDGIIECFYPAMGTGKALYQVVTATGDDGVPAARLPSTW-PAQLHQVPSITD--- 578
Query: 581 PVNNFPGRTYKFFD-GPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTV 638
N G TY++FD G +YPFGYGLSYT F Y+ S SP SV
Sbjct: 579 --YNMTGHTYRYFDGGDPLYPFGYGLSYTSFHYQTVSVSPTSV---------------RA 621
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIG 697
G N T ++V N G + EV VY S ++G
Sbjct: 622 GGN------------------VTVTVQVLNRGPYNADEVTQVYMSWMEATVPVPRWTLVG 663
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++R QS+ + F ++A + VD A + G I G
Sbjct: 664 FKRHRHTVNQSSSLSFVVSAEQMAVWVDEATGFQVQPGKMLIYAG 708
>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1283
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 239/751 (31%), Positives = 366/751 (48%), Gaps = 110/751 (14%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDL--AYGVPRLGLPLYEWWSEALHGVSFIGR 71
Y + +P ER DL+ R+TL EKV Q+ D + G+ RL +P +E LHG S+
Sbjct: 72 YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHGQSY--- 127
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
G+T FP I ++F+ L +++G+ + EA+A NL W
Sbjct: 128 -------------ATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NL----RVSW 169
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SP ++V RD RWGRV ET GEDPY+VGR + +++G Q + AC
Sbjct: 170 SPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGEH--------------MFACP 215
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A + R D ++++ M+ + PF + E + VM +Y NG+P
Sbjct: 216 KHFAGH---GQPVGGRDSHDYGLSDRVMRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPD 272
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+LL + +R +W F G++VSDC + I + T E+A A ++AG+D++CG
Sbjct: 273 NGSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVG-TMEEAAAMAVRAGVDIECGS 331
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC---NPQHI 368
Y AV++G I E+++D +LR ++ MRLG FD P +N+ N + P+H
Sbjct: 332 AYKKALASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHR 390
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSP 426
LA + A + VLLKN+N LPL+ NIKT+A++GP NA + G+Y P + S
Sbjct: 391 ALARKVAVKSTVLLKNENNLLPLDK-NIKTIAVIGP--NADQGQTGDYSAKYAPGQIISV 447
Query: 427 MDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD--------- 474
++G + S + YA GC + + + A++ AK ADA ++V G +
Sbjct: 448 LEGVKNHVSPSTKVLYAQGCTQL-DMDTTGFAEAVNIAKQADAVILVVGDNSNRHENGNK 506
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
S E D L +PG Q +LI K +A PV LV+++ + + N I+SIL
Sbjct: 507 KSTTGENVDGATLEIPGVQRQLI-KAVEATGKPVVLVLVNGKPFTLTWEDEN--IESILE 563
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPGEEGG A AD+IFG NP GRLPI+ + + P +PL GR Y ++D
Sbjct: 564 TWYPGEEGGNATADIIFGDENPSGRLPIS-----FPRHP-GQLPLWYNYETSGRNYDYYD 617
Query: 595 GPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
P +Y FG+GLSYT F+Y ++ T + P V +D
Sbjct: 618 MPFTPLYRFGHGLSYTTFRYS-------------------NLKATTKSGDPGFVTVSVD- 657
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
+EN GK G EV +Y + T + + G++RVF+ G+ V
Sbjct: 658 -------------IENTGKRPGEEVAQLYITDLVASVNTAVIDLKGFKRVFLKPGEKKTV 704
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F +N L +++ +L +G + VG
Sbjct: 705 TFELNPY-LLSLLNPDMKRVLEAGKFRMHVG 734
>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 329 bits (843), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 155/251 (61%), Positives = 186/251 (74%), Gaps = 15/251 (5%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+FP+C KLP R DL+ RMTL EKV + + A VPRLG+ YEWWSEALHGVS +G
Sbjct: 38 NFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEWWSEALHGVSNVG 97
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF 130
PGT F PGATSFP VI T ASFN +LW+ IG+ VS EARAM+N G AGLT+
Sbjct: 98 ------PGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGVAGLTY 151
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
WSPN+N+ RDPRWGR ETPGEDP V G+YA +YVRGLQ +D LK++AC
Sbjct: 152 WSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQ---------GNDGDRLKVAAC 202
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+ AYDLDNW G DRFHF+++V++QDM++TF +PF MCV EG V+SVMCSYN+VNGIP
Sbjct: 203 CKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIP 262
Query: 251 TCADPKLLNQT 261
TCADPKLL +T
Sbjct: 263 TCADPKLLKKT 273
>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
Length = 569
Score = 328 bits (841), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 202/555 (36%), Positives = 283/555 (50%), Gaps = 70/555 (12%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD P +RA LV+ M EK+Q + + GV RLGLP Y WWSEALHGV+
Sbjct: 65 CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVA------- 117
Query: 75 SPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG F S P ATS P IL A+F++ L +K+G + TEARA N ++G+ FW+
Sbjct: 118 GAPGIQFSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWT 177
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGED + Y +RGL EG + R +I A CK
Sbjct: 178 PNINPFKDPRWGRGSETPGEDTLRLKGYVAALLRGL---EGNKAQR-------RIIATCK 227
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
HYAA DL++W G R FD++++ QD+ E ++ PF+ C + V S MCSYN VNG+P C
Sbjct: 228 HYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPAC 287
Query: 253 ADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
A+ LL +R WN+ + Y+ SDC+++Q I +H + + T A AG D C
Sbjct: 288 ANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHYAS-TNAAGTALAFNAGTDSSC 346
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHI 368
GYFDGS Y +LG +++ PQ
Sbjct: 347 ----------------------------------EAGYFDGSKALYSSLGWSDVNTPQAQ 372
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA +A GIV+LKND G LPL + +A++G A+ + + G Y G +P+
Sbjct: 373 QLALQATVDGIVMLKND-GTLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPV- 430
Query: 429 GFYAYSKVINYAPGCADIVCQNNS-----MIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA ++ + + P A Q ++ A+ AA +D + GLD S AEG D
Sbjct: 431 --YA-AQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVD 487
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R L P Q LI K+ +A G ++I +D N + SILW +PG++GG
Sbjct: 488 RTSLEWPSAQLALIKKL--SALGKPLIIIQEGDQMDNTPLLTNKGVSSILWASWPGQDGG 545
Query: 544 RAIADVIFGKYNPGG 558
A+ +I G +P G
Sbjct: 546 PAVMQIISGAKSPAG 560
>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Taeniopygia guttata]
Length = 685
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 233/698 (33%), Positives = 352/698 (50%), Gaps = 102/698 (14%)
Query: 48 VPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG-ATSFPTVILTTASFNESLW 106
+PRLG+ Y W +E L G D E PG AT+FP + A+F+ L
Sbjct: 9 IPRLGIAPYNWNTECLRG----------------DGEAPGWATAFPQALGLAAAFSPELI 52
Query: 107 KKIGQTVSTEARAMYNLGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVG 158
++ +TE RA +N A GL+ +SP +N++R P WGR ET GEDP++ G
Sbjct: 53 YRVANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSG 112
Query: 159 RYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR-FHFDSRVTEQ 217
A ++V+GLQ R +K SA CKH++ + G++ + V E+
Sbjct: 113 ELARSFVQGLQG---------PHPRYVKASAGCKHFSVHG-----GHENILLYLLTVLER 158
Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
D + TF+ F+ CV G S MCSYNR+NG+P CA+ KLL +RG+W F GY+VSD
Sbjct: 159 DWRMTFLPQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEG 217
Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM----GAVQQGKIAEADIDT 333
+++ I+ H + E AVA V AG +L+ N A+ G I +
Sbjct: 218 AVELIMLGHHYTRSFLETAVASV-NAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRD 276
Query: 334 SLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
+R L+ MRLG FD Y +L + + +P+H L+ EAA + VLLKN G LPL
Sbjct: 277 RVRPLFYTRMRLGEFDPPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPL 336
Query: 392 NTGNIKT--LALVGPHANATKAMIGNYEGTP-CRYT-SPMDGFYAYSKVINYAPGCADIV 447
++ + LA+VGP A+ + + G+Y P RY +P G +++A GC++
Sbjct: 337 KAQDLSSQHLAVVGPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPR 396
Query: 448 CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG- 506
CQ S + AD ++ G + VE E KDR DL LPG Q EL+ AA G
Sbjct: 397 CQRYSRA-ELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGR 455
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK--YNPGGRLPITW 564
PV L++ +AG +D+++A+ + + +IL +P + G AIA V+ G+ +P GRLP TW
Sbjct: 456 PVILLLFNAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATW 515
Query: 565 YEANYVKIPYTSMPLRPVNNF--PGRTYKFF--DGPVVYPFGYGLSYTQFKYKVASSPKS 620
A ++P P+ N+ GRTY+++ + P +YPFGYGLSYT F+Y+
Sbjct: 516 -PAGMHQVP-------PMENYTMEGRTYRYYGQEAP-LYPFGYGLSYTTFRYR------- 559
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D+ L PP + C + + + +EN G D EVV +
Sbjct: 560 -DLVLS----------------PPVLPL------CAN--LSVSVVLENTGLRDSEEVVQL 594
Query: 681 YSKPPGIAGTHIK-QVIGYERVFIAAGQSAKVGFTMNA 717
Y + + + Q++ + RV + AG+ AK+ F + A
Sbjct: 595 YLRWEHSSVPVPRWQLVAFRRVAVPAGREAKLSFQVLA 632
>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 323
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 150/290 (51%), Positives = 198/290 (68%), Gaps = 16/290 (5%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+CD L +RA DLV R+T EK+ Q+GD A GVPRLG+P Y+WW+EALHG++ G+
Sbjct: 46 FCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGYKWWNEALHGLATSGK-- 103
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLTFWS 132
G HFD+ V ATSFP V+LT A+F++ LW +IGQ + EARA++N+G A GLT WS
Sbjct: 104 ----GLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFNVGQAEGLTIWS 159
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PN+N+ RDPRWGR ETPGEDP V RYA+ +VRG+Q +S S L+ SACCK
Sbjct: 160 PNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG--------NSSSSLLQTSACCK 211
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H AYDL++W G R+ F +RVTEQD+++TF PF CV E S VMC+Y +NG+P C
Sbjct: 212 HATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYTAINGVPAC 271
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
A+ LL T+RGDW GY+ SDCD++ + ++ ++ T EDAVA LK
Sbjct: 272 ANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYA-PTPEDAVAVSLK 320
>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 517
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 187/502 (37%), Positives = 270/502 (53%), Gaps = 29/502 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L PERA LVE +++ EK+Q + + G PR+GLP Y WWSEALHGV++
Sbjct: 35 LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVAY 94
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT+F + E +TS+P +L A F+++L +KIG + EARA N G
Sbjct: 95 A-------PGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AG +W+PN+N +DPRWGR ETPGED + RYA RGL E R
Sbjct: 148 AGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQRR------- 200
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
I + CKHYA D ++W G R F++++T QD+ E ++ PF+ C + V S+MC+YN
Sbjct: 201 -IISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNA 259
Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VNG+P+CA+ LL +R W + + Y+ SDC+++ + +H + T A +
Sbjct: 260 VNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYA-PTNAAGTAICFE 318
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNN 361
AG+D C ++ GA QG + E +D +L LY L+R GYFDG Y +L +
Sbjct: 319 AGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQH 378
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ + + LA +AA +G+VLLKN NG LPL+ +A++G A+A + + G Y G
Sbjct: 379 VNSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAH 437
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNS---MIPAAIDAAKNADATVIVAGLDLSVE 478
SP F A ++ ++ NN+ A++AA AD + GLD S
Sbjct: 438 HLYSP--AFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAA 495
Query: 479 AEGKDRVDLLLPGFQTELINKV 500
E DR DL P Q L+ V
Sbjct: 496 GETLDRTDLDWPEAQLTLVKVV 517
>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
Length = 534
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 195/565 (34%), Positives = 296/565 (52%), Gaps = 93/565 (16%)
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVN--------- 232
RP +I+A CKH AAY L+ DRF+F + + D + T++ F+ CV+
Sbjct: 7 RP-RIAATCKHLAAYSLET----DRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHY 61
Query: 233 ----------EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI 282
+ VMCSYN ++G+P CADP LL +R DWNF G +VSDC ++ I
Sbjct: 62 NASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNI 121
Query: 283 VESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVL 342
+H+F+ + E+AV L++G+DLDCG+ + +F A + + E DID +L L+ VL
Sbjct: 122 HSNHRFVA-SYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVL 180
Query: 343 MRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN-----DNGALPLNTGNIK 397
M LGYFD + + K++ +H +LA EAA Q IVLLKN + G LPL+ K
Sbjct: 181 MDLGYFDETDEPD--AKSSDDEMEHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHK 238
Query: 398 TLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAA 457
+AL GP A+ ++GNY G P +P+ G + + + VC +
Sbjct: 239 EIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKMGVEVAFRQRAS--VCDFH------ 290
Query: 458 IDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG---PVTLVIMS 514
AT++V GLD S+EAE +DR LLLP Q +LI ++ +K PV LV++S
Sbjct: 291 -----GESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVVS 345
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIP 573
G VD++ KN+ I +++ + YPG+ GG A+A V++G YNP G+L T Y +Y+ ++
Sbjct: 346 GGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEVS 405
Query: 574 YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
M +RP FPGRT++++ G V+YPFGYGLSYT F+Y
Sbjct: 406 LHDMRMRPDGKFPGRTHRYYRGDVIYPFGYGLSYTSFRYA-------------------- 445
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-- 691
+ + GT K + V N G MDGS V+++ P
Sbjct: 446 MEFLGGTVK---------------------VTVSNSGSMDGSVAVLLFHSAPQAGNEQEP 484
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMN 716
+ +IG+E+++++ G S V F ++
Sbjct: 485 FRSLIGFEKIYVSVGDSQLVSFDVS 509
>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
Length = 515
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 186/499 (37%), Positives = 269/499 (53%), Gaps = 29/499 (5%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
LS CD L PERA LVE +++ EK+Q + + G PR+GLP Y WWSEALHGV++
Sbjct: 35 LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVAY 94
Query: 69 IGRRTNSPPGTHF---DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
PGT+F + E +TS+P +L A F+++L +KIG + EARA N G
Sbjct: 95 A-------PGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
AG +W+PN+N +DPRWGR ETPGED + RYA RGL E R
Sbjct: 148 AGFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQRR------- 200
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
I + CKHYA D ++W G R F++++T QD+ E ++ PF+ C + V S+MC+YN
Sbjct: 201 -IISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNA 259
Query: 246 VNGIPTCADPKLLNQTIRGDWNF---HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
VNG+P+CA+ LL +R W + + Y+ SDC+++ + +H + T A +
Sbjct: 260 VNGVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYA-PTNAAGTAICFE 318
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNN 361
AG+D C ++ GA QG + E +D +L LY L+R GYFDG Y +L +
Sbjct: 319 AGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQH 378
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ + + LA +AA +G+VLLKN NG LPL+ +A++G A+A + + G Y G
Sbjct: 379 VNSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAH 437
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNS---MIPAAIDAAKNADATVIVAGLDLSVE 478
SP F A ++ ++ NN+ A++AA AD + GLD S
Sbjct: 438 HLYSP--AFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAA 495
Query: 479 AEGKDRVDLLLPGFQTELI 497
E DR DL P Q L+
Sbjct: 496 GETLDRTDLDWPEAQLTLV 514
>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 901
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 185/448 (41%), Positives = 254/448 (56%), Gaps = 37/448 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + +RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 33 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 91 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG +++ P
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 195
Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH+A + DR HFD+R +++D+ ET++ FE V EG V +VM +Y
Sbjct: 196 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAY 252
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y AV+QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPA 429
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPNAQVLYARG-ADLV 456
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q +L+ + A
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGR 686
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ T+ T D T + V+N G+ G EVV +Y P
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLA 826
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
K++ G++R+ + G+ ++GFT+NA +L++ D + + GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884
>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
acidiphila DSM 44928]
Length = 1343
Score = 318 bits (816), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 248/794 (31%), Positives = 367/794 (46%), Gaps = 119/794 (14%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQM-GDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
Y D + ERA DLV RMTLPEK Q+ + A +PRLG+ Y +WSE HGV+ +G
Sbjct: 49 YLDTHYSFAERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGAD 108
Query: 73 TNSPPGTHFDSEVPG---ATSFPTVILTTASFNESLWKKIGQTVSTEARAMY-------- 121
+N +V G ATSFP T S++ +L K VS E R
Sbjct: 109 SNR-------GDVTGGVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTG 161
Query: 122 --NLGNAG-----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
NLG + LTFW+PN+N+ RDP WGR E+ GEDPY+ A +V G Q G
Sbjct: 162 QNNLGPSASDYGALTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQ---GQ 218
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+ LK++A KHY+ L+N E + R S T+ ++++ + F V +
Sbjct: 219 SMTGQQQTPYLKVAATAKHYS---LNNIE-DSRHTGSSDTTDANIRDYYTKQFASLVRDA 274
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--VESHKFL--- 289
VS +M SYN VNG P+ AD +++ ++ + F GY SDC +I + SH +
Sbjct: 275 HVSGIMTSYNAVNGTPSPADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPG 334
Query: 290 ---NDTK--EDAVAR-----------VLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADI 331
N T +A R ++AG L+C G+ A+ G ++ +
Sbjct: 335 WTSNGTSWTNNATGRQISAAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVV 394
Query: 332 DTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA- 388
D +L L+ V M G FD G Y + K+ I +P H LA + A IVLL+ NGA
Sbjct: 395 DATLTRLFTVRMETGEFDPAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQ--NGAV 452
Query: 389 -------LPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAP 441
LP++ ++ +VG AN K +G Y G P + + G A + N +
Sbjct: 453 SGTSAKLLPVDPAKTDSVVIVGDLAN--KVTLGGYSGEPTHEVNAVQGITAAVQAANPSA 510
Query: 442 GCADIVCQNNSMI--PAAIDAA-----KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQT 494
C + I PA+ AA K+A ++VAG DLSV E DR L LPG
Sbjct: 511 TVTFDACGTGTQITTPASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYD 570
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
LI++V+ LV+ + G DI A+ + +I++ GY G+ G A+A V+FG+
Sbjct: 571 SLISQVSALGNPRTALVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQ 628
Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYK 613
NP G L TWY + P + L P GRTY++F G YPFGYG SY+ F Y
Sbjct: 629 NPAGHLDFTWYSGDSQLAPMDNYGLTPSQTGGLGRTYQYFTGTPTYPFGYGQSYSSFAYS 688
Query: 614 -VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKM 672
V P++ + D +V+N G +
Sbjct: 689 HVQVGPQNTN---------------------------------ADGTVHVSFDVKNTGTV 715
Query: 673 DGSEVVMVYSKPPGIAGTH---IKQVIGYERV-FIAAGQSAKVGFTMNACKSLKIVDNAA 728
G+ V +Y+ PPG AGT+ +Q+ G+++ + GQS + ++ +++
Sbjct: 716 AGTTVAQLYAAPPG-AGTNDTTREQLAGFQKTNTLKPGQSQHISLSVKVSSLSTWDESSL 774
Query: 729 NSLLASGAHTILVG 742
++A GA+ VG
Sbjct: 775 KQVVADGAYQFRVG 788
>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 412
Score = 318 bits (816), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 157/297 (52%), Positives = 198/297 (66%), Gaps = 13/297 (4%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+C+ KLP +RA DLV RMT EK Q+GD+A GVPRLG+P Y+WW+EALHGV+ G+
Sbjct: 96 LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGK 155
Query: 72 RTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-GLT 129
G H D V ATSFP V+LT ASFN++LW +IGQ EARA YN+G A GLT
Sbjct: 156 ------GIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYNIGQAEGLT 209
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPN+N+ RDPRWGR ETPGEDP V RYA +VRGLQ G + S L SA
Sbjct: 210 MWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSSNTKSVPPVLLTSA 266
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
CCKH AYDL++W+G R+ F + VT QD+ +TF PF CV +G S VMC+Y VNG+
Sbjct: 267 CCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGV 326
Query: 250 PTCADPKLLNQTIRGDWNFHG-YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
P+CA+ LL +T RG W G Y+ +DCD++ +I+ + +F T ED VA LKAG+
Sbjct: 327 PSCANADLLTKTFRGSWGLDGRYVAADCDAV-SIMRNSQFYRPTAEDTVATTLKAGM 382
>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 902
Score = 318 bits (816), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 184/447 (41%), Positives = 255/447 (57%), Gaps = 37/447 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGN 125
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 92 -------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 138
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG + +++ P
Sbjct: 139 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPY 197
Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K G
Sbjct: 255 RVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
+L+CG+ Y+ AV QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 314 TELECGEEYSTLP-AAVHQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVN 372
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAA 431
Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/298 (31%), Positives = 145/298 (48%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q +L+ + K
Sbjct: 629 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 746 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 791
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ D T + V+N G+ G EVV +Y P
Sbjct: 792 R------------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHPLT 827
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + G+ + FT++A +L+I D + GA+ + +G
Sbjct: 828 PQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIG 885
>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 914
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 184/447 (41%), Positives = 257/447 (57%), Gaps = 37/447 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D++ + +RA DLV RMTL EKV QM + A +PRLG+P Y+WW+E LHGV+ G
Sbjct: 34 YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 91 -------------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRY 137
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+G+Q EG + +++
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQG-EGADAPKNAQGETY 196
Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + + ++R HFD+R +++D+ ET++ FE V EG V +VM +YN
Sbjct: 197 RKLDATAKHFAVH---SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 253
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
R+ G A LL +R W FHGY+VSDC +I I ++HK + T+E A A +K G
Sbjct: 254 RLFGESASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVKNG 312
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
L+CG Y AVQQG I E DID +LR L MRLG FD G ++ L +
Sbjct: 313 TQLECGQEYATLP-AAVQQGLIGETDIDAALRTLMTARMRLGMFDPPGQLRWAQLPISVN 371
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P+H LA AR+ +VLLKND G LPL+ K +A++GP A+ T A++GNY GTP
Sbjct: 372 QSPEHDALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPAT 430
Query: 423 YTSPMDGFYAYSKVIN--YAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 431 PVTILQGIRAAAPDADVLYARG-ADLV 456
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 147/284 (51%), Gaps = 56/284 (19%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A+ AD V V GL VE E G DR DL LP Q EL+ ++ K
Sbjct: 628 ALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSATGK- 686
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ADV+FG NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQEH--VPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVTFYK 744
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
A+ + +R GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 745 ASETLPAFDDYAMR------GRTYRYFAGTPLYPFGHGLSYTQFAYS--------DLRLD 790
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ + D + + ++V N G G EVV +Y P
Sbjct: 791 RRK------------------------VAADGQLSATLKVTNTGTRAGDEVVQLYLHP-- 824
Query: 687 IAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
+A T IK++ G++R+ +A G+S V FT++ L+I D A
Sbjct: 825 LAPTRARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEA 868
>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 229/743 (30%), Positives = 353/743 (47%), Gaps = 110/743 (14%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQM--GDLAYGVP-----RLGLPLYEWWSEAL 63
++P+ + LP+ R DLV R+TL EK+QQ+ G A P RLG+ + W SE +
Sbjct: 33 EYPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECV 92
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
G +G N P GT +FP + A+F+ +L K+ T++ E RA N
Sbjct: 93 TG---LGTDGNDPHGT----------AFPQPLGMAATFDPALLKRAAGTIALELRAQRNF 139
Query: 124 G--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GL+ WSP +N+ R P WGR ET GE P + A ++V G+Q
Sbjct: 140 DRENGVVKFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQG----- 194
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNE 233
+ +R +A CKH LD + G D R+ FD+ V++ D+ TF++ FE C
Sbjct: 195 ----NHTRYYAAAAACKH-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAA 245
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
G V MCSYN + G+P CA+ + + R W F GY+VSD ++ I ESH + +
Sbjct: 246 G-VMGYMCSYNSIRGVPACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQT 304
Query: 294 EDAVARVLKAGLDLDCGD-----YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
AVA L AG D++ D Y N ++ A+ A ID S+ L+ V MRLG F
Sbjct: 305 LGAVA-ALNAGCDMEDSDDAQHVAYYNLSL-ALDLKLTDMATIDASVSRLFYVRMRLGEF 362
Query: 349 DGSPQ---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLN-TGNIKTLALVGP 404
D P+ +++L + + +P H+E+A + A IVLLKN N LPL+ + L+GP
Sbjct: 363 D-PPENDPWRSLNMSIVSSPAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGP 421
Query: 405 HANATKAMIGNY--EGTPCRYTSPMDGFYA------YSKVINYAPGCADIVCQNNSMIPA 456
A+ M+G Y G+ + G A + Y GC C
Sbjct: 422 FADNADLMMGKYSPHGSTNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAV 481
Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA--AKGPVTLVIMS 514
+ D ++ G VE+E DR ++ PG Q L+ V +A K + L++ +
Sbjct: 482 TTFIQQGCDTVLLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVST 541
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
AG VD+ + + ++ +IL + Y G+ G A+AD++ G+ +P GRLP +W P
Sbjct: 542 AGPVDLAALEQDTRVAAILDLIYLGQTAGTALADILLGETSPSGRLPFSW--------PN 593
Query: 575 TSMPLRPVNNFP--GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
+ P++++ GRTY+F V++PFGYGLSYTQF ++P +
Sbjct: 594 KVSDVPPIDDYTMQGRTYRFAQADVLFPFGYGLSYTQFNLSHLAAPYIL----------- 642
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
P C A+ + + V N G++ G+ + VY + P G I
Sbjct: 643 ----------PVCQALRLS------------VNVTNTGRLSGAIPLQVYVEWPNAVGGPI 680
Query: 693 KQVIGYERVFIAAGQSAKVGFTM 715
+Q+ RVF+ A S V ++
Sbjct: 681 RQLATTTRVFVDAASSKTVQLSI 703
>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 901
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 184/448 (41%), Positives = 253/448 (56%), Gaps = 37/448 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 33 PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 91 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG +++ P
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 195
Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +Y
Sbjct: 196 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 252
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y AV+QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPA 429
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPKAQVLYARG-ADLV 456
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q +L+ + A
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGR 686
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ T+ T D T + V+N G+ G EVV +Y P
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLA 826
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
K++ G++R+ + G+ ++GFT+NA +L++ D + + GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884
>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 902
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 183/447 (40%), Positives = 255/447 (57%), Gaps = 37/447 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGN 125
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 92 -------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 138
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GL+ EG + +++ P
Sbjct: 139 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRG-EGADAPKNAQGEPY 197
Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K G
Sbjct: 255 RVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
+L+CG+ Y+ AV+QG I EA IDT+L L MRLG FD G + + +
Sbjct: 314 TELECGEEYSTLP-AAVRQGLIDEAQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVN 372
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAA 431
Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/298 (31%), Positives = 144/298 (48%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A +AD V V GL VE E G DR DL LP Q +L+ + K
Sbjct: 629 ALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 746 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 791
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ D T + V+N G+ G EVV +Y P
Sbjct: 792 R------------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHPLT 827
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + AG+ + F ++A +L+I D + GA+ + +G
Sbjct: 828 PQRERAGKELHGFQRITLQAGEQRALHFILDAKNALRIYDAQRKAYAVDPGAYEVQIG 885
>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 901
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 185/448 (41%), Positives = 251/448 (56%), Gaps = 37/448 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 33 PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 91 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG +++ P
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 195
Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH A + DR HFD+R +++D+ ET++ FE V EG V +VM +Y
Sbjct: 196 YRKLDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAY 252
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y AV+QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ K +A++GP A+ T A++GNY GTP
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALLGNYYGTPA 429
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPNAQVLYARG-ADLV 456
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q +L+ + A
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGR 686
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ T+ T D + V+N G+ G EVV +Y P
Sbjct: 791 RT--------TIAT----------------DGSLAATVTVKNTGQRAGDEVVQLYLHPLA 826
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
K++ G++R+ + G+ ++GFT+NA +L++ D + + GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884
>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 896
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 192/460 (41%), Positives = 256/460 (55%), Gaps = 45/460 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D +LP+ RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 39 PYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG-- 96
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
GAT FP I A+F+ L ++ +S EARA ++ A
Sbjct: 97 --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKR 142
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ +G Y
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KHYA + + DR HFD +E+D+ ET++ F+ V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYN 250
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RVNG A + L +R DW F GYIVSDC +I+ I ++HK + T E A A +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
DLDCGD Y AV+ G I EA IDTSL+ L MRLG FD + + + +
Sbjct: 309 TDLDCGDTYAALPK-AVRAGLIDEATIDTSLKRLMTTRMRLGMFDPPAKVAWAQIPASVN 367
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+PQH LA AR+ +VLLKND G LPL +K +A+VGP A+ +++GNY GTP
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ + G A + YA G + + + A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 98/298 (32%), Positives = 149/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+NA+ V V GL VE E G DR D LP Q EL+ + A
Sbjct: 623 AVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ +++A+ + + +IL YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAVDWAQQH--VPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPITFYK 739
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ +R GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 740 EAERLPAFDDYAMR------GRTYRYFTGTALYPFGHGLSYTQFAYS--------DLRLD 785
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ T+G D ++V N GK G EVV +Y P
Sbjct: 786 RT--------TLGA----------------DGTLRATLKVRNTGKRAGDEVVQLYLHPLD 821
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
K++ G++R+ + G+ +V FT+ A +L+I D + + GA+ + +G
Sbjct: 822 PKRERAGKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQRKTYAVDPGAYEVQIG 879
>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 902
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 184/447 (41%), Positives = 249/447 (55%), Gaps = 35/447 (7%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + +RA DLV RMTL EK QM + A +PRLG+ Y+WW+EALHGV+ G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 92 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHER 137
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +VRGLQ G
Sbjct: 138 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESY 197
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K G
Sbjct: 255 RVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
+L+CG+ Y+ AV QG I EA IDTSL+ L MRLG FD G + + +
Sbjct: 314 TELECGEEYSTLP-AAVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVN 372
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAA 431
Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 98/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++A+ V V GL VE E G DR DL LP Q EL+ + K
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ ++P GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 746 ES------EALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 791
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
++ TV D FT + V+N G+ G EV +Y P
Sbjct: 792 RN--------TV----------------AADGSFTATVTVKNTGQRAGDEVAQLYLHPLT 827
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++RV + G+ ++ F +NA ++L+I D + GA+ + +G
Sbjct: 828 PQRERAGKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQRKTYTVDPGAYEVQIG 885
>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 902
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 184/447 (41%), Positives = 249/447 (55%), Gaps = 35/447 (7%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + +RA DLV RMTL EK QM + A +PRLG+ Y+WW+EALHGV+ G
Sbjct: 34 PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 92 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHER 137
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +VRGLQ G
Sbjct: 138 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESY 197
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +YN
Sbjct: 198 RKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 254
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K G
Sbjct: 255 RVYGESASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 313
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
+L+CG+ Y+ AV QG I EA IDTSL+ L MRLG FD G + + +
Sbjct: 314 TELECGEEYSTLP-AAVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVN 372
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 373 QSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAA 431
Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 432 PVTVLQGIRAAAPNAQVLYARG-ADLV 457
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 98/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++A+ V V GL VE E G DR DL LP Q EL+ + K
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ ++P GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 746 ES------EALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 791
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
++ TV D FT + V+N G+ G EV +Y P
Sbjct: 792 RN--------TV----------------AADGSFTATVTVKNTGQRAGDEVAQLYLHPLT 827
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++RV + G+ ++ F +NA ++L+I D + GA+ + +G
Sbjct: 828 PQRERAGKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQRKTYTVDPGAYEVQIG 885
>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 901
Score = 315 bits (807), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 184/448 (41%), Positives = 254/448 (56%), Gaps = 37/448 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 33 PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 91 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 136
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD-SR 183
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG + +++ R
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGER 195
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +Y
Sbjct: 196 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 252
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K
Sbjct: 253 NRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKH 311
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y AV+QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 312 GTELECGEEYATLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASV 370
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 371 NQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPA 429
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 430 APVTVLQGIRAAAPNAQVLYARG-ADLV 456
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q +L+ + K
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 686
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTAGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ T+ T D T + V+N G+ G EVV +Y P
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLA 826
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
K++ G++R+ + G+ ++GFT+NA +L++ D + + GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYDEQRKAYGVDPGAYEVQIG 884
>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
purpuratus]
Length = 623
Score = 314 bits (805), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 208/614 (33%), Positives = 319/614 (51%), Gaps = 69/614 (11%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGLPLYEWWSEA 62
S P+ + LP+ +R DL+ R+ + + Q+ A + RL + Y W +E
Sbjct: 28 SQLPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNTEC 87
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
L G D++ AT+FP + +A+F+ L ++ E RA YN
Sbjct: 88 LRG----------------DAQAGNATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYN 131
Query: 123 L--------GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
+ GL +SP IN++R P WGR ET GEDPY+ G A ++V GLQ
Sbjct: 132 YYLQKGDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQG---- 187
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+ R L +A CKH+AAY + RF FD++V+++D+Q TF F+ C+ G
Sbjct: 188 -----NHPRYLLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAG 242
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
SVMCSYN VNGIP CA+ LLN +R +W F GY+VSD +++ +H + +
Sbjct: 243 -TYSVMCSYNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLD 301
Query: 295 DAVARVLKAGLDLDCGDY---YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
A+ + LKAG +LD G ++ AV+ G + D+ S+ L+ +RLG FD
Sbjct: 302 TAI-KSLKAGCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFD-P 359
Query: 352 PQYKNLGKNN----ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
P + K N + +P+H E+A +AA + VL+KND LP+ G I TLA+VGP AN
Sbjct: 360 PDHNPYVKLNVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPIE-GTIHTLAVVGPFAN 418
Query: 408 ATKAMIGNYEGTP-CRY-TSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNAD 465
+K + G+Y P R+ T+ ++G + +A GC C ++A AD
Sbjct: 419 NSKLLFGDYAPNPDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQ-QGVLNAVTGAD 477
Query: 466 ATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG-PVTLVIMSAGAVDINFAK 524
V+ G + +E+EG DR D+LLPG Q +L+ A A G PV L++ +AG ++I +A
Sbjct: 478 VVVVCLGTGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWAL 537
Query: 525 NNPKIKSILWVGYPGEEGGRAIADVIFGK---YNPGGRLPITWYEANYVKIPYTSMPLRP 581
++P +++I+ +P + G A+ ++F NPGGRLP TW P T + P
Sbjct: 538 SSPSVQAIVECFFPAQATGVAL-RMMFQNAPGANPGGRLPSTW--------PATVAQIPP 588
Query: 582 VNNFP--GRTYKFF 593
+ N+ GRTY++F
Sbjct: 589 MENYSMDGRTYRYF 602
>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 754
Score = 314 bits (805), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 241/749 (32%), Positives = 360/749 (48%), Gaps = 112/749 (14%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD-------LAYGVPRLGL 53
F S KV +FP+ + LP ER +DLV R+T+ E + Q+ A + RLG+
Sbjct: 14 HFASSKVTSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGI 73
Query: 54 PLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTV 113
Y+W +E L G + G AT FP I A+F++ L K+ +TV
Sbjct: 74 GPYQWNTECLRGYAMNG----------------DATCFPQPIGLAATFDQGLIYKMAKTV 117
Query: 114 STEARAMYNL----GN----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYV 165
+ EARA +N GN GL+ +SP IN++R P WGR ET GEDP + A YV
Sbjct: 118 ALEARAKHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYV 177
Query: 166 RGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFIL 225
GLQ D L +A CKH+ AY RF F + V++ D+ TF
Sbjct: 178 TGLQ----------GDEIYLPATAVCKHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYP 227
Query: 226 PFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES 285
F CV+ G VMCSYN +NG+P+CA+P +L T+R ++F GY+VSD ++++ I
Sbjct: 228 AFRECVHAG-AQGVMCSYNAINGVPSCANP-MLETTLRKKFHFDGYVVSDENALENIDLY 285
Query: 286 HKFLNDTKEDAVARVLKAGLDLDCGDY-YTN---FTMGAVQQGKIAEADIDTSLRFLYIV 341
F +K + A L AG+DL+ + TN AV+QG + EA + S + L+
Sbjct: 286 FNF-TKSKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRT 344
Query: 342 LMRLGYFDGSPQYK---NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
M LG FD P++ N+ + + + H + A E A + VLLKND G LPL K
Sbjct: 345 RMALGEFD-PPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GILPLKQLYDK- 401
Query: 399 LALVGPHANATKAMIGNY--EGTPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMI 454
+++VGP N ++A+ G+Y E ++SP+ + S V + GC V NN +
Sbjct: 402 VSIVGPFINNSEALTGDYPAEFNLKYFSSPLFAANSLSSSGVARFTTGC---VGTNNQNL 458
Query: 455 PAAI--------DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
P + +D ++ G VEAE DR D+ LPG Q +LI V A G
Sbjct: 459 PICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANG 518
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV +V+ +AG +D+++ N +++ + + G A+ +V+ G NP GRLP TW
Sbjct: 519 PVIVVLFNAGPLDVSWVMGN--TAAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTWPA 576
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKL 625
+ P T + RTY++ ++PFGYGLSYT+F Y P ++
Sbjct: 577 SMEQVPPMTDYSMHE------RTYRYSTSSPLFPFGYGLSYTKFWYLDAVVEPTTI---- 626
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
Q+C + P VLI +N G +DG EVV +Y
Sbjct: 627 ---QRC----------QIPVVRVLI----------------QNTGHLDGEEVVQIYMTSK 657
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGF 713
++Q++ ++RV I AG+ +
Sbjct: 658 KKRDRELLRQLVAFQRVPIKAGEEVSISL 686
>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 343
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 149/335 (44%), Positives = 219/335 (65%), Gaps = 11/335 (3%)
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
MIGNY G C YT+P+ G Y+K ++ + GC D+ C N AA AA++ADAT++V
Sbjct: 1 MIGNYAGVACGYTTPLQGIRRYAKTVHLS-GCNDVFCNGNQQFNAAEVAARHADATILVM 59
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
GLD S+EAE +DR LLLPG+Q EL+++VA A++GP LV+MS G +D++FAKN+P+I +
Sbjct: 60 GLDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGA 119
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGR 588
ILWVGYPG+ GG AIADV+FG NPGG+LP+TWY +Y+ K+P T+M +R P +PGR
Sbjct: 120 ILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGR 179
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY+F+ GPVV+PFG+G+SYT F + + +P+ V + L R N T +N A+
Sbjct: 180 TYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSR--NTTGASN-----AI 232
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
+ C+ I+V+N G MDG+ ++V+S PPG + KQ+IG+E+V + G
Sbjct: 233 RVSHANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQKQLIGFEKVHLVTGSQ 292
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+V ++ CK L +VD + +G H + +G+
Sbjct: 293 KRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGD 327
>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 904
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 186/458 (40%), Positives = 258/458 (56%), Gaps = 39/458 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 94 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHAR 139
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD-SR 183
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG + +++ R
Sbjct: 140 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGSDAPKNAQGER 198
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +Y
Sbjct: 199 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 255
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I + + HK + T+E A A +
Sbjct: 256 NRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTH 314
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y+ AV QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 315 GTELECGEEYSTLP-AAVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASV 373
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 374 NQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPA 432
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAA 457
+ + G A + + YA G AD+V N PAA
Sbjct: 433 APVTVLQGIRAAAPNAQVLYARG-ADLVEGRND--PAA 467
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 147/298 (49%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q EL+ + K
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ +++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTAGSALAVDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 747
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ ++P GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 748 ES------ETLPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 793
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ D T + V+N G+ G EVV +Y P
Sbjct: 794 R------------------------STLTADGALTATVAVKNTGQRAGDEVVQLYLHPLK 829
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + GQ ++ FT+NA +L+I D + GA+ + +G
Sbjct: 830 PQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIG 887
>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 904
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 186/458 (40%), Positives = 258/458 (56%), Gaps = 39/458 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 36 PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 94 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHAR 139
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD-SR 183
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG + +++ R
Sbjct: 140 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGSDAPKNAQGER 198
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH+A + DR HFD+R +++D+ ET++ FE V +G V +VM +Y
Sbjct: 199 YRKLDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 255
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I + + HK + T+E A A +
Sbjct: 256 NRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTH 314
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y+ AV QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 315 GTELECGEEYSTLP-AAVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASV 373
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 374 NQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPA 432
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAA 457
+ + G A + + YA G AD+V N PAA
Sbjct: 433 APVTVLQGIRAAAPNAQVLYARG-ADLVEGRND--PAA 467
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 98/298 (32%), Positives = 147/298 (49%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q EL+ + K
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 747
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ ++P GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 748 ES------ETLPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 793
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ D T + V+N G+ G EVV +Y P
Sbjct: 794 R------------------------STLTADGALTATVAVKNTGQRAGDEVVQLYLHPLK 829
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + GQ ++ FT+NA +L+I D + GA+ + +G
Sbjct: 830 PQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRIYDAQRKAYTVDPGAYEVQIG 887
>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 904
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 186/458 (40%), Positives = 256/458 (55%), Gaps = 39/458 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + +RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 36 PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 94 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHAR 139
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG +++ P
Sbjct: 140 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEP 198
Query: 185 L-KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ A KH+A + E R HFD+R +++D+ ET++ FE V +G V +VM +Y
Sbjct: 199 YRKLDATAKHFAVHSGPEAE---RHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAY 255
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NRV G A LL +R W F GY+VSDC +I + + HK + T+E A A +
Sbjct: 256 NRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTH 314
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G +L+CG+ Y+ AV QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 315 GTELECGEEYSTLP-AAVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASV 373
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 374 NQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPA 432
Query: 422 RYTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAA 457
+ + G A + + YA G AD+V N PAA
Sbjct: 433 APVTVLQGIRAAAPNAQVLYARG-ADLVEGRND--PAA 467
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 147/298 (49%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q EL+ + K
Sbjct: 631 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 689
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 747
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ ++P GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 748 ES------ETLPAFDDYTMHGRTYRYFGGTPLYPFGHGLSYTQFAYS--------DLRLD 793
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ D T + V+N G+ G EVV +Y P
Sbjct: 794 R------------------------STLTADGALTATVAVKNTGQRAGDEVVQLYLHPLK 829
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + G+ ++ FT+NA +L+I D + GA+ + +G
Sbjct: 830 PQRERAGKELRGFQRLALQPGEQRELRFTINATDALRIYDAQRKAYTVDPGAYEVQIG 887
>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 901
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 182/447 (40%), Positives = 249/447 (55%), Gaps = 35/447 (7%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + + RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 33 PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 91 --------------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHER 136
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ G R
Sbjct: 137 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERY 196
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + DR HFD+ +++D+ ET++ FE V +G V +VM +YN
Sbjct: 197 RKLDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKDGKVDAVMGAYN 253
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I + HK + T+E A A +K G
Sbjct: 254 RVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 312
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
+L+CG+ Y+ AV+QG I EA IDT+L+ L MRLG FD G + + +
Sbjct: 313 TELECGEEYSTLP-AAVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSQIPASVN 371
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P H LA AR+ +VLLKND G LPL+ +K +A++GP A+ T A++GNY GTP
Sbjct: 372 QSPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALLGNYYGTPAA 430
Query: 423 YTSPMDGFYAYS--KVINYAPGCADIV 447
+ + G A + + YA G AD+V
Sbjct: 431 PVTVLQGIRAAAPNAQVLYARG-ADLV 456
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 151/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++A+ V V GL VE E G DR DL LP Q +L+ + K
Sbjct: 628 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK- 686
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 745 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 790
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ T+ T D T + V+N G+ G EVV +Y P
Sbjct: 791 RT--------TIAT----------------DGSLTATVTVKNTGQRAGDEVVQLYLHPLT 826
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + G+ ++GFT+NA +L++ D + + GA+ + +G
Sbjct: 827 PQRERAGKELHGFQRIALTPGEQRELGFTINAKDALRLYDEQRKAYVVDPGAYEVQIG 884
>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 850
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 178/462 (38%), Positives = 260/462 (56%), Gaps = 46/462 (9%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S P+ D L RA DLV RMTL EKV QM + A +PRLG+P Y+WW+EALHGV+
Sbjct: 22 SQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVARA 81
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG----- 124
G AT FP I A+++ +L +I +T+STEARA YN
Sbjct: 82 GL----------------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDD 125
Query: 125 ---NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLTFWSPNIN+ RDPRWGR ET GEDP++ R A+ +++G+Q D
Sbjct: 126 HSRYRGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQG---------ED 176
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
K+ A KHYA + + R FD + + +D+ +T++ F + E S+MC
Sbjct: 177 PHYYKVIATAKHYAVHSGPE---SSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMC 233
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+YNRV+GIP CA LL + +RG+W F G++VSDC ++ I H + D + +
Sbjct: 234 AYNRVDGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAV-AV 292
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
KAG DL CG+ Y + AV+ G I E +I+ SL L++ +LG FD + + N+
Sbjct: 293 KAGTDLTCGNEYRAL-VDAVKTGLITEPEINRSLERLFVARFKLGMFDPPERVPFSNIPY 351
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + H ++A EAAR+ IVLLKND G LPL + +IK +A++GP A+ +A++GNY G
Sbjct: 352 SEVDSAGHRKIALEAARKSIVLLKND-GTLPLKS-SIKKIAVIGPAADDAEALLGNYNGF 409
Query: 420 PCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAI 458
+P+ G +A + YA G A+ Q+ + +PA++
Sbjct: 410 SSLQVTPLAGIEHQWAGKAEVRYALG-ANYTAQSQAPLPASV 450
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 160/347 (46%), Gaps = 71/347 (20%)
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVIN 438
I L + + A P TG + L L HA +IG P R G A ++V+
Sbjct: 535 IFLEERELTADPPPTGRGRPLLL---HAQ----LIGG-RAYPIRVEYSASGPAASAQVL- 585
Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLL 488
+AP A ++ AAI+A NAD T+ GL+ S+E E G DR +L
Sbjct: 586 WAPPDAPLLA-------AAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLE 638
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LP Q +LI + A A PV +V+ S AV +NFA + ++L Y GEE G AIAD
Sbjct: 639 LPEPQEKLI-EAAIATGKPVVVVLASGSAVAMNFAAQH--ASALLETWYNGEETGTAIAD 695
Query: 549 VIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYT 608
+ G NP GRLP+T+Y + P+ ++ GRTY++F+G +Y FG+GLSY+
Sbjct: 696 TLAGINNPSGRLPVTFYRSVDQLPPFEEYAMK------GRTYRYFNGDALYSFGFGLSYS 749
Query: 609 QFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVEN 668
+F+Y + T + ++ V+ N
Sbjct: 750 KFQYSA-----------------------LKTRRAGSGTIVASRVR-------------N 773
Query: 669 MGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
++G EVV +Y G G I+ + G++R+ + G+S +V F +
Sbjct: 774 ASSIEGDEVVQLYVNGSGADGDPIRSLRGFQRIHLRPGESREVHFPL 820
>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 913
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 181/447 (40%), Positives = 258/447 (57%), Gaps = 37/447 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 94 -------------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARY 140
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG + +++
Sbjct: 141 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGDAY 199
Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + + DR HFD+ +++D+ ET++ FE V EG V +VM +YN
Sbjct: 200 RKLDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 256
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I ++HK + T+E A A + G
Sbjct: 257 RVYGESASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVNNG 315
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNI 362
+L+CG+ Y+ AV++G I+EAD+D +L+ L MRLG FD + ++ + +
Sbjct: 316 TELECGEEYSTLP-AAVRKGLISEADVDKALQKLMYSRMRLGMFDPPDTLRWAQIPLSAN 374
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P+H LA AR+ +VLLKND G LPL+ G IK +A++GP A+ T A++GNY GTP
Sbjct: 375 QSPEHDALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAA 433
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIV 447
+ + G A + YA G AD+V
Sbjct: 434 PVTVLQGIREAAPDAEVLYARG-ADLV 459
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 148/298 (49%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+ AD V V GL VE E G DR DL LP Q EL+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK- 689
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ADV+FG NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 748 ESEKLPAFDDYAMR------GRTYRYFAGTALYPFGHGLSYTQFAYS--------DLRLD 793
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ + D ++V+N G+ G EVV +Y P
Sbjct: 794 RSKL------------------------ATDGSLHATLKVKNTGQRAGDEVVQLYLHPLS 829
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + G++ +V F ++ L++ D A + + G + + VG
Sbjct: 830 PQRERARKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQVG 887
>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 913
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 182/447 (40%), Positives = 257/447 (57%), Gaps = 37/447 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 94 -------------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARY 140
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ EG + +++
Sbjct: 141 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEAY 199
Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + + DR HFD+ +++D+ ET++ FE V EG V +VM +YN
Sbjct: 200 RKLDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 256
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I ++HK + T+E+A A +K G
Sbjct: 257 RVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHG 315
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
+L+CG Y+ AV++G I+EAD+D +L+ L MRLG FD + + + +
Sbjct: 316 TELECGAEYSTLP-SAVRKGLISEADVDKALQKLMYSRMRLGMFDPPEKLAWAQIPLSAN 374
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P+H LA AR+ +VLLKND G LPL+ IK +A+VGP A+ T A++GNY GTP
Sbjct: 375 QSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAA 433
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIV 447
+ + G A + YA G AD+V
Sbjct: 434 PVTVLQGIREAAPDAEVLYARG-ADLV 459
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 141/282 (50%), Gaps = 52/282 (18%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+ AD V V GL VE E G DR DL LP Q L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ADV+FG NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 748 ESETLPAFDDYAMR------GRTYRYFAGTPLYPFGHGLSYTQFAYS--------DLRLD 793
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ + D + ++V+N G+ G EVV +Y +P
Sbjct: 794 RSKLA------------------------ADGRLHATLKVKNTGQRAGDEVVQLYLQPLS 829
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
K + G++R+ + G++ +V F ++ L++ D A
Sbjct: 830 PQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEA 871
>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 876
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 187/459 (40%), Positives = 254/459 (55%), Gaps = 45/459 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D + P+ RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 19 PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
GAT FP I A+F+ L ++ +S EARA ++ A
Sbjct: 77 --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKR 122
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ +G Y
Sbjct: 123 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 174
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + + DR HFD +E+D+ ET++ F+ V EG V++VM +YN
Sbjct: 175 -KLDATAKHFAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYN 230
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RVNG A + L +R DW F GYIVSDC +I+ I ++HK + T E A A +K G
Sbjct: 231 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 288
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
DLDCGD Y AV+ G I EA IDT+L+ L MRLG FD + + + +
Sbjct: 289 TDLDCGDTYAALP-AAVRAGLIDEATIDTALKRLMTTRMRLGMFDPPAKVPWAQIPASAN 347
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+PQH LA AR+ +VLLKND G LPL +K +A++GP A+ +++GNY GTP
Sbjct: 348 QSPQHDALARRTARESLVLLKND-GVLPLKP-TLKRIAVIGPTADDPMSLLGNYYGTPAA 405
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAID 459
+ + G A + YA G + + + A ID
Sbjct: 406 PVTILQGIRDAAPQAQVIYARGSDLVEGREDPNAAAPID 444
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 160/335 (47%), Gaps = 60/335 (17%)
Query: 427 MDGFYAYSKVINYAPGCADIVCQNNSMIPAA-------IDAAKNADATVIVAGLDLSVEA 479
++G AY + Y D + +P A +DAA++A+ V V GL VE
Sbjct: 566 LEGGKAYDLRVEYYEATRDAGVRLAWRMPGAKPPLQEAVDAARDAEVVVFVGGLTGDVEG 625
Query: 480 E----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
E G DR D LP Q EL+ + A PV V+ + A+ I++A+ + +
Sbjct: 626 EEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGTPVVAVLTTGSALAIDWAQQH--V 682
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
+IL YPG+ GG A+ DV+FG+ +PGGRLP+T+Y+ + +R GRT
Sbjct: 683 PAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYKEAERLPAFDDYAMR------GRT 736
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
Y++F G +YPFG+GLSYTQF Y D++LD+ TV
Sbjct: 737 YRYFQGKPLYPFGHGLSYTQFAYS--------DLRLDRT--------TV----------- 769
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQS 708
D T + ++N G+ G EVV +Y P +K++ G +R+ + G+
Sbjct: 770 -----AADGTLTATVTLKNTGQRAGDEVVQLYLHPLKPQRERALKELHGLQRITLQPGEQ 824
Query: 709 AKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
++ FT+ A +L+I D + + GA+ + +G
Sbjct: 825 RQLRFTIKAQDALRIYDEQRKAYAVDPGAYEVQIG 859
>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
Length = 902
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 176/418 (42%), Positives = 239/418 (57%), Gaps = 42/418 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + +RA DLV MTL EK QM + A +PRLG+ Y+WW+E LHGV+ G+
Sbjct: 47 YRDLSRSFHDRAADLVAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ-- 104
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
AT FP I A+F+ L ++ +S EARA YN
Sbjct: 105 --------------ATVFPQAIGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHGRY 150
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT+WSPNIN+ RDPRWGR ET GEDPY+ R + +V GLQ + Y
Sbjct: 151 EGLTYWSPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQG-DNPTYR-------- 201
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A KH+A + + DR HFD +E+D+ ET++ F+ V E DV +VM +YNR
Sbjct: 202 KLDATAKHFAVH---SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNR 258
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
VNG P P+LL Q +R DW F GY+VSDC +++ I + HK + DT E A A +K G+
Sbjct: 259 VNGEPATGSPRLLGQILRKDWGFKGYVVSDCGAVEDIYKHHKVV-DTVEAASALAVKNGV 317
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
DLDCG Y + AV G I E++ID +L L MRLG FD + + + ++ +
Sbjct: 318 DLDCGTEYAAL-VKAVHDGLIKESEIDAALTRLMQARMRLGMFDPASKVPWSDVPYSVNQ 376
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+PQH LA AAR+ +VLLKND G LPL+ +IK +A++GP A+ A++GNY GTP
Sbjct: 377 SPQHDALARRAARESMVLLKND-GVLPLSK-DIKHIAVIGPTADDVMALVGNYHGTPA 432
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 140/289 (48%), Gaps = 57/289 (19%)
Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
V GL VE E G DR DL LP Q +L+ + K PV LV+ S A
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSA 700
Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
+ +++A N + ++L YPG+ GG A+ADV+FGK +P GRLP+T+Y+A+ +
Sbjct: 701 LAVDWA--NQHLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYKAS------EKL 752
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
P GRTY++F G +YPFGYGLSYT+F Y D+KLD ++
Sbjct: 753 PAFDDYRMDGRTYRYFKGEPLYPFGYGLSYTKFTY--------ADLKLDHNK-------- 796
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI---KQ 694
+G N K ++V N GK G EVV +Y + G+ H K
Sbjct: 797 IGKND----------------KLHVTVKVHNAGKRAGDEVVQLYLR--GVGTPHERSNKD 838
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDN-AANSLLASGAHTILVG 742
+ G +R+ + GQ+ V F ++ L+ D A + +G + + +G
Sbjct: 839 LRGIQRITLQPGQTRDVSFDVSPATDLRYYDTKKAAYAVDAGRYEVQIG 887
>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 913
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 182/447 (40%), Positives = 257/447 (57%), Gaps = 37/447 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D + + +RA DLV RMTL EK QM + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--------N 125
GAT FP I A+F+ L ++ +S EARA ++
Sbjct: 94 -------------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARY 140
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ E V+ +++
Sbjct: 141 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EDVDVPKNAQGEAY 199
Query: 186 -KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + + DR HFD+ +++D+ ET++ FE V EG V +VM +YN
Sbjct: 200 RKLDATAKHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 256
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R W F GY+VSDC +I I ++HK + T+E+A A +K G
Sbjct: 257 RVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHG 315
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
+L+CG Y+ AV++G I+EAD+D +L+ L MRLG FD + + + +
Sbjct: 316 TELECGAEYSTLPT-AVRKGLISEADVDNALQKLMYSRMRLGMFDPPEKLAWAQIPLSAN 374
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+P+H LA AR+ +VLLKND G LPL+ IK +A+VGP A+ T A++GNY GTP
Sbjct: 375 QSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAA 433
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIV 447
+ + G A + YA G AD+V
Sbjct: 434 PVTVLQGIREAAPDAEVLYARG-ADLV 459
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 148/298 (49%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+ AD V V GL VE E G DR DL LP Q L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ADV+FG NPGGRLP+T+Y+
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y D++LD
Sbjct: 748 ESETLPAFDDYAMR------GRTYRYFAGTALYPFGHGLSYTQFAYS--------DLRLD 793
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ + D + ++V+N G+ G EVV +Y +P
Sbjct: 794 RSKLA------------------------ADGRLHATLKVKNTGQRAGDEVVQLYLQPLS 829
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K + G++R+ + G++ +V F ++ L++ D A + + G + + VG
Sbjct: 830 PQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQVG 887
>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
Length = 859
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 222/741 (29%), Positives = 358/741 (48%), Gaps = 93/741 (12%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+R DL+ MTL EK+ G + RLG+P +EW+ EALHG+
Sbjct: 34 DRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGIISWN------------ 81
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPR 142
T FP I A++N L + +S EARA+ N G + +SP +N+ RDPR
Sbjct: 82 -----CTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDPR 136
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E EDP+++ A YVRG+Q +D + +K KHY A +++
Sbjct: 137 WGRNGECYAEDPHLMSEMARMYVRGMQ---------GNDPKYVKTVTTVKHYVANNVE-- 185
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
R S + ++D+ E + ++ C+ + + + +M + N +NGIP A L+N +
Sbjct: 186 --TKREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNGIPCSAHDWLVNGVL 243
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM---- 318
R +W F GY+++D ++Q + + K+ + + A + KAG+D +C + N
Sbjct: 244 RNEWGFKGYVIADWAAVQGLEKRMKYASSQAQAAAMAI-KAGVDQEC---FRNKVRQAPM 299
Query: 319 -----GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELA 371
A+QQG I E ++D +++ L + G FD Y + + + H +LA
Sbjct: 300 VQALPDALQQGLITEKELDVTVKRLLRLRFMTGDFDDPSLNPYSAIPTSVLECDAHKQLA 359
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA Q IVLLKND LPL ++K++A++GP A+ + +G Y G P SP+DG
Sbjct: 360 LKAAEQSIVLLKND-AVLPLKK-DLKSIAMIGPFAD--RCWMGIYSGHPKSKVSPLDGIK 415
Query: 432 AYSKV-INYAPGCADIVCQNNSM-IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
AY+ +++A GC +++ I A+ AK ++ ++V G D + E DR + L
Sbjct: 416 AYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTSTENTDRKSIKL 475
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
PG Q +LI K A V LV++ +G + + + N I I+ G+E G A+A V
Sbjct: 476 PGNQHQLI-KAVQAVNKNVILVLVPSGPTAVTWEQKN--IPGIVCAWPNGQEQGTALAKV 532
Query: 550 IFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
+FG NPGG+L TWY+++ + + N RTY +F G +YPFGYGLSYT
Sbjct: 533 LFGDVNPGGKLNATWYQSDKDLPNFHDYKMAGGN----RTYMYFKGKPLYPFGYGLSYTN 588
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F D+ ++K ++ +Y T + +V N
Sbjct: 589 FTIS--------DVSINKKT-----------------------LQANEY-VTVKAKVNNT 616
Query: 670 GKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAA 728
G + G EVV VY + T +K + G++R+ +AAG S V + ++ +
Sbjct: 617 GAVAGDEVVQVYIRDVKSKEKTPLKALKGFQRISVAAGASKWVEIKI-PYEAFSHYNTKK 675
Query: 729 NSLL-ASGAHTILVGEGVGGV 748
+L+ A G ILVG +
Sbjct: 676 EALMVAKGEFEILVGNASDAI 696
>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
Length = 868
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 181/442 (40%), Positives = 246/442 (55%), Gaps = 45/442 (10%)
Query: 15 CDAKLP-YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
DA+ P RA LV +MTLPEKV QM + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 22 VDARTPDAHSRAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNGY-- 79
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
AT FP I AS++ SL +G +STEARA +N +G
Sbjct: 80 --------------ATVFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLF 125
Query: 128 --LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
LT WSPNIN+ RDPRWGR ET GEDPY+ G+ A+ +VRG+Q D P
Sbjct: 126 QGLTLWSPNINIFRDPRWGRGQETYGEDPYLTGQLAVAFVRGIQG--------DDPQHPR 177
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
I A KH+ A+ G D F D V+ D+++T++ F V +G SVMC+YN
Sbjct: 178 AI-ATPKHFVAHSGPE-AGRDSFDVD--VSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNA 233
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
++G P CA+ LL+ +R DW F GY+VSDCD++ I H F D + +VA V +AG
Sbjct: 234 LHGTPACANAGLLDTRLRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAV-QAGT 292
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNIC 363
DLDCG Y + AV+QG IAE+ +D SL L+ RLG G+ Y +G + I
Sbjct: 293 DLDCGHTYASLAQ-AVRQGDIAESALDASLVRLFTARYRLGELGSRGNDPYARIGADQID 351
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
+P H +LA +AA + +VLLKN + LPL+ G LA++GP A+A + + NY GT
Sbjct: 352 SPAHRKLALQAALESLVLLKNAHSTLPLHAG--MRLAVIGPDADALETLEANYHGTARHP 409
Query: 424 TSPMDGFYAY--SKVINYAPGC 443
+P+ G A + + YA G
Sbjct: 410 VTPLQGLRARFGADHVAYAQGA 431
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/298 (34%), Positives = 146/298 (48%), Gaps = 56/298 (18%)
Query: 462 KNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
+ADA V GL VE E G DR D+ LP Q L+ + A A+ P+ +V
Sbjct: 596 HDADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVV 654
Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
++S AV +++A+ + +IL YPG+ GG AIA V+ G YNPGGRLP+T+Y +
Sbjct: 655 LLSGSAVALDWAQQH--ADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYRSTRDL 712
Query: 572 IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC 631
PY S ++ GRTY++FDG +YPFGYGLSYT+F Y A + + +K Q
Sbjct: 713 PPYVSYAMQ------GRTYRYFDGRPLYPFGYGLSYTRFTY-AAPTLSAATLKAGGTLQV 765
Query: 632 RDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAG 689
EV N G+ G EVV VY + P +A
Sbjct: 766 -------------------------------SAEVRNAGQRAGDEVVQVYLDTPPSPLAP 794
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGG 747
H ++G+ R+ +AAG+ V FT+ A + L VD A + G + + +G G G
Sbjct: 795 RH--ALVGFRRIHLAAGEQRLVRFTL-APRQLSSVDAAGARAVEPGQYRVFIGAGQPG 849
>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
Length = 893
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 177/452 (39%), Positives = 254/452 (56%), Gaps = 47/452 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ DA L R DLV R+T EK+ QM + + RLG+P Y WW+E+LHGV+ G+
Sbjct: 43 YPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAGK 102
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGN---- 125
AT +P I ++F+E L ++ ++S E RA Y+ L
Sbjct: 103 ----------------ATVYPQAIGLASTFDEDLMLRVATSISDEGRAKYHDFLSKDVRT 146
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFWSPNIN+ RDPRWGR ET GEDP++ GR AIN+V+G+Q + +S
Sbjct: 147 IYGGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQG-------ENDNSD 199
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK A KHYA + + R D T +D+ ET++ F M + E +V S+MC+Y
Sbjct: 200 YLKAVATIKHYAVH---SGPEKTRHSDDYHPTRKDLFETYLPAFRMAIAETNVQSLMCAY 256
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-FLNDTKEDAVARVLK 302
NRV+G P C + +L+ + +RGD F+GY+VSDC +I ES + D+ +A A +K
Sbjct: 257 NRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHVVDSPAEAAAWAVK 316
Query: 303 AGLDLDCGD----YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
+G DL+CGD YTN A+QQG I E ID +++ L+ ++LG FD + Y
Sbjct: 317 SGTDLNCGDSHGNTYTNLHY-ALQQGLITEDYIDIAVKRLFKARIKLGMFDEQDRVPYSE 375
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
+G + + +P+H+ L EAA + IVLLKN NG LPL G +A++GP+A ++GNY
Sbjct: 376 IGMDVVGSPKHLALTQEAAEKSIVLLKN-NGVLPLKAG--VKVAVIGPNAVDEDVLVGNY 432
Query: 417 EGTPCRYTSPMDGFYAYSKVIN--YAPGCADI 446
G P + P++G N YAPG A I
Sbjct: 433 HGVPVKPVLPLEGIVNRVGEANVFYAPGSAQI 464
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 135/302 (44%), Gaps = 54/302 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAEGK----------DRVDLLLPGFQTELINKVADAAKG 506
A+ AA+ AD + + G+D +E E DR + LP QT L+ ++ K
Sbjct: 620 ALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK- 678
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV +V S A+ +N+ + K+ +IL YPGE G A+A++++G +P GRLP+T+Y+
Sbjct: 679 PVVMVNFSGSAMALNW--ESEKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFYK 736
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + RTYKF+ G +Y FG+GL Y F Y
Sbjct: 737 G------VDDLPAFNDYHMENRTYKFYRGEPLYAFGHGLGYVDFAYN------------- 777
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPP 685
N V A+ I + V N GKM +V VY S
Sbjct: 778 --------NLVVANTAEAGKALPI------------AVSVTNTGKMQAEDVAQVYISLLD 817
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
A T I+ + ++R +AAG+S ++ F + A + L +D+ + +G + VG G
Sbjct: 818 APANTPIRDLKAFKRTKLAAGESTELEFNLPA-RVLTYIDDNGKTQTYTGRVEVTVGSGQ 876
Query: 746 GG 747
G
Sbjct: 877 KG 878
>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 896
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 189/460 (41%), Positives = 251/460 (54%), Gaps = 45/460 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D P RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 39 PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG----- 127
GAT FP I A+F+ L ++ +S EARA ++ AG
Sbjct: 97 --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142
Query: 128 ---LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
LTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ +G Y
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KHYA + + DR HFD +E+D+ ET++ F+ V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYN 250
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RVNG A + L +R DW F GYIVSDC +I+ I ++HK + T E A A +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
DLDCGD Y AV+ G I EA ID SL L +RLG FD + + + +
Sbjct: 309 TDLDCGDTYAALP-AAVRAGLIDEATIDRSLTRLMAARLRLGMFDPPAKVPWAQIPASAN 367
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+PQH LA AR+ +VLLKND G LPL +K +A+VGP A+ +++GNY GTP
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ + G A + YA G + + + A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 149/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+NAD V V GL VE E G DR D LP Q EL+ + A
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++FDG +YPFG+GL+YTQF Y +++LD
Sbjct: 740 EDERLPAFDDYAMR------GRTYRYFDGKPLYPFGHGLAYTQFAYS--------NLRLD 785
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ TV D + V+N G+ G EVV +Y P
Sbjct: 786 RT--------TV----------------AADGTLRATVSVKNTGQRAGDEVVQLYLHPLN 821
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
K++ G++R+ + G+ +V F + ++L+I D + + GA+ + +G
Sbjct: 822 PQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIG 879
>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
Length = 898
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 173/428 (40%), Positives = 239/428 (55%), Gaps = 42/428 (9%)
Query: 3 ESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEA 62
++I K + Y D + ERA DLV RMTL EKV QM + A +PRLG+P Y+WW+EA
Sbjct: 32 KTIAAKQTQPLYLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEA 91
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
LHGV+ G AT FP I A+F+ +L +S EARA YN
Sbjct: 92 LHGVARAGE----------------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYN 135
Query: 123 LGN--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
GLTFWSPN N+ RDPRWGR ET GEDPY+ R + +VRGL+
Sbjct: 136 DFQRRGMRGRYEGLTFWSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEG---- 191
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
D K+ A KH+A + + ++R FD +E+D+ ET++ F+ V +G
Sbjct: 192 -----DDPTYQKLDATAKHFAVH---SGPESERHRFDVHPSERDLHETYLPAFQALVQQG 243
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
V +VM +YNRV+G+P A +LL +R DW F GY+VSDCD++ I + HK + T E
Sbjct: 244 GVDAVMGAYNRVDGVPATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVV-PTAE 302
Query: 295 DAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSP 352
A A + G DL+CG Y + AV G + E IDT++ L + RLG FD G
Sbjct: 303 QAAALAVNNGDDLNCGTTYATL-VKAVHDGLVNEHTIDTAVTRLMLARFRLGMFDPPGRV 361
Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
+ L + + +PQH LA A++ +VLLKND G LPL+ N++ +A++GP A+ A+
Sbjct: 362 PWSTLPMSVVQSPQHDALALRTAQESMVLLKND-GLLPLSH-NVRRIAVIGPTADNVTAL 419
Query: 413 IGNYEGTP 420
+GNY GTP
Sbjct: 420 LGNYHGTP 427
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 155/306 (50%), Gaps = 57/306 (18%)
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKV 500
S AA+DAA++AD + GL +E E G DR L LP Q +L+ +
Sbjct: 620 KSPFEAALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLLQAL 679
Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
K PV LV+ + A+ I++AK + + +IL YPG++GG A+AD +FG +P GRL
Sbjct: 680 QVTGK-PVVLVLTTGSALAIDWAKQH--LPAILLAWYPGQDGGHAVADALFGNVDPAGRL 736
Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
P+T+Y++ P+ ++ GRTY++F G ++PFG+GLSYT+F Y
Sbjct: 737 PVTFYKSARQLPPFDDYAMK------GRTYRYFTGQPLFPFGFGLSYTRFAYS------- 783
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++LD+D T+G + + + V+N G+ G EVV +
Sbjct: 784 -DLQLDRD--------TLGPSD----------------RMRISLRVKNTGQRAGDEVVQL 818
Query: 681 YSKPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGA 736
Y +P + H IK + G++R+ + G+ V F ++ LK D A ++ +A G
Sbjct: 819 YLRP--LRAPHARAIKSLRGFQRISLKPGEERSVSFDISPQTDLKYYDVAHHAYAVAPGR 876
Query: 737 HTILVG 742
+ + VG
Sbjct: 877 YQVQVG 882
>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 796
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 225/730 (30%), Positives = 346/730 (47%), Gaps = 118/730 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL EA+HG +G T FPT I +++N L KK+
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------------TTVFPTAIGQASTWNPDLIKKM 167
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
++ E RA T + P I++ R+PRW RV ET GEDPY++ + V G Q
Sbjct: 168 AHVIAKEIRA-----QGSNTAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQ 222
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR-FHFDSRVTEQDMQETFILPFE 228
H ++A KH+AAY + N H R D+ + ++ P +
Sbjct: 223 G-----SHESDLKSNEHVAATLKHFAAYGVSEGGHNGAAVHIGQR----DLFQNYMYPVK 273
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
V+ G V SVM +Y+ ++G+P+ A LL ++ W F G+++SD SI+ ++ H
Sbjct: 274 EAVDNG-VMSVMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDHHI 332
Query: 289 LNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
+ DT+EDA A + AG+D+D G + Y + + AV GK+AE ID ++R + V +LG
Sbjct: 333 V-DTEEDAAAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGL 391
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
F+ + + + N +HIELA E ARQ I +LKN++ LPLN ++ +A++G +A+
Sbjct: 392 FENPYANEKQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNAD 450
Query: 408 ATKAMIGNYEG--TPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
+G+Y + + ++G + I Y G A + + IPAA++AAKN
Sbjct: 451 MQYNQLGDYTAPQSEENIITVLEGIQHKMPNANIEYVKGTA-VRDTTQTNIPAAVEAAKN 509
Query: 464 ADATVIVAG----LDLSVE----------------------AEGKDRVDLLLPGFQTELI 497
A+ ++V G D E EG DR L L G Q EL+
Sbjct: 510 AEVAIVVLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELL 569
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
V A P LV++ + +N+ N + IL YPG+EGG AIADVIFG +NP
Sbjct: 570 QAVV-ATGTPTVLVLIKGRPLLLNWPAEN--VPVILDAWYPGQEGGSAIADVIFGDFNPA 626
Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRT-YKFFDGPVVYPFGYGLSYTQFKY---K 613
GRLP++ V +P+ FP R Y D +YPFGYGLSY++FKY K
Sbjct: 627 GRLPVS------VPKSLGQIPVYYNYWFPNRRDYVETDAKPLYPFGYGLSYSEFKYSDLK 680
Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
VA+S K ++ K +++ N K+D
Sbjct: 681 VATSGKG-----------------------------------RNTKIEISLKISNTSKVD 705
Query: 674 GSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
G EV+ +Y + + +KQ+ +ERV I AG++ V F + K L + D +
Sbjct: 706 GDEVIQLYIRDMVSTVLSPVKQLRAFERVSIKAGETKTVQFEL-LPKELSLFDTEMKQKV 764
Query: 733 ASGAHTILVG 742
+G +++G
Sbjct: 765 QAGEFKLMIG 774
>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 701
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 232/762 (30%), Positives = 353/762 (46%), Gaps = 145/762 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPR-----LGLPLYEWWSEALHGV-S 67
+C+ LP R +DL+ R+ L EK + A PR +GLP Y W + +HGV S
Sbjct: 34 FCNTSLPVSARVEDLLARLPLDEKAILL--TARASPRGNMSSIGLPEYNWGANCVHGVRS 91
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG 127
G TN P TSFP + N S+ ++
Sbjct: 92 TCG--TNCP------------TSFPNPV------NLSIHRR------------------- 112
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
RDPRWGR ETP EDP V +Y + Y +GLQ EG + D R L+
Sbjct: 113 -----------RDPRWGRNTETPSEDPLVNSKYGVAYTKGLQ--EG----KHEDPRYLQA 155
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
KHY AY +N+ G +R F++ V+ D +T+ F + +G+ VMCSYN VN
Sbjct: 156 VVTLKHYVAYSYENYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDGNAKGVMCSYNSVN 215
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
G+P CA+ +L N+ +RG F GYI SD +I+ I + ++ T+ +A + AG D+
Sbjct: 216 GVPACANNELENKLLRGMLGFDGYITSDSGAIEAISDWLHYV-PTRCEAARLAILAGTDV 274
Query: 308 DCGD--YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD---GSPQYKNLGKNNI 362
+ G Y V+ ++ +D LR + LG FD P +K + N++
Sbjct: 275 NSGRGFGYMACLKELVESNQLDVKVVDDVLRHTLKLRFELGLFDPIEDQPYWK-VTPNDV 333
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+L+ + AR+ IVLL+N+ LPL G LA+VGPHA A +A++GNY G C
Sbjct: 334 NTDAAKKLSLDLARKSIVLLQNNQPVLPLRRG--VKLAVVGPHAQAKRALLGNYLGQMCH 391
Query: 423 --------YTSPMDGFYAYS--KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
+P + A + YA GC ++ + + A+ A + A+A V+ G
Sbjct: 392 GDYNEVGCIKTPFEAVSASNGDSSTTYALGC-NVTGNSTAGFVEAVKAVQGAEAVVLFLG 450
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
+D SVEAE +DR ++ LP Q +L+ +V K P +V+M+ G + + ++
Sbjct: 451 IDKSVEAEVRDRNNIDLPAIQVQLLQRVRAVGK-PTVVVLMNGGVLTAEDIIG--QTDAL 507
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
+ YPG G +A+ D++FG NPGG+LP+T Y ++YV M V +PGR+Y++
Sbjct: 508 VEAFYPGFFGAQAMTDILFGDANPGGKLPVTMYRSDYVNT--VDMKSMNVTAYPGRSYRY 565
Query: 593 FDGPVVYPFGYGLSYTQFKYK----VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
F G V+PFG+GLSYT F K A++ KSV + N T+
Sbjct: 566 FKGEPVFPFGWGLSYTSFSLKADDATATTAKSVSATM---------NTTI---------- 606
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-----PGIAGTHIKQVIGYERVFI 703
VV Y +P G A KQ+ Y RV +
Sbjct: 607 ---------------------------SVVFAYFRPIKTDASGPATLLNKQLFDYRRVTL 639
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
+S ++ F + +L +VD N + G++ I++ GV
Sbjct: 640 KPSESTRLSFEVQR-STLALVDEEGNLVSFPGSYDIIITNGV 680
>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 896
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 189/460 (41%), Positives = 250/460 (54%), Gaps = 45/460 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D P RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 39 PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG----- 127
GAT FP I A+F+ L ++ +S EARA ++ AG
Sbjct: 97 --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKR 142
Query: 128 ---LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
LTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ +G Y
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KHYA + + DR HFD +E+D+ ET++ F+ V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYN 250
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RVNG A + L +R DW F GYIVSDC +I+ I ++HK + T E A A +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
DLDCGD Y AV+ G I EA ID SL L +RLG FD + + +
Sbjct: 309 TDLDCGDTYAALP-AAVRAGLIDEATIDRSLTRLMAARLRLGMFDPPAKVPWAQTPASAN 367
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+PQH LA AR+ +VLLKND G LPL +K +A+VGP A+ +++GNY GTP
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ + G A + YA G + + + A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 149/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+NAD V V GL VE E G DR D LP Q EL+ + A
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++FDG +YPFG+GL+YTQF Y +++LD
Sbjct: 740 EDERLPAFDDYAMR------GRTYRYFDGKPLYPFGHGLAYTQFAYS--------NLRLD 785
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ TV D + V+N G+ G EVV +Y P
Sbjct: 786 RT--------TV----------------AADGTLRATVSVKNTGQRAGDEVVQLYLHPLN 821
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
K++ G++R+ + G+ +V F + ++L+I D + + GA+ + +G
Sbjct: 822 PQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQRKAYAVDPGAYELQIG 879
>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
Length = 890
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 178/444 (40%), Positives = 248/444 (55%), Gaps = 46/444 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D LP+ ERA DLV +MTL EKV QM A + RLG+P Y WW+E LHGV G
Sbjct: 40 YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN---- 125
AT FP I A +++ +I VS EARA ++ G
Sbjct: 98 --------------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIY 143
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFW+PNIN+ RDPRWGR +ET GEDP++ G A++Y++GLQ D R L
Sbjct: 144 QGLTFWTPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQ---------GDDDRYL 194
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A KH+ + DR HFD+R + +D T+ F+ + E V SVMC+YNR
Sbjct: 195 KLVATSKHFLVHSGPE---PDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNR 251
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES-HKFLNDTKEDAVARVLKAG 304
NG+P C K + +R +W F GYIVSDC ++ + H + T E+A A +KAG
Sbjct: 252 YNGLPCCGS-KPVENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAG 310
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNN 361
DL+CG+ Y + AV+QG ++E +ID ++ L +RLG FD P+ Y N+ +
Sbjct: 311 TDLNCGNSYPAL-VDAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSV 368
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ + +H ELA AAR+ +VLLKNDN LPL+ N+K +A++GP+AN ++ NY G P
Sbjct: 369 VDSKEHRELALIAARKSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPS 427
Query: 422 RYTSPMDGFYAY--SKVINYAPGC 443
+P+DG + + YA GC
Sbjct: 428 NPVTPLDGIRQKLPNANVQYALGC 451
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 92/269 (34%), Positives = 137/269 (50%), Gaps = 52/269 (19%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
AI A +D ++ GL ++E E G DRVD+ LP QT+L+ + K
Sbjct: 610 AIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK- 668
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV LV+++ A+ IN+ N + +IL YPG+ GG AIADV+FG YNP GRLP+T+Y+
Sbjct: 669 PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFYK 726
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ T +P + GRTY++F G ++PFGYGLSYT FKY P D
Sbjct: 727 S------VTQLPPFEDYSMDGRTYQYFKGEALFPFGYGLSYTSFKYDNLVVP-------D 773
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K + +++ T ++V N G DG EVV +Y P
Sbjct: 774 KLEAGKEV--------------------------TVHVDVTNTGNRDGDEVVQLYVSHPD 807
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+ I+ + G++R+ + AG++ V FT+
Sbjct: 808 VESAPIRSLQGFDRIALKAGETKTVSFTL 836
>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
Length = 896
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 189/460 (41%), Positives = 249/460 (54%), Gaps = 45/460 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY D P RA DLV RMTL EK QM + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 39 PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
GAT FP I A+F+ L ++ +S EARA ++ A
Sbjct: 97 --------------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKR 142
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R + +V+GLQ +G Y
Sbjct: 143 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQG-PYR------- 194
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KHYA + DR HFD +E+D+ ET++ F+ V EG V++VM +YN
Sbjct: 195 -KLDATAKHYAVHSGPE---ADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYN 250
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RVNG A + L +R DW F GYIVSDC +I+ I ++HK + T E A A +K G
Sbjct: 251 RVNGESASASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHG 308
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
DLDCGD Y AV+ G I EA ID SL L +RLG FD + + +
Sbjct: 309 TDLDCGDTYAALP-AAVRAGLIDEATIDRSLTRLMAARLRLGMFDPPAKVPWAQTPASAN 367
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+PQH LA AR+ +VLLKND G LPL +K +A+VGP A+ +++GNY GTP
Sbjct: 368 QSPQHDALARRTARESLVLLKND-GLLPLKP-TLKRIAVVGPTADDPMSLLGNYYGTPAA 425
Query: 423 YTSPMDGFY--AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ + G A + YA G + + + A IDA
Sbjct: 426 PVTILQGIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+NAD V V GL VE E G DR D LP Q EL+ + A
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQAL-QATGT 681
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+ DV+FG+ +PGGRLPIT+Y+
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++FDG +YPFG+GL+YTQF Y +++LD
Sbjct: 740 EDERLPAFDDYAMR------GRTYRYFDGKPLYPFGHGLAYTQFAYS--------NLRLD 785
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ TV D + V+N G+ G EVV +Y P
Sbjct: 786 RT--------TV----------------AADGTLRATVWVKNTGQRAGDEVVQLYLHPLN 821
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
K++ G++R+ + G+ +V FT+ ++L+I D + + GA+ + +G
Sbjct: 822 PQRERARKELRGFQRITLQPGEHREVSFTITPREALRIYDEQRKAYAVDPGAYELQIG 879
>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 850
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 176/429 (41%), Positives = 244/429 (56%), Gaps = 37/429 (8%)
Query: 32 MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
MTL EK QM + A +PRLG+P Y+WW+EALHGV+ G GAT F
Sbjct: 1 MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------------GATVF 44
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGNAGLTFWSPNINVVRDPRW 143
P I A+F+ L ++ +S EARA ++ GLTFWSPNIN+ RDPRW
Sbjct: 45 PQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRW 104
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL-KISACCKHYAAYDLDNW 202
GR ET GEDP++ R + +V+GLQ EG + +++ P K+ A KH+A +
Sbjct: 105 GRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPYRKLDATAKHFAVHSGPE- 162
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
DR HFD+R +++D+ ET++ FE V +G V +VM +YNRV G A LL +
Sbjct: 163 --ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVL 220
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQ 322
R W F GY+VSDC +I I + HK + T+E A A +K G +L+CG+ Y+ AV+
Sbjct: 221 RQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECGEEYSTLP-AAVR 278
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
QG I EA IDT+L L MRLG FD G + + + +P H LA AR+ +V
Sbjct: 279 QGLIDEAQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHDALARRTARESLV 338
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS--KVIN 438
LLKND G LPL+ +K +A++GP A+ T A++GNY GTP + + G A + +
Sbjct: 339 LLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVL 397
Query: 439 YAPGCADIV 447
YA G AD+V
Sbjct: 398 YARG-ADLV 405
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 94/298 (31%), Positives = 145/298 (48%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+D A++AD V V GL VE E G DR DL LP Q +L+ + K
Sbjct: 577 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 635
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV V+ + A+ I++A+ + + +IL YPG+ GG A+AD +FG NPGGRLP+T+Y+
Sbjct: 636 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 693
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ + +R GRTY++F G +YPFG+GLSYTQF Y ++LD
Sbjct: 694 ESETLPAFDDYAMR------GRTYRYFGGTPLYPFGHGLSYTQFAYS--------GLRLD 739
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ D T + V+N G+ G EVV +Y P
Sbjct: 740 R------------------------TTIAADGSLTATVTVKNTGQRAGDEVVQLYLHPLT 775
Query: 687 IAGTHI-KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
K++ G++R+ + G+ + FT++A +L+I D + GA+ + +G
Sbjct: 776 PQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYDAQRKAYAVDPGAYEVQIG 833
>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
Length = 614
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 205/631 (32%), Positives = 315/631 (49%), Gaps = 65/631 (10%)
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
SPNIN+ RDPRWGR E P EDP + G + Y GLQ E DSR K+
Sbjct: 11 SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGE--------DSRYTKVVVTL 62
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+ AY L++ +G R +FD++V+ + +T+ F V EG+ VMCSYN +NG PT
Sbjct: 63 KHWDAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPT 122
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
C P LL + +R W F GY+ SD +I+ I H + + A + D+D G
Sbjct: 123 CTHP-LLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGA 181
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIE 369
Y + + AV G+ + D+D +L + LG FD Y + ++I +
Sbjct: 182 VYHDALLDAVNSGECSMDDVDRALYNTLKLRFELGLFDPIEDQPYWRINASSINTTYAQD 241
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR------Y 423
L + + ++LL+N N ALP G + +A++GPH NA +A++GNY G C
Sbjct: 242 LNMKITLESMILLQNHNNALPFKKG--RKVAVIGPHINAQEALVGNYLGQLCPDDSFDCI 299
Query: 424 TSPMDGFYAYSKVINY--APGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
TSP+ A + + N A G + C + S I A++ AK+AD V++ G++ ++EAE
Sbjct: 300 TSPLAAIEAINGMSNTVSAMGSGVLACTDAS-IQEAVNVAKDADYVVLLIGINDTIEAES 358
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR + LP Q +L +A K V+++ G + I K ++ +I+ GYPG
Sbjct: 359 NDRTSIDLPQCQHKLTAAIAHLNK-TTAAVLINGGMLAIEQEKK--QLPAIIEAGYPGFY 415
Query: 542 GGRAIADVIFGKYNP-GGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
GG AIA IFG N GG+LP T Y A+Y+ KI + M + N PGR+Y+++ G ++
Sbjct: 416 GGAAIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEM---TNSPGRSYRYYTGQPLW 472
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GL+YT F + S + G+N
Sbjct: 473 PFGFGLAYTTFSVQSPGPSAST--------------FATGSNT----------------S 502
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKP---PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
F+ + V N GK G VV VY P P + + KQ+I +ERV + Q V ++
Sbjct: 503 FSLPVHVVNTGKRTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLTPNQRLGVTIPLS 562
Query: 717 ACKSLKIVDNAANSLLAS-GAHTILVGEGVG 746
A +VD +++++ G++ ++V +GV
Sbjct: 563 A-DVFNMVDPVTGNVVSTPGSYRLVVSDGVA 592
>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
Length = 860
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 170/462 (36%), Positives = 259/462 (56%), Gaps = 52/462 (11%)
Query: 2 FESIKVKLSDFP-YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
F S+ + + P Y + LP+ ERA+DL++R++L EK+ M + + RLG+P Y WW+
Sbjct: 10 FLSVNLFAQNIPGYLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWN 69
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGV+ GR AT FP I A+++ L +I +S EARA
Sbjct: 70 EALHGVARNGR----------------ATVFPMPIGLAATWDRDLIYRIADVISNEARAK 113
Query: 121 YNLG--------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
YN G++ W+PNIN+ RDPRWGR +ET GEDPY+ G A+++++GLQ
Sbjct: 114 YNSALKKNQRGIYQGISLWAPNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQG-- 171
Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
D + LK A KH A + E R HF++ V+ D+ ET++ F+ +
Sbjct: 172 -------QDKKYLKTIATPKHLAVHSGPEPE---RHHFNALVSNYDLNETYLPHFKKSIM 221
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
+G SVMC+YNR+ G C LL +R W F G +VSDC ++ I SHK + D+
Sbjct: 222 KGKAYSVMCAYNRLRGKACCGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIV-DS 280
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
E A A + +G DL+CG+ + + A + G I E +ID++LR + + +LG FD P
Sbjct: 281 PEKAAALAVSSGTDLECGNTFLSLK-NAYRDGLITEKEIDSALRRVLLARFKLGMFD-PP 338
Query: 353 Q---YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
+ Y + ++ + N + E+A EAAR+ IVLLKNDN LPL++ +I +A++GP+A+
Sbjct: 339 EIVSYSQIDESYLDNSYNREIALEAARKSIVLLKNDNKLLPLDS-SINKIAVIGPNADNL 397
Query: 410 KAMIGNYEGTPCRYTSPM--------DGFYAYSKVINYAPGC 443
++++GNY G P Y +P+ +G Y K ++APG
Sbjct: 398 ESLLGNYHGFPSEYITPLQAIRRVLKNGEVFYEKGCDFAPGV 439
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 140/296 (47%), Gaps = 53/296 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A A +DA ++ GL +E E G DR+ L LP Q +LI K+ K
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV LV+++ G + + N I +IL YPG+ GGRAI DVI+GKYNP G+LP+T Y+
Sbjct: 650 PVILVLLNGGPISTVWESEN--IPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ P+ + + GRTY++F G V+YPFG+GL+YT DI +
Sbjct: 708 SENDLPPFENYDME------GRTYRYFKGEVLYPFGWGLNYT-------------DITIS 748
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
N + N ++K D ++++N G + G E V +Y+K
Sbjct: 749 --------NIELSAN----------EIKDND-TIRVVVKLKNNGNLAGEETVQLYTKALK 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
T IK + G+E++ + G V F ++ VD + G + I+VG
Sbjct: 790 DNRT-IKTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843
>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
Length = 333
Score = 298 bits (762), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 209/334 (62%), Gaps = 14/334 (4%)
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
MIGNYEGTP +YT+P+ G A Y PGC+++ C + I A A ADATV++
Sbjct: 1 MIGNYEGTPGKYTTPLQGLTALVAT-TYLPGCSNVAC-GTAQIDEAKKIAAAADATVLIV 58
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
G+D S+EAEG+DRV++ LPG Q LI +VA A+KG V LV+MS G DI+FAKN+ KI S
Sbjct: 59 GIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITS 118
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGR 588
ILWVGYPGE GG AIADVIFG YNP GRLP TWY +YV K+P T+M +R P + +PGR
Sbjct: 119 ILWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGR 178
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY+F+ G +Y FG GLSYTQF + + +PKSV I +++ C + C +V
Sbjct: 179 TYRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSIPIEEGHSC---------HSSKCKSV 229
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
C++ F + V N G + GS V ++S PP + + K ++G+E+VF+ A
Sbjct: 230 DAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSVHNSPQKHLLGFEKVFVTAKAE 289
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
A V F ++ CK L IVD +A G H + VG
Sbjct: 290 ALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVG 323
>gi|147826476|emb|CAN72807.1| hypothetical protein VITISV_033721 [Vitis vinifera]
Length = 236
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 138/215 (64%), Positives = 164/215 (76%), Gaps = 6/215 (2%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
R+ + + + F +CD L Y ERAKDLV RMTL EKV Q A GV RLGLP Y WWS
Sbjct: 23 RYALLGLDMKSFAFCDKSLSYEERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWS 82
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHG+S +G PG FD +PGATSFPTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 83 EALHGISNLG------PGVFFDETIPGATSFPTVILSTAAFNQTLWKTLGRVVSTEGRAM 136
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
YNLG+AGLTFWSPNINVVRD RWGR ET GEDP++VG +A+NYVRGLQDVEG E D
Sbjct: 137 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDL 196
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT 215
+SRPLK+S+CCKHYAAYD+D+W DR FD+RV+
Sbjct: 197 NSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVS 231
>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 889
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 175/446 (39%), Positives = 243/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA DLV M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 88 -----ATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSP 142
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++ GLQ D P I A KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQG--------DDLDHPRTI-ATPKH 193
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + + R FD V+ +D++ T+ F + EG +VMC+YN ++G P CA
Sbjct: 194 LAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACA 250
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
L+N +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 251 ADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N QH LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALA 368
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN+ LPLN G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 369 LQAAAESIVLLKNNANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 49/280 (17%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y Q G+
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTRFAYDAP--------------QLSSTAVQAGS------ 781
Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
T Q+ V N G G EV VY + P + ++ ++G++RV +A
Sbjct: 782 --------------TLQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 827
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
AG+ + F ++A ++L VD + + +G +T+ VG G
Sbjct: 828 AGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866
>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
Length = 754
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 216/691 (31%), Positives = 348/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L + T+ ++ R + G+ SP ++V RDPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++G+P +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + HK ++ E A+ L++G+D++ D Y + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHKVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVTAIKE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G ++EA ID ++ + + RLG D ++ + + + ELA +AAR+ IVLLK
Sbjct: 317 GLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL+ NI +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNMLPLSK-NINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
KV+ YA GC DI ++ AI+ AK AD + V +GL LS
Sbjct: 436 GKVL-YAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L L G Q EL+ ++ K P+ LV+++ + ++ N +K+I+
Sbjct: 494 QAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIINY--VKAIIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIAD+IFG YNP GRLPIT+ + + + Y+ P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADIIFGDYNPSGRLPITFPMDTGQIPLYYSRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK V P + +
Sbjct: 605 HSSPLFTFGYGLSYTQFEYSNLEVTPKEVG---------------------PLSYI---- 639
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
T ++V+N+G M+G EVV +Y SK +K++ G+ +V + G+ +V
Sbjct: 640 --------TILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L DN ++ G + IL+G
Sbjct: 692 KFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721
>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
Length = 754
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 216/691 (31%), Positives = 348/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L + T+ ++ R + G+ SP ++V RDPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++G+P +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + HK ++ E A+ L++G+D++ D Y + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHKVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVTAIKE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G ++EA ID ++ + + RLG D ++ + + + ELA +AAR+ IVLLK
Sbjct: 317 GLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL+ NI +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNMLPLSK-NINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
KV+ YA GC DI ++ AI+ AK AD + V +GL LS
Sbjct: 436 GKVL-YAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L L G Q EL+ ++ K P+ LV+++ + ++ N +K+I+
Sbjct: 494 QAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLINGRPLVLSPIINY--VKAIIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIAD+IFG YNP GRLPIT+ + + + Y+ P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADIIFGDYNPSGRLPITFPMDTGQIPLYYSRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK V P + +
Sbjct: 605 HSSPLFTFGYGLSYTQFEYSNLEVTPKEVG---------------------PLSYI---- 639
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
T ++V+N+G M+G EVV +Y SK +K++ G+ +V + G+ +V
Sbjct: 640 --------TILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L DN ++ G + IL+G
Sbjct: 692 KFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721
>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 889
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 176/446 (39%), Positives = 247/446 (55%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA DLV M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GN-----AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N GN AGLT WSP
Sbjct: 88 -----ATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSP 142
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++ GLQ E +++ R A KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQG-EDLDHPR--------TIATPKH 193
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + + R FD V+ +D++ T+ F + EG +VMC+YN ++G P CA
Sbjct: 194 LAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACA 250
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
L+N +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 251 ADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N QH LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALA 368
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN+ LPLN G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 369 LQAAAESIVLLKNNANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 49/280 (17%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y Q G+
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTRFAYDAP--------------QLSSTAVQAGS------ 781
Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
T Q+ V N G G EV VY + P + ++ ++G++RV +A
Sbjct: 782 --------------TLQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 827
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
AG+ + F ++A ++L VD + + +G +T+ VG G
Sbjct: 828 AGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866
>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 906
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 171/431 (39%), Positives = 242/431 (56%), Gaps = 44/431 (10%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
DF + D + + ER LV++M+L EKV QM + + +PRL +P Y WW+E LHGV+ G
Sbjct: 49 DFSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVARAG 108
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNA-- 126
AT FP I ASF+++L K IG +S EARA ++ + N
Sbjct: 109 Y----------------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKR 152
Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GL FWSPNIN+ RDPRWGR ET GEDPY+ G A ++ GLQD SD
Sbjct: 153 GIYTGLDFWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQD---------SDG 203
Query: 183 RPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVM 240
+ LK A KH+A + G + R FD V+++D+ ET++ F V E V S+M
Sbjct: 204 KYLKTIATSKHFAVHS-----GPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIM 258
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
+YNR G LLNQ +R W F GY+VSDC +IQ I HK + E A V
Sbjct: 259 GAYNRFRGESCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHKIASTAAEAAAIGV 318
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
G DL+CG+YYT+ T AV +G I+E +ID +++ L++ RLG FD + Y +
Sbjct: 319 -SGGCDLNCGNYYTHLTE-AVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVSYAQIP 376
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+C+ H LA +AA++ +VLLKN LPL+ IK +A++GP+A+ ++++GNY G
Sbjct: 377 FGIVCSEAHNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHG 436
Query: 419 TPCRYTSPMDG 429
P + + +DG
Sbjct: 437 IPKKPVTFLDG 447
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 156/313 (49%), Gaps = 54/313 (17%)
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEGKD----------RVDLLLPGFQTELINKVA 501
S I A+ AK+AD V+V GL +E E D R + LP Q L+ V
Sbjct: 615 SKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVK 674
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
+ K PV LV+ + A+ IN+AK N + +I+ GYPGEEGG A+ADV+FG YNP GRLP
Sbjct: 675 ETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNPAGRLP 731
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
IT+Y++ P+ ++ GRTY++F+G +YPFGYGLSYT+F YK P V
Sbjct: 732 ITYYQSVEDLPPFEDYDMK------GRTYRYFEGKPLYPFGYGLSYTRFSYKDLEVPAKV 785
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
+ D V+ + V N+G G EVV +Y
Sbjct: 786 NAG--------------------------DPVQI-------SVTVTNIGSRAGDEVVQLY 812
Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ I+Q+ G++R+ + G+S V FT++A + L +++ + ++ G +I
Sbjct: 813 LNDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLSA-RQLSMINGESKRVIEEGVFSIH 871
Query: 741 VGEGVGGVSFPLQ 753
VG G LQ
Sbjct: 872 VGGEQPGFDGKLQ 884
>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 886
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 176/446 (39%), Positives = 244/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ E +++ R A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG-EDLDHPR--------TIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
+ A+++G + EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RDLGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 138/282 (48%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 686
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y D Q T+ P
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867
>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 886
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N +L +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + +G SVMC+YN ++G P CA
Sbjct: 191 LAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+++G + EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RELGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPLN G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ + YA G A + MIP
Sbjct: 424 QRFGAQQVRYAQG-APLAAGVPGMIP 448
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 141/296 (47%), Gaps = 52/296 (17%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+DA V GL VE E G DR D+ LP Q L+ + A A+ P+ +V+
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
MS AV +N+AK + W YPG+ GG AIA ++ G NPGGRLP+T+Y +
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLP 730
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
Y S ++ GRTY++F G ++PFGYGLSYT+F Y D Q
Sbjct: 731 AYVSYDMK------GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS- 770
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
+ T+ P V N G G EV VY + P + +
Sbjct: 771 --STTLQAGNP----------------LQVTTTVRNTGTHAGDEVAQVYLQYPDRPQSPL 812
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ ++G++RV +AAG+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 813 RSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 867
>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 886
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 176/446 (39%), Positives = 244/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ E +++ R A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG-EDLDHPR--------TIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
+ A+++G + EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RDLGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 138/282 (48%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 686
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG A+A ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 687 ADAIVAAW--YPGQSGGTAMARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y D Q T+ P
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867
>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.14.25]
gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.27]
Length = 755
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 224/700 (32%), Positives = 351/700 (50%), Gaps = 116/700 (16%)
Query: 85 VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWG 144
V AT+FP I ++++ L +++ T+ +A+ + N L SP ++V RDPRWG
Sbjct: 98 VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLIGT--NQCL---SPVLDVCRDPRWG 152
Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
R ET GED Y+V + YV+GLQ ++ A KH+AA+ EG
Sbjct: 153 RCEETYGEDQYLVASIGLAYVKGLQGEN-------------ELIATVKHFAAHGFP--EG 197
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
R V ++++E F+ PFE+ + G SVM +Y+ ++GIP ++ +LL + +R
Sbjct: 198 G-RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQ 256
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD-----LDCGDYYTNFTMG 319
+W F G +VSD D+I+ + HK + KE A+ L+AG+D +DC + +
Sbjct: 257 EWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPLLE 312
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
AV++G I+E+ ID ++ + + +LG F+ +N + N + ELA + AR+ I
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFNNHYINENNVPEKLDNSKSRELALDVARKSI 372
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGFYA 432
VLLKNDN LPLN NI T+A++GP+AN + ++G+Y T + ++G
Sbjct: 373 VLLKNDN-ILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGI-- 428
Query: 433 YSKVIN-----YAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------- 476
KV N YA GC DI ++ AI+ AK D + V +GL LS
Sbjct: 429 MRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGK 487
Query: 477 --------VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +
Sbjct: 488 DEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFN--E 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMP--LRPVNNF 585
+ +I+ +PGEEGG AIADVIFG YNP GRLPI++ + + I Y P LRP
Sbjct: 545 VNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISFPIDTGQIPIYYNRKPSSLRP---- 600
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
Y ++PFGYGLSYT+FKY + +PK V+
Sbjct: 601 ----YVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVN---------------------- 634
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFI 703
K +EVEN+GK +G E V +Y SK IK++ G+ +V++
Sbjct: 635 -----------SSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYL 683
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ K+ F++ ++L D ++ +G + IL+G+
Sbjct: 684 KPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722
>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 886
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 177/446 (39%), Positives = 241/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD+I + + H F D +VA LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNVAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 139/282 (49%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 686
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 687 ADAIMAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y +P+ L + I
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTLQAGNPLQVI------------ 783
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 784 -----------------ATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867
>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.4]
Length = 755
Score = 295 bits (754), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 224/700 (32%), Positives = 351/700 (50%), Gaps = 116/700 (16%)
Query: 85 VPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWG 144
V AT+FP I ++++ L +++ T+ +A+ + N L SP ++V RDPRWG
Sbjct: 98 VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLIGT--NQCL---SPVLDVCRDPRWG 152
Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
R ET GED Y+V + YV+GLQ ++ A KH+AA+ EG
Sbjct: 153 RCEETYGEDQYLVASIGLAYVKGLQGEN-------------ELIATVKHFAAHGFP--EG 197
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
R V ++++E F+ PFE+ + G SVM +Y+ ++GIP ++ +LL + +R
Sbjct: 198 G-RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQ 256
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD-----LDCGDYYTNFTMG 319
+W F G +VSD D+I+ + HK + KE A+ L+AG+D +DC + +
Sbjct: 257 EWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPLLE 312
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
AV++G I+E+ ID ++ + + +LG F+ +N + N + ELA + AR+ I
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFNDHYINENNVPEKLDNSKSRELALDVARKSI 372
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGFYA 432
VLLKNDN LPLN NI T+A++GP+AN + ++G+Y T + ++G
Sbjct: 373 VLLKNDN-ILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGI-- 428
Query: 433 YSKVIN-----YAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------- 476
KV N YA GC DI ++ AI+ AK D + V +GL LS
Sbjct: 429 MRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGK 487
Query: 477 --------VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +
Sbjct: 488 DEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFN--E 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMP--LRPVNNF 585
+ +I+ +PGEEGG AIADVIFG YNP GRLPI++ + + I Y P LRP
Sbjct: 545 VNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISFPIDTGQIPIYYNRKPSSLRP---- 600
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
Y ++PFGYGLSYT+FKY + +PK V+
Sbjct: 601 ----YVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVN---------------------- 634
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFI 703
K +EVEN+GK +G E V +Y SK IK++ G+ +V++
Sbjct: 635 -----------SSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVYL 683
Query: 704 AAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ K+ F++ ++L D ++ +G + IL+G+
Sbjct: 684 KPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722
>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 886
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 239/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N +L +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + +G SVMC+YN ++G P CA
Sbjct: 191 LAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+++G + EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RELGT-AIERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPLN G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLNAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ + YA G A + MIP
Sbjct: 424 QRFGAQQVRYAQG-APLAAGVPGMIP 448
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 142/296 (47%), Gaps = 52/296 (17%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+DA V GL VE E G DR D+ LP Q L+ + A A+ P+ +V+
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
MS AV +N+AK + W YPG+ GG AIA ++ G NPGGRLP+T+Y +
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLP 730
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
Y S ++ GRTY++F G ++PFGYGLSYT+F Y +P+ L Q
Sbjct: 731 AYVSYDMK------GRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTAL---QAGN 778
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
+ T V N G G EV VY + P + +
Sbjct: 779 PLQVTT--------------------------TVRNTGTRAGDEVAQVYLQYPDRPQSPL 812
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ ++G++RV +AAG+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 813 RSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 867
>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 916
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 184/482 (38%), Positives = 268/482 (55%), Gaps = 43/482 (8%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ D L + ERA LV RMTL EK QM + + + RLGLP Y+WW+EALHGV+ G
Sbjct: 49 PWLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG-- 106
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN--- 125
GAT FP I ASF+ L ++ +S EARA ++ G
Sbjct: 107 --------------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGR 152
Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDP++ R +++VRGLQ ++ + + D +
Sbjct: 153 YQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMD-PQTGQPLDPKY 211
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + + DR FD ++QD+ +T++ FE V E DV +VM +YN
Sbjct: 212 RKLDATAKHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYN 268
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G LL T+R DW F GY++SDC +I I ++HK + +T E+A A +K G
Sbjct: 269 RVYGESASGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIV-ETPEEAAALAVKNG 327
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNI 362
+L+CG Y + AV++G I+EA++D +L L++ M LG FD Q + + +
Sbjct: 328 TELNCGSTYADHLPVAVKKGLISEAELDDALTRLFVARMELGMFDPPEQVRWAQVPYSVN 387
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+ +H LA + A++ +VLLKND G LPL+ +I+ LA+VGP A+ T A++GNY GTP
Sbjct: 388 QSAEHDALARKMAQESLVLLKND-GVLPLSK-DIRRLAVVGPTADDTMALLGNYYGTPAD 445
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ + G + APG + + ++ D A AT ++ L EA
Sbjct: 446 PVTILRG------IREAAPGVDVVYARGVDLVEGRDDPA----ATPLIEPQYLRPEAGST 495
Query: 483 DR 484
+R
Sbjct: 496 ER 497
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 155/312 (49%), Gaps = 55/312 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A++AA +ADA V V GL VE E G DR D+ LP Q +L+ V K
Sbjct: 643 ALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHATGK- 701
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV +V+ + A+ I++A+ N + IL YPG+ GG A+ + +FG YNPGGRLP+T+Y
Sbjct: 702 PVVMVLTTGSALGIDWARRN--VPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVTFYS 759
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
A+ P+ ++ RTY++F G ++PFG+GLSYT F Y +KLD
Sbjct: 760 ADEKLPPFDDYAMKE------RTYRYFTGQPLFPFGHGLSYTSFGYS--------GLKLD 805
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+ + A D+V T + V+N GK G EVV +Y P
Sbjct: 806 RKR-----------------AGAGDEV-------TVSVTVKNQGKRAGDEVVQLYLAPVK 841
Query: 687 IAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVGEG 744
+K++ G++RV + G+S V F++ + L++ D AA G + + VG
Sbjct: 842 PQRERALKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDPGRYEVQVGAS 901
Query: 745 VGGV--SFPLQL 754
+ S PL++
Sbjct: 902 SADIRASVPLEV 913
>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
Length = 349
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 192/313 (61%), Gaps = 26/313 (8%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+ +C+ L P RA L+ +TL EK++Q+ D A G+PR G+P YEWWSE+LHG++ G
Sbjct: 39 SYTFCNQSLSVPTRAHSLISLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGIAING 98
Query: 71 RRTNSPPGTHFD-SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
PG F V AT FP VI++ A+FN +LW IG ++ EARAM+N+G +GLT
Sbjct: 99 ------PGVSFTIGPVSAATGFPQVIISAAAFNRTLWFLIGSAIAIEARAMHNVGQSGLT 152
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ-----------------DVE 172
FW+PN+N+ RDPRWGR ETPGEDP + YAI +V+G Q E
Sbjct: 153 FWAPNVNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKGFQGGNWKSGVSGSGSGRYGFGE 212
Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
D L +SACCKH AYDL+ W R+ F++ VTEQD+++T+ PF C+
Sbjct: 213 KRMLRDDDGDDGLMLSACCKHLTAYDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCIE 272
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
EG S +MCSYN VNG+P CA LL Q R +W F GYIVSDCD++ TI E + + +
Sbjct: 273 EGKASCLMCSYNEVNGVPACAREDLL-QKAREEWGFEGYIVSDCDAVATIFEYQNY-SKS 330
Query: 293 KEDAVARVLKAGL 305
EDAVA LKAG+
Sbjct: 331 AEDAVAIALKAGM 343
>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 888
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 185/477 (38%), Positives = 254/477 (53%), Gaps = 50/477 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 86
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 87 -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D P I A KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGTAY 308
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG +Y LG +I N + LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN N LPL G LA++GP+A+A A+ NY+GT + +P+ G
Sbjct: 368 LQAAAESIVLLKNANATLPLKAGT--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425
Query: 432 AY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRV 485
++ + YA G A + MIP + A +D + G +VE EG RV
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP---ETALRSDGKPGLRGEYFDNVELEGAPRV 478
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 142/287 (49%), Gaps = 45/287 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 630 VEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 688
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA + G NPGGRLP+T+Y + PY S ++
Sbjct: 689 ADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 740
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y+ +P+ + + Q + T
Sbjct: 741 GRTYRYFKGEALFPFGYGLSYTRFAYE---TPR---LSVTTLQAGSPLQVTT-------- 786
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G+ G EV VY + P + ++ ++G++RV + G
Sbjct: 787 ------------------TVRNTGERAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQPG 828
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
+ + FT++A ++L VD ++ +G + + VG G P Q
Sbjct: 829 EQRTLTFTLDA-RALSDVDRTGTRVVEAGDYRLFVGGGQPDTGAPGQ 874
>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 888
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 185/477 (38%), Positives = 254/477 (53%), Gaps = 50/477 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 86
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 87 -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D P I A KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGTAY 308
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG +Y LG +I N + LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN N LPL G LA++GP+A+A A+ NY+GT + +P+ G
Sbjct: 368 LQAAAESIVLLKNANATLPLKAGT--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425
Query: 432 AY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRV 485
++ + YA G A + MIP + A +D + G +VE EG RV
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP---ETALRSDGKPGLRGEYFDNVELEGAPRV 478
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 133/287 (46%), Gaps = 45/287 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 630 VEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 688
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA + G NPGGRLP+T+Y + PY S ++
Sbjct: 689 ADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 740
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT F Y Q G+
Sbjct: 741 GRTYRYFKGEALFPFGYGLSYTSFAYDAP--------------QLSSTTLQAGS------ 780
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV + G
Sbjct: 781 ------------PLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQPG 828
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
+ + FT++A ++L VD + +G + + VG G P Q
Sbjct: 829 EQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGGQPDTGAPGQ 874
>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 886
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448
Score = 132 bits (331), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 138/282 (48%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTH 686
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y D Q T+ P
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867
>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 886
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 139/282 (49%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMH 686
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + PY S ++
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 738
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y D Q T+ P
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 779
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 780 -------------LQVTATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 827 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 867
>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 888
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 185/477 (38%), Positives = 254/477 (53%), Gaps = 50/477 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 38 QRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 86
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 87 -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D P I A KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGTAY 308
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG +Y LG +I N + LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN N LPL G LA++GP+A+A A+ NY+GT + +P+ G
Sbjct: 368 LQAAAESIVLLKNANATLPLKAGT--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425
Query: 432 AY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRV 485
++ + YA G A + MIP + A +D + G +VE EG RV
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP---ETALRSDGKPGLRGEYFDNVELEGAPRV 478
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 133/287 (46%), Gaps = 45/287 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 630 VEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 688
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA + G NPGGRLP+T+Y + PY S ++
Sbjct: 689 ADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 740
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT F Y Q G+
Sbjct: 741 GRTYRYFKGEALFPFGYGLSYTSFAYDAP--------------QLSSTTLQAGS------ 780
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV + G
Sbjct: 781 ------------PLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLQPG 828
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
+ + FT++A ++L VD + +G + + VG G P Q
Sbjct: 829 EQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGGQPDTGAPGQ 874
>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 874
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 24 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 72
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 73 -----ATVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 127
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 128 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 178
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 179 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACA 235
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 236 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 294
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 295 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 353
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 354 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 411
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 412 QRFGAQQVSYAQG-APLAAGVPGMIP 436
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 136/282 (48%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 616 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 674
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 675 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 726
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++ FGYGLSYT+F Y Q G++
Sbjct: 727 GRTYRYFKGEPLFAFGYGLSYTRFAYDAP--------------QLSTTTLQAGSS----- 767
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 768 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 814
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 815 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 855
>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 889
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 242/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA DLV M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 39 QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 88 -----ATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSP 142
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++ GLQ D P I A KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQG--------DDLDHPRTI-ATPKH 193
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + + R FD V+ +D++ T+ F + EG +VMC+YN ++G P CA
Sbjct: 194 LAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACA 250
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
L+N +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 251 ADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N QH LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALA 368
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN+ LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 369 LQAAAESIVLLKNNANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 137/280 (48%), Gaps = 49/280 (17%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT F Y Q G+
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTCFAYDAP--------------QLSSTAVQAGS------ 781
Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
T Q+ V N G G EV VY + P + ++ ++G++RV +A
Sbjct: 782 --------------TLQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLA 827
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
AG+ + F ++A ++L VD + + +G +T+ VG G
Sbjct: 828 AGEQRTLTFNLDA-RALSDVDPSGQRAVEAGNYTLFVGGG 866
>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 889
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 88 -----ATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 142
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 193
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 194 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACA 250
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 251 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 368
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 369 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 136/282 (48%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++ FGYGLSYT+F Y Q G++
Sbjct: 742 GRTYRYFKGEPLFAFGYGLSYTRFAYDAP--------------QLSTTTLQAGSS----- 782
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 783 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 829
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 830 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 870
>gi|167038437|ref|YP_001666015.1| glycoside hydrolase family 3 [Thermoanaerobacter pseudethanolicus
ATCC 33223]
gi|320116830|ref|YP_004186989.1| glycoside hydrolase family 3 domain-containing protein
[Thermoanaerobacter brockii subsp. finnii Ako-1]
gi|166857271|gb|ABY95679.1| glycoside hydrolase, family 3 domain protein [Thermoanaerobacter
pseudethanolicus ATCC 33223]
gi|319929921|gb|ADV80606.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
brockii subsp. finnii Ako-1]
Length = 784
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 230/804 (28%), Positives = 378/804 (47%), Gaps = 133/804 (16%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGD--------------------LAYGV---PR 50
Y D+ +R +DL+++MT+ EKV Q+ ++YG+ R
Sbjct: 5 YLDSTQSVEKRVEDLLQQMTIEEKVAQLNSIWVYEILDDMKFSFDKAKRLMSYGIGQITR 64
Query: 51 LG----LPLYEWWSEALHGVSFI--GRRTNSPPGTHFDS----EVPGATSFPTVILTTAS 100
LG L E A F+ R P H +S GAT FP I ++
Sbjct: 65 LGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMAKGATIFPQTIGVAST 124
Query: 101 FNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRY 160
+N + +K+ + + +A+ +P +++ RDPRWGR ET GEDPY+V R
Sbjct: 125 WNNEIVEKMASVIREQMKAV-----GARQALAPLLDITRDPRWGRTEETFGEDPYLVMRM 179
Query: 161 AINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQ 220
++Y+RGLQ ++S I A KH+ Y N EG + + + E++++
Sbjct: 180 GVSYIRGLQ----------TESLKEGIVATGKHFVGYG--NSEGGMNWA-PAHIPERELR 226
Query: 221 ETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQ 280
E F+ PFE V E +SS+M Y+ ++G+P KLLN +R DW F G +VSD +I
Sbjct: 227 EVFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAIS 286
Query: 281 TIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAEADIDTSLRFL 338
+ E H +D K+ A L+AG+D++ DYY ++ G+I ++ +++ +
Sbjct: 287 QLYEYHHVTSD-KKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRV 345
Query: 339 YIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
+ LG F+ + + ELA + A++ IVLLKN+N LPL ++K+
Sbjct: 346 LKIKFELGLFENPYINEEKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKK-DLKS 404
Query: 399 LALVGPHANATKAMIGNYEGTPCR-------------YTSPM-------DGFYAYSKVIN 438
+A++GP+A++ + MIG+Y PC + +P+ D + V+
Sbjct: 405 IAVIGPNADSIRNMIGDY-AYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463
Query: 439 -------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDLSVEAE 480
YA GC D++ + A++ AK AD V+V G D E
Sbjct: 464 GIKAKVSSNTEVLYAKGC-DVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGE 522
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
+DR DL LPG Q ELI + + PV +V+++ + I++ KI +I+ PGE
Sbjct: 523 SRDRADLNLPGVQEELIKAIYETGT-PVIVVLINGRPMSISWIAE--KIPAIIEAWLPGE 579
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
EGGRA+ADVIFG YNPGG+LPI+ ++ + + Y P +++ G + P +Y
Sbjct: 580 EGGRAVADVIFGDYNPGGKLPISIPQSVGQLPVYYYHKPSGGRSHWKGDYVELSTKP-LY 638
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT+F Y N + K V +D
Sbjct: 639 PFGYGLSYTEFSY---------------------TNLNISNRK----------VSLRDRM 667
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNAC 718
++++N G + G EVV +Y ++ T +K++ G++R+ + AG+ V F + +
Sbjct: 668 VEISVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLDAGEEKTVIFKL-SI 726
Query: 719 KSLKIVDNAANSLLASGAHTILVG 742
+ L D ++ G +++G
Sbjct: 727 EQLGFYDENMEYVVEPGRVDVMIG 750
>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 886
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 36 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 84
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 85 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 139
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 140 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 190
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 191 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 248 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 306
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 307 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 365
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 366 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 423
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 424 QRFGAQQVSYAQG-APLAAGVPGMIP 448
Score = 132 bits (333), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 140/282 (49%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 628 VEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTH 686
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 687 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 738
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y +P+ L Q + T
Sbjct: 739 GRTYRYFKGEPLFPFGYGLSYTRFAY---DAPQLSTTTL---QAGNPLQVTA-------- 784
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 785 ------------------TVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 826
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 827 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 867
>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 874
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 175/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 24 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 72
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N SL +++G VSTEARA +N AGLT WSP
Sbjct: 73 -----ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSP 127
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 128 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 178
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 179 IAVHSGPE---PGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCA 235
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 236 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 294
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 295 RELGT-AIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 353
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 354 LQAAAESIVLLKNDANTLPLRAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 411
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 412 QRFGAQQVSYAQG-APLAAGVPGMIP 436
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 139/282 (49%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 616 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMH 674
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + PY S ++
Sbjct: 675 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 726
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT+F Y D Q T+ P
Sbjct: 727 GRTYRYFKGEPLFPFGYGLSYTRFAY-------------DAPQLS---TTTLQAGNP--- 767
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 768 -------------LQVTATVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 814
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 815 EQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGGQPGT 855
>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 889
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 241/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 88 -----ATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 142
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + A KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQG---------EDLNHPRTIATPKH 193
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 194 IAVH---SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACA 250
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 251 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 309
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+ +G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 310 RELGT-AIARGEVDEALLDQSLVRLFATRYRLGELEAPRKDPYARLGAKDVDNAAHRALA 368
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 369 LQAAAESIVLLKNDANTLPLKAGT--RLAVIGPNADALAALEANYQGTSSAPVTPLLGLR 426
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ ++YA G A + MIP
Sbjct: 427 QRFGAQQVSYAQG-APLAAGVPGMIP 451
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 136/282 (48%), Gaps = 45/282 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + Y S ++
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPAYVSYDMK------ 741
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++ FGYGLSYT+F Y Q G++
Sbjct: 742 GRTYRYFKGEPLFAFGYGLSYTRFAYDAP--------------QLSTTTLQAGSS----- 782
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EV VY + P + ++ ++G++RV +AAG
Sbjct: 783 -------------LQVTTTVRNTGARAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLAAG 829
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ + F ++A ++L VD + + +G +T+ VG G G
Sbjct: 830 EQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGGQPGT 870
>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
Length = 897
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 178/447 (39%), Positives = 246/447 (55%), Gaps = 45/447 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ D + +RA LV +MTL EK QM + A + RLG+P Y+WW+E LHGV+ G+
Sbjct: 37 PWLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
AT FP I A+F+ L ++ T+S EARA ++
Sbjct: 96 ---------------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGR 140
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPN+N+ RDPRWGR ET GEDPY+ R + +VRGLQ + V
Sbjct: 141 YQGLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYR-------- 192
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH A + DR HFD+R + +D+ +T++ FE V EGDV +VM +YN
Sbjct: 193 -KLDATAKHLAVHSGPE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYN 248
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R DW F GY+VSDC +I I + H + T+E A A ++ G
Sbjct: 249 RVYGESASASRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHHIVT-TREAAAALAVRNG 307
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
+L+CG Y AV+QG I+EA+ID ++ L+ MRLG FD + + N
Sbjct: 308 TELECGQEYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFDPPERVRWARIPASVN 366
Query: 365 --PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P H LA +AA+ +VLLKND G LPL+ +IK +A+VGP A+ T A++GNY GTP
Sbjct: 367 QAPSHDALALKAAQASLVLLKND-GILPLSR-DIKRIAVVGPTADDTMALLGNYFGTPAA 424
Query: 423 YTSPMDGFYAYSK--VINYAPGCADIV 447
+ + G +K + YA G D+V
Sbjct: 425 PVTILQGIREAAKGVEVRYARGV-DLV 450
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/298 (31%), Positives = 149/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+ AD V V GL VE E G DR DL LP Q L+ + K
Sbjct: 622 ALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHATGK- 680
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV +V+ A+ +++A+++ + +IL YPG+ GG A+ +FG NP GRLP+T+Y+
Sbjct: 681 PVVMVLTGGSAIAVDWAQSH--LPAILMSWYPGQRGGTAVGQALFGDVNPAGRLPVTFYK 738
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
A+ ++P GRTY++F G +YPFG+GLSYT+F Y ++LD
Sbjct: 739 AS------EALPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGT--------LRLD 784
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPP 685
G+ + D + ++V N G G EVV +Y +
Sbjct: 785 -----------AGSLR-------------ADGRLGVAVDVTNAGTRSGDEVVQLYVRREH 820
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA-ANSLLASGAHTILVG 742
+G ++++ G++R+ +A G+ V FT+ A ++L+ D A A + GA+ + VG
Sbjct: 821 AGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQALRHYDEARAAYEVRPGAYEVRVG 878
>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
Length = 897
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 177/447 (39%), Positives = 246/447 (55%), Gaps = 45/447 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ D + +RA LV +MTL EK QM + A + RLG+P Y+WW+E LHGV+ G+
Sbjct: 37 PWLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
AT FP I A+F+ L ++ T+S EARA ++
Sbjct: 96 ---------------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGR 140
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPN+N+ RDPRWGR ET GEDPY+ R + +VRGLQ + V
Sbjct: 141 YQGLTFWSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYR-------- 192
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH A + DR HFD+R + +D+ +T++ FE V EGDV +VM +YN
Sbjct: 193 -KLDATAKHLAVHSGPE---ADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYN 248
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R DW F GY+VSDC +I I + H+ + T+E A A ++ G
Sbjct: 249 RVYGESASASRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHRIVT-TREAAAALAVRNG 307
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
+L+CG Y AV+QG I+EA+ID ++ L+ MRLG FD + + N
Sbjct: 308 TELECGQEYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFDPPERVRWARIPASVN 366
Query: 365 --PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P H LA +AA+ +VLLKND G LPL+ N + +A+VGP A+ T A++GNY GTP
Sbjct: 367 QAPAHDALALKAAQASLVLLKND-GILPLSR-NTRRIAVVGPTADDTMALLGNYFGTPAA 424
Query: 423 YTSPMDGFYAYSK--VINYAPGCADIV 447
+ + G +K + YA G D+V
Sbjct: 425 PVTILQGIREAAKGVEVRYARGV-DLV 450
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 137/287 (47%), Gaps = 53/287 (18%)
Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
V V GL VE E G DR DL LP Q L+ + K PV +V+ A
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLTGGSA 691
Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
+ +++A+ + + +IL YPG+ GG A+ +FG NP GRLP+T+Y+A +M
Sbjct: 692 IAVDWAQAH--LPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFYKAG------EAM 743
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
P GRTY++F G +YPFG+GLSYT+F Y ++LD D
Sbjct: 744 PAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGT--------LRLDADSL------- 788
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
D + ++V N G G EVV +Y + +G ++++
Sbjct: 789 -----------------RADGRLGVAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQELR 831
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNA-ANSLLASGAHTILVG 742
G++RV +A G+ V FT+ A ++L+ D A A + GA+ + VG
Sbjct: 832 GFQRVQLAPGERRTVTFTLEAAQALRHYDEARAAYAVQPGAYEVRVG 878
>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 883
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 166/460 (36%), Positives = 247/460 (53%), Gaps = 51/460 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D LP +RA DLV R+TL EK Q+ A G+PRLG+P Y++WSE LHG++ G
Sbjct: 35 LPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWSEGLHGIARSGY 94
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
AT FP + A+F+E L +IG+ +STEARA YN A
Sbjct: 95 ----------------ATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVAHDLRS 138
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT WSPNIN+ RDPRWGR ET GEDP++ R +V GLQ D
Sbjct: 139 IFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQG---------DDPN 189
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
+ KH+A + + ++R F++ + D+ +T++ F + EG S+MC+Y
Sbjct: 190 YYRAIGTPKHFAVH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMCAY 246
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARVL 301
N + G P CA LL++ +R DW F G++ SDC +I E H + D ++ +V +
Sbjct: 247 NAIEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDGI- 305
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
+AG D +CG Y N AV++G I E+++D LR L++ +LG FD Q Y ++
Sbjct: 306 RAGTDTNCGGTYRNLA-SAVRKGMIQESELDVPLRRLFLARFKLGLFDPPSQVKYASMPI 364
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ H ELA +AAR+ +VLLKN++ LPL+ +KT+A++GP+A++ ++ GNY
Sbjct: 365 TENMSSSHTELALQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNYNAI 423
Query: 420 PCRYTSPMDGF--------YAYSKVINYAPGCADIVCQNN 451
P +DG Y++ YA G A ++ +
Sbjct: 424 PKNPVMQVDGIAREFRDAKVLYAQGSPYAEGVALVIPRTQ 463
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 143/298 (47%), Gaps = 52/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A++A K ADA V GL +E E G DR DL+LP Q +L+ + A A+
Sbjct: 606 AMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASGK 664
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
P+ +V+++ A+ +N+A+ + +IL YPG+ G +AIA+ + GK NP GRLP+T+Y
Sbjct: 665 PLVVVLLNGSALAVNWAQEH--ADAILEAWYPGQAGAQAIAETLSGKNNPSGRLPVTFYR 722
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ P+T + RTY++F G +Y FGYGLSY+ F Y A K +LD
Sbjct: 723 SVNDLPPFTDYAM------ANRTYRYFKGKPLYEFGYGLSYSTFSYSNAHLSKE---RLD 773
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
R + +V+N + G EV +Y PP
Sbjct: 774 AGDTLR-----------------------------VEADVKNTSTLAGDEVAELYLTPPQ 804
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++ + G+E V + GQS V FT++ + L VD + +G +++ VG G
Sbjct: 805 NGVYPLRSLEGFEHVHLLPGQSKHVSFTLDP-RQLSEVDEKGIRAVRAGVYSVTVGGG 861
>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
Length = 868
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 177/469 (37%), Positives = 246/469 (52%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S + D+P+ + LP ER DL++R+T EKV QM + + RLG+P Y+WW+EAL
Sbjct: 18 SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ G+ AT FP I A+F++ + VS EARA Y+
Sbjct: 78 HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ R + V+GLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D + K AC KHYA + W +R FD VT +D+ +T++ FE V EG+
Sbjct: 177 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGN 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
V VMC+YNR G P C+ KLL +R W + I+SDC +I E H+
Sbjct: 230 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETH 289
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
D E A A + G DL+CG+ Y + A++ GKI+E D+D SLR L LG FD
Sbjct: 290 PDA-ESASADAVLNGTDLECGNSYRAL-VKALKDGKISENDLDVSLRRLLKGRFELGMFD 347
Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
Q Y + N + +P+H+ A E A + +VLLKN N LPL + I+ +A+VGP+A
Sbjct: 348 PDEQVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 406
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ + NY G P + ++G ++VI Y GC AD V Q+
Sbjct: 407 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 454
Score = 132 bits (331), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 142/315 (45%), Gaps = 55/315 (17%)
Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
G AD+ Q + P A AAK DA VIV G ++ V EG DR ++
Sbjct: 579 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 638
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LP Q E++ K A PV V+ + A+ +N+ + N I +IL Y G+E G A+A
Sbjct: 639 ELPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVA 695
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
D++FG YNP GRLP+T+Y++ +P + GRTY++ +YPFGYGLSY
Sbjct: 696 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 749
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F Y+ A K K+ KDQ T ++
Sbjct: 750 TNFAYRNA---KLSSGKIAKDQSV-----------------------------TLTFDIA 777
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N GKMDG EV +Y K P IK + + RV + AG S +V + DN
Sbjct: 778 NTGKMDGDEVAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNT 837
Query: 728 ANSLLASGAHTILVG 742
+ G + IL G
Sbjct: 838 QTMEVRPGKYQILYG 852
>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
Length = 788
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 238/811 (29%), Positives = 373/811 (45%), Gaps = 163/811 (20%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y + K P +R +DL+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 43 YENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGNI 101
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H +++ +P AT F
Sbjct: 102 DEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYF 161
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L +IG+ + EA A+ G T +SP +++ +DPRWGR +ET
Sbjct: 162 PAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + LQ + A KH+A Y + + +
Sbjct: 216 GEDPYLVGELGKQMITSLQK--------------YNLVATPKHFAVYSIPIGGRDGKTRT 261
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ +I PF M E VM SYN +G P L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ I HK + DT ED +A+ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
GKI++ +D + + + RLG FD Y+ GK + + +H ++ EAARQ
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
+VLLKN+ LPL+ +I+++A++GP+AN +I CRY +P+ Y
Sbjct: 434 LVLLKNETNLLPLSK-SIRSIAVIGPNANEQTQLI-------CRYGPANAPIKTVYQGIK 485
Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
+++VI Y GC DI+ + ++ AI AAK A+ V+V G
Sbjct: 486 ELLPHAEVI-YKKGC-DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGG 543
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
+ E + R L LPG Q EL+ V K PV LV++ A IN+A + + +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAAAH--VPAIL 600
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGE G+A+A+ +FG YNPGGRL +T + + +IP+ + P +P ++ T +
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+YPFG+GLSYT F Y + SP ++ D C+
Sbjct: 658 --GALYPFGHGLSYTTFTYSDLHISPSHQGVQGDIHVSCK-------------------- 695
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
++N GK+ G EVV +Y + T+ K + G+ER+ + AG+ V
Sbjct: 696 -------------IKNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTV 742
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + + L + D N + G+ +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVEPGSFKVMLG 772
>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
SJ95]
Length = 777
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 237/814 (29%), Positives = 389/814 (47%), Gaps = 161/814 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYGVPRLGLPLYEWWSEAL-HGVSFIGR 71
Y + P ER +DL+E+MTL EK+ Q+G +Y + G +E L G+ I R
Sbjct: 4 YKNPDKPIEERIEDLLEQMTLDEKIAQLGSFWSYELLDNGNFSFEKAQNLLKEGIGQITR 63
Query: 72 ---------------------------RTNSPPGTHFDS----EVPGATSFPTVILTTAS 100
R P H + GAT FP +I ++
Sbjct: 64 PGGATGFSPKKTAELANKIQKFLLTETRLGIPAFMHEECLSGYMTRGATIFPQMIGAAST 123
Query: 101 FNESLWKKIGQTVSTEARAMYNLG-NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+ L +++ ++ + +A LG + GL SP ++V RDPRWGR ET GEDPY++ +
Sbjct: 124 WEPPLIERMTTSIRNQMKA---LGIHQGL---SPVVDVTRDPRWGRTEETFGEDPYLIAK 177
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
+ YV+GLQ SD I A KH+ Y + NW + +
Sbjct: 178 MGVAYVKGLQ----------SDDLKNGIVATLKHFVGYGVSEGGMNWA-------PAHIP 220
Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
E++++ETF+ PFE + EG V SVM +Y+ ++GIP A LL + +R +W F G +VSD
Sbjct: 221 ERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIVVSD 280
Query: 276 CDSIQTIVESHKF-LNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGAVQQGKIAEADID 332
+I +++E HK LN KE+A + LKAG+D++ D Y A++ G+ +EA ID
Sbjct: 281 YFAINSLMEYHKIALN--KEEAAIKALKAGIDVELPSFDCYKEPLKNAIENGEFSEAFID 338
Query: 333 TSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIVLLKNDNGALP 390
S+R + + +G F+ Y +L K +N+ P+ +LA E A++ IVLLKND G +P
Sbjct: 339 KSVRNILRLKFEMGLFENP--YVDLEKVPDNLDTPEDRKLAYEIAKKSIVLLKND-GIVP 395
Query: 391 LNTGN-IKTLALVGPHANATKAMIGNY-------------------EGT-------PCR- 422
L + IK +A++GP+AN+ + + G+Y EG P +
Sbjct: 396 LKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQGAFGTSAMEGITFSESELPIKT 455
Query: 423 -YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------LDL 475
Y S + + +YA GC +I N MI A++ A+N+D ++V G LD
Sbjct: 456 IYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVELAENSDVALLVLGDKSGLTLDC 514
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
+ E +D L+LPG Q +L+ V + PV +V+++ +++ N + +I
Sbjct: 515 TT-GESRDSSTLILPGVQLDLLKSVINTGT-PVIVVLVNGRPYSLDWVSKN--VSAIFEA 570
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
PGEEGG A+AD+I G +P G+LPI++ + + Y P + + G + D
Sbjct: 571 WLPGEEGGNALADIILGDESPSGKLPISFPRHVGQIPVYYNHKPSGGRSQWWG---DYTD 627
Query: 595 GPV--VYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
P +YPFG+GLSYTQF+Y ++ ++ + V I +D
Sbjct: 628 SPAKPLYPFGHGLSYTQFEYGNLQIENNDRIVKISMD----------------------- 664
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQS 708
V+N+G+ G E+V +Y + T +K++ G++RV + +
Sbjct: 665 ----------------VKNIGEETGDEIVQLYMNDEVASVTRPVKELKGFQRVTLKPSEK 708
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ F + ++L + + L+ G ++VG
Sbjct: 709 KRIIFNL-PIETLALYNEKMEFLVEKGYFKVMVG 741
>gi|383643328|ref|ZP_09955734.1| glycoside hydrolase family 3 [Sphingomonas elodea ATCC 31461]
Length = 799
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 229/720 (31%), Positives = 343/720 (47%), Gaps = 106/720 (14%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ EALHG+ PGATSFP I +SF+ L + I
Sbjct: 145 RLGIPML-MHEEALHGLV-----------------APGATSFPQSIALASSFDPKLVENI 186
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ EARA A L +P ++V RDPRWGR+ ET GEDPY+V + + +RG Q
Sbjct: 187 FSMAAKEARAR----GANLVL-APVVDVARDPRWGRIEETYGEDPYLVTQMGLAAIRGFQ 241
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
G SD K+ KH + N + + E+ ++E F PFE
Sbjct: 242 ---GTTMPLKSD----KVFITLKHMTGHGQPE---NGTNVGPASLGERTLREDFFPPFEA 291
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V V SVM SYN ++GIP+ A+ LL +RG+W F G +VSD +I+ ++ H
Sbjct: 292 AVKTLPVMSVMASYNEIDGIPSHANKWLLTDVLRGEWGFQGAVVSDYFAIRELITRHHLF 351
Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D K DA R L AG+D++ G+ YT+ V+QG++++ +ID ++R + + G
Sbjct: 352 KDPK-DAAQRALDAGVDVETPDGEAYTHLVQ-LVKQGRVSQGEIDNAVRRVLRMKFEGGL 409
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
F+ L P+ I L+ +AAR+ IVLLKN G LPL+ IK +A++G HA
Sbjct: 410 FENPYPEVKLAAARTNTPEAIALSRQAARESIVLLKNAQGLLPLDARGIKRMAVIGTHAK 469
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSK---VINYAPG---------CADIVCQ-----N 450
T IG Y P S ++G A K ++YA G D V Q N
Sbjct: 470 DTP--IGGYSDLPNHVVSVLEGMQAEGKGKFAVDYAEGIRITNHREWSKDAVAQVPASVN 527
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAA 504
+ + A++ AKNAD V+V G + +V E D L LPG Q +L ++
Sbjct: 528 DQLRAQALETAKNADVVVLVLGGNEAVSREAWADNHLGDSETLDLPGPQDQLAKELIALG 587
Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
K PV +++++ +N+ K +++ Y GE+ G AIADV+FG+YNPGG+LP++
Sbjct: 588 K-PVVVILLNGRPYAVNYLAE--KAPALIEGWYLGEQTGNAIADVVFGRYNPGGKLPVSV 644
Query: 565 YEA-NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+ + I Y P R Y F D +YPFGYGLSYT F S+P+
Sbjct: 645 ARSVGQLPIYYNKKP------SARRGYLFGDTSPLYPFGYGLSYTTFDI---SAPR---- 691
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
+GT + I D K + +++V N GK+ G EVV ++
Sbjct: 692 --------------LGT-----PTIGIAD------KASVEVDVTNTGKVAGDEVVQLFVH 726
Query: 684 PPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ T + ++ +ERV + G+ V F + L + ++ ++ G TI G
Sbjct: 727 DDEASVTRPVIELKRFERVTLKPGEKKTVRFELT-PDDLALWNSQMRHVVEPGTFTISSG 785
>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 788
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 238/811 (29%), Positives = 373/811 (45%), Gaps = 163/811 (20%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y + K P +R +DL+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 43 YENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGNI 101
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H +++ +P AT F
Sbjct: 102 DEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYF 161
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L +IG+ + EA A+ G T +SP +++ +DPRWGR +ET
Sbjct: 162 PAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + LQ + A KH+A Y + + +
Sbjct: 216 GEDPYLVGELGKQMITSLQK--------------YNLVATPKHFAVYSIPIGGRDGKTRT 261
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ +I PF M E VM SYN +G P L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ I HK + DT ED +A+ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
GKI++ +D + + + RLG FD Y+ GK + + +H ++ EAARQ
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
+VLLKN+ LPL+ +I+++A++GP+AN +I CRY +P+ Y
Sbjct: 434 LVLLKNETNLLPLSK-SIRSIAVIGPNANEQTQLI-------CRYGPANAPIKTVYQGIK 485
Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
+++VI Y GC DI+ + ++ AI AAK A+ V+V G
Sbjct: 486 ELLPHTEVI-YKKGC-DIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGG 543
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
+ E + R L LPG Q EL+ V K P+ LV++ A IN+A + I +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PIILVMLDGRASSINYAAAH--IPAIL 600
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGE G+A+A+ +FG YNPGGRL +T + + +IP+ + P +P ++ T +
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+YPFG+GLSYT F Y + SP ++ D C+
Sbjct: 658 --GALYPFGHGLSYTTFTYSDLHISPSHQGVQGDIHVSCK-------------------- 695
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
++N GK+ G EVV +Y + T+ K + G+ER+ + AG+ V
Sbjct: 696 -------------IKNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTV 742
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + + L + D N + G+ +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVELGSFKVMLG 772
>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
Length = 349
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 219/356 (61%), Gaps = 18/356 (5%)
Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
+ T MIGNY G C YT+P+ G Y++ I+ A GC D+ C N + AA AA+ ADA
Sbjct: 1 DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADA 59
Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
TV+V GLD S+EAE DR LLLPG Q EL+++VA A++GP LV+MS G +D+ FAKN+
Sbjct: 60 TVLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKND 119
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVN 583
P+I +I+WVGYPG+ GG AIADV+FG NPGG+LP+TWY NYV +P T M +R P
Sbjct: 120 PRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPAR 179
Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD---INYTVGT 640
+PGRTY+F+ GPVV+PFG GLSYT F + +A P V + L + + ++ V
Sbjct: 180 GYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKAVRV 239
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYER 700
+ C A+ DV ++V+N G MDG+ ++V++ PP KQ++G+ +
Sbjct: 240 SHADCNALSPLDV---------HVDVKNTGSMDGTHTLLVFTSPPDGKWASSKQLMGFHK 290
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+ IAAG +V ++ CK L +VD + G H + +G+ VS LQ NL
Sbjct: 291 IHIAAGSEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVS--LQTNL 344
>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 874
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 242/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRL +P YEWWSE LHG++ G
Sbjct: 24 QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------- 72
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N +L +++G VSTEARA +N AGLT WSP
Sbjct: 73 -----ATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 127
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + P I A KH
Sbjct: 128 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLNHPRTI-ATPKH 178
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +DM+ T+ F + +G SVMC+YN ++G P CA
Sbjct: 179 IAVHSGPE---PGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACA 235
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 236 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 294
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 295 RELGT-AIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 353
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 354 LQAAAESIVLLKNTATTLPLKAGT--RLAVIGPNADALAALEANYQGTSATPITPLLGLR 411
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
+ ++ + YA G A + MIP
Sbjct: 412 QHFGAQQVRYAQG-APLAAGVPGMIP 436
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 137/299 (45%), Gaps = 52/299 (17%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+DA V GL VE E G DR D+ LP Q L+ + A A+ P+ +V+
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 660
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
MS AV +N+AK N W YPG+ GG AIA + G NPGGRLP+T+Y +
Sbjct: 661 MSGSAVALNWAKANADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLP 718
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
Y S ++ GRTY++F G ++PFGYGLSYT F Y R
Sbjct: 719 AYVSYDMK------GRTYRYFKGEPLFPFGYGLSYTSFAYDAP----------------R 756
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
T+ P V N G G EV VY + P + +
Sbjct: 757 LSTRTLQAGNP----------------LQVTTTVRNTGSRAGDEVAQVYLQYPDRPQSPL 800
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
+ ++G++RV + G+ ++ FT++A ++L VD + + +G + + VG G G P
Sbjct: 801 RSLVGFQRVHLKPGEQRELTFTLDA-RALSDVDRSGQRAVEAGEYRVFVGGGQPGTGAP 858
>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 902
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 173/445 (38%), Positives = 241/445 (54%), Gaps = 46/445 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y DA P ERA DLV+RMTL EK Q+ D A +PRLG+P Y+ WSEALHGV+ G
Sbjct: 38 YRDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAGH-- 95
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL----GNA--- 126
AT FP I A+++ + K++G +STEAR YN GN
Sbjct: 96 --------------ATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIF 141
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ G+ I ++ G+Q D+
Sbjct: 142 WGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQG---------PDAAHP 192
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K A KH+A + + R FD +V+ +D++ET++ F V +G V SVMC+YN
Sbjct: 193 KAVATSKHFAVHSGPE---SLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNA 249
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
V+G+ CA+ LL + ++ W F G++VSDC +I + + HK D A A L AG
Sbjct: 250 VDGMGACANKMLLEEHLKQAWGFKGFVVSDCGAIMDVTQGHKNAPDIVH-AAAISLAAGT 308
Query: 306 DLDCGDYYTNFTM--GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
DL C + F AV++G + E + + LY LG FD GS + +
Sbjct: 309 DLSCSIWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFDEPGSNPNDKIDMSQ 368
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ + +H A +AA + IVLLKND G LPL N KT+A++GP A ++ GNY G P
Sbjct: 369 VASEEHRAEALKAAEESIVLLKND-GLLPLK--NAKTIAVIGPTAELLASLEGNYNGQPV 425
Query: 422 RYTSPMDGFYAY--SKVINYAPGCA 444
R +P+DG ++ + YA G +
Sbjct: 426 RPVTPLDGIVKQFGAENVRYAQGSS 450
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 131/263 (49%), Gaps = 45/263 (17%)
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
G DR + LP Q +L+ + A K PV +V +S AV +N+A N +IL YPG
Sbjct: 659 GGDRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALNWA--NQHAGAILQAWYPGV 715
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVY 599
EGG AIA + G+ NP GRLP+T+Y A+ +P +T ++ RTY+++ G ++
Sbjct: 716 EGGTAIAKTLAGESNPAGRLPVTFY-ASVQDLPAFTEYAMK------NRTYRYYAGKPLW 768
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
FG+GLSY+ FKY ++KL A+ +D K
Sbjct: 769 GFGFGLSYSTFKYG--------EVKL--------------------ASTSVDAGKS---- 796
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
T + V N ++ G EVV Y K P G ++G++RV + G+S +V ++ +
Sbjct: 797 LTATVTVTNTSQVAGDEVVEAYLKTPQKGGPS-HSLVGFQRVPLNPGESREVAIEVSP-R 854
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
SL VD++ + +G + + +G
Sbjct: 855 SLSAVDDSGKRSILAGEYRLSIG 877
>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 889
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 177/446 (39%), Positives = 241/446 (54%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 39 QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------- 87
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 88 -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 142
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D P I A KH
Sbjct: 143 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLDHPRTI-ATPKH 193
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + +G SVMC+YN ++G P CA
Sbjct: 194 IAVHSGPE---PGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACA 250
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 251 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAS-LKAGHDLNCGYAY 309
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG + + Y LG +I N + LA
Sbjct: 310 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELEAPHKDPYATLGAKDIDNTANRALA 368
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA Q IVLLKND LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 369 LKAAAQSIVLLKNDANTLPLKAG--ARLAVIGPNADALAALEANYQGTSSTPVTPLLGLR 426
Query: 432 AYSKV--INYAPGCADIVCQNNSMIP 455
V ++YA G A + MIP
Sbjct: 427 QRFGVHQVSYAQG-APLAAGVPGMIP 451
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/284 (31%), Positives = 137/284 (48%), Gaps = 49/284 (17%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR D+ LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 631 VEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTH 689
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA ++ G NPGGRLP+T+Y + PY S ++
Sbjct: 690 ADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK------ 741
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT F Y Q G+
Sbjct: 742 GRTYRYFKGEPLFPFGYGLSYTSFAYGAP--------------QLSSTTLQAGS------ 781
Query: 647 AVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
T Q+ V N G G EV VY + P + ++ ++G++RV +
Sbjct: 782 --------------TLQVTTTVRNTGTRAGDEVAQVYLQYPDRPQSPLRSLVGFQRVHLK 827
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
G+ + FT++A ++L VD + +G +T+ VG G G
Sbjct: 828 PGEQRTLTFTLDA-RALSDVDRTGQRAVEAGDYTLFVGGGQRGT 870
>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.G.57.14]
gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.N.15.51]
gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.G.57.14]
gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.N.15.51]
Length = 754
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 212/691 (30%), Positives = 348/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ D+ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++GIP +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + H+ ++ E A+ L++G+D++ D Y + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVNALKE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G + E+ ID ++ + + RLG D +N + + + ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL + N+ +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
SKV+ YA GC DI ++ AI+ A+ AD + + +GL LS
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +K+++
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIADVIFG YNPGGRLPIT+ + + + Y P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPGGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK +G N
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
I+V+N+GKM+G +VV +Y SK +K++ G+ ++ + G+ +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L D+ ++ G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
HVE10/4]
Length = 754
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 213/691 (30%), Positives = 348/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ D+ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++GIP +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + H+ ++ E A+ L++G+D++ D Y+ + A+ +
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYSEPLVNALTE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G + E+ ID ++ + + RLG D +N + + + ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL + N+ +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
SKV+ YA GC DI ++ AI+ A+ AD + V +GL LS
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +K+++
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIADVIFG YNPGGRLPIT+ + + + Y P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPGGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK +G N
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
I+V+N+GKM+G +VV +Y SK +K++ G+ ++ + G+ +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L D+ ++ G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
Length = 888
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 174/446 (39%), Positives = 240/446 (53%), Gaps = 46/446 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRLG+P YEWW+E LHG++ G
Sbjct: 38 QRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNGY----------- 86
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L +++G VSTEARA +N AGLT WSP
Sbjct: 87 -----ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 141
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D P I A KH
Sbjct: 142 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLEHPRTI-ATPKH 192
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +D++ T+ F + EG SVMC+YN ++G P CA
Sbjct: 193 IAVHSGPE---PGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACA 249
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 250 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGTAY 308
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG +Y LG +I N + LA
Sbjct: 309 RALGT-AIERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALA 367
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA + IVLLKN N LPL LA++GP+A+A A+ NY+GT + +P+ G
Sbjct: 368 LQAAAESIVLLKNANATLPLKAST--RLAVIGPNADALAALEANYQGTSSQPVTPLLGLR 425
Query: 432 AY--SKVINYAPGCADIVCQNNSMIP 455
++ + YA G A + MIP
Sbjct: 426 QRFGAQQVRYAQG-APLAAGVPGMIP 450
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 139/301 (46%), Gaps = 52/301 (17%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+DA V GL VE E G DR D+ LP Q L+ + A A+ P+ +V+
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVL 674
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
MS AV +N+AK + W YPG+ GG AIA + G NPGGRLP+T+Y +
Sbjct: 675 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLP 732
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
PY S ++ GRTY++F G ++PFGYGLSYT+F Y+ R
Sbjct: 733 PYVSYDMK------GRTYRYFKGEALFPFGYGLSYTRFAYETP----------------R 770
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
T+ P V N G+ G EV VY + P + +
Sbjct: 771 LSATTLQAGSP----------------LQVTTTVRNTGERAGDEVAQVYLQYPERPQSPL 814
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPL 752
+ ++G++RV + G+ + FT++A ++L VD + +G + + VG G P
Sbjct: 815 RSLVGFQRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGGQPDTGAPG 873
Query: 753 Q 753
Q
Sbjct: 874 Q 874
>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
4393]
Length = 883
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 246/447 (55%), Gaps = 45/447 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ D + RA LV +MTL EK QM + A + RLG+P Y+WW+EALHGV+ G+
Sbjct: 23 PWQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVARAGQ- 81
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------G 124
AT FP I A+F+ L ++ T+S EARA ++
Sbjct: 82 ---------------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGR 126
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFWSPNIN+ RDPRWGR ET GEDPY+ R + +V+GLQ + V
Sbjct: 127 YQGLTFWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDDPVYR-------- 178
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ A KH+A + DR HFD+R +++D+ +T++ FE V EG V +VM +YN
Sbjct: 179 -KLDATAKHFAVHSGPE---ADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYN 234
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
RV G A LL +R DW F GY+VSDC +I I + H L ++E A A +K G
Sbjct: 235 RVYGESASASQFLLRDVLRRDWGFTGYVVSDCWAIVDIWK-HHHLAPSREAAAALAVKNG 293
Query: 305 LDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN 364
+L+CG Y AV+QG I EA+ID ++ L+ MRLG FD + + N
Sbjct: 294 TELECGQEYATLP-AAVRQGLIGEAEIDDAVTRLFTARMRLGMFDPPERVRWARIPASVN 352
Query: 365 --PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P H LA +AA++ +VLLKND G LPL+ +K +A+VGP A+ T A++GNY GTP
Sbjct: 353 QVPAHDALALQAAQESLVLLKND-GVLPLSR-TLKRIAVVGPTADDTMALLGNYFGTPAA 410
Query: 423 YTSPMDGFYAYSKVI--NYAPGCADIV 447
+ + G +K I YA G D+V
Sbjct: 411 PVTILQGIRDAAKGIEVRYARGV-DLV 436
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 150/298 (50%), Gaps = 53/298 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+DAA+NAD V V GL VE E G DR DL LP Q L+ + K
Sbjct: 608 ALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHATGK- 666
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV +V+ A+ +++A+ + + +IL YPG+ GG A+ +FG+ NP GRLP+T+Y
Sbjct: 667 PVVMVLTGGSALAVDWAQAH--LPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVTFYR 724
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
A+ ++P GRTY++F G +YPFG+GLSYT+F Y KL
Sbjct: 725 AD------QALPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYG----------KLH 768
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
D A + DD + K Q+EV N GK G EV +Y +
Sbjct: 769 LD-----------------APRIADDGRLK-----LQVEVANTGKRAGDEVAQLYVRRLA 806
Query: 687 IAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
A +Q + G++RV +A G+ + F ++A ++L+ D+A + ++ +G + + +G
Sbjct: 807 AAPGDAQQTLRGFQRVHLAPGERRTLTFELDAQQALRQYDDARGAYVVPAGRYEVRIG 864
>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
Length = 868
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 178/469 (37%), Positives = 241/469 (51%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S K D+P+ + LP ER DL+ R+T EK+ QM ++ + RLG+P Y+WW+EAL
Sbjct: 18 SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+F+++ + VS EARA Y+
Sbjct: 78 HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + + RGLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D K AC KHYA + W +R FD T +D+ ET++ FE V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGD 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
V VMC+YNR G P C+ KLL +R W + I+SDC +I K +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A A + G DL+CG Y A+ GKI+E D+D SLR L LG FD
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ Y + + + +P+HI A + AR+ IVLLKN N LPL+ NIK +A+VGP+A
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
+ + NY G P + + ++G +KV N Y GC AD V +
Sbjct: 408 STMLWANYNGFPTKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)
Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
A K+AD V V G+ ++ V+AEG DR ++ +P Q E++ + K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ + A+ +N+ N + +IL Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + GRTY++ +YPFGYGLSYT F YK A KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
KD+ + +N+ T ++ N GKMDG EV +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+K + ++RV + AG V + DN + G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
Length = 868
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 178/469 (37%), Positives = 241/469 (51%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S K D+P+ + LP ER DL+ R+T EK+ QM ++ + RLG+P Y+WW+EAL
Sbjct: 18 SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+F+++ + VS EARA Y+
Sbjct: 78 HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + + RGLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D K AC KHYA + W +R FD T +D+ ET++ FE V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGD 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
V VMC+YNR G P C+ KLL +R W + I+SDC +I K +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A A + G DL+CG Y A+ GKI+E D+D SLR L LG FD
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ Y + + + +P+HI A + AR+ IVLLKN N LPL+ NIK +A+VGP+A
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
+ + NY G P + + ++G +KV N Y GC AD V +
Sbjct: 408 STMLWANYNGFPTKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 139/297 (46%), Gaps = 51/297 (17%)
Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
A K+AD V V G+ ++ V+AEG DR ++ +P Q E++ + K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ + A+ +N+ N + +IL Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + GRTY++ +YPFGYGLSYT F YK A KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
KD+ + +N+ T ++ N GKMDG EV +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+K + ++RV + AG + V + DN + G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
Length = 868
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 176/469 (37%), Positives = 246/469 (52%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S + D+P+ + LP ER DL++R+T EKV QM + + RLG+P Y+WW+EAL
Sbjct: 18 SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ G+ AT FP I A+F++ + VS EARA Y+
Sbjct: 78 HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ R + V+GLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D + K AC KHYA + W +R FD VT +D+ +T++ FE V EG+
Sbjct: 177 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGN 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
V VMC+YNR G P C+ KLL +R W + I+SDC +I E H+
Sbjct: 230 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETH 289
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
D E A A + G DL+CG+ Y + A++ GKI+E D+D SLR L LG FD
Sbjct: 290 PDA-ESASADAVLNGTDLECGNSYRAL-VKALKDGKISENDLDVSLRRLLKGRFELGMFD 347
Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ Y + N + +P+H+ A E A + +VLLKN N LPL + I+ +A+VGP+A
Sbjct: 348 PDERVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 406
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ + NY G P + ++G ++VI Y GC AD V Q+
Sbjct: 407 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 454
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 142/315 (45%), Gaps = 55/315 (17%)
Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
G AD+ Q + P A AAK DA VIV G ++ V EG DR ++
Sbjct: 579 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 638
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
LP Q E++ K A PV V+ + A+ +N+ + N I +IL Y G+E G A+A
Sbjct: 639 ELPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVA 695
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
D++FG YNP GRLP+T+Y++ +P + GRTY++ +YPFGYGLSY
Sbjct: 696 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 749
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F Y+ A K K+ KDQ T ++
Sbjct: 750 TNFAYRNA---KLSSGKIAKDQSV-----------------------------TLTFDIA 777
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N GKMDG E+ +Y K P IK + + RV + AG S +V + DN
Sbjct: 778 NTGKMDGDEIAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELAPETFHSFNDNT 837
Query: 728 ANSLLASGAHTILVG 742
+ G + IL G
Sbjct: 838 QTMEVRPGKYQILYG 852
>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
REY15A]
Length = 754
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 213/691 (30%), Positives = 348/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L I + ++AR + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQARLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ D+ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++GIP +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + H+ ++ E A+ L++G+D++ D Y+ + A+ +
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYSEPLVNALTE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G + E+ ID ++ + + RLG D +N + + + ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL + N+ +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
SKV+ YA GC DI ++ AI+ A+ AD + V +GL LS
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +K+++
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSPIIN--YVKAVIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIADVIFG YNP GRLPIT+ + + + Y P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPSGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK +G N
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
I+V+N+GKM+G +VV +Y SK +K++ G+ ++ + G+ +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L D+ ++ G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 868
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 177/469 (37%), Positives = 243/469 (51%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S K D+P+ + +LP ER DL+ R+T EK+ QM ++ + RLG+P Y+WW+EAL
Sbjct: 18 SCSEKQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+F+++ + VS EARA Y+
Sbjct: 78 HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + + RGLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D K AC KHYA + W +R F++ T +D+ ET++ FE V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
V VMC+YNR G P C+ KLL +R W + I+SDC +I K +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A A + G DL+CG Y A+ GKI+E D+D SLR L LG FD
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ Y + + + +P+HI A + AR+ IVLLKN N LPL+ NIK +A+VGP+A
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
+ + NY G P + + ++G +KV N Y GC AD V +
Sbjct: 408 STMLWANYNGFPSKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)
Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
A K+AD V V G+ ++ V+AEG DR ++ +P Q E++ + K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ + A+ +N+ N + +IL Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + GRTY++ +YPFGYGLSYT F YK A KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
KD+ + +N+ T ++ N GKMDG EV +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+K + ++RV + AG V + DN + G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 868
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 177/469 (37%), Positives = 243/469 (51%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S K D+P+ + +LP ER DL+ R+T EK+ QM ++ + RLG+P Y+WW+EAL
Sbjct: 18 SCSEKQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+F+++ + VS EARA Y+
Sbjct: 78 HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + + RGLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D K AC KHYA + W +R F++ T +D+ ET++ FE V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
V VMC+YNR G P C+ KLL +R W + I+SDC +I K +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A A + G DL+CG Y A+ GKI+E D+D SLR L LG FD
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ Y + + + +P+HI A + AR+ IVLLKN N LPL+ NIK +A+VGP+A
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
+ + NY G P + + ++G +KV N Y GC AD V +
Sbjct: 408 STMLWANYNGFPSKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)
Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
A K+AD V V G+ ++ V+AEG DR ++ +P Q E++ + K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ + A+ +N+ N + +IL Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + GRTY++ +YPFGYGLSYT F YK A KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
KD+ + +N+ T ++ N GKMDG EV +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+K + ++RV + AG V + DN + G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
Length = 868
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 177/469 (37%), Positives = 242/469 (51%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S K D+P+ + LP ER DL+ R+T EK+ QM ++ + RLG+P Y+WW+EAL
Sbjct: 18 SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+F+++ + VS EARA Y+
Sbjct: 78 HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + + RGLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D K AC KHYA + W +R F++ T +D+ ET++ FE V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
V VMC+YNR G P C+ KLL +R W + I+SDC +I K +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A A + G DL+CG Y A+ GKI+E D+D SLR L LG FD
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ Y + + + +P+HI A + AR+ IVLLKN N LPL+ NIK +A+VGP+A
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
+ + NY G P + + ++G +KV N Y GC AD V +
Sbjct: 408 STMLWANYNGFPTKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)
Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
A K+AD V V G+ ++ V+AEG DR ++ +P Q E++ + K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ + A+ +N+ N + +IL Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + GRTY++ +YPFGYGLSYT F YK A KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
KD+ + +N+ T ++ N GKMDG EV +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+K + ++RV + AG V + DN + G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVRPGKYQILYG 852
>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 868
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 177/469 (37%), Positives = 242/469 (51%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S K D+P+ + LP ER DL+ R+T EK+ QM ++ + RLG+P Y+WW+EAL
Sbjct: 18 SCSEKQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ GR AT FP I A+F+++ + VS EARA Y+
Sbjct: 78 HGVARAGR----------------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + + RGLQ
Sbjct: 122 YQKDKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D K AC KHYA + W +R F++ T +D+ ET++ FE V EGD
Sbjct: 177 ----DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGD 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHK-----FLN 290
V VMC+YNR G P C+ KLL +R W + I+SDC +I K +
Sbjct: 230 VQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETH 289
Query: 291 DTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG 350
E A A + G DL+CG Y A+ GKI+E D+D SLR L LG FD
Sbjct: 290 PDAESASADAVLNGTDLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDP 348
Query: 351 SPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ Y + + + +P+HI A + AR+ IVLLKN N LPL+ NIK +A+VGP+A
Sbjct: 349 DERVPYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAAD 407
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKVIN----YAPGC---ADIVCQN 450
+ + NY G P + + ++G +KV N Y GC AD V +
Sbjct: 408 STMLWANYNGFPSKTVTIVEGI--RNKVPNAEVIYELGCNHTADFVVTD 454
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 51/297 (17%)
Query: 456 AAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAK 505
A K+AD V V G+ ++ V+AEG DR ++ +P Q E++ + K
Sbjct: 597 ATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALVATGK 656
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ + A+ +N+ N + +IL Y G+EGG A+ADV+FG YNP GRLPIT+Y
Sbjct: 657 -PVVYVVCTGSALALNW--ENDHVNAILNAWYGGQEGGTAVADVLFGDYNPAGRLPITFY 713
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + GRTY++ +YPFGYGLSYT F YK A KL
Sbjct: 714 KS------VDQLPDFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYKNA--------KL 759
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
KD+ + +N+ T ++ N GKMDG EV +Y K P
Sbjct: 760 SKDK--------IASNE----------------SVTLSFDIANTGKMDGDEVAQIYIKNP 795
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+K + ++RV + AG V + DN + G + IL G
Sbjct: 796 NDPAGPLKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIRPGKYQILYG 852
>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 762
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 217/695 (31%), Positives = 346/695 (49%), Gaps = 110/695 (15%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I ++F L +++ + +A N+ + GL SP +++ RDPRWGR
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMKAA-NV-HQGL---SPVLDIPRDPRWGRT 160
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+V R A YV+GLQ G ++ I A KH+ AY + EG
Sbjct: 161 EETFGEDPYLVSRMATEYVKGLQ---GEDWREG-------IVATVKHFTAYGIS--EGA- 207
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
R ++V E++++E F+ PFE+ + EG S+M +Y+ ++G+P + LL + +R +W
Sbjct: 208 RNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEW 267
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGAVQQG 324
F GY+VSD +++ + HK D KE AV L+AG+D++ D Y + AV++G
Sbjct: 268 GFKGYVVSDYIAVRMLENFHKVARDAKEAAVL-ALEAGIDIELPSVDCYGEPLIQAVKEG 326
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN------ICNPQHIELAAEAARQG 378
I+E I+ S+ + LG FD NL K+ P+ +L+ E AR+
Sbjct: 327 LISEEVINASVERVLRAKFMLGLFD-----DNLEKDPKKVYEVFDKPEFRDLSREVARRS 381
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-------------------EGT 419
IVLLKND G LPL+ N+K +A++GP+A+ + + G+Y E
Sbjct: 382 IVLLKND-GTLPLSK-NLKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKC 439
Query: 420 PCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---- 472
R S ++G + + YA GC DI+ + AI+ AK AD + V G
Sbjct: 440 VVRTVSILEGIRNKVSPETEVLYAKGC-DIISDSKDGFAEAIEMAKEADVIIAVMGEESG 498
Query: 473 -LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
+ EG DR L L G Q +L+ ++ K P+ LV+++ + + N + +
Sbjct: 499 LFHRGISGEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEHEN--LNA 555
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
IL YPGEEGG A+ADVIFG YNP G+LPI+ + A +IP N P
Sbjct: 556 ILEAWYPGEEGGNAVADVIFGDYNPSGKLPIS-FPAVTGQIPVY------YNRKPSAFSD 608
Query: 592 FFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
+ D +YPFG+GLSYT F+Y D+K+ ++
Sbjct: 609 YIDESAKPLYPFGHGLSYTTFEYS--------DLKISPEK-------------------- 640
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQ 707
++ ++ + FT ++N G DG EVV +Y +A +K++ G++++++ G+
Sbjct: 641 VNSLEKVEISFT----IKNTGNRDGEEVVQLYIHDQ-VASLERPVKELKGFKKIYLKPGE 695
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
S +V FT+ + L D ++ G +++G
Sbjct: 696 SKRVTFTLYP-EQLAFYDEFMRFIVEKGVFEVMIG 729
>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
Length = 788
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 244/812 (30%), Positives = 378/812 (46%), Gaps = 165/812 (20%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y + K P ER +DL+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 43 YENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGNI 101
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H D++ +P AT F
Sbjct: 102 DEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYF 161
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L +IG+ + EA A+ G T +SP +++ +DPRWGR +ET
Sbjct: 162 PAQCGQGATWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG G Q + ++ H + A KH+A Y + + +
Sbjct: 216 GEDPYLVGEL------GKQMITSLQKH--------NLVATPKHFAVYSIPVGGRDGKTRT 261
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ +I PF M E VM SYN +G P L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ I HK N T ED +A+ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISSKHKVAN-TYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
GKI++ +D + + V LG FD Y+ GK + + +H ++ EAARQ
Sbjct: 376 ADGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
+VLLKN+ LPL+ +++++A++GP+A+ +I CRY +P+ Y
Sbjct: 434 LVLLKNEMNLLPLSK-SLRSIAVIGPNADERTQLI-------CRYGPANAPIKTVYQGIK 485
Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
+++VI Y GC DI+ + ++ AI AAK A+ V+V G
Sbjct: 486 ERLPHTEVI-YRKGC-DIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGG 543
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
+ E + R L LPG Q EL+ V K PV LV++ A IN+A + + +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAAAH--VPAIL 600
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGE G+A+A+ +FG YNPGGRL +T + + +IP+ + P +P ++ T +
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC--RDINYTVGTNKPPCAAVLID 651
V+YPFG+GLSYT F Y D+K+ +Q DIN
Sbjct: 658 --GVLYPFGHGLSYTTFSYG--------DLKISPLRQGVQGDIN---------------- 691
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
+ CK ++N GK+ G EVV +Y + T+ K + G+ER+ + AG+
Sbjct: 692 -ISCK---------IKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQM 741
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F + + L + D N + G +++G
Sbjct: 742 VHFRLRP-QDLGLWDKNMNFRVEPGKFKVMIG 772
>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
Length = 798
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 233/799 (29%), Positives = 370/799 (46%), Gaps = 139/799 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D+ P R ++L+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 53 YEDSYAPLEARVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGNI 111
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H ++ +P AT F
Sbjct: 112 DEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYF 171
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
P A++N+ L +IG+ EAR LG + +SP +++ +DPRWGR +ET G
Sbjct: 172 PAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYG 226
Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
EDPY G+ + LQ K+ + KH+A Y + + + D
Sbjct: 227 EDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRTD 272
Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
V ++M+ ++ PF + +E VM SYN +G P L + +R +W F GY
Sbjct: 273 PHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGY 332
Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAVQ 322
+VSD ++++ I H+ N EDAVA+ + AGL++ T+FT AV+
Sbjct: 333 VVSDSEAVEFISTKHQVANGY-EDAVAQAVNAGLNIR-----THFTPPADFILPLRSAVK 386
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVL 381
+GKI++ ++ + + V LG FD + I + P+H +LA EAARQ +VL
Sbjct: 387 KGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLVL 446
Query: 382 LKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINY 439
LKN++ LPL+ +I+++A++GP+A+ + +I Y T+ +G + Y
Sbjct: 447 LKNEHQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVY 505
Query: 440 APGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
GC DI+ Q M+ AI+AAK A+ TV+V G + E + R
Sbjct: 506 KKGC-DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSR 564
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
L LPG Q EL+ K+ K PV LV++ A INFA + + +I+ +PGE GG+
Sbjct: 565 TSLDLPGRQKELLKKICQLGK-PVVLVMIDGRASSINFAATH--VPAIIHAWFPGEFGGQ 621
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
AIA+ +FG YNPGGRL +T + + +IP+ + P +P ++ T + +YPFG+G
Sbjct: 622 AIAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSETSVY---GALYPFGHG 676
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F+Y D+ + +Q N ++
Sbjct: 677 LSYTTFQYS--------DLAISPSKQGVQGNISISCT----------------------- 705
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKI 723
++N+G+ +G EVV +Y + + T QV+ G+ER+ + S V F + + L I
Sbjct: 706 -IKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGI 763
Query: 724 VDNAANSLLASGAHTILVG 742
D N + G +++G
Sbjct: 764 WDKQMNFTVEPGMFKVMIG 782
>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
L.S.2.15]
Length = 754
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 211/691 (30%), Positives = 347/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ D+ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++GIP +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + H+ ++ E A+ L++G+D++ D Y + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVNALKE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G + E+ ID ++ + + RLG D +N + + + ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL + N+ +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
SKV+ YA GC DI ++ AI+ A+ AD + + +GL LS
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +K+++
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIADVIFG YNP GRLPIT+ + + + Y P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPSGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK +G N
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
I+V+N+GKM+G +VV +Y SK +K++ G+ ++ + G+ +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L D+ ++ G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
L.D.8.5]
Length = 754
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 211/691 (30%), Positives = 347/691 (50%), Gaps = 106/691 (15%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
ET GEDPY+V + Y+ GLQ D+ ++ A KH+AA+ N
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQ----------GDN---QLVATAKHFAAHGFPEGGRNI 201
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ H +R +++ETF+ PFE+ V G V S+M +Y+ ++GIP +P+LL +R +
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G +VSD D I+ + H+ ++ E A+ L++G+D++ D Y + A+++
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHRVASNKMEAAIL-ALESGVDIEFPTIDCYGEPLVNALKE 316
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G + E+ ID ++ + + RLG D +N + + + ELA + AR+ IVLLK
Sbjct: 317 GLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIVLLK 376
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-------PCRYTSPMDGF---YAY 433
N+N LPL + N+ +A++GP+AN + M+G+Y T + + G
Sbjct: 377 NENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKKVGE 435
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV----AGLDLS------------- 476
SKV+ YA GC DI ++ AI+ A+ AD + + +GL LS
Sbjct: 436 SKVL-YAKGC-DIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFKKY 493
Query: 477 --VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
V EG DR L LPG Q EL+ ++ K P+ LV+++ + ++ N +K+++
Sbjct: 494 QAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLINGRPLVLSSIIN--YVKAVIE 550
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGEEGG AIADVIFG YNP GRLPIT+ + + + Y P ++F R Y
Sbjct: 551 AWFPGEEGGNAIADVIFGDYNPSGRLPITFPMDTGQIPLYYNRKP----SSF--RPYVML 604
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
++ FGYGLSYTQF+Y + +PK +G N
Sbjct: 605 RSSPLFTFGYGLSYTQFEYSNLEVTPKE-----------------IGPNS---------- 637
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
I+V+N+GKM+G +VV +Y SK +K++ G+ ++ + G+ +V
Sbjct: 638 ------NIAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHLKPGEKRRV 691
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + ++L D+ ++ G + +L+G
Sbjct: 692 KFIL-PTEALAFYDSFMRLVVEKGEYQLLIG 721
>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
Length = 788
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 242/811 (29%), Positives = 376/811 (46%), Gaps = 163/811 (20%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y + K P ER +DL+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 43 YENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGNI 101
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H D++ +P AT F
Sbjct: 102 DEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYF 161
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
P A++N+ L +IG+ + EA A+ +SP +++ +DPRWGR +ET G
Sbjct: 162 PAQCGQGATWNKELIARIGEVEAKEAVAL-----EYTNIYSPILDIAQDPRWGRCVETYG 216
Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
EDPY+VG G Q + ++ H + A KH+A Y + + + D
Sbjct: 217 EDPYLVGEL------GKQMITSLQKH--------NLVATPKHFAVYSIPVGGRDGKTRTD 262
Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
V ++M+ +I PF M E VM SYN +G P L + +R +W F GY
Sbjct: 263 PHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGY 322
Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAVQ 322
+VSD ++++ I HK N T ED +A+ + AGL++ T+FT AV
Sbjct: 323 VVSDSEAVEFISSKHKVAN-TYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAVA 376
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQGI 379
GKI++ +D + + V LG FD Y+ GK + + +H ++ EAARQ +
Sbjct: 377 DGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQSL 434
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA---- 432
VLLKN+ LPL+ +++++A++GP+A+ +I CRY +P+ Y
Sbjct: 435 VLLKNEMNLLPLSK-SLRSIAVIGPNADERTQLI-------CRYGPANAPIKTVYQGIKE 486
Query: 433 ---YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLD 474
+++VI Y GC DI+ + ++ AI AAK A+ V+V G +
Sbjct: 487 RLPHTEVI-YRKGC-DIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGN 544
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
E + R L LPG Q EL+ V K PV LV++ A IN+A + + +IL
Sbjct: 545 ELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAAAH--VPAILH 601
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD 594
+PGE G+A+A+ +FG YNPGGRL +T + + +IP+ + P +P ++ T +
Sbjct: 602 AWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY-- 657
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC--RDINYTVGTNKPPCAAVLIDD 652
V+YPFG+GLSYT F Y D+K+ +Q DIN
Sbjct: 658 -GVLYPFGHGLSYTTFSYG--------DLKISPLRQGVQGDIN----------------- 691
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
+ CK ++N GK+ G EVV +Y + T+ K + G+ER+ + AG+ V
Sbjct: 692 ISCK---------IKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMV 742
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + + L + D N + G +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVEPGKFKVMIG 772
>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
43184]
gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
Length = 868
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 173/469 (36%), Positives = 247/469 (52%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S + D+P+ + LP ER DL++R+T EK+ QM + + RLG+P Y+WW+EAL
Sbjct: 18 SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEAL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ G+ AT FP I A+F++ + VS EARA Y+
Sbjct: 78 HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 121
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ R + V+GLQ
Sbjct: 122 YQKNKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQG----- 176
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D + K AC KHYA + W +R FD VT +D+ +T++ FE V +G+
Sbjct: 177 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGN 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
V VMC+YNR G P C+ KLL +R W + I+SDC +I + H+
Sbjct: 230 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETH 289
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
D E A A + G DL+CG+ Y + A+++GKI+E D+D SLR L LG FD
Sbjct: 290 PDA-ESASADAVLNGTDLECGNSYKAL-IKALKEGKISENDLDVSLRRLLKGRFELGMFD 347
Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ Y + N + +P+H+ A E A + +VLLKN N LPL + I+ +A+VGP+A
Sbjct: 348 PDERVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 406
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ + NY G P + ++G ++VI Y GC AD V Q+
Sbjct: 407 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 454
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 141/315 (44%), Gaps = 55/315 (17%)
Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
G AD+ Q + P A AAK DA VIV G ++ V EG DR ++
Sbjct: 579 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 638
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+P Q E++ K A PV V+ + A+ +N+ N I +IL Y G+E G A+A
Sbjct: 639 EIPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEDAN--IDAILNAWYGGQEAGTAVA 695
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
D++FG YNP GRLP+T+Y++ +P + GRTY++ +YPFGYGLSY
Sbjct: 696 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 749
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F Y+ A K K+ KDQ T ++
Sbjct: 750 TNFAYRNA---KLSSGKITKDQSV-----------------------------TLTFDIA 777
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N GKMDG EV +Y K P IK + + RV + AG S +V + DN
Sbjct: 778 NTGKMDGDEVAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNT 837
Query: 728 ANSLLASGAHTILVG 742
+ G + IL G
Sbjct: 838 QTMEVRPGKYQILYG 852
>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 168/430 (39%), Positives = 231/430 (53%), Gaps = 41/430 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+ + LP ER +DLV R+TL EKV M D + VPRLG+ Y WW+EALHGV G
Sbjct: 22 LPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP I A+F++ K++ + VS EARA Y+
Sbjct: 82 ----------------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSE 125
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFW+PN+N+ RDPRWGR ET GEDPY+ R + VRGLQ S+S+
Sbjct: 126 RYRGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQG--------PSESK 177
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W +R FD ++ +D+ ET++ F+ V +G V VMC+
Sbjct: 178 YDKLHACAKHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVL 301
YNR G P C +LL +R +W F G +VSDC +I ++ H + TKE AVA +
Sbjct: 235 YNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAV 294
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
KAG DLDCG Y + AV++G I E ID SL L LG D + ++
Sbjct: 295 KAGTDLDCGVDYQSLEK-AVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVSWSDIPY 353
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + +H A E AR+ + LLKN NG LPL+ + + ++GP+AN + M GNY G
Sbjct: 354 TVVDSEKHRAKALEVARKSMTLLKNKNGTLPLSK-HCGKIVVIGPNANDSIMMWGNYNGF 412
Query: 420 PCRYTSPMDG 429
P + ++G
Sbjct: 413 PSHTVTILEG 422
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 133/296 (44%), Gaps = 53/296 (17%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+A+A V V G+ VE E G DR + LP Q +L+ ++ K P+ L++
Sbjct: 599 DAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILIL 657
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
S A I + +I+ YPG+ GG A+ADV+FG YNP GRLP+T+Y+
Sbjct: 658 CSGSA--IGLSAEVDLADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTFYKTT---- 711
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
+P N GRTY++F G ++PFGYGLSYT F+ A K +
Sbjct: 712 --EQLPDFEDYNMQGRTYRYFKGEALFPFGYGLSYTSFEIGKAQLSK------------K 757
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
I+ N + ++N G+ DG EV+ VY + +
Sbjct: 758 RIHANESVN--------------------LDLWIKNTGERDGEEVIQVYIRKLKDKEGPL 797
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGEGVGG 747
K + ++RV + +G+ ++ + S + D N + + +G + +L G G
Sbjct: 798 KTLRAFKRVHVKSGEKKQISIHL-PNDSFEFFDPEFNVMRVMAGEYEVLYGTSSEG 852
>gi|1749831|emb|CAA91219.1| beta-xylo-glucosidase [Thermoanaerobacter brockii]
Length = 730
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 206/698 (29%), Positives = 341/698 (48%), Gaps = 100/698 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I +++N + +K+ + + +A+ +P +++ RDPRWGR
Sbjct: 57 GATIFPQTIGVASTWNNEIVEKMASVIREQMKAV-----GARQALAPLLDITRDPRWGRT 111
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+V R ++Y+RGLQ ++S I A KH+ Y N EG
Sbjct: 112 EETFGEDPYLVMRMGVSYIRGLQ----------TESLKEGIVATGKHFVGYG--NSEGGM 159
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
+ + + E++++E F+ PFE V E +SS+M Y+ ++G+P KLLN +R DW
Sbjct: 160 NWA-PAHIPERELREVFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDW 218
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQG 324
F G +VSD +I + E H +D K+ A L+AG+D++ DYY ++ G
Sbjct: 219 GFEGIVVSDYFAISQLYEYHHVTSD-KKGAAKLALEAGVDVELPSTDYYGLPLRELIESG 277
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
+I ++ +++ + + LG F+ + + ELA + A++ IVLLKN
Sbjct: 278 EIDIDFVNEAVKRVLKIKFELGLFENPYINEEKAVEIFDTNEQRELAYKIAQESIVLLKN 337
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR-------------YTSPM---- 427
+N LPL ++K++A++GP+A++ + MIG+Y PC + +P+
Sbjct: 338 ENNLLPLKK-DLKSIAVIGPNADSIRNMIGDY-AYPCHIESLLEMRETDNVFNTPLPESL 395
Query: 428 ---DGFYAYSKVIN-------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVA 471
D + V+ YA GC D++ + A++ AK AD V+V
Sbjct: 396 EAKDIYVPIVTVLQGIKAKVSSNTEVLYAKGC-DVLNNSKDGFKEAVEIAKQADVAVVVV 454
Query: 472 G-----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
G D E +DR DL LPG Q ELI + + PV +V+++ + I++
Sbjct: 455 GDKSGLTDGCTSGESRDRADLNLPGVQEELIKAIYETGT-PVIVVLINGRPMSISWIAE- 512
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNF 585
KI +I+ PGEEGGRA+ADVIFG YNPGG+LPI+ ++ + + Y P +++
Sbjct: 513 -KIPAIIEAWLPGEEGGRAVADVIFGDYNPGGKLPISIPQSVGQLPVYYYHKPSGGRSHW 571
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
G + P +YPFGYGLSYT+F Y N + K
Sbjct: 572 KGDYVELSTKP-LYPFGYGLSYTEFSY---------------------TNLNISNRK--- 606
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIA 704
V +D ++++N G + G EVV +Y ++ T +K++ G++R+ +
Sbjct: 607 -------VSLRDRMVEISVDIKNTGTLKGDEVVQLYIHQEALSVTRPVKELKGFKRITLD 659
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
AG+ V F + + + L D ++ G +++G
Sbjct: 660 AGEEKTVIFKL-SIEQLGFYDENMEYVVEPGRVDVMIG 696
>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
Length = 863
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 171/448 (38%), Positives = 242/448 (54%), Gaps = 46/448 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DA
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + +G DL+CG + + T AV++G I+E I+TS++ L LG + + + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N+N LPLN +A++GP+AN + GNY
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 138/296 (46%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ ++AD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I N +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
Y ++ GRTY+F +YPFGYGLSYT+F Y A+ +S KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
++ A+L I V N+G+ DG EVV VY P
Sbjct: 762 GEK----------------AILT-------------IPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
K + G++RV IA G++ V + S + D A N++ +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847
>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
Length = 866
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 173/469 (36%), Positives = 247/469 (52%), Gaps = 54/469 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S + D+P+ + LP ER DL++R+T EK+ QM + + RLG+P Y+WW+EAL
Sbjct: 16 SCSQRQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEAL 75
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ G+ AT FP I A+F++ + VS EARA Y+
Sbjct: 76 HGVARAGK----------------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQ 119
Query: 124 GNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ R + V+GLQ
Sbjct: 120 YQKNKEYDRYKGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQG----- 174
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
D + K AC KHYA + W +R FD VT +D+ +T++ FE V +G+
Sbjct: 175 ----DDPKYFKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGN 227
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE------SHKFL 289
V VMC+YNR G P C+ KLL +R W + I+SDC +I + H+
Sbjct: 228 VQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETH 287
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
D E A A + G DL+CG+ Y + A+++GKI+E D+D SLR L LG FD
Sbjct: 288 PDA-ESASADAVLNGTDLECGNSYKAL-IKALKEGKISENDLDVSLRRLLKGRFELGMFD 345
Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ Y + N + +P+H+ A E A + +VLLKN N LPL + I+ +A+VGP+A
Sbjct: 346 PDERVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAA 404
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ + NY G P + ++G ++VI Y GC AD V Q+
Sbjct: 405 DSTMLWANYNGFPTHTVTILEGIRNKVPDTEVI-YELGCNHAADFVIQD 452
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 141/315 (44%), Gaps = 55/315 (17%)
Query: 442 GCADIVCQNNSMIPA--AIDAAKNADATVIV---------AGLDLSVEAEG---KDRVDL 487
G AD+ Q + P A AAK DA VIV G ++ V EG DR ++
Sbjct: 577 GSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNI 636
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+P Q E++ K A PV V+ + A+ +N+ N I +IL Y G+E G A+A
Sbjct: 637 EIPKVQQEMV-KALKATGKPVVYVLCTGSALALNWEDAN--IDAILNAWYGGQEAGTAVA 693
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
D++FG YNP GRLP+T+Y++ +P + GRTY++ +YPFGYGLSY
Sbjct: 694 DILFGDYNPSGRLPVTFYKS------IDQLPDFEDYSMKGRTYRYMTETPLYPFGYGLSY 747
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F Y+ A K K+ KDQ T ++
Sbjct: 748 TNFAYRNA---KLSSGKITKDQSV-----------------------------TLTFDIA 775
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNA 727
N GKMDG EV +Y K P IK + + RV + AG S +V + DN
Sbjct: 776 NTGKMDGDEVAQIYIKNPNDPEGPIKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNT 835
Query: 728 ANSLLASGAHTILVG 742
+ G + IL G
Sbjct: 836 QTMEVRPGKYQILYG 850
>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
Length = 863
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 171/448 (38%), Positives = 242/448 (54%), Gaps = 46/448 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DA
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + +G DL+CG + + T AV++G I+E I+TS++ L LG + + + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N+N LPLN +A++GP+AN + GNY
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 138/296 (46%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ ++AD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I N +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
Y ++ GRTY+F +YPFGYGLSYT+F Y A+ +S KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
++ A+L I V N+G+ DG EVV VY P
Sbjct: 762 GEK----------------AILT-------------IPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
K + G++RV IA G++ V + S + D A N++ +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847
>gi|403253118|ref|ZP_10919422.1| xylosidase [Thermotoga sp. EMP]
gi|402811565|gb|EJX26050.1| xylosidase [Thermotoga sp. EMP]
Length = 778
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 241/811 (29%), Positives = 378/811 (46%), Gaps = 152/811 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLG 52
Y D P R +DL+ RMTL EKV Q+G L G+ ++
Sbjct: 4 YRDPSQPIEVRVRDLLSRMTLEEKVAQLGSVWGYELIDERGKFNKEKAKELLKNGIGQIT 63
Query: 53 LP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
P EA V+ I R R P H + G T+FP I +
Sbjct: 64 RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123
Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+++ L +K+ + + R + + GL +P ++V RDPRWGR ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDM 219
++YV+GLQ G + + + A KH+A Y EG + + + E++
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSAS--EGGKNWA-PTNIPEREF 225
Query: 220 QETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSI 279
+E F+ PFE V E +V SVM SY+ ++G+P A+ KLL +R DW F G +VSD ++
Sbjct: 226 KEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDYFAV 285
Query: 280 QTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEADIDTS 334
+ + + H+ + K +A L+AG+D++ C Y + V++G I+EA ID +
Sbjct: 286 KVLEDYHRIARN-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEALIDEA 340
Query: 335 LRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
+ + ++ LG F+ Y + K I N H ++A E AR+ I+LLKND G LPL
Sbjct: 341 VARVLMLKFMLGLFENP--YVEVEKAKIEN--HRDIALEIARKSIILLKND-GILPLQKN 395
Query: 395 NIKTLALVGPHANATKAMIGNYE----------------GTPC----------------- 421
K +AL+GP+A + ++G+Y G P
Sbjct: 396 --KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEHM 453
Query: 422 -RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------LD 474
S +D F YA GC ++ ++ S AI+ AK +D ++V G LD
Sbjct: 454 KSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTLD 512
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
+ E +D +L LPG Q EL+ +VA K PV LV+++ + + K+ +IL
Sbjct: 513 CTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KVNAILQ 568
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
V PGE GGRAI D+I+GK NP G+LPI++ A + + + P +++ G
Sbjct: 569 VWLPGEAGGRAIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDYVDES 628
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
P ++PFG+GLSYT+F+Y + PK V PP V+I
Sbjct: 629 TKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGEVVI-- 664
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
+++VEN+G DG EVV +Y + T +K++ G++RV + A + V
Sbjct: 665 ----------KVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKTV 714
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F ++ L D ++ G ++VG
Sbjct: 715 VFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744
>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
Length = 801
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 233/799 (29%), Positives = 370/799 (46%), Gaps = 139/799 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D+ P R ++L+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 56 YEDSCAPLEVRVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGNI 114
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H ++ +P AT F
Sbjct: 115 DEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPG 151
P A++N+ L +IG+ EAR LG + +SP +++ +DPRWGR +ET G
Sbjct: 175 PAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYG 229
Query: 152 EDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD 211
EDPY G+ + LQ K+ + KH+A Y + + + D
Sbjct: 230 EDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRTD 275
Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
V ++M+ ++ PF + +E VM SYN +G P L + +R +W F GY
Sbjct: 276 PHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGY 335
Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAVQ 322
+VSD ++++ I H+ N EDAVA+ + AGL++ T+FT AV+
Sbjct: 336 VVSDSEAVEFISTKHQVANGY-EDAVAQAVNAGLNIR-----THFTPPADFILPLRSAVK 389
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVL 381
+GKI++ ++ + + V LG FD + I + P+H +LA EAARQ +VL
Sbjct: 390 KGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLVL 449
Query: 382 LKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINY 439
LKN++ LPL+ +I+++A++GP+A+ + +I Y T+ +G + Y
Sbjct: 450 LKNEHQTLPLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVY 508
Query: 440 APGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
GC DI+ Q M+ AI+AAK A+ TV+V G + E + R
Sbjct: 509 KKGC-DIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSR 567
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
L LPG Q EL+ K+ K PV LV++ A INFA + + +I+ +PGE GG+
Sbjct: 568 TSLDLPGRQEELLKKICQLGK-PVVLVMIDGRASSINFAATH--VPAIIHAWFPGEFGGQ 624
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
AIA+ +FG YNPGGRL +T + + +IP+ + P +P ++ T + +YPFG+G
Sbjct: 625 AIAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSETSVY---GALYPFGHG 679
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F+Y D+ + +Q N ++
Sbjct: 680 LSYTTFQYS--------DLVISPSKQGVQGNISISCT----------------------- 708
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKI 723
++N+G+ +G EVV +Y + + T QV+ G+ER+ + S V F + + L I
Sbjct: 709 -IKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGI 766
Query: 724 VDNAANSLLASGAHTILVG 742
D N + G +++G
Sbjct: 767 WDKQMNFTVEPGMFKVMIG 785
>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
Length = 882
Score = 285 bits (730), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 170/435 (39%), Positives = 234/435 (53%), Gaps = 45/435 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+ A LV +MTL EK+ Q + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 32 QHAAALVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------- 80
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L + +G STEARA +NL AGLT WSP
Sbjct: 81 -----ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSP 135
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDPY+ G+ A++++RGLQ + P I A KH
Sbjct: 136 NINIFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQG--------NIPDHPRTI-ATPKH 186
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A + + R FD V+ D++ T+ F + +G SVMC+YN ++G P CA
Sbjct: 187 FAVH---SGPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACA 243
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +R DW F+G++VSDCD+I + H F D A A LK+G DL+CG+ Y
Sbjct: 244 SDWLLNTRLRNDWGFNGFVVSDCDAIDDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTY 302
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
+ A+ +G I EA +D +L L+ RLG Y +G +I P H LA
Sbjct: 303 RDLNQ-AIARGDIDEALLDQALIRLFAARQRLGTLQPREHDPYATIGIKHIDTPAHRALA 361
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA Q +VLLKN LPL G TLA++GP A++ A+ NY+GT +P+ G
Sbjct: 362 LQAAVQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419
Query: 432 AY--SKVINYAPGCA 444
+ I+YA G +
Sbjct: 420 TRFGAAKIHYAQGAS 434
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 95/295 (32%), Positives = 137/295 (46%), Gaps = 52/295 (17%)
Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
A +ADA V GL VE E G DR + LP Q L+ V K P+
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
+V+MS AV +N+A+++ +IL YPG+ GG AIA + G NPGGRLP+T+Y +
Sbjct: 666 VVLMSGSAVALNWAQHH--ANAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYRSTQ 723
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
PY S + GRTY++F G +YPFGYGLSYTQF Y+ +
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFTYEAPQLSTAT-------- 769
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
+K D T V N G G EVV +Y +PP
Sbjct: 770 -----------------------LKAGD-TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQ 805
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++ ++G++RV + G+S + FT++ + L V + +G + + VG G
Sbjct: 806 APLRNLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQRSVEAGHYHLFVGGG 859
>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
Length = 863
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 172/449 (38%), Positives = 241/449 (53%), Gaps = 46/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DAV
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAVHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + G DL+CG + + T AV++G I+E I+TS++ L LG + + + N+
Sbjct: 295 ADAVLNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N N LPLN +A++GP+AN + GNY
Sbjct: 354 PYSVIDCPKHKELALKMAHESLVLLQNKNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGCA 444
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVCG 440
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/296 (32%), Positives = 135/296 (45%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ KNAD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+P + GRTY+F +YPFGYGLSYT+F Y A+ +S KL+K
Sbjct: 711 ------IQQLPDYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLNK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
+ K I V N+G+ DG EVV VY P
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
K + G++RV IA G++ V + S + D A N++ SG + IL G
Sbjct: 793 KEGPQKTLRGFQRVNIAKGKTQNVSIEL-PYDSFEWFDTATNTIRPLSGTYKILYG 847
>gi|336412679|ref|ZP_08593032.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
3_8_47FAA]
gi|335942725|gb|EGN04567.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
3_8_47FAA]
Length = 735
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 222/755 (29%), Positives = 351/755 (46%), Gaps = 87/755 (11%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y V++GK+ A +D S+R + V RLG F+
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKNDN LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKRIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D + +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKP-QGNDRSGFAGALDVVRWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K P+ LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
+IL + PG G R++A ++ G+ NP G+L IT+ Y + I Y R +
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAITFPYSTGQIPIYYNR---RKSGRWHQ 601
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
YK Y FGYGLSYT+F+Y V +P S +K
Sbjct: 602 GFYKDITSDPFYSFGYGLSYTEFQYGVV-TPSSTTVK----------------------- 637
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
+ K + ++ V N+GK DG+E V + P + T +K++ +E+ FI G
Sbjct: 638 --------RGEKLSVEVTVTNVGKRDGAETVHWFISDPYCSITRPVKELKHFEKQFIKVG 689
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
++ F ++ + L VD L +G + I V
Sbjct: 690 ETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWV 724
>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
Length = 888
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 179/484 (36%), Positives = 251/484 (51%), Gaps = 54/484 (11%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D +LP RA DLV RMTL EK +Q+G A +PRLG+P Y WW+E LHGV+ G
Sbjct: 38 YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-----GNA-- 126
AT FP I A+++ + + TE RA Y G+
Sbjct: 96 --------------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDW 141
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT WSPNIN+ RDPRWGR ET GEDPY+ GR + ++RGLQ D
Sbjct: 142 YRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQG---------QDPNF 192
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K A KHYA + ++R D + D+++T++ F V EG V +VMC+YN
Sbjct: 193 FKTIATAKHYAVHSGPE---SNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYN 249
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN-DTKEDAVARVLKA 303
V+G+P CA L++Q +R DW F G++VSDC + I T E+ + R L A
Sbjct: 250 AVDGVPACASEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNA 309
Query: 304 GLDLDCGDYYTNF------TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YK 355
G+DL CGDY ++ T+ AV++G + E +D +L L+ +RLG FD + +
Sbjct: 310 GMDLVCGDYRADWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFDPPAEVPFS 369
Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
+ P+H ++ E A+ + LLKND G LPL G + +A+VGP+A++ A+IGN
Sbjct: 370 KITAAQNDTPEHRAMSLEMAKASMTLLKND-GVLPLK-GEPRRIAVVGPNADSVDALIGN 427
Query: 416 YEGTPCRYTSPMDGFYA-YSKV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
Y GTP + + G A + K + YA G +V + +P DA ADA GL
Sbjct: 428 YYGTPSNPVTVLAGIRARFPKAEVVYAEGTG-LVGPASLPVP---DAVLCADAACRTKGL 483
Query: 474 DLSV 477
V
Sbjct: 484 KQEV 487
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 98/291 (33%), Positives = 141/291 (48%), Gaps = 54/291 (18%)
Query: 465 DATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
D V V GL VE E G DR L LP Q +L+ ++ K PV LV+M+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
A+ +N+A N + +I+ YPG EGG A+A ++ G Y+P GRLP+T+Y + P+
Sbjct: 672 GSALSVNWADAN--LPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYRSAGDLPPF 729
Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRD 633
++ GRTY++F G V+YPFGYGLSYT+F Y S +SV
Sbjct: 730 ADYAMK------GRTYRYFGGEVLYPFGYGLSYTRFSYGAPQLSARSV------------ 771
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIK 693
D + T +V N G MDG EVV +Y PG GT I+
Sbjct: 772 ---------------------SADGEITVTTQVTNTGGMDGEEVVQLYVSHPGRDGTPIR 810
Query: 694 QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
+ G++R+ + G++ V FT+ + L +VD N + G + VG G
Sbjct: 811 ALQGFQRIGLKRGETRPVSFTLKD-RQLSVVDAEGNRRVEPGRVEVWVGGG 860
>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
Length = 791
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 229/821 (27%), Positives = 378/821 (46%), Gaps = 153/821 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P ER DL+ +MTL EK+ QM L YG R+ LP W W + +
Sbjct: 47 YEDPSAPMEERVNDLLSQMTLEEKICQMATL-YGSGRVLEDALPEEHWKQALWKDGIGNI 105
Query: 64 ----HGVSFIGRRTNSPPGTHFDSE--------------VP--------------GATSF 91
+G+ G + P H ++ +P AT F
Sbjct: 106 DEEHNGLGTFGSEYSFPYNKHVKAKHEIQRWFVEETRLGIPVDFTNEGIRGLCHDRATFF 165
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P+ +++N+ L +IG+ + EA A+ G T +SP +++ +DPRWGR +E
Sbjct: 166 PSQSGQGSTWNKELIARIGEVEAKEAIAL------GYTNIYSPILDICQDPRWGRSVECY 219
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG+ G Q ++ ++ HR + + KH+A Y + + +
Sbjct: 220 GEDPYLVGQL------GKQMIQSLQKHR--------LVSTVKHFAVYSIPVGGRDGKTRT 265
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V+ ++M+ ++ PF E VM SYN +G P + L + +R ++ F G
Sbjct: 266 DPHVSPREMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFKG 325
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ I H +++ E VA+ + AGL++ T+FT A+
Sbjct: 326 YVVSDSEAVEFITTKHHVVSNEVE-GVAQAVNAGLNIR-----THFTKPEDFVLPLRQAI 379
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIV 380
++GK++ I++ + + + LG FD + + I + +H ++A EAARQ +V
Sbjct: 380 KEGKVSPETINSRVADILRIKFWLGLFDNPYRGDEKQEEKIVHCKEHQQVALEAARQSLV 439
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYAYSK-- 435
LLKN+N LPL +K++A++GP+AN +I CRY +P+ Y K
Sbjct: 440 LLKNENQLLPLKK-TVKSVAVIGPNANEQTQLI-------CRYGPANAPIKTVYQGIKEL 491
Query: 436 ----VINYAPGCADI--------------VCQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
+ Y GC I + M+ A+ AA+NA+ V+V G
Sbjct: 492 LPETEVVYRKGCEIIDSHFPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELT 551
Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
E + R L LPG Q EL+ + K P LV++ A IN+A N I +IL +
Sbjct: 552 VREDRSRTSLDLPGHQQELMQAIHATGK-PTVLVLLDGRAATINYA--NQYIPAILHAWF 608
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
PGE G A+A+ +FG YNPGGRL +T + + +IP+ + P +P ++ P T +
Sbjct: 609 PGEFAGTAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDEPCETAVY---GA 663
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
+YPFGYGLSYT+F YK ++++ ++Q TV
Sbjct: 664 LYPFGYGLSYTKFSYK--------NLQITPEEQGPQGEITVSC----------------- 698
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
EV N+G G EVV +Y + T++K + G+ER+ + G++ KV F +
Sbjct: 699 -------EVTNIGDRTGDEVVQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILT 751
Query: 717 ACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
+ L + D ++ G +++G + + N+
Sbjct: 752 P-QDLGLWDKNNKFVVEPGMFKVMIGAASTDIRLEGKFNIK 791
>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
Length = 861
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 172/445 (38%), Positives = 240/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H+ D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASADA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D P + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPLEGTYELLYG 845
>gi|170288668|ref|YP_001738906.1| glycoside hydrolase family 3 protein [Thermotoga sp. RQ2]
gi|170176171|gb|ACB09223.1| glycoside hydrolase family 3 domain protein [Thermotoga sp. RQ2]
Length = 778
Score = 285 bits (729), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 244/817 (29%), Positives = 378/817 (46%), Gaps = 162/817 (19%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRL 51
PY D P R +DL+ RMTL EKV Q+G L G+ ++
Sbjct: 3 PYRDPSQPIEVRVRDLLSRMTLEEKVAQLGSVWGYELIDERGKFSREKAKELLKNGIGQV 62
Query: 52 GLP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTT 98
P EA V+ I R R P H + G T+FP I
Sbjct: 63 TRPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMA 122
Query: 99 ASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVG 158
++++ L +K+ + + R + + GL +P ++V RDPRWGR ET GE PY+V
Sbjct: 123 STWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVA 177
Query: 159 RYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRV 214
R ++YV+GLQ G + + + A KH+A Y NW + +
Sbjct: 178 RMGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSASEGGKNWAPTN-------I 220
Query: 215 TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVS 274
E++ +E F+ PFE V E +V SVM SY+ ++G+P A+ KLL +R DW F G +VS
Sbjct: 221 PEREFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVS 280
Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEA 329
D +++ + + H+ D K +A L+AG+D++ C Y + V++G I+EA
Sbjct: 281 DYFAVKVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEA 335
Query: 330 DIDTSL-RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA 388
ID ++ R L + M LG F+ Y + K I H ++A E AR+ I+LLKND G
Sbjct: 336 LIDEAVARVLRLKFM-LGLFENP--YVEVEKAKI--ESHRDIALEIARKSIILLKND-GI 389
Query: 389 LPLNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC----------- 421
LPL+ K +AL+GP+A + ++G+Y G P
Sbjct: 390 LPLSKE--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKK 447
Query: 422 -------RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-- 472
S +D F YA GC ++ ++ S AI+ AK +D ++V G
Sbjct: 448 SIEEHMKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDK 506
Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
LD + E +D +L LPG Q EL+ +VA K PV LV+++ + + K
Sbjct: 507 SGLTLDCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--K 562
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
+ +IL V PGE GGR+I D+I+GK NP G+LPI++ A + + + P +++ G
Sbjct: 563 VNAILQVWLPGEAGGRSIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHG 622
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
P ++PFG+GLSYT+F+Y + PK V PP
Sbjct: 623 DYVDESTKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAG 660
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAA 705
V+I +++VEN G DG EVV +Y + T +K++ G++RV + A
Sbjct: 661 EVVI------------KVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKA 708
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ V F ++ L D ++ G ++VG
Sbjct: 709 KEKKTVVFRLH-MDVLAYYDRDMKLVVEPGEFRVMVG 744
>gi|160884133|ref|ZP_02065136.1| hypothetical protein BACOVA_02110 [Bacteroides ovatus ATCC 8483]
gi|423291392|ref|ZP_17270240.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
CL02T12C04]
gi|156110475|gb|EDO12220.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392663392|gb|EIY56942.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
CL02T12C04]
Length = 735
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 222/760 (29%), Positives = 357/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EK+ Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y + V++GK+ A +D S+R + V RLG F+
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRYLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKNDN LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D A+ +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K P+ LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G R++A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D K + ++ V N G DG+E V + P + T +K++ +E+
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
FI AG++ F ++ + V+ L +G + ILV
Sbjct: 685 FIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
Length = 861
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 237/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E Y
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD------ 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H+ D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAAA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++ G DL+CG Y + AV+ G I E +ID SL+ L LG D P + + +
Sbjct: 296 VRTGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT N+K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-NLK-IAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPLEGTYELLYG 845
>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
Length = 777
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 240/808 (29%), Positives = 374/808 (46%), Gaps = 157/808 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSE--------- 61
Y D P ER KDL+ +M + EK QM L YG R+ LP +W SE
Sbjct: 32 YEDPSAPIEERVKDLLSQMNMDEKTCQMATL-YGSGRVLADALPTEKWKSEIWKDGIGNI 90
Query: 62 -----------------------ALHGVS--FIGRRTNSPPGTHFDSEVPG-----ATSF 91
A+H + F+ P + + G AT F
Sbjct: 91 DEEHNGLGKFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATFF 150
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P +++N+ L +IG+ + EA A+ G T +SP +++ +DPRWGR +E
Sbjct: 151 PAQCGQGSTWNKELIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRAVECY 204
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG+ G Q ++ ++ H K+ A KH+A Y + +
Sbjct: 205 GEDPYLVGQL------GKQMIQSLQKH--------KLVATPKHFAVYSIPVGGRDGGTRT 250
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P + L Q +R +W F G
Sbjct: 251 DPHVAPREMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKG 310
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD D+++ I HK + D E+AV + + AGL++ TNF+ A+
Sbjct: 311 YVVSDSDAVEFISSKHK-VADNNEEAVVQSVNAGLNV-----RTNFSSPAGFIKPLRSAI 364
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICN-PQHIELAAEAARQG 378
+GK+++A ID + + V LG FD Y+ GK + I + +H +A EAARQ
Sbjct: 365 AKGKVSQATIDQRVSEILYVKFWLGLFDNP--YRGDGKLADKIVHCKEHQAVALEAARQS 422
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYAYSK 435
IVLLKN + LPL +K++A++GP+A+ K +I CRY +P+ Y K
Sbjct: 423 IVLLKNQDNLLPLQK-TLKSVAVIGPNADEQKELI-------CRYGPSNAPIKTVYKGIK 474
Query: 436 ------VINYAPGCA--------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
+ Y GC DI + ++ AI+AAK+A+ ++V G
Sbjct: 475 EALPGAKVVYKKGCEIVDPHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSE 534
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
E + R L LPG Q EL+ V K P LV++ A IN+AK + +IL
Sbjct: 535 VTVREERSRTSLDLPGRQEELLKAVCKLGK-PTILVMIDGRASSINYAKKY--VPAILHA 591
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
+PGE G+A+A+ IFG NPGG+L +T + + +IP+ + P +P ++ T G
Sbjct: 592 WFPGEFCGQAVAETIFGDNNPGGKLAVT-FPKSVGQIPF-AFPFKPGSDSGCGTS--VTG 647
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
++PFG+GLSYT F+Y ++K+ +QQ +G K C
Sbjct: 648 -ALFPFGHGLSYTTFEYN--------NLKISPEQQG-----VLGEVKVSCT--------- 684
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
V+N GK G EVV +Y + T++K + G+ER+ + + KV FT
Sbjct: 685 ----------VKNTGKRPGDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFT 734
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ + L I D + G +++G
Sbjct: 735 LSP-QDLAIWDKNMKFQVEPGTFKVMIG 761
>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
Length = 730
Score = 285 bits (728), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 232/749 (30%), Positives = 340/749 (45%), Gaps = 127/749 (16%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV---- 66
D+P+ D LP ER DL+ MTL EKV MG G+PRL + Y SE HGV
Sbjct: 26 DYPFQDPDLPNEERIDDLITCMTLEEKVDLMG-FVPGIPRLDVK-YTRISEGYHGVAQGG 83
Query: 67 -SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--- 122
S G+R +P T FP A+++ +L ++ +TE R +Y
Sbjct: 84 PSNWGKRNPTP-----------TTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPK 132
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+GL +PN ++ RDPRWGR E GEDP++ G A + GL
Sbjct: 133 YQRSGLVVMAPNADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLA---------GDHP 183
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
R LK ++ KH+ L N +DRF S E+ +E + PFEM + +G S+M +
Sbjct: 184 RYLKATSLLKHF----LANSNEDDRFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAA 239
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN +NG P P +L + G+W G I +D + +V HK D A A +K
Sbjct: 240 YNAINGTPAHVHP-MLRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPT-ATAACIK 297
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGK 359
AG++L D +T + AV+Q + EA+ID +R + + LG D P+ Y N+G
Sbjct: 298 AGINLFL-DNHTQAALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGH 355
Query: 360 NNICNPQHI----ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
P + E R+ IVLLKN+N LPL+ I ++A+VGP AN T ++
Sbjct: 356 EPGLEPWELPETHAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLANTT--LLDW 413
Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN---SMIPAAIDAAKNADATVIVAG 472
Y GTP P DG Y+ N P + +N M A++ A + D ++V G
Sbjct: 414 YSGTPPYAIPPRDGIEGYA---NSGPFPSPAKFGSNWVADMSDTALEVAASRDVAIVVVG 470
Query: 473 LD---------LSVEAEGK---DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
++ +EGK DR +++L Q E I KV A P T+V++ +
Sbjct: 471 NHPESNAGWGVVTSPSEGKEAVDRQEIILQPDQEEFIQKV--YAANPNTIVVLVS----- 523
Query: 521 NF-------AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP 573
NF A+N P +I+ + + +E G A+ADV+FG YNPGG+ TW P
Sbjct: 524 NFPYAMPWAAENAP---AIVHITHASQEQGNALADVLFGDYNPGGKTVQTW--------P 572
Query: 574 YTSMPLRPVNNFP---GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQ 630
+ L P+ ++ GRTY + YPFGYGLSYT F+
Sbjct: 573 KSLDQLPPMMDYDIRRGRTYMYSQHEPQYPFGYGLSYTTFE------------------- 613
Query: 631 CRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAG 689
+ + D T ++ V N G+ DG EVV +Y + P
Sbjct: 614 --------------LSKLKAPKKLKADATATIKVRVANTGERDGDEVVQLYVRYPNSKVE 659
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNAC 718
KQ+ G++RV + AG+S + A
Sbjct: 660 RPSKQLKGFQRVTVPAGKSVTGEIPLKAA 688
>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
Length = 861
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 172/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D P + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 130/297 (43%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
+ ++RV I AG++ V + + + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTGV-NFEWFDVESNTMRPLEGTYELLYG 845
>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
Length = 861
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 172/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D P + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 128/297 (43%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q N + K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKA 647
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 648 GKKVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQCDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPLEGTYELLYG 845
>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 861
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 172/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D P + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 127/297 (42%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q N + K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKA 647
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 648 GKKVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 N------VNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + + + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTGV-NFEWFDAESNTMRPLEGTYELLYG 845
>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 762
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 214/690 (31%), Positives = 347/690 (50%), Gaps = 100/690 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I ++F L +++ + RA N+ + GL SP +++ RDPRWGR
Sbjct: 106 GATVFPQAIGMASTFEPELIRRVSDVIRQHMRAA-NV-HQGL---SPVLDIPRDPRWGRT 160
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+V R A YV+GLQ G ++ I A KH+ AY +
Sbjct: 161 EETFGEDPYLVSRMAAEYVKGLQ---GEDWREG-------IIATVKHFTAYGISE---GA 207
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
R ++V E++++E F+ PFE+ + EG S+M +Y+ ++G+P + LL + +R +W
Sbjct: 208 RNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEW 267
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGAVQQG 324
F GY+VSD +I+ + H+ D KE AV L+AG+D++ D Y + AV++G
Sbjct: 268 GFKGYVVSDYIAIRMLENFHRVAKDAKEAAVL-ALEAGIDIELPSVDCYGEPLIQAVKEG 326
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVLLK 383
I+E I+ S+ + LG FDG + +I + P+ EL+ E AR+ IVLLK
Sbjct: 327 LISEEVINASVERVLRAKFMLGLFDGDLEKDPKKVYDIFDKPEFRELSREVARRSIVLLK 386
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNY-------------------EGTPCRYT 424
ND G LPL+ NI+T+A++GP+A+ + + G+Y E R
Sbjct: 387 ND-GILPLSK-NIRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIPEECAVRTV 444
Query: 425 SPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDL 475
S ++G A ++V+ YA GC +I+ + AI+ AK AD + V G
Sbjct: 445 SILEGIKNKVSAETQVL-YAKGC-EILSDSKEGFDEAIEIAKRADVIIAVMGEESGLFHR 502
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
+ EG DR L L G Q +L+ ++ K P+ LV+++ + + N + +IL
Sbjct: 503 GISGEGNDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN--LNAILEA 559
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYTSMPLRPVNNFPGRTYKFFD 594
YPGEEGG A+ADVIFG YNP G+LPI++ V + Y P ++ + K
Sbjct: 560 WYPGEEGGDAVADVIFGDYNPSGKLPISFPAVTGQVPVYYNRKP-SAFTDYVEESAK--- 615
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFG+GLSYT F+Y ++K+ ++ ++ ++
Sbjct: 616 --PLYPFGHGLSYTTFEYS--------NLKIHPEK--------------------VNALE 645
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIGYERVFIAAGQSAKVG 712
+ FT ++N G +G EVV +Y +A +K++ G++++ + G+S +V
Sbjct: 646 KVEISFT----IKNTGVREGEEVVQLYVHDQ-VASLERPVKELKGFKKIHLKPGESKRVT 700
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + + L D ++ G I++G
Sbjct: 701 FILYP-EQLAFYDEFMRFVVEKGIFEIMIG 729
>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 788
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 236/811 (29%), Positives = 371/811 (45%), Gaps = 163/811 (20%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y + K P +R +DL+ +MTL EK QM L YG R+ LP W +E G+ I
Sbjct: 43 YENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGNI 101
Query: 70 GRRTNS----------PPGTHFDSE--------------VP--------------GATSF 91
N P H D++ +P AT F
Sbjct: 102 DEEHNGLGAFKSEYSFPYAKHVDAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYF 161
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L +IG+ + EA A+ G T +SP +++ +DPRWGR +ET
Sbjct: 162 PAQCGQGATWNKKLIARIGEVEAKEAVAL------GYTNIYSPILDIAQDPRWGRCVETY 215
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + LQ + A KH+A Y + + +
Sbjct: 216 GEDPYLVGELGKQMITSLQK--------------YNLVATPKHFAVYSIPIGGRDGKTRT 261
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ +I PF M E VM SYN +G P L + +R +W F G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ I HK + DT ED +A+ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN---ICNPQHIELAAEAARQG 378
GKI++ +D + + + LG FD Y+ GK + + +H ++ EAARQ
Sbjct: 376 DDGKISQETLDKRVAEILRIKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---TSPMDGFYA--- 432
+VLLKN+ LPL+ +I+++A++GP+A+ +I CRY +P+ Y
Sbjct: 434 LVLLKNETHLLPLSK-SIRSIAVIGPNADEQTQLI-------CRYGPANAPIKTVYQGIK 485
Query: 433 ----YSKVINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGL 473
+++VI Y GC DI+ + ++ I AAK A+ V+V G
Sbjct: 486 ELLPHAEVI-YKKGC-DIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGG 543
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
+ E + R L LPG Q EL+ V K PV LV++ A IN+A + + +IL
Sbjct: 544 NELTVREDRSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAAAH--VPAIL 600
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
+PGE G+A+A+ +FG YNPGGRL +T + + +IP+ + P +P ++ T +
Sbjct: 601 HAWFPGEFCGQAVAEALFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDESSSTSVY- 657
Query: 594 DGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+YPFG+GLSYT F Y + SP ++ D C+
Sbjct: 658 --GALYPFGHGLSYTTFTYSDLHISPSHQGVQGDIHVSCK-------------------- 695
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
++N GK+ G EVV +Y + T+ K + G+ER+ + AG+ V
Sbjct: 696 -------------IKNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTV 742
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F + + L + D N + G+ +++G
Sbjct: 743 HFRLRP-QDLGLWDKNMNFRVEPGSFKVMLG 772
>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 863
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 171/449 (38%), Positives = 240/449 (53%), Gaps = 46/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DA
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + G DL+CG + + T AV++G I+E I+TS++ L LG + + + N+
Sbjct: 295 ADAVLNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N N LPLN +A++GP+AN + GNY
Sbjct: 354 PYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGCA 444
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVCG 440
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 97/296 (32%), Positives = 135/296 (45%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ KNAD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+P + GRTY+F +YPFGYGLSYT+F Y A+ +S KL K
Sbjct: 711 ------IQQLPDYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLAK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
+ K I V N+G+ DG EVV VY P
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
G K + G++RV IA G++ V + S + D A N++ SG + IL G
Sbjct: 793 KGGPQKTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYG 847
>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
Length = 863
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 171/449 (38%), Positives = 240/449 (53%), Gaps = 46/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DA
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + G DL+CG + + T AV++G I+E I+TS++ L LG + + + N+
Sbjct: 295 ADAVLNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N N LPLN +A++GP+AN + GNY
Sbjct: 354 PYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGCA 444
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVCG 440
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 97/296 (32%), Positives = 135/296 (45%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ KNAD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAI--VPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+P + GRTY+F +YPFGYGLSYT+F Y A+ +S KL K
Sbjct: 711 ------IQQLPDYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLAK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
+ K I V N+G+ DG EVV VY P
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
G K + G++RV IA G++ V + S + D A N++ SG + IL G
Sbjct: 793 KGGPQKTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYG 847
>gi|15642851|ref|NP_227892.1| xylosidase [Thermotoga maritima MSB8]
gi|418046013|ref|ZP_12684107.1| Beta-glucosidase [Thermotoga maritima MSB8]
gi|4980564|gb|AAD35170.1|AE001694_6 xylosidase [Thermotoga maritima MSB8]
gi|351675566|gb|EHA58726.1| Beta-glucosidase [Thermotoga maritima MSB8]
Length = 778
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 237/789 (30%), Positives = 366/789 (46%), Gaps = 159/789 (20%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLG 52
Y D P R +DL+ RMTL EKV Q+G L G+ ++
Sbjct: 4 YRDPSQPIEVRVRDLLSRMTLEEKVAQLGSVWGYELIDERGKFSREKAKELLKNGIGQIT 63
Query: 53 LP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
P EA V+ I R R P H + G T+FP I +
Sbjct: 64 RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123
Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+++ L +K+ V + R + + GL +P ++V RDPRWGR ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAVREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
++YV+GLQ G + + + A KH+A Y NW + +
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSASEGGKNWAPTN-------IP 221
Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
E++ +E F+ PFE V E +V SVM SY+ ++G+P A+ KLL +R DW F G +VSD
Sbjct: 222 EREFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSD 281
Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEAD 330
+++ + + H+ D K +A L+AG+D++ C Y + V++G I+EA
Sbjct: 282 YFAVKVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEAL 336
Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
ID ++ + + LG F+ Y + K I H ++A E AR+ I+LLKND G LP
Sbjct: 337 IDEAVTRVLRLKFMLGLFENP--YVEVEKAKI--ESHRDIALEIARKSIILLKND-GILP 391
Query: 391 LNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC------------- 421
L K +AL+GP+A + ++G+Y G P
Sbjct: 392 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 449
Query: 422 -----RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---- 472
S +D F YA GC ++ ++ S AI+ AK +D ++V G
Sbjct: 450 EEHMKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSG 508
Query: 473 --LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
LD + E +D +L LPG Q EL+ +VA K PV LV+++ + + K+
Sbjct: 509 LTLDCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KVN 564
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRT 589
+IL V PGE GGRAI D+I+GK NP G+LPI++ A + + + P +++ G
Sbjct: 565 AILQVWLPGEAGGRAIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDY 624
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
P ++PFG+GLSYT+F+Y + PK V PP V
Sbjct: 625 VDESTKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGEV 662
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQ 707
+I +++VEN+G DG EVV +Y + T +K++ G++RV + A +
Sbjct: 663 VI------------KVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKE 710
Query: 708 SAKVGFTMN 716
V F ++
Sbjct: 711 KKTVVFRLH 719
>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
Length = 861
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 176/449 (39%), Positives = 242/449 (53%), Gaps = 43/449 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY DA L ERA+DL+ R+TL EKV MGD + V RLG+ + WWSEALHGV+ G
Sbjct: 21 MPYKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQG- 79
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-------- 123
G T FP I ASFN+ L + +S EARA ++
Sbjct: 80 ---------------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDER 124
Query: 124 -GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+ GL+ W+PN+N+ RDPRWGR ET GEDPY+ R I+ V GLQ + +Y
Sbjct: 125 RQDNGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPKDAKYK----- 179
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W N + + + + ET++ F++ V + DVS VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEW--NRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y+R + P C + LL + +R +W F +VSDC +I SHK +D AV VL
Sbjct: 235 YHRQDDDPCCGNNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVL- 293
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
AG D++CG YT + AV +G I EADID S+ L RLG FD + + N+
Sbjct: 294 AGTDVECGFGYTYHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFDDNSIVPWANIPD 353
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
I +H LA E ARQ + LL+N N LPL++ K +A++GP+A+ K M GNY G
Sbjct: 354 TIINCKKHQALALEMARQSMTLLQNKNNILPLSSK--KKIAVIGPNADDAKLMWGNYNGI 411
Query: 420 PCRYTSPMDGFYAYS-KVINYAPGCADIV 447
P + + ++G + + K I Y GC DIV
Sbjct: 412 PVKTVTILEGIKSIAGKDIFYEKGC-DIV 439
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 118/268 (44%), Gaps = 52/268 (19%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
+D K+ D V G+ +E E G DR D+ LP Q I + A K
Sbjct: 593 VDRLKDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKAGK-- 650
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
+++++ I + ++IL Y G+ GG+AIA+V+FGKYNP G+LPIT+Y+
Sbjct: 651 -RVIMVNCSGSAIGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPITFYKN 709
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+P + GRTY++ + ++PFGYGLSYT F A++ S+ K +
Sbjct: 710 ------IDQLPDFEEYDMKGRTYRYLEDKPLFPFGYGLSYTTFDIGRATA-SSISAKAGE 762
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
K I V+N GK GSE V VY K
Sbjct: 763 -------------------------------KIKLVIPVKNTGKRTGSETVQVYVKKVD- 790
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+G IK + ++R+ + S + F +
Sbjct: 791 SGGPIKTLRSFKRIELPPNVSQDLTFEL 818
>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
Length = 885
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 171/441 (38%), Positives = 233/441 (52%), Gaps = 46/441 (10%)
Query: 28 LVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPG 87
LV +MT EK+ Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 40 LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE---------------- 83
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVV 138
AT FP I A++N L +G STEARA +NL AGLT WSPNIN+
Sbjct: 84 ATVFPQAIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIF 143
Query: 139 RDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYD 198
RDPRWGR +ET GEDPY+ GR A+ ++ GLQ D + P I A KH A +
Sbjct: 144 RDPRWGRGMETYGEDPYLTGRLAVGFIHGLQG--------DDPAHPRTI-ATPKHLAVH- 193
Query: 199 LDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLL 258
+ R FD V+ D + T+ F + +G SVMC+YN ++G P CA L+
Sbjct: 194 --SGPEPGRHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLI 251
Query: 259 NQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
+ +RGDW F G++VSDCD+I + + H + D + A LKAG DL+CG Y +
Sbjct: 252 DGRVRGDWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAA-ALKAGHDLNCGTAYRELGI 310
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEAAR 376
A +G+ EA +D SL L+ RLG + Y LG +I + H LA +AA+
Sbjct: 311 -AFDRGEADEALLDRSLVRLFAARYRLGELQPRRNDPYARLGARDIDSAAHRALALQAAQ 369
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--S 434
Q +VLLKN N LPL G LA++GP+A+A A+ NY+GT + +P+ G +
Sbjct: 370 QSLVLLKNANATLPLRPG--LRLAVLGPNADALAALEANYQGTSVQPVTPLQGLRTRFGA 427
Query: 435 KVINYAPGCADIVCQNNSMIP 455
+ YA G A + MIP
Sbjct: 428 AQVAYAQG-APLAAGVPGMIP 447
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 139/287 (48%), Gaps = 45/287 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR DL LP Q L+ + A A+ P+ +V+MS AV +N+A+ +
Sbjct: 627 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAEQH 685
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA + G NPGGRLP+T+Y + PY S ++
Sbjct: 686 ADAIIAAW--YPGQSGGTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK------ 737
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYTQF Y +P+ L Q
Sbjct: 738 GRTYRYFKGEPLFPFGYGLSYTQFAY---DAPQLSTTTLQAGQ----------------- 777
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EVV VY + P A + ++ ++G++RV + G
Sbjct: 778 ------------PLQVSTTVRNTGARAGDEVVQVYLQYPQRAQSPLRSLVGFQRVHLQPG 825
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQ 753
++ + F ++A + L VD + + +G + + VG G G P Q
Sbjct: 826 EARTLSFALDA-RQLSDVDRSGQRAVEAGDYRLFVGGGQPGTGAPGQ 871
>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
Length = 861
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 169/445 (37%), Positives = 239/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP I ASFN+SL ++ VS EAR + +
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFSENGVLK 127
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E +Y
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYD------ 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ +T +D+ ET++ F+ V + DV VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENITPRDLWETYLPAFKDLVQKADVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
+ +G DL+CG Y + AV+ G I E ID SL+ L LG D P + + +
Sbjct: 296 VLSGTDLECGGEYGSLA-DAVKAGLIDEKQIDVSLKRLLTARFELGEMDEQPAWAEIPAS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H +LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 TLNSKEHQDLALRMARESLVLLQNKNDILPLNT-DLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ + + Y PGC
Sbjct: 413 GHTVTLLEAVRSKLPEGQVMYEPGC 437
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A++ K+AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I + ++IL YPG+ GG AI DV+FG YNP GRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A K+
Sbjct: 708 DA------GQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEADLSKN------ 755
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
T+G T I V N G+ DG EVV VY +
Sbjct: 756 ----------TIGDGG----------------TVTLTIPVSNAGQRDGDEVVQVYLRCMA 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ +V + +S + D A N++ G + +L G
Sbjct: 790 DKEGPHYTLRAFKRVHIPAGETKQVTIPLT-YESFEWFDTATNTVHPLKGTYELLYG 845
>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 909
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 235/440 (53%), Gaps = 50/440 (11%)
Query: 31 RMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATS 90
+MT EKV Q + A +PRLG+P YEWW+E LHG++ G AT
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------------ATV 110
Query: 91 FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVVRDP 141
FP I A++N +L +++G STEARA +NL AGLT WSPNIN+ RDP
Sbjct: 111 FPQAIGLAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDP 170
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RWGR +ET GEDPY+ G+ A+ ++RGLQ D + P I A KH A +
Sbjct: 171 RWGRGMETYGEDPYLTGQLAVGFIRGLQG--------DDLTHPRTI-ATPKHLAVHSGPE 221
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
R FD V+ D++ T+ F + +G +VMC+YN ++G P CA LLN
Sbjct: 222 ---PGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGR 278
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAV 321
+RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y + A+
Sbjct: 279 LRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDLGK-AI 336
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQ 377
+G EA +D SL L+ RLG PQ Y LG ++ + H LA +AA+Q
Sbjct: 337 ARGDADEAVLDQSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQ 394
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
IVLL+N N LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 395 SIVLLQNRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAA 452
Query: 438 N--YAPGCADIVCQNNSMIP 455
N YA G A + + MIP
Sbjct: 453 NLRYAQG-APLAAGVSGMIP 471
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 139/285 (48%), Gaps = 45/285 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR DL LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 651 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQH 709
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA V+ G NPGGRLP+T+Y + Y S ++
Sbjct: 710 ADAIVAAW--YPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK------ 761
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++ FG GLSYT+F Y ++P Q G N
Sbjct: 762 GRTYRYFKGEPLFAFGSGLSYTRFTY---AAP-----------QLSATTLQAGAN----- 802
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
+ +V N G G EVV VY +PP A + ++ ++G++RV + G
Sbjct: 803 -------------LQVRTQVRNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQPG 849
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
++ +VGF + + L VD A + G + + VG G G P
Sbjct: 850 EAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGGQPGTGAP 893
>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 859
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 231/817 (28%), Positives = 379/817 (46%), Gaps = 139/817 (17%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----------- 58
PY D+ P R +DL+ RM+L EK Q+ L YG PR+ P W
Sbjct: 60 PYEDSSRPIDARIEDLLARMSLEEKTAQLTTL-YGFPRVLKDERPTSAWREAMWKDGIGN 118
Query: 59 -----------------------WS---EALHGVS--FIGRRTNSPPGTHFDSEVPG--- 87
WS AL+ V FI + P + + G
Sbjct: 119 IDEHLNGNTGWTNNLADPVHDLPWSLHARALNEVQRWFIEQTRLGIPVDFTNEGIRGLLH 178
Query: 88 --ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWG 144
ATSFP + ++++ +L ++IG+ EARA+ G T +SP +++ RDPRWG
Sbjct: 179 SKATSFPAELAVASTWDPALVREIGRITGREARAL------GYTNIYSPVLDLARDPRWG 232
Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
R +ET GEDP++VG + VRGLQ E+ + + KH+A Y +
Sbjct: 233 RTIETYGEDPFLVGTLGVEQVRGLQ----AEH----------VVSTLKHFAVYSIPKGGR 278
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+ D + T +++Q F+ PF + E VM SYN +G+P L++ +RG
Sbjct: 279 DGEARTDPQATWREVQTIFLEPFRRAIREAGALGVMASYNDYDGVPVEGSALFLSEILRG 338
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA---- 320
W F GY+VSD +++ I H+ + T DA+ + ++AGL++ TNFT A
Sbjct: 339 QWGFRGYVVSDSAAVEFIHSKHR-VAPTPADAIRQAVEAGLNI-----RTNFTPPAAYAE 392
Query: 321 -----VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEA 374
V+ GK+A A ID +R + V +LG FD + + P+H+ +A A
Sbjct: 393 PLRQLVRDGKLAMATIDARVRDVLRVKFQLGLFDRPYVADPAAADRVVRAPEHLVVAQRA 452
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA-- 432
R+ IVLLKN+ LPL+ ++ + + GP A+ A Y + +P+ G A
Sbjct: 453 GREAIVLLKNEPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLRAKL 512
Query: 433 -YSKVINYAPG---------CADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSV 477
+ + YA G +D++ + + I AA+ AA+N D + V G +
Sbjct: 513 GAAVEVRYAKGVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGETDEL 572
Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
E R+ L LPG+Q EL+ + K P+ LV+ + + + +A + + +I+ + +
Sbjct: 573 CRESSSRISLALPGYQQELLEALHATGK-PLVLVLSNGRPLSVVWAARH--VPAIVELWF 629
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV 597
PGE+GG A+A V+ G NP GRLPIT + + ++PY + P P + R + +G
Sbjct: 630 PGEDGGAALAAVLLGDANPSGRLPIT-FPQSVGQLPY-NFPAHPGSQ--ARDFGQVEG-S 684
Query: 598 VYPFGYGLSYTQFKYK-VASSPKSVDIKLD----------KDQQCRDINYTVGTNKPPCA 646
++PFG+GLSYT F+Y + +P+ + + + R Y+V T
Sbjct: 685 LFPFGHGLSYTTFRYSDLRITPERIPVDGFGAAGGGDPGLRGSASRATPYSVSTVP---- 740
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAA 705
+FT +V N G G EVV +Y + T+ + G+ RV +A
Sbjct: 741 ------------EFTITCDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAP 788
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
G++ V FT++ L++ + + ++ G T+++G
Sbjct: 789 GETKPVTFTLHRAH-LELYNRDGDWVVEPGRFTVMLG 824
>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 861
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H+ D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPD-KEHASAGA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPLEGTYELLYG 845
>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
Length = 861
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 170/445 (38%), Positives = 237/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E +Y
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYD------ 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++ G DL+CG Y + AV+ G I E +ID SL+ L LG D P + + +
Sbjct: 296 VRTGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPAS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 128/297 (43%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q N + K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKA 647
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 648 GKKVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPLEGTYELLYG 845
>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 866
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 165/452 (36%), Positives = 241/452 (53%), Gaps = 41/452 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+PY + +L ERA+DL R+TL EK + M + + +PRLG+P +EWWSEALHG++ G
Sbjct: 23 YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
AT FP AS+++ L ++ S EA A NL
Sbjct: 82 ---------------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIK 126
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD---- 179
G++ W+PNIN+ RDPRWGR ET GEDPY+ R + V GLQ G + RD
Sbjct: 127 RYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQ---GQPFRRDMRPF 183
Query: 180 -SDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVS 237
R K AC KHYA + W +R FD R+ E+D+ ET++ F+ V EG+V
Sbjct: 184 TERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVR 240
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLNDTKEDA 296
VMC+Y R++G P C + + L+Q +RG+W ++G +VSDC +I E H + +T +A
Sbjct: 241 EVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEA 300
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A ++AG D++CG Y AV+QG I+ IDTS+ L +G FD +
Sbjct: 301 SAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPW 359
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
K G I + H LA + AR+ + LL+N N LPL+ ++ +A++GP+AN + + G
Sbjct: 360 KLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWG 418
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
NY G P T+ + G + + GC I
Sbjct: 419 NYTGYPISTTTILKGIRSKVPAARFVEGCGYI 450
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/328 (31%), Positives = 150/328 (45%), Gaps = 67/328 (20%)
Query: 432 AYSKVINYAPGCADIVCQ----NNSMIPAAIDAAKNADATVIV---------AGLDLSVE 478
AY + Y A VCQ S I A+ AA+ DA V+V G ++ V+
Sbjct: 573 AYRVRVEYVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVD 632
Query: 479 A---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
A +G DR + LP Q E+I + A K V V S GAV + ++L
Sbjct: 633 APGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNCSGGAVAL--VPEAEACDAVLQA 689
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
Y GE GG+A+ADV+FG YNP G+LP+T+Y+++ +P GRTY++F G
Sbjct: 690 WYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD------ADLPDFLDYRMTGRTYRYFRG 743
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
++PFG+GLSYT F + P+ Y G
Sbjct: 744 TPLFPFGFGLSYTSFAF---GKPR----------------YENG---------------- 768
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+EV N GK DG+EVV VY K P A +K + G+ R+ + AG+ +V M
Sbjct: 769 -----MLYVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRGFARIDLKAGERRRVEIAM 823
Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
+ + D AN++ + G H ++VG
Sbjct: 824 PR-ERFEGWDATANTMRVKPGNHLLMVG 850
>gi|297736784|emb|CBI25985.3| unnamed protein product [Vitis vinifera]
Length = 241
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 133/212 (62%), Positives = 162/212 (76%)
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDP+ V YA++YVRGLQDVEG E D +SRPLK+S+ KH+AAYDLDNW DR HF
Sbjct: 9 GEDPFTVSVYAVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHF 68
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
++RV+EQDM ETF+ PFE CV EGDVS VMCS+N +NGIP CADP+L TIR +WN HG
Sbjct: 69 NARVSEQDMAETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHG 128
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEAD 330
YIVSDC SI+TIVE KFL+ T E+AVA LKAGLDL+CG YY + AV G++ + D
Sbjct: 129 YIVSDCWSIETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHD 188
Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
+D SL LY+VLMRLG+FDG P +LGK++I
Sbjct: 189 LDQSLSNLYVVLMRLGFFDGIPALASLGKDDI 220
>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 912
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 231/809 (28%), Positives = 367/809 (45%), Gaps = 142/809 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D K P ER +DL+ +MT+ EK QM L YG R+ LP +W ++ G+ I
Sbjct: 18 YEDPKAPLNERIEDLLSQMTVEEKTCQMVTL-YGYQRVLKDSLPTPDWKNQLWKDGIGAI 76
Query: 70 GRRTNS---------------PPGTH----------FDSE----VPG------------- 87
N+ P H F E +P
Sbjct: 77 DEHLNAFRGWGVPPMQNELVWPASNHAWALNEVQRFFVEETRLGIPADFTNEGIRGVENY 136
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++IG EAR + G T ++P ++V RD RWGR
Sbjct: 137 IATNFPTQLALGHTWNRELIRQIGYITGREARLL------GYTNVYAPILDVGRDQRWGR 190
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I +GLQ +++++ KH+ AY +
Sbjct: 191 YEEVYGESPYLVAELGIAMGKGLQT-------------DMQVASTAKHFIAYSNNKGARE 237
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ PF + E + VM SYN +G P + L Q +RG
Sbjct: 238 GFARVDPQMSWREVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGT 297
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + HK D KE AV + ++AGL++ C + Y +
Sbjct: 298 MGFRGYVVSDSDAVEYLYSKHKTAKDMKE-AVRQSVEAGLNVRCTFRSPESYVLPLRELI 356
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK-NLGKNNICNPQHIELAAEAARQGIV 380
Q+G ++ ID +R + V G FD Q L + + H ++A +A+R+G+V
Sbjct: 357 QEGGLSMETIDNRVRDILRVKFLTGLFDTPYQTDLALADKEVNSEAHQQVALQASREGLV 416
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN N LPL+ IK +A+ GP+A+ + +Y T+ ++G K +
Sbjct: 417 LLKNANNLLPLDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKV 476
Query: 438 NYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
Y GC ++I+ + + I A+D K +D V+V G + E K
Sbjct: 477 TYTKGCDLVDANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKS 536
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R L LPG Q +L+ + K PV LV+++ + IN+A + + +IL YPG +GG
Sbjct: 537 RTSLDLPGHQQQLLEAIVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSQGG 593
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG-------RTYKFFDGP 596
AIA+ +FG YNPGG+L +T + +IP+ + P +P + G +GP
Sbjct: 594 TAIAEALFGDYNPGGKLTVT-FPKTVGQIPF-NFPAKPASQVDGGQTPGMKGNQSRINGP 651
Query: 597 VVYPFGYGLSYTQFKYK--VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGLSYT F+Y SSP V T+K P V
Sbjct: 652 -LYPFGYGLSYTTFEYSNLQLSSP-------------------VITDKEPVT------VT 685
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGF 713
CK ++N G G EVV +Y++ T+ K + G+ERV + G++ KV F
Sbjct: 686 CK---------IKNTGTRSGDEVVQLYTRDVISSVTTYEKNLRGFERVHLEPGETKKVSF 736
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ + ++++ + ++ G I++G
Sbjct: 737 QL-LPRDFQLLNKDNHWVVEPGMFQIMIG 764
>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 865
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 170/425 (40%), Positives = 233/425 (54%), Gaps = 42/425 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FPY + L +RA DL+ER+TL EKV M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 25 FPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRAGI 84
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN-- 125
AT FP I ASF++ L K+ VS EARA Y GN
Sbjct: 85 ----------------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLK 128
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFW+PNIN+ RDPRWGR ET GEDPY+ R + VRGLQ + ++Y
Sbjct: 129 RYQGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYD------ 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W +R F++ + +D+ ET++ F+ V E DV VMC+
Sbjct: 183 --KLHACAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCA 237
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G IVSDC +I H+ D KE A A
Sbjct: 238 YNRFEGEPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPD-KETASAGA 296
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
+ +G DL+CG+ Y + AVQ+G I E ID S++ L LG D + ++ +
Sbjct: 297 VLSGTDLECGNNYKSLPE-AVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + H +LA E AR+ IVLL+N N LPL ++K +AL+GP+AN + GNY G P
Sbjct: 356 VVDSKAHKDLALEIARKSIVLLQNRNNILPLKE-DMK-IALIGPNANDSVMQWGNYNGFP 413
Query: 421 CRYTS 425
++
Sbjct: 414 SHTST 418
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 94/303 (31%), Positives = 138/303 (45%), Gaps = 59/303 (19%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ A+ID K AD V G+ S+E E G DR + LP Q LI+++
Sbjct: 591 LQASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKL 650
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIK---SILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
K P+ V S AV + P+ K +IL YPG+ GG A+ADV+FG YNP G+L
Sbjct: 651 GK-PIIFVNYSGSAVGLE-----PESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKL 704
Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
P+T+Y+ +P + GRTY++ +Y FG+GLSYT F Y A+ +
Sbjct: 705 PVTFYKHT------DQLPDFQDYSMKGRTYRYMTESPLYSFGHGLSYTNFTYGPATLSQQ 758
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
T+ K + T I V+N G DG EVV V
Sbjct: 759 ----------------TISQGK----------------EVTLTIPVQNTGNYDGEEVVQV 786
Query: 681 YSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTI 739
Y G + ++RV IA GQ A V FT+++ ++ + D N++ + G + +
Sbjct: 787 YLSCSGDKEGPSHTLRAFKRVHIAKGQRANVSFTLDS-ETFQWFDTNTNTMRMVEGNYEL 845
Query: 740 LVG 742
L G
Sbjct: 846 LYG 848
>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 863
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 169/448 (37%), Positives = 241/448 (53%), Gaps = 46/448 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DA
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + +G DL+CG + + T AV++ I+E I+TS++ + LG + + + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKDLISEEKINTSVKRVLKARFELGEMNSTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N+N LPLN +A++GP+AN + GNY
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQ--MKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 135/296 (45%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ ++AD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I N +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
Y ++ GRTY+F +YPFGYGLSYT+F Y A+ +S KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTETPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
+ K I V N+G+ DG EVV VY P
Sbjct: 762 GE-----------------------------KAILTIPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
K + G++RV IA G++ V + S + D A N++ +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847
>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 866
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 165/452 (36%), Positives = 241/452 (53%), Gaps = 41/452 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+PY + +L ERA+DL R+TL EK + M + + +PRLG+P +EWWSEALHG++ G
Sbjct: 23 YPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP AS+++ L ++ S EA A NL
Sbjct: 82 ---------------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIK 126
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD---- 179
G++ W+PNIN+ RDPRWGR ET GEDPY+ R + V GLQ G + RD
Sbjct: 127 RYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQ---GQPFRRDMRPF 183
Query: 180 -SDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVS 237
R K AC KHYA + W +R FD R+ E+D+ ET++ F+ V EG+V
Sbjct: 184 TERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVR 240
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLNDTKEDA 296
VMC+Y R++G P C + + L+Q +RG+W ++G +VSDC +I E H + +T +A
Sbjct: 241 EVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEA 300
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A ++AG D++CG Y AV+QG I+ IDTS+ L +G FD +
Sbjct: 301 SAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPW 359
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
K G I + H LA + AR+ + LL+N N LPL+ ++ +A++GP+AN + + G
Sbjct: 360 KLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWG 418
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
NY G P T+ + G + + GC I
Sbjct: 419 NYTGYPISTTTILKGIRSKVPAARFVEGCGYI 450
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/328 (31%), Positives = 149/328 (45%), Gaps = 67/328 (20%)
Query: 432 AYSKVINYAPGCADIVCQ----NNSMIPAAIDAAK--NADATVIVAGLDLSVEAE----- 480
AY + Y A VCQ S I A+ AA+ +AD V V G+ +E E
Sbjct: 573 AYRVRVEYVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVD 632
Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
G DR + LP Q E+I + A K V V S GAV + ++L
Sbjct: 633 APGFNGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNCSGGAVAL--VPEAEACDAVLQA 689
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
Y GE GG+A+ADV+FG YNP G+LP+T+Y+++ +P GRTY++F G
Sbjct: 690 WYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD------ADLPDFLDYRMTGRTYRYFRG 743
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
++PFG+GLSYT F V +P+ + KL
Sbjct: 744 TPLFPFGFGLSYTSF---VFGTPRYENGKL------------------------------ 770
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+EV N GK DG+EVV VY K P A +K + G+ R+ + AG+ +V M
Sbjct: 771 -------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRGFARIDLKAGERRRVEIAM 823
Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
+ + D N++ + G H ++VG
Sbjct: 824 PR-ERFEGWDATTNTMRVKPGNHLLMVG 850
>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
Length = 861
Score = 282 bits (721), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 239/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H+ D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAGA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPLEGTYELLYG 845
>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
Length = 870
Score = 282 bits (721), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 166/428 (38%), Positives = 239/428 (55%), Gaps = 47/428 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + ER DLV R+TL EKV Q+ D + + RL +P Y WW+EALHGV+ G+
Sbjct: 34 YLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAGK-- 91
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+F+E L ++G +S E RA ++ A
Sbjct: 92 --------------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMY 137
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT+WSPNIN+ RDPRWGR ET GEDPY+ R A+N++ GLQ + EY L
Sbjct: 138 TGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQG-DNTEY--------L 188
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K A KHYA + R D +++D+ ET++ F+ + + V+SVMC+YN
Sbjct: 189 KSVATLKHYAVHSGPEVS---RHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNS 245
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--VESHKFLNDTKEDAVARVLKA 303
VNG P C + +L+ +R ++NF GYIVSDC +I V+SH +N T+ A A LK
Sbjct: 246 VNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVN-TEAKAAAMALKT 304
Query: 304 GLDLDCGDYYTN---FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
G DL+CGD++ N + AV++G + E D+D +L+ L +LG FD Y +
Sbjct: 305 GTDLNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFDNPENVPYSDTS 364
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + + +H+ L EAA++ +VLLKN+ LPL GN K +AL+GP+A+ ++GNY G
Sbjct: 365 IDIVGSNKHLALTQEAAKKSLVLLKNEQ-VLPL-KGNEK-VALIGPNADNEAILLGNYNG 421
Query: 419 TPCRYTSP 426
P +P
Sbjct: 422 MPIVPITP 429
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/328 (28%), Positives = 148/328 (45%), Gaps = 58/328 (17%)
Query: 431 YAYSKVINYAPGCADIVCQN-NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK------- 482
+ +S VIN P + +N S+ A++ A AD V V G+ ++E E
Sbjct: 574 FWHSNVIN--PTASLTWLKNPQSLTQQALNNANEADVIVFVGGISANLEGEEMPLQIDGF 631
Query: 483 ---DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
DR ++ LP Q L+ K+ K P+ LV MS A+ +N+ N I +I+ YPG
Sbjct: 632 SHGDRTNINLPKSQLNLLKKLKQTGK-PIVLVNMSGSAMALNWENEN--IDAIIQGFYPG 688
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
E G A+ +++G+Y+P G+LPIT+Y++ + +P + RTYK+++G V+Y
Sbjct: 689 EAAGSALVSLLYGEYSPSGKLPITFYKS------VSDLPDFKDYSMKNRTYKYYEGEVLY 742
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GLSY FKYK + + D D+N T
Sbjct: 743 PFGFGLSYADFKYK--------NTRHSIDAGSGDLNLTT--------------------- 773
Query: 660 FTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
+ N +VV VY S P T KQ++G++ + + + FT+
Sbjct: 774 -----TITNQSSFSADDVVQVYVSMPDAPIKTPNKQLVGFKHITLKNESKNDIKFTIPKN 828
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVG 746
K L ++ ++ G I VG G G
Sbjct: 829 K-LSYINEQGIAVAYKGRLIITVGSGQG 855
>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 853
Score = 282 bits (721), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 237/421 (56%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 30 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 88 --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V+GLQ D R
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPR 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 240
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A ++A
Sbjct: 241 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQA 299
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + + A +Q +++ADID++ + M+LG FDG+ + Y + +
Sbjct: 300 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPS 359
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AAR+ IVLLKN N LPLN +K++A+VG NA K G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAP 417
Query: 421 C 421
Sbjct: 418 V 418
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 147/303 (48%), Gaps = 55/303 (18%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + I +I+ YPGE+GG A+ADV+FG YNP GRLP+T+Y++ +P
Sbjct: 658 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS------LDELP 709
Query: 579 -LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
+ GRTYK+F G V+YPFGYGLSY+ FKY
Sbjct: 710 AFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYS------------------------ 745
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAG-THIKQ 694
D+K KD T + ++N GK G EV VY + P G IK+
Sbjct: 746 --------------DLKVKDGANTISVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKE 791
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQ 753
+ G+ R+ + +G+S V ++ + L+ D + GA I+VG +
Sbjct: 792 LKGFRRIPLKSGESRVVDIELDK-EQLRYWDAGLGQFIVPQGAFDIMVGASSKDIRLQTV 850
Query: 754 LNL 756
+NL
Sbjct: 851 INL 853
>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
Length = 596
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 204/635 (32%), Positives = 308/635 (48%), Gaps = 84/635 (13%)
Query: 128 LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
+T+WSPN+N+ RDPRWGR ET GEDPY+ YVRGLQ +D LK
Sbjct: 1 MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQG---------NDPFFLKA 51
Query: 188 SACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+AC KHYA + G + R F++ +++D+ ET++ FE V E V +VM +YNR
Sbjct: 52 AACAKHYAVH-----SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNR 106
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
V G LL +R W F G++VSDC ++ I HK D E A A LK+GL
Sbjct: 107 VYGESASGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAE-ASAIALKSGL 165
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF---DGSPQYKNLGKNNI 362
+L+CG + A+++ I E D+D +L L + ++LG D SP YKN+ + I
Sbjct: 166 NLNCGGSFHALKE-ALERKLITEVDLDNALMPLMMTRLKLGNLTDDDESP-YKNISDSVI 223
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+ H +A E A++ +VLLKN+N LPL ++KT+ + GP+A T M+GNY G R
Sbjct: 224 ASYTHAMVAREVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPR 282
Query: 423 YTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPA--AIDAAKNADATVIVAGL---- 473
+ + G A INY G I+ +M PA + + A+ ++V GL
Sbjct: 283 SNTFLQGIAAKVSGGTSINYKIG---ILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGID 339
Query: 474 -----DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
D + D+ +L LP Q + + ++ + VI +D+
Sbjct: 340 EGEEGDAIASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDLEEVSELSD 399
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
+ W YPG+EGG A+ D++FG + GR+P+T+ I +P N GR
Sbjct: 400 AVIMAW--YPGQEGGMALGDLLFGDVSFSGRMPVTF------PINSDWLPAFEDYNMQGR 451
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TYK+ ++YPFGYGL+Y Y S K ++ K D Q+
Sbjct: 452 TYKYMTDNIMYPFGYGLTYGDVSY---SDVKILNPKYDGKQEIH---------------- 492
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG--THIKQVIGYERVFIAAG 706
Q + N G + EVV +Y PG AG T I +IG++RV + +
Sbjct: 493 -------------VQATLRNNGNNEVEEVVQLYLSAPG-AGVITPISSLIGFKRVTLESH 538
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
S V F + + ++++ + +LL G +TI+V
Sbjct: 539 LSQTVEFIIKPDQLKMVMEDGSKNLL-KGKYTIIV 572
>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
Length = 850
Score = 281 bits (720), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 237/421 (56%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 84
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 85 --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 130
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V+GLQ D R
Sbjct: 131 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPR 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 182 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A ++A
Sbjct: 238 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQA 296
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + + A +Q +++ADID++ + M+LG FDG+ + Y + +
Sbjct: 297 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPS 356
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AAR+ IVLLKN N LPLN +K++A+VG NA K G+Y G P
Sbjct: 357 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAP 414
Query: 421 C 421
Sbjct: 415 V 415
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 147/303 (48%), Gaps = 55/303 (18%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 597 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 654
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + I +I+ YPGE+GG A+ADV+FG YNP GRLP+T+Y++ +P
Sbjct: 655 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS------LDELP 706
Query: 579 -LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
+ GRTYK+F G V+YPFGYGLSY+ FKY
Sbjct: 707 AFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKY------------------------- 741
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAG-THIKQ 694
D+K KD T + ++N GK G EV VY + P G IK+
Sbjct: 742 -------------SDLKVKDGANTVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKE 788
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQ 753
+ G+ R+ + +G+S V ++ + L+ D + GA I+VG +
Sbjct: 789 LKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGRFIVPQGAFDIMVGASSKDIRLQTV 847
Query: 754 LNL 756
+NL
Sbjct: 848 INL 850
>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
Length = 468
Score = 281 bits (720), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 166/428 (38%), Positives = 238/428 (55%), Gaps = 47/428 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + ER DLV R+TL EKV Q+ D + + RL +P Y WW+EALHGV+ G+
Sbjct: 34 YLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAGK-- 91
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL--------GN 125
AT FP I A+F+E L ++G +S E RA ++
Sbjct: 92 --------------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMY 137
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT+WSPNIN+ RDPRWGR ET GEDPY+ R A+N++ GLQ + EY L
Sbjct: 138 TGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQG-DNAEY--------L 188
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K A KHYA + + R D +E+D+ ET++ F+ + + V+SVMC+YN
Sbjct: 189 KSVATLKHYAVH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNS 245
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI--VESHKFLNDTKEDAVARVLKA 303
VNG P C + +L+ +R ++NF GYIVSDC +I V+SH +N T A A LK
Sbjct: 246 VNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVN-TGAKAAAMALKT 304
Query: 304 GLDLDCGDYYTN---FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLG 358
G DL+CGD++ N + AV++G + E D+D +L+ L +LG FD Y +
Sbjct: 305 GTDLNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFDNPENVPYSDTS 364
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + + +H+ L EAA++ +VLLKN+ LPL GN K +AL+GP+A+ ++GNY G
Sbjct: 365 IDVVGSNKHLALTQEAAQKSLVLLKNEQ-VLPLK-GNEK-IALIGPNADNEAILLGNYNG 421
Query: 419 TPCRYTSP 426
P +P
Sbjct: 422 MPIVPITP 429
>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
Length = 861
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 238/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 127/285 (44%), Gaps = 52/285 (18%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
+ ++RV I AG++ V ++ +S + D A N++
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAISLTH-ESFEWFDEATNTM 833
>gi|336404202|ref|ZP_08584900.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
gi|335943530|gb|EGN05369.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
Length = 735
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 222/760 (29%), Positives = 355/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EK+ Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y V++GK+ A +D S+R + V RLG F+
Sbjct: 311 DAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKNDN LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D A+ +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K PV LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G R++A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D K + ++ V N G DG+E V + P + T +K++ +E+
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
Length = 866
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 165/452 (36%), Positives = 240/452 (53%), Gaps = 41/452 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+PY + +L ERA+DL R+TL EK + M + + +PRLG+P +EWWSEALHG++ G
Sbjct: 23 YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
AT FP AS+++ L + S EA A NL
Sbjct: 82 ---------------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIK 126
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRD---- 179
G++ W+PNIN+ RDPRWGR ET GEDPY+ R + V GLQ G + RD
Sbjct: 127 RYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQ---GQPFRRDMRPF 183
Query: 180 -SDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVS 237
R K AC KHYA + W +R FD R+ E+D+ ET++ F+ V EG+V
Sbjct: 184 TERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVR 240
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLNDTKEDA 296
VMC+Y R++G P C + + L+Q +RG+W ++G +VSDC +I E H + +T +A
Sbjct: 241 EVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETPAEA 300
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A ++AG D++CG Y AV+QG I+ IDTS+ L +G FD +
Sbjct: 301 SAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPW 359
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
K G I + H LA + AR+ + LL+N N LPL+ ++ +A++GP+AN + + G
Sbjct: 360 KLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWG 418
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGCADI 446
NY G P T+ + G + + GC I
Sbjct: 419 NYTGYPISTTTILKGIRSKVPAARFVEGCGYI 450
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/328 (31%), Positives = 150/328 (45%), Gaps = 67/328 (20%)
Query: 432 AYSKVINYAPGCADIVCQ----NNSMIPAAIDAAKNADATVIV---------AGLDLSVE 478
AY + Y A VCQ S I A+ AA+ DA V+V G ++ V+
Sbjct: 573 AYRVRVEYVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVD 632
Query: 479 A---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
A +G DR + LP Q E+I + A K V V S GAV + ++L
Sbjct: 633 APGFKGGDRTSIELPEAQREVIRLLRQAGK-LVVFVNCSGGAVAL--VPETEACDAVLQA 689
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
Y GE GG+A+ADV+FG YNP G+LP+T+Y+++ +P GRTY++F G
Sbjct: 690 WYAGEAGGQAVADVLFGDYNPSGKLPVTFYKSD------ADLPDFLDYRMTGRTYRYFRG 743
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
++PFG+GLSYT F + P+ + KL
Sbjct: 744 IPLFPFGFGLSYTSFAF---GKPRYENGKL------------------------------ 770
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+EV N GK DG+EVV VY K P A +K + G+ R+ + AG+ +V M
Sbjct: 771 -------YVEVTNTGKRDGAEVVQVYVKNPADADGPVKTLRGFARIDLKAGERRRVEIAM 823
Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
+ + D N++ + G H ++VG
Sbjct: 824 PR-ERFEGWDATTNTMRVKPGNHLLMVG 850
>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
Length = 850
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 237/421 (56%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 84
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 85 --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 130
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V+GLQ D R
Sbjct: 131 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPR 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 182 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A ++A
Sbjct: 238 NALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQA 296
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + + A +Q +++ADID++ + M+LG FDG+ + Y + +
Sbjct: 297 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPS 356
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AAR+ IVLLKN N LPLN +K++A+VG NA K G+Y G P
Sbjct: 357 VIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAP 414
Query: 421 C 421
Sbjct: 415 V 415
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/303 (31%), Positives = 147/303 (48%), Gaps = 55/303 (18%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 597 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 654
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + I +I+ YPGE+GG A+ADV+FG YNP GRLP+T+Y++ +P
Sbjct: 655 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS------LDELP 706
Query: 579 -LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
+ GRTYK+F G V+YPFGYGLSY+ FKY
Sbjct: 707 AFDDYDITQGRTYKYFKGDVLYPFGYGLSYSSFKYS------------------------ 742
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGIAG-THIKQ 694
D+K KD T + ++N GK G EV VY + P G IK+
Sbjct: 743 --------------DLKVKDGANTVSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIKE 788
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQ 753
+ G+ R+ + +G+S V ++ + L+ D + GA I++G +
Sbjct: 789 LKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGQFIVPQGAFDIMIGASSKDIRLQTV 847
Query: 754 LNL 756
+NL
Sbjct: 848 INL 850
>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
Length = 861
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 171/445 (38%), Positives = 238/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 129/291 (44%), Gaps = 53/291 (18%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+AD + G+ S+E E G DR D+ LP Q +L+ + K +V
Sbjct: 597 DADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGK---KVVF 653
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 654 INYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK------ 707
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
+P + GRTY++ ++PFG+GLSYT F Y A KL K+ +
Sbjct: 708 DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLSKNTIAK 759
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
N I V N+G+ DG EVV VY + PG
Sbjct: 760 GEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPGDKEGPR 795
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V ++ ++ + D +N++ G + +L G
Sbjct: 796 YTLRAFKRVHIPAGKTESVAISLTG-ENFEWFDVESNTMRPLEGTYELLYG 845
>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 896
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 168/432 (38%), Positives = 234/432 (54%), Gaps = 45/432 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ + P +R +LV +MTL E+ QM + A +PRLG+P Y WWSE LHG++ G
Sbjct: 38 PWDNPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSGY- 96
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
AT FP I +A+F+ + ++G TVSTEARA YN
Sbjct: 97 ---------------ATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSI 141
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT W+PNIN+VRDPRWGR ET GEDP++ G A YV GLQ ++ +
Sbjct: 142 YFGLTLWAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQ---------GNNPKY 192
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
LK A KH++ Y N + R ++ + DMQ+T++ F M + +G S+MCSYN
Sbjct: 193 LKTVATPKHFSVY---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYN 249
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVLK 302
V G+P+CA+ KLL +RG W F GYI SDC +I +H + D A + VL
Sbjct: 250 AVYGVPSCAN-KLLADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVL- 307
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
AG D DCG Y +VQQG I++A ID ++ L+ RLG FD Y ++ +
Sbjct: 308 AGTDTDCGTGYKVLPQ-SVQQGLISKAAIDRAVERLFTARFRLGMFDPKADVPYNSIPYS 366
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + H A E A + +VLLKN+ G LPL N +T+A+VGP+A ++ GNY P
Sbjct: 367 VVDSAAHRAQALEDASKSMVLLKNEGGILPLR--NARTIAVVGPNAANLNSIEGNYNAIP 424
Query: 421 CRYTSPMDGFYA 432
+ P+DG A
Sbjct: 425 SHPSLPVDGIEA 436
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 135/263 (51%), Gaps = 42/263 (15%)
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
+G DR L LP Q +L++ + K PV LV+++ A+ I++AK + ++ IL YPG
Sbjct: 652 DGGDRTRLSLPQTQQDLLHALVATGK-PVVLVLLNGSALSIDWAKQH--VQGILEAWYPG 708
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
E GG AI + + G+ +PGG+LPIT+Y + P+T ++ GRTY+++ G ++
Sbjct: 709 EAGGEAIGETLSGQNDPGGKLPITFYTSVKDLPPFTDYSMK------GRTYRYYTGKPLF 762
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT F+Y R + +P
Sbjct: 763 PFGYGLSYTTFEYS----------------HVRLSTSNLKAGEP---------------- 790
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
T + EV+N G + G V VY PP +K++ G++RV +A GQS ++ FT+N +
Sbjct: 791 LTVEAEVKNTGHVAGDAVTEVYVTPPQNGVNPLKELKGFDRVHLAPGQSRQLTFTLNP-R 849
Query: 720 SLKIVDNAANSLLASGAHTILVG 742
L +VD A + G ++I VG
Sbjct: 850 DLSLVDEAGKRSVQPGVYSIFVG 872
>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 895
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 175/452 (38%), Positives = 240/452 (53%), Gaps = 53/452 (11%)
Query: 31 RMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATS 90
+MT EKV Q + A +PRLG+P YEWW+E LHG++ G AT
Sbjct: 53 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------------ATV 96
Query: 91 FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVVRDP 141
FP I A++N +L +++G STEARA +NL AGLT WSPNIN+ RDP
Sbjct: 97 FPQAIGLAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDP 156
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RWGR +ET GEDPY+ G+ A+ ++ GLQ D + P I A KH A +
Sbjct: 157 RWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDLTHPRTI-ATPKHLAVHSGPE 207
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
R FD V+ D++ T+ F + +G SVMC+YN ++G P CA LLN
Sbjct: 208 ---PGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGR 264
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAV 321
+RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y + A+
Sbjct: 265 LRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDLGK-AI 322
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQ 377
+G EA +D SL L+ RLG PQ Y LG ++ + H LA +AA+Q
Sbjct: 323 ARGDADEALLDQSLVRLFAARYRLGEL--QPQRKDPYAQLGAKDVDSAAHRALALQAAQQ 380
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
IVLL+N N LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 381 SIVLLQNRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAA 438
Query: 438 N--YAPGCADIVCQNNSMIPAAIDAAKNADAT 467
N YA G A + + MIP + A ++D T
Sbjct: 439 NVRYAQG-APLAAGVSGMIP---ETALHSDGT 466
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 139/285 (48%), Gaps = 45/285 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR DL LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 637 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQH 695
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA V+ G NPGGRLP+T+Y + Y S ++
Sbjct: 696 ADAIVAAW--YPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK------ 747
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++ FG GLSYT+F Y ++P Q G N
Sbjct: 748 GRTYRYFKGEPLFAFGSGLSYTRFTY---AAP-----------QLSATTLQAGAN----- 788
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
+ +V N G G EVV VY +PP A + ++ ++G++RV + G
Sbjct: 789 -------------LQVRTQVSNSGTRAGDEVVQVYLQPPQGAQSPLRTLVGFQRVTLQPG 835
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
++ +VGF + + L VD A + G + + VG G G P
Sbjct: 836 EAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGGQPGTGAP 879
>gi|347736643|ref|ZP_08869226.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
gi|346919803|gb|EGY01181.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
Length = 775
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 232/729 (31%), Positives = 351/729 (48%), Gaps = 123/729 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ + E LHG IG TSFP I +S++ L +++
Sbjct: 121 RLGIPVL-FHEEGLHGYPAIG-----------------PTSFPQAIAQASSWDPDLIREV 162
Query: 110 GQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
V+ E R G++ SP ++V RDPRWGR+ ET GEDPY+ G + V+GL
Sbjct: 163 DSVVAREIRVR------GVSLVLSPVVDVARDPRWGRIEETFGEDPYLAGEMGVAAVQGL 216
Query: 169 QDVEGVEYHRDSDSRPL---KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFIL 225
Q DS PL K+ A KH + N + V E+ ++E F
Sbjct: 217 Q----------GDSLPLADGKVFATLKHLTGHGQPESGTN---VGPASVGERTLREMFFP 263
Query: 226 PFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES 285
PFE ++ +V +VM SYN ++G+P+ + LL+ +RG+W + G I+SD +I +V
Sbjct: 264 PFEQVIHRTNVRAVMASYNEIDGVPSHVNTWLLHDILRGEWGYKGSIISDYSAIDQLVSI 323
Query: 286 HKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
H + D A+ R ++AG+D D G+ Y + +V+ GKI E ID ++R + +
Sbjct: 324 HHVVPDLPSAAI-RAIQAGVDADLPDGESYASLA-DSVRAGKIKEEVIDRAVRRILELKF 381
Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
+ G F+ + + N + +A +AA++ +VLLKND G LPL+ +KTLA++G
Sbjct: 382 QAGLFEHPYADADKAEALTANGEARAVALKAAQKSVVLLKND-GVLPLDMAKVKTLAVIG 440
Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKV-INYAPGC---------ADIV---- 447
P NA KA +G Y G P + S +DG A ++V + YA G D V
Sbjct: 441 P--NAAKAHLGGYSGEPKQTVSILDGIKAKVGARVKVTYAEGVRITKDDDWYGDTVELAD 498
Query: 448 -CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKV 500
+N +I A+ AK AD V+V G + EG DR L L G Q +L +
Sbjct: 499 PAENARLIQQAVAVAKTADHIVLVIGDNEQTSREGWANNHLGDRDSLDLVGQQNDLAKAL 558
Query: 501 ADAAKGPVTLVIMSA---GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
K PV +V+ + VD+ A+ N ++ W Y G+EGG A+ADV+FG NPG
Sbjct: 559 FALGK-PVVVVLQNGRPLSVVDVA-ARANALVEG--W--YLGQEGGTAMADVLFGDVNPG 612
Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPG--RTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
G+LP+T V +P+ N P R Y F ++PFGYGLSYT F
Sbjct: 613 GKLPVT------VARSVGQLPMF-YNKKPSARRGYLFDTTDPLFPFGYGLSYTTFD---V 662
Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
SP+ + P A KD T ++V N GK G
Sbjct: 663 GSPR--------------------LSTPTIA---------KDGAITVAVDVRNTGKRAGD 693
Query: 676 EVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
EVV +Y + T +K++ G++R+ +A G+S V FT++ K+L + + ++
Sbjct: 694 EVVQLYLHQQVASVTRPVKELKGFQRITLAPGESRTVTFTVDG-KALALWNQDMKRVVEP 752
Query: 735 GAHTILVGE 743
GA I+VG+
Sbjct: 753 GAFDIMVGD 761
>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
SP1PR4]
Length = 885
Score = 281 bits (719), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 179/445 (40%), Positives = 237/445 (53%), Gaps = 48/445 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L P RA+DLV RMTL EK QM + A + RLG+P Y++WSE LHGV+ G
Sbjct: 30 YLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSEGLHGVARSGY-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+++E L +IG VSTEARA YN
Sbjct: 88 --------------ATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQHGVHSIY 133
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT WSPNIN+ RDPRWGR ET GEDP++ R +VRG+Q D
Sbjct: 134 FGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQG---------DDPNYF 184
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ A KH+A + + R F+ V++ D+ +T++ F + EG S+MC+YNR
Sbjct: 185 RTIATPKHFAVHSGPE---STRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSIMCAYNR 241
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARVLKA 303
++G P CA LL Q +RGDW F G++ SDC +I H F + KEDA A +KA
Sbjct: 242 IDGQPACASDLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHFSKE-KEDASAAGVKA 300
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G D CG Y T AV+ G I E ++D SL L+ +RLG FD + Y L
Sbjct: 301 GTDTACGKTYLGLT-SAVKSGLITEHEMDISLERLFEARIRLGLFDDPARMPYARLTMAE 359
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +P H LA AAR+ IVLLKN N LPL+ +K +A++GP+A + A+ GNY
Sbjct: 360 VNSPAHRALALRAARESIVLLKNANNLLPLH--GVKNIAVIGPNAASLDALEGNYNAIAR 417
Query: 422 RYTSPMDGFYAY---SKVINYAPGC 443
P+DG A +KV+ YA G
Sbjct: 418 DPAMPVDGIAAAFPGAKVV-YAQGA 441
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 132/288 (45%), Gaps = 58/288 (20%)
Query: 468 VIVAGLDLSVEAEGK------------DRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
V+VA + LS E EG+ DR D+ LP Q EL+ V K P+ +V+M+
Sbjct: 621 VVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELLRAVKATGK-PLIVVLMNG 679
Query: 516 GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYT 575
A+ A + + ++L YPGE G +AIA+ + GK NP GRLP+T+Y
Sbjct: 680 SAI----ALKDSETDALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSN------ID 729
Query: 576 SMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQCRDI 634
+P + RTY++F G +Y FG GLSYT F+Y KV+ S + D
Sbjct: 730 QLPAFDDYSMANRTYRYFKGQPLYAFGGGLSYTTFRYGKVSLSATHLHAGED-------- 781
Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQ 694
T + EV N GK+ G EV VY PP +
Sbjct: 782 -------------------------LTVEAEVTNTGKVAGDEVAQVYLTPPQTSIAPRFA 816
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++GY+RV + GQS + FT++ + L VD ++G + I VG
Sbjct: 817 LVGYQRVHLLPGQSKPMRFTLHP-RELSQVDAQGVRAASAGHYEIKVG 863
>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 909
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 235/440 (53%), Gaps = 50/440 (11%)
Query: 31 RMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATS 90
+MT EKV Q + A +PRLG+P YEWW+E LHG++ G AT
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------------ATV 110
Query: 91 FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSPNINVVRDP 141
FP I A++N +L +++G STEARA +NL AGLT WSPNIN+ RDP
Sbjct: 111 FPQAIGLAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDP 170
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RWGR +ET GEDPY+ G+ A+ ++ GLQ D + P I A KH A + +
Sbjct: 171 RWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDLTHPRTI-ATPKHLAVH---S 218
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
R FD V+ D++ T+ F + +G SVMC+YN ++G P CA LLN
Sbjct: 219 GPEPGRHGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGR 278
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAV 321
+RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y + A+
Sbjct: 279 LRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDLGK-AI 336
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIELAAEAARQ 377
+G EA +D SL L+ RLG PQ Y LG ++ + H LA +AA+Q
Sbjct: 337 ARGDADEALLDKSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQ 394
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
IVLL+N N LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 395 SIVLLQNRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAA 452
Query: 438 N--YAPGCADIVCQNNSMIP 455
N YA G A + + MIP
Sbjct: 453 NVRYAQG-APLAAGVSGMIP 471
Score = 132 bits (331), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 136/285 (47%), Gaps = 45/285 (15%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR DL LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 651 VEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQH 709
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
W YPG+ GG AIA V+ G NPGGRLP+T+Y + Y S ++
Sbjct: 710 ADAIVAAW--YPGQSGGTAIAQVLAGDVNPGGRLPVTFYRSTKDLPAYVSYDMK------ 761
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++ FG GLSYT+F Y A + ++ Q R
Sbjct: 762 GRTYRYFKGEPLFAFGSGLSYTRFTY-AAPQLSATTLQAGAHLQVR-------------- 806
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
+V N G G EVV VY + P A + ++ ++G++RV + G
Sbjct: 807 -----------------TQVRNSGTRAGDEVVQVYLEFPQRAQSPLRTLVGFQRVTLQPG 849
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
++ V F + A + L VD A + G + + VG G G P
Sbjct: 850 EARDVSFEL-APRQLSDVDRAGQRAVQPGDYRVFVGGGQPGTGAP 893
>gi|262405981|ref|ZP_06082531.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345510488|ref|ZP_08790055.1| beta-glucosidase [Bacteroides sp. D1]
gi|262356856|gb|EEZ05946.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345454434|gb|EEO48987.2| beta-glucosidase [Bacteroides sp. D1]
Length = 735
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 223/760 (29%), Positives = 354/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
++M S+N ++G+P A+P ++ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -APTLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y V++GK+ A +D S+R + V RLG F+
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKNDN LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D A+ +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K PV LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G R++A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D K + ++ V N G DG+E V + P + T +K++ +E+
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|222099590|ref|YP_002534158.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
gi|2429092|gb|AAB70867.1| beta-xylosidase [Thermotoga neapolitana]
gi|221571980|gb|ACM22792.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
Length = 778
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 238/815 (29%), Positives = 371/815 (45%), Gaps = 160/815 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLG 52
Y D P R KDL+ RMTL EK+ Q+G L G+ ++
Sbjct: 4 YRDPSQPVEVRVKDLLSRMTLEEKIAQLGSVWGYELIDERGKFKREKAKDLLKNGIGQIT 63
Query: 53 LP---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
P EA V+ I R R P H + G T+FP I +
Sbjct: 64 RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123
Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+++ L +K+ + + R + + GL +P ++V RDPRWGR ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTAAIREDMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
++YV+GLQ ++ + A KH+A Y NW + +
Sbjct: 179 MGVSYVKGLQ----------GENIKEGVVATVKHFAGYSASEGGKNWAPTN-------IP 221
Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
E++ +E F+ PFE V E V SVM SY+ ++G+P A+ +LL +R DW F G +VSD
Sbjct: 222 EREFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSD 281
Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMGAVQQGKIAEAD 330
++ + E H+ D E A L+AG+D+ DC + + V++G + E+
Sbjct: 282 YFAVNMLGEYHRIAKDKSESA-RLALEAGIDVELPKTDCYQHLKDL----VEKGIVPESL 336
Query: 331 IDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
ID ++ + + LG F+ Y ++ K I H +LA E AR+ I+LLKND G LP
Sbjct: 337 IDEAVSRVLKLKFMLGLFENP--YVDVEKAKI--ESHRDLALEIARKSIILLKND-GTLP 391
Query: 391 LNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC------------- 421
L K +AL+GP+A + ++G+Y G P
Sbjct: 392 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 449
Query: 422 -----RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---- 472
S +D F YA GC ++ ++ S AI+ AK +D ++V G
Sbjct: 450 EEHMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSG 508
Query: 473 --LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
LD + E +D +L LPG Q EL+ ++A K PV LV+++ + + ++
Sbjct: 509 LTLDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPYSLKNLVD--RVN 564
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRT 589
+IL V PGE GGRAI DVI+GK NP G+LPI++ A + + + P +++ G
Sbjct: 565 AILQVWLPGEAGGRAIVDVIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDY 624
Query: 590 YKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
P ++PFG+GLSYT+F+Y + PK V P V
Sbjct: 625 VDESTKP-LFPFGHGLSYTRFEYSNLRIEPKEV---------------------PSAGEV 662
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQ 707
+I +++VEN+G MDG EVV +Y + T +K++ G++RV + A +
Sbjct: 663 VI------------KVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKE 710
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F ++ L D ++ G ++VG
Sbjct: 711 KKTVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 744
>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
Length = 771
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 216/727 (29%), Positives = 334/727 (45%), Gaps = 124/727 (17%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG AT+FPT +++N L +++
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------------ATTFPTAPGQASTWNPELIERM 161
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ ++ E R G + P +++VRDPRW R E+ GED Y+ R YVRG
Sbjct: 162 GKVIAAEIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGTG 216
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S SR + KH+ AY N + + E++++ET++ PFE
Sbjct: 217 SGD------LSQSR--HALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEA 265
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V G SVM +YN V+GIP A+ ++L +RG+W F G++VSD SI+ + E+H
Sbjct: 266 AVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVA 324
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
+E AV + L+AG+D D A + G +AEA+ID ++ + + +G F+
Sbjct: 325 GSVREAAV-QALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE 383
Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
+P + H ELA EAARQ + LL+N +G LPL+ ++ +A++GP+A+
Sbjct: 384 -NPYIDEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNI 442
Query: 410 KAMIGNYEGTPCRYTSPMDG---FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
+G+Y + DG +V+ Y+ GC + + S I AA+ AA+ DA
Sbjct: 443 YNQLGDYTAQQTAANTVRDGLEKLLGRDRVV-YSRGCT-VRGGDRSEIAAAVSAARGTDA 500
Query: 467 TVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELINKVADA 503
V+V G D E EG DR L L G Q EL+ ++ A
Sbjct: 501 AVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KA 559
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
P+ +V ++ +D+ A + W YPG GG A+A+ I G+ NP GRLPIT
Sbjct: 560 TGTPLIVVCIAGRPLDLRRASEQADALLMAW--YPGARGGDAVAETILGRNNPAGRLPIT 617
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
A +IP RP N+ Y +YPFGYGLSY+ F+Y + +S D
Sbjct: 618 IPRAEG-QIPVYYNKKRPANH----DYTDLTAAPLYPFGYGLSYSTFEYGSLEARQSGDN 672
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-- 681
L +V C+ + N +G EVV +Y
Sbjct: 673 VL--------------------------EVSCR---------IRNTSDREGDEVVQLYIS 697
Query: 682 ------SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASG 735
+PP +Q+ G+ R+ +A G+ +V FT+ ++L ++D ++ G
Sbjct: 698 DMVASTVRPP-------RQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQGRRVVEKG 749
Query: 736 AHTILVG 742
I VG
Sbjct: 750 DFVIAVG 756
>gi|148269983|ref|YP_001244443.1| glycoside hydrolase family 3 protein [Thermotoga petrophila RKU-1]
gi|147735527|gb|ABQ46867.1| glycoside hydrolase, family 3 domain protein [Thermotoga petrophila
RKU-1]
Length = 778
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 244/812 (30%), Positives = 375/812 (46%), Gaps = 154/812 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYG---VP 49
Y D P R +DL+ RMTL EK Q+G L G V
Sbjct: 4 YRDPSQPIEVRVRDLLSRMTLEEKAAQLGSVWGYELIDERGKFSREKAKELLKNGIGQVT 63
Query: 50 RLGLPLYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
R G EA V+ I R R P H + G T+FP I +
Sbjct: 64 RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123
Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+++ L +K+ + + R + + GL +P ++V RDPRWGR ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDM 219
++YV+GLQ G + + + A KH+A Y EG + + + E++
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSAS--EGGKNWA-PTNIPEREF 225
Query: 220 QETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSI 279
+E F+ PFE V E +V SVM SY+ ++G+P A+ KLL +R DW F G +VSD ++
Sbjct: 226 KEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSDYFAV 285
Query: 280 QTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEADIDTS 334
+ + + H+ D K +A L+AG+D++ C Y + V++G I+EA ID +
Sbjct: 286 KVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEALIDEA 340
Query: 335 L-RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT 393
+ R L + M LG F+ Y + K I H ++A + AR+ I+LLKND G LPL
Sbjct: 341 VARVLRLKFM-LGLFENP--YVEVEKAKI--ESHKDIALDIARKSIILLKND-GILPLQK 394
Query: 394 GNIKTLALVGPHANATKAMIGNYE----------------GTPC---------------- 421
K +AL+GP+A + ++G+Y G P
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452
Query: 422 --RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------L 473
S +D F YA GC ++ ++ S AI+ AK +D ++V G L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
D + E +D +L LPG Q EL+ +VA K PV LV+++ + + K+ +IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KVNAIL 567
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKF 592
V PGE GGRAI D+I+GK NP G+LPI++ A + + + P +++ G
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDYVDE 627
Query: 593 FDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
P ++PFG+GLSYT+F+Y + PK V PP V+I
Sbjct: 628 STKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGEVVI- 664
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
+++VEN G DG EVV +Y + T +K++ G++RV + A +
Sbjct: 665 -----------KVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F ++ L D ++ G ++VG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744
>gi|312794525|ref|YP_004027448.1| glycoside hydrolase family 3 domain-containing protein
[Caldicellulosiruptor kristjanssonii 177R1B]
gi|312181665|gb|ADQ41835.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
kristjanssonii 177R1B]
Length = 770
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 212/710 (29%), Positives = 350/710 (49%), Gaps = 113/710 (15%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I +F+ + +++ + + T+ +A+ +P I+V RD RWGRV
Sbjct: 102 GATVFPQSIGVACTFDNEIVEELAKVIRTQMKAV-----GAHQALAPLIDVARDARWGRV 156
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NW 202
ET GEDPY+V A++YV+GLQ D I A KH+ Y + NW
Sbjct: 157 EETFGEDPYLVANMAVSYVKGLQ----------GDDIKDGIVATGKHFVGYAMSEGGMNW 206
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
+ E++++E ++ PFE+ V + S+M +Y+ ++GIP A+ KLL
Sbjct: 207 A-------PVHIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIA 259
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGA 320
RG+W F G VSD ++ +++ HK + T E+A A L AGLD++ + +T + A
Sbjct: 260 RGEWGFDGIYVSDYSGVKNLLDYHKSVK-TYEEAAALSLWAGLDIELPKIECFTEEFIKA 318
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC-NPQHIELAAEAARQGI 379
+++GK A +D +++ + + RLG FD +P K G + N + +L+ + A++ +
Sbjct: 319 LKEGKFDMALVDAAVKRVLEMKFRLGLFD-NPYIKTEGVVELFDNKEQRQLSRKVAQESM 377
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYA------- 432
VLLKND+ LPL+ ++K +A++GP+AN+ + ++G+Y P + + ++ F+
Sbjct: 378 VLLKNDS-FLPLSK-DLKKIAVIGPNANSVRNLLGDY-SYPA-HIATLEMFFIKEDRGVG 433
Query: 433 -----YSKVIN-------------------YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
VIN YA GC D+ Q+ S A AA+ ADA +
Sbjct: 434 NEEEFVKNVINMKSIFEAIKDKVSSNTEVVYAKGC-DVNSQDKSGFEEAKKAAEGADAVI 492
Query: 469 IV----AGLDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINF 522
+V AGL L E +DR L LPG Q +L+ ++ P T+V++ G V +++
Sbjct: 493 LVVGDKAGLRLDCTSGESRDRASLRLPGVQEDLVKEIVSV--NPNTVVVLVNGRPVALDW 550
Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRP 581
N +K++L +PGEEG A+AD++FG YNPGG+L I++ + V + Y P
Sbjct: 551 IMEN--VKAVLEAWFPGEEGADAVADILFGDYNPGGKLAISFPRDVGQVPVYYGHKPSGG 608
Query: 582 VNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
+ + G + P++ PFGYGLSYT F+YK N+ +
Sbjct: 609 KSCWHGDYVEMSTKPLL-PFGYGLSYTTFEYK---------------------NFAIEKE 646
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYER 700
K D +EVEN GK +G E+V +Y++ T +K++ GY+R
Sbjct: 647 KIGM-----------DESIKVSVEVENTGKYEGDEIVQLYTRKEEYLVTRPVKELKGYKR 695
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
V + G+ KV F + D N ++ G +++G + F
Sbjct: 696 VHLKPGEKKKVVFELYP-DLFAFYDYDMNRVVTPGVVEVMIGASSEDIKF 744
>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
Length = 771
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 225/760 (29%), Positives = 345/760 (45%), Gaps = 131/760 (17%)
Query: 22 PERAKDLVERMTLPEKV-----QQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSP 76
P KDL R LP ++ +M A RLG+PL+ EA HG IG
Sbjct: 89 PWTQKDL--RTGLPPQLAARLANRMQRYAVQHSRLGIPLF-LAEEAPHGHMAIG------ 139
Query: 77 PGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNIN 136
AT+FPT +++N L +++G+ ++ E R G + P ++
Sbjct: 140 -----------ATTFPTAPGQASTWNPELIERMGKVIAAEIRL-----QGGHICYGPVLD 183
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+VRDPRW R E+ GED Y+ R YVRG + S SR + KH+ A
Sbjct: 184 IVRDPRWSRTEESYGEDCYLTARIGEAYVRGTGSGD------LSQSR--HALSTLKHFIA 235
Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
Y N + + E++++ET++ PFE V G SVM +YN V+GIP A+ +
Sbjct: 236 YGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRR 291
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
+L +RG+W F G++VSD SI+ + E+H +E AV + L+AG+D D
Sbjct: 292 MLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAV-QALRAGVDADLKGGAFAS 350
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
A + G +AEA+ID ++ + + +G F+ +P + H ELA EAAR
Sbjct: 351 LREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYIDEAAAAEVGCAAHSELALEAAR 409
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FYAY 433
Q + LL+N +G LPL+ ++ +A++GP+A+ +G+Y + DG
Sbjct: 410 QSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGR 469
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE----------- 478
+V+ Y+ GC + + S I AA+ AA+ DA V+V G D E
Sbjct: 470 DRVV-YSRGCT-VRGGDRSEIAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAH 527
Query: 479 --------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIK 530
EG DR L L G Q EL+ ++ A P+ +V ++ +D+ A
Sbjct: 528 DEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVVCIAGRPLDLRRASEQADAL 586
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
+ W YPG GG A+A+ I G NP GRLPIT A +IP RP N+ Y
Sbjct: 587 LMAW--YPGARGGDAVAETILGHNNPAGRLPITIPRAEG-QIPVYYNKKRPANH----DY 639
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+YPFGYGLSY+ F+Y + +S D L
Sbjct: 640 TDLTAAPLYPFGYGLSYSTFEYGSLEARQSGDNVL------------------------- 674
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHIKQVIGYERVF 702
+V C+ + N +G EVV +Y +PP +Q+ G+ R+
Sbjct: 675 -EVSCR---------IRNTSDREGDEVVQLYISDMVASTVRPP-------RQLGGFRRIR 717
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+A G+ +V FT+ ++L ++D ++ G I VG
Sbjct: 718 LAPGEQRQVSFTLGD-EALSLIDPQGRRVVEKGDFVIAVG 756
>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
Length = 939
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 233/808 (28%), Positives = 369/808 (45%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D R +DL+ +MTL EK QM L YG R+ LP EW
Sbjct: 50 YEDPNATLDARIEDLLSQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 108
Query: 59 --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
W + H + FI P + + G
Sbjct: 109 DEHLNGFQQWGLPPSDNPNVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 168
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 169 RATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 222
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q H +++A KH+ AY +
Sbjct: 223 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFVAYSNNKGARE 269
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +GIP L + +RG+
Sbjct: 270 GMARVDPQMSPREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGE 329
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 330 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + ++ +A +A+R+ ++
Sbjct: 389 KEGGLSEDIINDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKAENEAVALQASRESLI 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN+N LPL+ NIKT+A+ GP+AN + +Y + ++G ++ +
Sbjct: 449 LLKNENNVLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEV 508
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
YA GC D+V N + I A++ A+ AD V+V G E K
Sbjct: 509 LYAKGC-DLVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENK 567
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 568 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKG 624
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+ADV+FG YNPGG+L +T + + +IP+ + P +P + G DG +
Sbjct: 625 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRVNG 682
Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGLSYT F+Y + SPK +
Sbjct: 683 SLYPFGYGLSYTTFEYSNIEISPK---------------------------------MMT 709
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ K T + +V N GK G EVV +Y + T+ K + G+ERV + G++ +V F
Sbjct: 710 ANQKATVRCKVTNTGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFI 769
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ K L+++D ++ G +I+VG
Sbjct: 770 LDR-KHLELLDKHMEWVVEPGDFSIMVG 796
>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 861
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 170/445 (38%), Positives = 238/445 (53%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +R +DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G++G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E D+R
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DAR 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 180 YDKLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++AG DL+CG Y + AV+ G I E +ID SL+ L LG D + + +
Sbjct: 296 VRAGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 129/291 (44%), Gaps = 53/291 (18%)
Query: 463 NADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVI 512
+AD + G+ S+E E G DR D+ LP Q +L+ + A K +V
Sbjct: 597 DADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK---KVVF 653
Query: 513 MSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKI 572
++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 654 INYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK------ 707
Query: 573 PYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
+P + GRTY++ ++PFG+GLSYT F Y A KL K+ +
Sbjct: 708 DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLSKNTIAK 759
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI 692
N I V N+G+ DG EVV VY + PG
Sbjct: 760 GEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPGDKEGPR 795
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 796 YTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPLEGTYELLYG 845
>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 747
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 223/744 (29%), Positives = 340/744 (45%), Gaps = 100/744 (13%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV--- 66
+ P+ D +LP +R DL+ RMTL EK+ M + VPRLG+ E HGV
Sbjct: 30 TGLPFQDPELPAEQRIDDLIGRMTLEEKIDCMA-MRAAVPRLGVKGSRH-IEGYHGVAQG 87
Query: 67 --SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-- 122
S GRR + T FP A+++ L +++ + EAR ++
Sbjct: 88 GPSNWGRRNPT-----------ATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSP 136
Query: 123 -LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
AGL +PN ++ RDPRWGR E GEDP+ G A +VRGLQ D
Sbjct: 137 RYDRAGLIVRAPNADLARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQ---------GDD 187
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
R K + KH+ L N + R S +E+ +E + PFEM + +G ++M
Sbjct: 188 PRYFKAVSLVKHF----LANSNEDGRESSSSNFSERQWREYYAKPFEMAIVDGGAPALMA 243
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+YN VNG P P +L + +W +G + +D ++ +VE H D A A +
Sbjct: 244 AYNAVNGTPAHVHP-MLRDIVMAEWKLNGILCTDGGGLRLLVEKHHAFPDLP-SAAAACV 301
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
KAG++ D + + AV +G I E D+D +LR L+ V ++LG D + Y +G+
Sbjct: 302 KAGIN-HFLDRHKDAVTEAVARGSITERDLDAALRGLFRVSLKLGLLDPDERVPYAAIGR 360
Query: 360 NN----ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
N P L + ++ IVLLKN LPL+ +KT+ALVGP N +
Sbjct: 361 NGEAEPWLRPDTQALVRKVTQRSIVLLKNSGALLPLDRTKVKTVALVGPLVNTV--LPDW 418
Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD- 474
Y GTP P G + G V M AA++ A+ ++ ++ G D
Sbjct: 419 YGGTPPYTVPPSIG-------VEKVAGEGVKVGWLADMGDAAVELARTSEIAIVCVGNDP 471
Query: 475 --------LSVEAEGK---DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
+ +EGK DR DL LP Q + I +V A P T+V++ + NF
Sbjct: 472 ISAGGWELVRTPSEGKEAVDRKDLALPRDQEKFIRRV--LAANPRTIVVLIS-----NFP 524
Query: 524 KNNP----KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
P + +I+ + + +E G A+ DV++G+ NP G+L TW P + L
Sbjct: 525 YAMPWVVKHVPAIVHLTHASQELGHALGDVLWGEVNPDGKLAQTW--------PKSLKQL 576
Query: 580 RPVNNFP---GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
P+ ++ GRTY++F G +PFG+GLSYT F L + D+
Sbjct: 577 PPMMDYDLTHGRTYQYFKGEPQFPFGFGLSYTTF-------------NLSNLRVGLDVAR 623
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQV 695
VG A + + +EV N G G EVV VY++ P +KQ+
Sbjct: 624 HVGAGAETPAESPAPRTFAPNAILSIAVEVTNTGTRAGDEVVQVYARYPHSKVSRPLKQL 683
Query: 696 IGYERVFIAAGQSAKVGFTMNACK 719
G++R+ +AAG++A V + A +
Sbjct: 684 CGFQRISVAAGETAHVRLQLPASR 707
>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
17393]
gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 943
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 231/808 (28%), Positives = 368/808 (45%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D R +DL+ +MTL EK QM L YG R+ LP EW
Sbjct: 53 YEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111
Query: 59 --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
W + H + FI P + + G
Sbjct: 112 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGIESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 RATNFPTQLGLGHTWNRELIRQVGLITGREARIL------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q H +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFVAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G+P L +RG+
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + ++ LA +A+R+ +V
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKAENESLALQASRESLV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN+N LPL+ N+K +A+ GP+A+ + +Y T+ ++G S+ +
Sbjct: 452 LLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N + I A++ A+ AD V+V G E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 571 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+ADV+FG YNPGG++ +T + + +IP+ + P +P + G DG +
Sbjct: 628 GTAVADVLFGDYNPGGKMTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVNG 685
Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+Y FGYGLSYT F+Y + SPK V
Sbjct: 686 ALYSFGYGLSYTTFEYSGIEISPK---------------------------------VIT 712
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ K T + +V N GK G EVV +Y + T+ K + G+ER+ + G++ +V FT
Sbjct: 713 PNQKATVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFT 772
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ K L+++D ++ G +I+VG
Sbjct: 773 LDR-KQLELLDKHMEWVVEPGDFSIMVG 799
>gi|423293434|ref|ZP_17271561.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
CL03T12C18]
gi|392678377|gb|EIY71785.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
CL03T12C18]
Length = 735
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 220/760 (28%), Positives = 355/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EK+ Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y V++GK+ A +D S+R + V RLG F+
Sbjct: 311 DAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKN+N LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNNNQILPLT--NKKKIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D A+ +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K P+ LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G R++A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D K + ++ V N G DG+E V + P + T +K++ +E+
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
FI G++ F ++ + V+ L +G + ILV
Sbjct: 685 FIKVGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
Length = 957
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 230/766 (30%), Positives = 365/766 (47%), Gaps = 113/766 (14%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM--GDLAYGVPRLGLPLYEW 58
+ E V L + Y + LP R +DL+ MT+ +K++ + G G+P LG+P
Sbjct: 157 KAEIANVPLKERAYMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK 216
Query: 59 WSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR 118
EA+HG S+ G+ GAT FP I A++N+ L + + E
Sbjct: 217 -VEAIHGFSY---------GS-------GATIFPQSIGMGATWNKRLIEAAAMAIGDETV 259
Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
+ NA + WSP ++V +D RWGR ET GEDP +V +++G Q
Sbjct: 260 S----ANA-VQAWSPVLDVAQDARWGRCEETYGEDPVLVTEIGGAWIKGYQ--------- 305
Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
S+ L + KH+AA+ R D ++E++M+E ++PF + S
Sbjct: 306 ---SKGLMTTP--KHFAAH---GAPLGGRDSHDIGLSEREMREIHLVPFRDIYKKYKYQS 357
Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
+M SY+ G+P +LL +R +W F G+IVSDC +I + + K +A
Sbjct: 358 IMMSYSDFLGVPVAKSKELLKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAAR 417
Query: 299 RVLKAGLDLDCGDYYTN-FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
+ L AG+ +CGD Y + + A ++G++ D+D + + L L R G F+ +P K L
Sbjct: 418 QALAAGIATNCGDTYNDPDVIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNP-CKPL 476
Query: 358 GKNNIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
N I +P+H LA + A++ IVLL+N LPL+ ++KT+A++GP A+ +
Sbjct: 477 DWNKIYPGWNSPEHQALARKTAQESIVLLENKGNILPLSK-SLKTIAVIGPGADNLQPGD 535
Query: 414 GNYEGTPCRYTSPMDGFYA---YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
+ P + S + G A S + Y GC I + I A+ AA+NAD V+V
Sbjct: 536 YTSKPQPGQLKSVLTGIKAAVNSSTKVLYEEGCRFIGTEGTD-IAKAVKAAENADVAVLV 594
Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
G + EA E D L+LPG Q +L+ V K PV L++ + +++
Sbjct: 595 LGDCSTSEALKGITNTSGENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNLS 653
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
+A N + + W+ PG+EGG A ADV+FG YNP GRLP+T+ P + L
Sbjct: 654 YAAENCQAVLVNWL--PGQEGGYATADVLFGDYNPAGRLPMTF--------PRDAAQLPL 703
Query: 582 VNNF--PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
NF GR Y + D P +Y FGYGLSYT F Y ++I L+K+ +N T
Sbjct: 704 YYNFKTSGRVYDYVDMPYYPLYQFGYGLSYTSFNY------SDLNISLEKNGNV-SVNAT 756
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
V N GK+ G EVV +Y + T + ++
Sbjct: 757 ----------------------------VTNTGKVAGDEVVQLYITDMYASVKTRVMELK 788
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++RV++ G+S KV F + + L ++++ + ++ G I+VG
Sbjct: 789 DFDRVYLNPGESKKVSFVLTPYQ-LSLLNDEMDRVVEKGLFKIMVG 833
>gi|281412136|ref|YP_003346215.1| glycoside hydrolase family 3 domain protein [Thermotoga
naphthophila RKU-10]
gi|281373239|gb|ADA66801.1| glycoside hydrolase family 3 domain protein [Thermotoga
naphthophila RKU-10]
Length = 778
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 243/816 (29%), Positives = 374/816 (45%), Gaps = 162/816 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYG---VP 49
Y D P R +DL+ RMTL EK Q+G L G V
Sbjct: 4 YRDPSQPIEVRVRDLLSRMTLEEKAAQLGSVWGYELIDERGKFSREKAKELLKNGIGQVT 63
Query: 50 RLGLPLYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
R G EA V+ I R R P H + G T+FP I +
Sbjct: 64 RPGGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMAS 123
Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+++ L +K+ + + R + + GL +P ++V RDPRWGR ET GE PY+V R
Sbjct: 124 TWDPDLIEKMTTAIREDMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVAR 178
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVT 215
++YV+GLQ G + + + A KH+A Y NW + +
Sbjct: 179 MGVSYVKGLQ---GEDIKKG-------VVATVKHFAGYSASEGGKNWAPTN-------IP 221
Query: 216 EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSD 275
E++ +E F+ PFE V E +V SVM SY+ ++G+P A+ KLL +R DW F G +VSD
Sbjct: 222 EREFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSD 281
Query: 276 CDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-----CGDYYTNFTMGAVQQGKIAEAD 330
+++ + + H+ D K +A L+AG+D++ C Y + V++G I+EA
Sbjct: 282 YFAVKVLEDYHRIARD-KSEAARLALEAGIDVELPKTECYQYLKDL----VEKGIISEAL 336
Query: 331 IDTSL-RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGAL 389
ID ++ R L + M LG F+ Y + K I H ++A + AR+ I+LLKND G L
Sbjct: 337 IDEAVARVLRLKFM-LGLFENP--YVEVEKAKI--ESHKDIALDIARKSIILLKND-GIL 390
Query: 390 PLNTGNIKTLALVGPHANATKAMIGNYE----------------GTPC------------ 421
PL K +AL+GP+A + ++G+Y G P
Sbjct: 391 PLQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKS 448
Query: 422 ------RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG--- 472
S +D F YA GC ++ ++ S AI+ AK +D ++V G
Sbjct: 449 IEEHMKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKS 507
Query: 473 ---LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
LD + E +D +L LPG Q EL+ +VA K PV LV+++ + + K+
Sbjct: 508 GLTLDCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPYSLKNVVD--KV 563
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGR 588
+IL V PGE GGR+I D+I+GK NP G+LPI++ A + + + P +++ G
Sbjct: 564 NAILQVWLPGEAGGRSIVDIIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGD 623
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
P ++PFG+GLSYT+F+Y + PK V PP
Sbjct: 624 YVDESTKP-LFPFGHGLSYTKFEYSNLRIEPKEV---------------------PPAGE 661
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
V+I +++VEN G DG EVV +Y + T +K++ G++RV + A
Sbjct: 662 VVI------------KVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAK 709
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ V F ++ L D ++ G ++VG
Sbjct: 710 EKKTVVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744
>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
Length = 942
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 231/808 (28%), Positives = 368/808 (45%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D R +DL+ +MTL EK QM L YG R+ LP EW
Sbjct: 53 YEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111
Query: 59 --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
W + H + FI P + + G
Sbjct: 112 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++IG EAR + G T ++P ++V RD RWGR
Sbjct: 172 RATNFPTQLGLGHTWNRELIRQIGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q H +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFVAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G+P L +RG+
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + ++ LA +A+R+ +V
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLAGADKEVEKAENESLALQASRESLV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN+N LPL+ N+K +A+ GP+A+ + +Y T+ ++G ++ +
Sbjct: 452 LLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N + I A++ A+ AD V+V G E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 571 SRSSLELPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+ADV+FG YNPGG+L +T + + +IP+ + P +P + G DG +
Sbjct: 628 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVNG 685
Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+Y FGYGLSYT F+Y + SPK V
Sbjct: 686 ALYSFGYGLSYTTFEYSDIEISPK---------------------------------VIT 712
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ K T + +V N GK G EVV +Y + T+ K + G+ER+ + G++ +V FT
Sbjct: 713 PNQKATVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFT 772
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ K L+++D ++ G +I++G
Sbjct: 773 LDR-KQLELLDKHMEWVVEPGDFSIMIG 799
>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 877
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 173/465 (37%), Positives = 245/465 (52%), Gaps = 50/465 (10%)
Query: 2 FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
+ + ++ Y D L RA DLV RMTL EK Q+G A +PRLG+P Y WW+E
Sbjct: 11 LDPVPADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNE 70
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
LHGV+ G AT FP I A+++E + +G VSTE RA Y
Sbjct: 71 GLHGVARAGV----------------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKY 114
Query: 122 ------NLGN---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
+ G GLT WSPNIN+ RDPRWGR ET GEDPY+ R I Y+ GLQ
Sbjct: 115 VERVHPDGGTDWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQ--- 171
Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
+D + K A KH+A + ++R D ++ D+++T++ F V
Sbjct: 172 ------GNDPKFFKTVATSKHFAVHSGPE---SNRHKEDVYPSKFDLEDTYLPAFRATVT 222
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-ESHKFLND 291
EG SVMC YN V G+P CA L+ + +R +W F G++VSDC + I E
Sbjct: 223 EGKAYSVMCVYNAVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTK 282
Query: 292 TKEDAVARVLKAGLDLDCGDYYTNFT------MGAVQQGKIAEADIDTSLRFLYIVLMRL 345
T E+ VA LKAG+DL CGDY + + AV+ G++ A +D +L L+ +RL
Sbjct: 283 TAEEGVAVGLKAGMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRL 342
Query: 346 GYFD--GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
G FD S + ++ ++ P H +A + A++ +VLLKND G LPL KT+A++G
Sbjct: 343 GMFDPPASLPFAHITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPLKA-EPKTIAVIG 400
Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADI 446
P+A++ A++GNY G P + + +DG A + I YA G I
Sbjct: 401 PNADSLDALVGNYYGKPSKPVTVLDGIRARFPTAKIVYAEGTGLI 445
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 146/310 (47%), Gaps = 70/310 (22%)
Query: 453 MIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVAD 502
M A+D AK AD V V GL VE E G DR + LP Q +L+ KV
Sbjct: 587 MAGQAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIG 646
Query: 503 AAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPI 562
K P LV+MS A+ +N+A + + +I+ YPG EGG A+A +I G Y+P GRLP+
Sbjct: 647 TGK-PTVLVLMSGSALGVNWADKH--VPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPV 703
Query: 563 TWYEANYVKIPYTSMPLRPVNNFPG--------RTYKFFDGPVVYPFGYGLSYTQFKYKV 614
T+Y R V+ PG RTY++F+G V+YPFG+GLSYT F Y
Sbjct: 704 TFY--------------RSVDALPGFSDYTMKNRTYRYFNGEVLYPFGHGLSYTTFAY-- 747
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
++PK + T ++V N G MD
Sbjct: 748 -ANPKVSAASVAAGSSV-----------------------------TVSVDVSNSGAMDS 777
Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
EVV +Y PG GT I+ + G++RV + G++ V F ++ ++L +VD + +
Sbjct: 778 DEVVQLYVSHPG--GTAIRSLQGFQRVSLKKGETKTVQFKLDD-RALSVVDEHGGRKVQA 834
Query: 735 GAHTILVGEG 744
G + +G G
Sbjct: 835 GQVDLWIGGG 844
>gi|383115356|ref|ZP_09936112.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
gi|313695234|gb|EFS32069.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
Length = 735
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 222/755 (29%), Positives = 347/755 (45%), Gaps = 87/755 (11%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +I+AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRIAACLKHYIGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANHYTMTAILKERWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y V++GK+ A +D S+R + V RLG F+
Sbjct: 311 DAAWYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKNDN LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKRIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D + +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKP-QGNDRSGFAGALDVVRWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K P+ LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
+IL + PG G R++A ++ G+ NP G+L IT+ Y + I Y R +
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAITFPYSTGQIPIYYNR---RKSGRWHQ 601
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
YK Y FGYGLSYT+F+Y V +P S +K
Sbjct: 602 GFYKDITSDPFYSFGYGLSYTEFQYGVV-TPSSTTVK----------------------- 637
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
+ K + ++ V N GK DG+E V + P + T +K++ +E+ FI G
Sbjct: 638 --------RGEKLSVEVTVTNAGKRDGAETVHWFISDPYCSITRPVKELKHFEKQFIKVG 689
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
++ F ++ + L VD L +G + I V
Sbjct: 690 ETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWV 724
>gi|6006601|emb|CAB56857.1| beta-mannanase [Thermotoga neapolitana]
Length = 821
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 237/813 (29%), Positives = 370/813 (45%), Gaps = 160/813 (19%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGD---------------------LAYGVPRLGLP 54
D P R KDL+ RMTL EK+ Q+G L G+ ++ P
Sbjct: 49 DPSQPVEVRVKDLLSRMTLEEKIAQLGSVWGYELIDERGKFKREKAKDLLKNGIGQITRP 108
Query: 55 ---LYEWWSEALHGVSFIGR------RTNSPPGTHFDSEVP----GATSFPTVILTTASF 101
EA V+ I R R P H + G T+FP I +++
Sbjct: 109 GGSTNLEPQEAAELVNEIQRFLVEETRLGIPAMIHEECLTGYMGLGGTNFPQAIAMASTW 168
Query: 102 NESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYA 161
+ L +K+ + + R + + GL +P ++V RDPRWGR ET GE PY+V R
Sbjct: 169 DPDLIEKMTAAIREDMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMG 223
Query: 162 INYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVTEQ 217
++YV+GLQ ++ + A KH+A Y NW + + E+
Sbjct: 224 VSYVKGLQ----------GENIKEGVVATVKHFAGYSASEGGKNWAPTN-------IPER 266
Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
+ +E F+ PFE V E V SVM SY+ ++G+P A+ +LL +R DW F G +VSD
Sbjct: 267 EFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSDYF 326
Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMGAVQQGKIAEADID 332
++ + E H+ D E A L+AG+D+ DC + + V++G + E+ ID
Sbjct: 327 AVNMLGEYHRIAKDKSESA-RLALEAGIDVELPKTDCYQHLKDL----VEKGIVPESLID 381
Query: 333 TSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLN 392
++ + + LG F+ Y ++ K I H +LA E AR+ I+LLKND G LPL
Sbjct: 382 EAVSRVLKLKFMLGLFENP--YVDVEKAKI--ESHRDLALEIARKSIILLKND-GTLPLQ 436
Query: 393 TGNIKTLALVGPHANATKAMIGNYE----------------GTPC--------------- 421
K +AL+GP+A + ++G+Y G P
Sbjct: 437 KN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEE 494
Query: 422 ---RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG------ 472
S +D F YA GC ++ ++ S AI+ AK +D ++V G
Sbjct: 495 HMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSGLT 553
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
LD + E +D +L LPG Q EL+ ++A K PV LV+++ + + ++ +I
Sbjct: 554 LDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPYSLKNLVD--RVNAI 609
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYK 591
L V PGE GGRAI DVI+GK NP G+LPI++ A + + + P +++ G
Sbjct: 610 LQVWLPGEAGGRAIVDVIYGKVNPSGKLPISFPRSAGQIPVFHYVKPSGGRSHWHGDYVD 669
Query: 592 FFDGPVVYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
P ++PFG+GLSYT+F+Y + PK V P V+I
Sbjct: 670 ESTKP-LFPFGHGLSYTRFEYSNLRIEPKEV---------------------PSAGEVVI 707
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSA 709
+++VEN+G MDG EVV +Y + T +K++ G++RV + A +
Sbjct: 708 ------------KVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKK 755
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F ++ L D ++ G ++VG
Sbjct: 756 TVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 787
>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
Length = 863
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 173/447 (38%), Positives = 243/447 (54%), Gaps = 45/447 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
P+ ++ LP ERA+DL++R+TL EKV M D + +PRLG+ Y WW+EALHGV G
Sbjct: 22 LPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------ 125
AT FP I A+F++ ++ + VS EARA Y+
Sbjct: 82 ----------------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSE 125
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFW+PN+N+ RDPRWGR ET GEDPY+ + + VRGLQ S+S+
Sbjct: 126 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQG--------PSESK 177
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W +R FD ++ +D+ ET++ F+ V +G V VMC+
Sbjct: 178 YDKLHACAKHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVL 301
YNR G P C +LL +R +W F G +VSDC +I ++ H + TKE AVA +
Sbjct: 235 YNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAV 294
Query: 302 KAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
KAG DLDCG DYY AV++G I E ID SL L LG D + ++
Sbjct: 295 KAGTDLDCGVDYYA--LQKAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSDIP 352
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + +H E A E AR+ + LLKND+G LPL+ + +A++GP+AN + M GNY G
Sbjct: 353 YTVVDSEKHREKALEMARKSMTLLKNDHGTLPLSK-HCGKIAVIGPNANDSVMMWGNYNG 411
Query: 419 TPCRYTSPMDGFYAY--SKVINYAPGC 443
P + ++G ++ I Y GC
Sbjct: 412 FPSHTVTILEGITHKLGAEQIIYDKGC 438
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 139/296 (46%), Gaps = 55/296 (18%)
Query: 460 AAKNADATVIV--AGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
AA+ DA VIV G+ VE E G DR + LP Q +L+ ++ K P
Sbjct: 594 AARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-P 652
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V L++ S A I + +I+ Y G+ GG A+ADV+FG YNP GRLP+T+Y+A
Sbjct: 653 VILILCSGSA--IGLSAEVDLADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKA 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+P + GRTY++F+G ++PFGYGLSYT F+ A K
Sbjct: 711 T------EQLPDFEDYSMQGRTYRYFEGEALFPFGYGLSYTSFEIGKARLSK-------- 756
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
++ R+ N +V + ++ VEN GK+DG EV+ +Y +
Sbjct: 757 -KRIRE-NESV----------------------SLKLTVENTGKLDGDEVIQIYIRKLQD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
+K + ++R + AG+ V F + D +N++ + G + IL G
Sbjct: 793 KEGPLKTLRAFKRFHLRAGEKKDVTFHLQN-DHFNFFDTESNTMRVMPGEYEILYG 847
>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 861
Score = 278 bits (712), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 169/445 (37%), Positives = 235/445 (52%), Gaps = 44/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +RA+DL+ R+TL EKV M + + +PRLG+ YEWW+EALHGV G
Sbjct: 24 LPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL-GNAG--- 127
AT FP I ASFN+SL ++ S EAR + G +G
Sbjct: 84 ----------------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLK 127
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFW+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ E Y
Sbjct: 128 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD------ 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + +D+ ET++ F+ V + V VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R +W + G +VSDC +I +H D KE A A
Sbjct: 237 YNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAA 295
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
++ G DL+CG Y + AV+ G I E +ID SL+ L LG D + + +
Sbjct: 296 VRTGTDLECGSEYASLA-DAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTS 354
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H LA AR+ +VLL+N N LPLNT ++K +A++GP+AN + GNY G P
Sbjct: 355 VLNSKEHQALALRMARESLVLLQNKNNILPLNT-HLK-VAVMGPNANDSVMQWGNYNGIP 412
Query: 421 CRYTSPMDGFYAY--SKVINYAPGC 443
+ ++ A I Y PGC
Sbjct: 413 AHTVTLLEAVRAKLPEGQIIYEPGC 437
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 131/297 (44%), Gaps = 53/297 (17%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
A+ +AD + G+ S+E E G DR D+ LP Q +L+ + A K
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGK- 649
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ I ++IL YPG+ GG AI D ++G+YNPGGRLP+T+Y+
Sbjct: 650 --KVVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+P + GRTY++ ++PFG+GLSYT F Y A KL
Sbjct: 708 ------DVNQLPDFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEA--------KLS 753
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ + N I V N+G+ DG EVV VY + PG
Sbjct: 754 KNTIAKGEN------------------------VVLTIPVSNVGQRDGEEVVQVYLRRPG 789
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ ++RV I AG++ V + ++ + D +N++ G + +L G
Sbjct: 790 DKEGPRYTLRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPLEGTYELLYG 845
>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 942
Score = 278 bits (712), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 230/808 (28%), Positives = 366/808 (45%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D R +DL+ +MTL EK QM L YG R+ LP EW
Sbjct: 53 YEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111
Query: 59 --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
W + H + FI P + + G
Sbjct: 112 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 RATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQHSH-------------QVAATGKHFVAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G+P L +RG+
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + ++ LA +A+R+ +V
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLAGADKEVEKAENESLALQASRESLV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN+N LPL+ N+K +A+ GP+A+ + +Y T+ ++G ++ +
Sbjct: 452 LLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N + I A++ A+ AD V+V G E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + IL YPG +G
Sbjct: 571 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+ADV+FG YNPGG+L +T + + +IP+ + P +P + G DG +
Sbjct: 628 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVNG 685
Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+Y FGYGLSYT F+Y + SPK V
Sbjct: 686 ALYSFGYGLSYTTFEYSDIEISPK---------------------------------VIT 712
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+ K T + +V N GK G EVV +Y + T+ K + G+ER+ + G++ +V FT
Sbjct: 713 PNQKATVRCKVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFT 772
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ K L+++D ++ G +I+VG
Sbjct: 773 LDR-KQLELLDKHMEWVVEPGDFSIMVG 799
>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 851
Score = 278 bits (711), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 162/421 (38%), Positives = 233/421 (55%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 28 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 85
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L +++ +S EARA +N + G
Sbjct: 86 --------------FTVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQ 131
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V+GLQ D
Sbjct: 132 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPH 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 238
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK+L TKE A LKA
Sbjct: 239 NALNDVPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLK-TKEAAATLSLKA 297
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q +++ADID++ + M+LG FDG + Y + +
Sbjct: 298 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDGVERNPYTKISPS 357
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AARQ IVLLKN LPLN +K++A+VG NA K G+Y G P
Sbjct: 358 VIGSKEHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVG--INAGKCEFGDYSGAP 415
Query: 421 C 421
Sbjct: 416 V 416
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 148/301 (49%), Gaps = 51/301 (16%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
A + + + V G++ S+E EG+DR D+ LP Q E + ++ + +++++ ++
Sbjct: 598 AVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILVAGSSLA 656
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
IN+ + + +I+ YPGE+GG A+A+V+FG YNP GRLP+T+Y++ ++P P
Sbjct: 657 INWMDEH--VPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----PF 709
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ GRTYK+F G V+YPFGYGLSY+ FKY
Sbjct: 710 DDYDITKGRTYKYFKGEVLYPFGYGLSYSSFKY--------------------------- 742
Query: 640 TNKPPCAAVLIDDVKCKDY--KFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVI 696
D++ KD + ++N GK +G EV VY + P G +K++
Sbjct: 743 -----------SDLRVKDEADEVAVSFRLKNTGKRNGDEVTQVYVRIPETGGIVPVKELK 791
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQLN 755
G+ RV + +G+S +V +N + L+ D + G I+VG + ++
Sbjct: 792 GFRRVPLKSGESRRVEIRLNK-EQLRYWDVGKGQFVVPKGTFDIMVGASSKDIRLQTVID 850
Query: 756 L 756
L
Sbjct: 851 L 851
>gi|298479985|ref|ZP_06998184.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
gi|298273794|gb|EFI15356.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
Length = 735
Score = 278 bits (711), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 222/760 (29%), Positives = 354/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLY-- 56
Y DAK P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGEEVKKVPSEIGSLIYFD 89
Query: 57 --EWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
++ + R P +D+ T +P + S+N L ++ +
Sbjct: 90 INPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G +A VRG
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + ++ Q + +T++LP+EM V G
Sbjct: 198 -YQGDDMSAENRMAACLKHYVGYGASE-AGRDYVY--TEISAQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +++ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
DA AGL++D + Y V++GK+ A +D S+R + V LG F+
Sbjct: 311 DAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFCLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K+ PQ + +AA+ A + +VLLKNDN LPL N K +A+VGP A ++
Sbjct: 371 PVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLT--NKKKIAVVGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G DG A + YA GC + S A+D A+ +D +
Sbjct: 429 GSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKP-QGNDRSGFAGALDVARWSDVVI 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G L+ E R + LP Q EL+ ++ +A K PV LV+ + +++N + P
Sbjct: 488 VCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVLSNGRPLELN--RMEPL 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G R++A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 CDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D K + ++ V N G DG+E V + P + T +K++ +E+
Sbjct: 631 PSATKVKRGD------KLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVKELRHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 853
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 163/420 (38%), Positives = 232/420 (55%), Gaps = 45/420 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y +A P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 29 YKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 86
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K+I +S EARA +N + G
Sbjct: 87 --------------FTVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQ 132
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V+GLQ D
Sbjct: 133 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------DDPH 183
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 240 NALNNVPCTLNSWLLQKVLRRDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 298
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + + A +Q +EADID++ + M+LG FDG + Y + +
Sbjct: 299 GLDLECGDDVYDEYLLNAYKQYMASEADIDSAAYHVLTARMKLGLFDGVERNPYAKISPS 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H +A AAR+ IVLLKN LPLN +K++A+VG NA K G+Y G P
Sbjct: 359 VIGSKEHQTVALNAARECIVLLKNQKNMLPLNVKKLKSIAVVG--INAGKCEFGDYSGAP 416
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 155/299 (51%), Gaps = 46/299 (15%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
A + + V V G++ S+E EG+DR D+ LP Q E + ++ + LV+++ ++
Sbjct: 599 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLVAGSSLA 657
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
+N+ N + +I+ YPGE+GG A+A+V+FG YNP GRLP+T+Y++ ++P
Sbjct: 658 VNW--ENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LEQLP----AF 710
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ GRTY++F V+YPFGYGLSYT FKY ++K+D + ++++T
Sbjct: 711 DDYDITKGRTYQYFKKDVLYPFGYGLSYTTFKYS--------NLKVDDAGKTVNVSFT-- 760
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGT--HIKQVIG 697
++N GK G EV VY + P IAG+ I+Q+ G
Sbjct: 761 --------------------------LKNTGKRAGDEVAQVYVRLPEIAGSTQAIRQLKG 794
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+ RV + AG+S KV T++ + + A ++ G+ T +VG G + NL
Sbjct: 795 FRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVGASSGDIRLENTTNL 853
>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
Length = 801
Score = 278 bits (711), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 232/810 (28%), Positives = 365/810 (45%), Gaps = 147/810 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D P R +DL+ +MTL EK QM L YG R+ LP +W ++ G+ I
Sbjct: 44 YEDPTRPVDLRIEDLLSQMTLEEKSCQMATL-YGFGRVLKDELPTPDWKNQIWKDGIGNI 102
Query: 70 GRRTNS-------------PPGTH----------------------FDSE-VPG-----A 88
+ N+ PP H F +E + G A
Sbjct: 103 DEQLNNLAYHPSAVTDKAWPPSNHIKALNTIQEFFVEDTRLGIPVDFTNEGIRGLCHEKA 162
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
TSFP+ + A++N++L KIG EAR + G T +SP +++ RDPRWGRV+
Sbjct: 163 TSFPSQLGVGATWNKNLVGKIGHITGKEARLL------GYTNVYSPILDIARDPRWGRVV 216
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDPY+VG V+G+Q K+ + KH+A Y +
Sbjct: 217 ECYGEDPYLVGELGYQMVKGIQQE--------------KVVSTPKHFAIYSAPKGGRDGD 262
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D+ +TE+++ ++ PF+ + + VM SYN NG+P + LN +R DW
Sbjct: 263 ARTDAHITERELFSLYLHPFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWG 322
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM--------- 318
F GY+VSD +++ I + H D K DAV + + AGL++ T+FTM
Sbjct: 323 FKGYVVSDSRAVEFIADKHHVAKDRK-DAVRQAVLAGLNV-----RTDFTMPEDFILPVR 376
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAAR 376
V++G + A ID +R + V G FD +P K + + + P++ E+A +A+
Sbjct: 377 ELVKEGGLDMATIDDRVRDILRVKFWQGLFD-APYGKQMKEADKTVGKPEYQEVAYQASL 435
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---YAY 433
+ IVLLKN+ LPL+ K++ + GP+A A + Y + S DG +
Sbjct: 436 ESIVLLKNEENILPLDFSKYKSVLVTGPNAKAINHSVSRYGPSHIDVVSVFDGIKEKFPK 495
Query: 434 SKVINYAPGCA-------DIVCQN-------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
I Y GC D N S I A+ AK ++V G D
Sbjct: 496 DVEIKYTKGCVFFDENWPDSELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVG 555
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E + R L LPG Q +L+ ++ PV +V+++ + IN+ + + I+ + G
Sbjct: 556 ESRSRTSLDLPGNQQKLVEEIYKTGT-PVIVVLINGRPMTINWV--DKYVPGIVEGWFQG 612
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF------PGRTYKFF 593
+ GG AIADV+ G YNPGG+LP++ + ++P + P +P P + K
Sbjct: 613 KFGGSAIADVLVGSYNPGGKLPVS-FPKTVGQLP-MNFPSKPGAQADQPAKGPNGSGKTR 670
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
G +YPFGYGLSYT F+Y N + +N +
Sbjct: 671 VGGFLYPFGYGLSYTTFEY---------------------TNLKIRSN-----------I 698
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
K +++ N GK G E+V +Y S + KQ+ G+ER+ + AG++ V
Sbjct: 699 KNGLGDVVVSVDITNSGKRKGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVN 758
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT+ + + L + + +L G+ TI++G
Sbjct: 759 FTL-SPEDLSLYNRQMEFVLEPGSFTIMIG 787
>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 881
Score = 278 bits (710), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 168/464 (36%), Positives = 244/464 (52%), Gaps = 56/464 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + L +RA DLV RMT+ EKV Q+ + + VPRL +P Y+WWSEALHGV+
Sbjct: 30 YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVA------ 83
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN-------- 125
PG T +P + A+F+ +++ + + E R + G
Sbjct: 84 -----------QPGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIF 132
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL FW+PNIN+ RDPRWGR ET GEDP++ R + YV+GLQ + Y L
Sbjct: 133 QGLDFWAPNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDDPKYY--------L 184
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
IS KHYA + + R D +V++ D +T++ F V E SVMC+YN
Sbjct: 185 AIS-TPKHYAVH---SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNS 240
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+NG P C + LL +RG WNF GY+VSDC++I I HKF T+ +A A ++ G+
Sbjct: 241 INGQPACVNEFLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKF-TKTQAEASALAVQRGM 299
Query: 306 DLDCGDY--------YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---Y 354
D +C D+ Y + A +QG + E++IDT+L L+ M+LG FD P+ Y
Sbjct: 300 DNECVDFGKQKDDHDYRPY-FDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPY 357
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+ + + +H ELA A + +VLLKND G LPL +K +A++GP A T+ ++G
Sbjct: 358 SKIDPKELESAEHRELARTLANESMVLLKND-GTLPLKKSGLK-IAVIGPLAEQTRYLLG 415
Query: 415 NYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPA 456
NY GTP S ++G A I + G + QN +P+
Sbjct: 416 NYNGTPSHTVSVLEGLRAEFPDAQITFERGT-QFLDQNGEAVPS 458
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 101/300 (33%), Positives = 154/300 (51%), Gaps = 52/300 (17%)
Query: 455 PAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAA 504
PAA+ AAKNAD + V G+ +E E G DR L LP + +L+ ++ A
Sbjct: 601 PAAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAG 660
Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
K PV LV+ + A+ +N+A+ + +IL YPGEEGG AIA + GK NP GRLP+T+
Sbjct: 661 K-PVVLVLSNGSALSVNWAQQH--ANAILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTF 717
Query: 565 YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
Y P+ ++ GRTY++F+G +YPFGYGLSYT F Y+ + PK+
Sbjct: 718 YTGTEQLPPFEDYAMK------GRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPKA---- 767
Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP 684
+ P T Q+ V N GK++G EV +Y
Sbjct: 768 ------------PLNAGDP----------------VTAQVTVTNTGKVEGDEVAQLYLSF 799
Query: 685 PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
P IAG ++ + G+ R+ + AG+S + F + + L +V+ A + ++A G +++ VG G
Sbjct: 800 PNIAGAPLRALRGFRRIHLKAGESQTIKFELKD-RDLSMVNEAGDPIIAEGEYSVSVGGG 858
>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 763
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 216/691 (31%), Positives = 335/691 (48%), Gaps = 100/691 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP + +++N L++ I + V+ E RA G +SP ++VVRDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA-----QGGSATYSPVLDVVRDPRWGRT 177
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDP++V +A+ V+GLQ E ++ H + A KH+A Y N
Sbjct: 178 EETFGEDPHLVTEFAVAAVQGLQG-ERLDSH-------TSLLATLKHFAGYGASEGGRNG 229
Query: 207 R-FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
H R ++ E +LPF V G + SVM +YN ++G+P + LL +R
Sbjct: 230 APVHMGLR----ELHEVDLLPFRKAVEAGAL-SVMTAYNEIDGVPCTSSGYLLQDVLREA 284
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
W F G++++DC +I + H E A A+ LKAG+D++ G + A++QG
Sbjct: 285 WGFDGFVITDCGAIHMLACGHNTAGSGVE-AAAQSLKAGVDMEMSGTMFRAHLHQALEQG 343
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
I E D++ + + + RLG FD + I +HI LA +AA +GIVLLKN
Sbjct: 344 LITEEDLNRAAGRVLELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKN 403
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGF---YAYSKVINY 439
+ LPL++ + T+A++GP+A+A +G+Y P + + +DG S+V+ Y
Sbjct: 404 EGNLLPLDSSS-GTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVL-Y 461
Query: 440 APGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA--------- 479
APGC I + P A+ A+ AD V+V G +DL A
Sbjct: 462 APGC-RIQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAES 520
Query: 480 -----EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
EG DR L L G Q EL+ ++ K PV +V ++ + + + I SI+
Sbjct: 521 DMECGEGIDRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPWIDEH--IPSIVE 577
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
YPG+EGG AIAD++FG NP GRLP++ E + Y + R G+ Y
Sbjct: 578 AWYPGQEGGSAIADMLFGDINPSGRLPLSIPKEVGQLPNSYNARRTR------GKRYLET 631
Query: 594 DGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
D YPFG+GLSYT+F+Y ++ P V I +
Sbjct: 632 DLAPRYPFGFGLSYTEFRYGRLTVEPAVVPIGGEA------------------------- 666
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKV 711
T +I+V N G DG+EVV +Y + T ++ + G+ +VF+ AG++ +V
Sbjct: 667 --------TVRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEV 718
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT+ + + L+++ ++ G I VG
Sbjct: 719 TFTIGS-EQLELIGLDLKPVVEPGEFRIQVG 748
>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 878
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 161/430 (37%), Positives = 242/430 (56%), Gaps = 46/430 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + +LP ER DL+ R+T+ EK+ Q+ + + RLG+P Y WW+E+LHGV+ G
Sbjct: 24 YPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGVARAGY 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN-- 125
AT FP I AS+++ L ++ +S EARA ++ G
Sbjct: 84 ----------------ATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHD 127
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFWSPNIN+ RDPRWGR ET GEDPY+ G YV+GLQ ++++
Sbjct: 128 IYQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQ---------GNNAK 178
Query: 184 PLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK+ A KH+A + G + R FD +++D+ ET++ F V +G+V S+M
Sbjct: 179 YLKVVATAKHFAVHS-----GPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMT 233
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+YNR+ G A L + +R W F+GY+VSDC +I + ++H D E A A +
Sbjct: 234 AYNRIYGEAASASNSLYS-ILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAE-ASAMAV 291
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
K G DL+CG+ Y T A+Q G I EAD+D +L L +LG FD + Y +
Sbjct: 292 KEGCDLNCGNSYEKLT-DALQDGLITEADLDVALHRLMRARFKLGMFDSDEKVPYAKIPF 350
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ NP+H LA +AA++ IVLLKN+N LPL + N+K +A++GP+A+ +++ GNY G
Sbjct: 351 SVNNNPKHKVLALKAAQKSIVLLKNENAILPL-SKNLKNIAVIGPNADNIQSLWGNYNGM 409
Query: 420 PCRYTSPMDG 429
P + ++G
Sbjct: 410 PKNPVTVLEG 419
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 150/302 (49%), Gaps = 54/302 (17%)
Query: 452 SMIPAAIDAAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVA 501
+ + A+ AA +D V+ GL ++ VE EG DR L LP Q EL+ +V
Sbjct: 587 NQLEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEVV 646
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K PV LV+++ A+ IN+A N I +I+ GYPG+EGG AIA+V+FG YNP GRLP
Sbjct: 647 ATGK-PVVLVLLNGSALSINWASEN--IPAIISAGYPGQEGGNAIANVLFGDYNPAGRLP 703
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
+T+Y++ +P N GRTYK+F +YPFGYGLSYT+FKY P +
Sbjct: 704 VTYYKS------VDDLPPFEDYNMDGRTYKYFKKEPLYPFGYGLSYTKFKYSNLEIPLEI 757
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
I N+P ++V N G DG EVV +Y
Sbjct: 758 KI-----------------NEP----------------IKVSVQVANEGDFDGDEVVQLY 784
Query: 682 SK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ G I +++G++R+ + G KV FT+ + L +++ ++ G +I
Sbjct: 785 VRDEEGSTPRPICELVGFKRIHLKKGARQKVEFTIQP-RELAMINKDDKFVIEPGWFSIS 843
Query: 741 VG 742
VG
Sbjct: 844 VG 845
>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides faecis MAJ27]
Length = 804
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 223/726 (30%), Positives = 344/726 (47%), Gaps = 120/726 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG T FPT I +A+++ +L +++
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------------TTVFPTGIGMSATWSPTLIEEV 183
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ ++ E R+ + P +++ RDPRW RV ET GEDP + GR V GL
Sbjct: 184 GKAIAKEIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGLG 238
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
SR A KH+ AY + EG ++ S V +D+ E F+ PF
Sbjct: 239 S--------GDLSREHATIATLKHFLAYAVP--EGGQNGNYAS-VGARDLHENFLPPFRE 287
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
+ G +S VM SYN ++GIP A+ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 288 AIEAGALS-VMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 345
Query: 290 NDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E+A + L AG+D+D GD + N + AV+ GK+ E I+ ++ + + +G F
Sbjct: 346 ASTMEEAAVQALSAGVDIDLGGDAFMNL-LQAVRSGKLDETQINAAVDRILRMKFEMGLF 404
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + N +H++LA + A+ +VLL+N N LPL+ IK +A+VGP+A+
Sbjct: 405 EHPYVNPKTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLSK-KIKRVAVVGPNADN 463
Query: 409 TKAMIGNY----EGTPCRYTSPMDGFYAYSKV----INYAPGCADIVCQNNSMIPAAIDA 460
M+G+Y E R + +DG SK+ + Y GCA I + I A++A
Sbjct: 464 RYNMLGDYTAPQEDKDIR--TVLDG--VISKLSPSRVEYVRGCA-IRDTTVNEIAEAVEA 518
Query: 461 AKNADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELI 497
A ++ + V G + + EG DR L L G Q +L+
Sbjct: 519 AHRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLL 578
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
N + K P+ +V + +D +A ++L YPG+ GG AIADV+FG YNP
Sbjct: 579 NALKTTGK-PLIVVYIEGRPLDKVWASECA--DALLTASYPGQAGGDAIADVLFGDYNPA 635
Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
GRLP++ + +IP P N+ Y +Y FGYGLSYT F+Y
Sbjct: 636 GRLPVS-VPRSVGQIPVYYNKKAPRNH----DYVEMAASPLYGFGYGLSYTTFEYS---- 686
Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
D+++ T K PC F +V+N G DG EV
Sbjct: 687 ----DLQI--------------TQKSPC-------------HFEVSFKVKNTGNYDGEEV 715
Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
+Y K + +KQ+ +ER F+ G+ ++ FT+ K L I+D + ++ +G
Sbjct: 716 AQLYLKDEYASVVQPLKQLKHFERFFLRKGEEKEILFTLTE-KDLSIIDRSMKRVVETGD 774
Query: 737 HTILVG 742
I++G
Sbjct: 775 FRIMIG 780
>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
Length = 854
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 164/421 (38%), Positives = 230/421 (54%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D K P +R DL+ R+T+ EK+ + + G+PRL +P Y +E+LHGV GR
Sbjct: 30 YLDEKAPTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGVVRPGR-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I + +N L KI +S EAR +N G
Sbjct: 88 --------------FTVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQR 133
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +VRGLQ D R
Sbjct: 134 FTDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQG---------DDPR 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV +G +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAY 240
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P A+P LL + +R DW F+GY+VSDC +V + K++ TKE A +KA
Sbjct: 241 NAINDVPCTANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVK-TKEAAATLSIKA 299
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
GLDL+CG D Y + A Q ++ ADIDT+ + M LG FD Y + +
Sbjct: 300 GLDLECGDDVYMQPLLNAYNQYMVSRADIDTAAYRVLRARMHLGLFDDPDLNPYNKISPS 359
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H +LA EAARQ IVLLKN+N LPLN +K++A+VG NA + G+Y G P
Sbjct: 360 VVGSAEHKQLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVG--INAGNSEFGDYSGIP 417
Query: 421 C 421
Sbjct: 418 A 418
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 152/313 (48%), Gaps = 51/313 (16%)
Query: 449 QNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPV 508
Q M A A + + + V G++ ++E EG+DR D+ LP Q E I ++ P
Sbjct: 589 QRLDMYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKV--NPN 646
Query: 509 TLVIMSAGA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
+V++ AG+ + IN+ + + +I+ YPGE+GG A+A+V+FG+YNPGGRLP+T+Y +
Sbjct: 647 IVVVLVAGSSLAINWMDEH--VPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYYNS 704
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+IP + GRTY++F G +YPFGYGLSYT F YK
Sbjct: 705 -LEEIP----SFDDYDITKGRTYQYFKGKPLYPFGYGLSYTTFAYK-------------- 745
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-- 685
N + N + K +F E++N G+MDG EV VY K P
Sbjct: 746 -------NLQINDN-------------GNNIKVSF--ELKNTGRMDGDEVSQVYVKIPSS 783
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEG 744
GI IK++ G++R + G + V + L+ D+A + + G + ++G
Sbjct: 784 GIF-MPIKELKGFQRSTLKKGATKNVEINIRK-DLLRYWDDATETFITPKGEYEFMIGTS 841
Query: 745 VGGVSFPLQLNLN 757
+ LN
Sbjct: 842 SQDIQLTKSFTLN 854
>gi|395802372|ref|ZP_10481625.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395435613|gb|EJG01554.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 745
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 236/781 (30%), Positives = 355/781 (45%), Gaps = 141/781 (18%)
Query: 28 LVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
L+ +MTL EKV + G+ + GV RLG+P + L I R +P G D
Sbjct: 53 LISQMTLEEKVGMLHGNSMFANAGVKRLGIPELKMADGPLGVREEISRDNWAPAGWTNDF 112
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
AT +P A++N + G ++ E RA SP IN+VR P
Sbjct: 113 ----ATYYPAGGALAATWNAEMAHTFGTSLGEELRA-----RDKDMLLSPAINMVRTPLG 163
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
GR E EDP++ + A+ + GLQ+ + + AC KHYAA +N E
Sbjct: 164 GRTYEYMSEDPFLNKKIAVPLIVGLQEKD--------------VMACVKHYAA---NNQE 206
Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
N F D ++ E+ ++E ++ FE V E S+M +YN+ G C + +LN+ +R
Sbjct: 207 TNRDF-VDVQIDERTLREIYLPAFEASVKEAKAYSIMGAYNKFRGEYLCENDYMLNKILR 265
Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-------YYTNF 316
+W F G +VSD ++ + A+ LK GLD++ G + +
Sbjct: 266 DEWGFKGVVVSDWAAVHS---------------TAKSLKNGLDIEMGTPKPFNEFFLADK 310
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
+ AV+ G+++E +ID ++ + VL ++ G + K +I H + A + A
Sbjct: 311 LIVAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER----AKGSIATEAHYQDAYKIAA 366
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC-RYTSPMDGF---YA 432
+ IVLLKN+N ALPL +K++A++G +A A+ G G R +P++G
Sbjct: 367 EAIVLLKNENNALPLQLDGVKSIAVIGNNATKKNALGGFGAGVKTKREVTPLEGLKNRLP 426
Query: 433 YSKVINYAPGCADIVCQNN--------------------SMIPAAIDAAKNADATVIVAG 472
S INYA G + + N + + A+DAAKN+D +I AG
Sbjct: 427 SSVKINYAEGYLERYDKKNRGNLGNITANGPVTIDELDPAKVQEAVDAAKNSDVAIIFAG 486
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKS 531
+ E E DR DL LP Q ELI KV A P T+V+M AGA DIN + + K +
Sbjct: 487 SNRDYETEASDRRDLHLPFGQEELIKKV--LAVNPKTIVVMIAGAPFDIN--EVSKKSSA 542
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT-- 589
++W + G EGG A+ADVI GK NP G+LP T + I P N+FPG
Sbjct: 543 LVWSWFNGSEGGNALADVILGKVNPSGKLPWT------MPIALKDSPAHATNSFPGDKAV 596
Query: 590 ---------YKFFDGPVV---YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
Y++FD V YPFGYGLSYT F A K DK +
Sbjct: 597 NYAEGLLIGYRWFDTKNVAPLYPFGYGLSYTSFALDNA--------KTDKTSYAQ----- 643
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI- 696
+DV ++V+N GK+DG EVV +Y+ T Q +
Sbjct: 644 -------------NDV------IEVTVDVKNTGKVDGKEVVQLYTSKSDSKITRAAQELK 684
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVGEGVGGVSFPLQLN 755
G+++ + AG S KV + K L D A+ + G +TI +G + +Q+
Sbjct: 685 GFKKAEVKAGSSTKVTIKV-PVKELAYYDVASKKWTVEPGKYTIKLGTSSRDIKKEIQVT 743
Query: 756 L 756
+
Sbjct: 744 V 744
>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 854
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 160/452 (35%), Positives = 241/452 (53%), Gaps = 42/452 (9%)
Query: 5 IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
+ +K FPY + L ERA DL R+TL EK + M + + +PRLG+P +EWWSEALH
Sbjct: 16 LPMKAQQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALH 75
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
G+ G AT FP + +S++++L +K+ VS E R
Sbjct: 76 GIGRNGF----------------ATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQA 119
Query: 125 N--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
GL+FW+PNIN+ RDPRWGR ET GEDPY+ R + VRGLQ
Sbjct: 120 KRSGTIKRYQGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQG------ 173
Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGD 235
SDS+ K+ AC KH+A + W +R F+ + E+D+ ET++ F+ V +GD
Sbjct: 174 --PSDSKYRKLLACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGD 228
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES-HKFLNDTKE 294
V+ VMC+Y R++G P C + + L +R +WN+ G +VSDC ++ + H ++
Sbjct: 229 VAEVMCAYQRIDGQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDAT 288
Query: 295 DAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-- 352
A A+ + +G D++CG Y+N AV+ G I EAD+D S+R L LG FD
Sbjct: 289 HASAKAVLSGTDVECGSDYSNLPE-AVRAGIIKEADVDVSVRRLLEARFALGDFDPDELV 347
Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
+ + ++ + + H +LA + AR+ +VLL+N N LPL K + +VG +A + M
Sbjct: 348 PWTKISESVVASKAHKQLALDMARKSMVLLQN-NDILPLKRSGQK-IVVVGANAIDSTMM 405
Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVINYAPGCA 444
GNY G P + + + G S + + PGC
Sbjct: 406 WGNYSGYPTQTVTILQGLQTKSDQVTFIPGCG 437
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 146/321 (45%), Gaps = 61/321 (19%)
Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GK 482
Y +V + A DI S + AD + V G+ +E E G
Sbjct: 568 YVQVTSLAMIKFDITHTGLSTPQDIVRKTAGADVVIFVGGISPRLEGEEMEVSDPGFKGG 627
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR + LP Q E+I +++A + +V ++ I + ++ +IL YPGE+G
Sbjct: 628 DRTTIELPQAQREVIKALSEAGR---RIVFVNCSGSAIALTPESQRVDAILQAWYPGEQG 684
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
G A+ADV+FG YNP G+LP+T+Y+ + +P GRTY++F ++PFG
Sbjct: 685 GTAVADVLFGDYNPSGKLPVTFYKND------AQLPDFLDYRMAGRTYRYFKETPLFPFG 738
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
YGLSYTQF Q R IN V
Sbjct: 739 YGLSYTQFTIG----------------QPRYINNQV------------------------ 758
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
Q+ V N GK DG EVV VY + A IK + G++RV + G++ +V ++ +S +
Sbjct: 759 QVSVSNTGKRDGDEVVQVYIRRTDDAAGPIKTLRGFQRVSLKVGETKQVSVSL-PRESFE 817
Query: 723 IVDNAANSL-LASGAHTILVG 742
D ++N++ + G + ++VG
Sbjct: 818 WWDASSNTMRVIPGNYEVMVG 838
>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
Length = 745
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 223/757 (29%), Positives = 354/757 (46%), Gaps = 103/757 (13%)
Query: 27 DLVERMTLPEKVQQMGDLAYGVPRLGLPLYEW---------------------WSEALHG 65
DL+ RMTL EK+ Q G + P + ++ +L
Sbjct: 37 DLLRRMTLEEKIGQTVLYTSGYDVITGPTVDPNYKEYLKKGMVGGIFNAVGADYTRSLQK 96
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
++ R P +D T FP + + S++ ++ + ++EA A
Sbjct: 97 IAVEETRLGIPLIFGYDVIHGQRTIFPIPLAESCSWDLEAMERSARIAASEATA------ 150
Query: 126 AGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
G+ + ++P +++ RDPRWGRV E GED Y+ A V+G Q D+ S
Sbjct: 151 EGINWIYAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQG--------DNLSAV 202
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
+ AC KHYAAY G D D + E + T++ PF+ ++ G ++M S+N
Sbjct: 203 NTVVACVKHYAAYGA-TMAGRDYNTVDMSLNE--LWNTYLPPFKAALDAG-CGTIMTSFN 258
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
+NGIP + LL +R WNF+G++V+D SI ++ H + ND K A + AG
Sbjct: 259 DLNGIPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMI-PHGYANDEKHSA-EIAMNAG 316
Query: 305 LDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNN 361
+D+D G Y N +++GK++E D+ + R + + +LG F+ +Y N K +
Sbjct: 317 VDMDMQGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFEDPYRYCDANREKTD 376
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
I P + E A + AR+ +VLLKND LPL K +AL+GP ++G +
Sbjct: 377 ILTPANKEAARDMARKSMVLLKNDKQTLPLKEN--KRVALIGPLVKDKYEILGCWSAMGN 434
Query: 422 RYTSPM---DGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
R T P+ DG I+YA GC DI ++ A+ A +D V+V G +
Sbjct: 435 RDTIPVSVYDGLVEAIGKDKISYAKGC-DIQSEDTKGFAEAVRVASASDVVVMVMGEFHN 493
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
+ E R +L LPG Q +L+ + K PV LV+M+ + IN+ K+N + +IL
Sbjct: 494 MSGENNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEKDN--LDAILEAW 550
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTY 590
+PG GG AIADV+ GKYNP G+L +T + N +IP T P P N P Y
Sbjct: 551 FPGTMGGAAIADVLTGKYNPSGKLTMT-FPQNVGQIPLFYNHKNTGRPYDP--NVPQFAY 607
Query: 591 --KFFD--GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
+++D +YPFGYGLSYT F Y D+ L +
Sbjct: 608 GSRYWDVSNEPLYPFGYGLSYTTFTYS--------DLTLSSKEI---------------- 643
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAA 705
K+ +++ N G+ DG EVV +Y++ G +K++ G+++VF+ A
Sbjct: 644 --------TKENPLKVSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLKA 695
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
G+S + FT+ + L+ ++ + G + VG
Sbjct: 696 GESKVIDFTL-SVNDLRFYNSQLEYVYEPGDFHLFVG 731
>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 866
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 161/434 (37%), Positives = 231/434 (53%), Gaps = 43/434 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ D +LP+ +R DL++R+T+ EKV M D++ + RLG+ Y WW+EALHGV+ G
Sbjct: 24 YPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAGL 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
AT FP I ASF+ + VS EARA +N +
Sbjct: 84 ----------------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYG 127
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT W+P IN+ RDPRWGR +ET GEDPY+ + V+GLQ +Y
Sbjct: 128 RYEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYD------ 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDS-RVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R FD+ + ++D+ ET++ FE V E V VMC+
Sbjct: 182 --KLHACAKHFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCA 236
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARV 300
YNR G P C +LL Q +R W F G +V+DC +I + +HK D + A V
Sbjct: 237 YNRFEGDPCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAV 296
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
+G DLDCG Y T AV++G I E DID S+R L + RLG D + +
Sbjct: 297 Y-SGTDLDCGSSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQSLVPWSKIS 354
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
N + + H ++A + AR+ I LL+N N LPL +G +K +A++GP+A + GNY G
Sbjct: 355 YNVVASKAHNQIALDMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGNYNG 413
Query: 419 TPCRYTSPMDGFYA 432
TP + ++G A
Sbjct: 414 TPANTITILEGIKA 427
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 142/310 (45%), Gaps = 53/310 (17%)
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
DI + + I +I AD V V G+ S+E E G DR D+ LP Q
Sbjct: 584 DIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQR 643
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
+ + + +A K ++ ++ I A ++I+ YPG+ GG+A+ADV+FGKY
Sbjct: 644 QFVKALKEAGK---RVIFINCSGSPIGLADEMANSEAIVQAWYPGQAGGQAVADVLFGKY 700
Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
NP GRLPIT+Y T +P + GRTY++ ++PFGYGLSYTQF+Y
Sbjct: 701 NPSGRLPITFYRDT------TQLPDFENYDMAGRTYRYMQDKPLFPFGYGLSYTQFQY-- 752
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
+P +L V + V N GK G
Sbjct: 753 -GNP-----------------------------ILNQQVITNGQTIQLTVPVTNTGKRSG 782
Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LA 733
EVV VY + G A +K + + R+ AGQ+ +V F + K L+ + + ++ +
Sbjct: 783 DEVVQVYLRKKGDATGPVKTLRDFRRLSFNAGQTQQVVFKITP-KQLEWWNEQSKAMQVQ 841
Query: 734 SGAHTILVGE 743
SG + +LVG+
Sbjct: 842 SGDYELLVGK 851
>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
Length = 733
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 222/766 (28%), Positives = 366/766 (47%), Gaps = 108/766 (14%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGV----PRLGLPLYEWWSEALH----GVSFI---GR 71
E+A+ L+ +MTL +KV QM +G P G ++ E + G F
Sbjct: 22 EQAEQLLSKMTLEDKVGQMTQFDWGYNPINPETGESEHDLIIELIRQGKVGSIFNLSGAA 81
Query: 72 RTNSPPG---THFDSEVPGA----------TSFPTVILTTASFNESLWKKIGQTVSTEAR 118
N G H + ++P T FP + A++N + ++ STEA
Sbjct: 82 EANELQGLIEQHTELKIPMVIGRDVIHGYRTVFPIPLAMAAAWNPEVARQTSAAASTEA- 140
Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
L + ++P I+V RDPRWGR+ E+ GEDPY+ Y +V G Q
Sbjct: 141 ----LTDGVTWVFAPMIDVSRDPRWGRIAESIGEDPYLTAAYGRAWVEGSQ--------- 187
Query: 179 DSDSRPLKISACC-KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
D+ P + +A C KH+A Y + G D D ++++++++ + PF+ V G +S
Sbjct: 188 -IDNGPGRATASCPKHFAGYGMAE-AGRDYNTVD--LSDRELRDIILPPFQDAVEAGALS 243
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
+M S+N +NGIP CA+ LL +R +W F G + SD +++ ++ N+ E+A
Sbjct: 244 -IMASFNEINGIPACANEYLLKTILRDEWGFEGVVASDYNALVELIVHGVAANE--EEAC 300
Query: 298 ARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKN 356
+ AG D+D +T V+ G++ E+ +D S+R + + ++LG + S +
Sbjct: 301 EMTVLAGCDMDMHSGIFTRQLPKLVRAGRVPESVVDDSVRRILAMKIKLGLLEQSK--SD 358
Query: 357 LGKNNICNP---QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
+ ++ P +++ELA EAARQ IVLL+N LPL+ ++A++GP A+ +
Sbjct: 359 VSQSAATQPLKSEYVELAREAARQSIVLLQNKEQVLPLSKAG-ASIAVIGPLADNATDPL 417
Query: 414 GNY--EGTPCRYTSPMDGFY---AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G + +G + ++G A I YA GC DI + AA++AA+++D V
Sbjct: 418 GCWALDGRSDEVVTALEGIRQAAAEGTSIRYAQGC-DIDSDSEEGFEAALEAARSSDVVV 476
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
++ G ++ E + R L LPG Q L+ VA K P+ VI+S + FA +
Sbjct: 477 MLLGESATMSGESRSRAALDLPGKQRALVEAVAKLGK-PIVAVILSGRP--LTFAWLPEQ 533
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP---YTSMPLRPVNNF 585
+I+ + G + G AIADV+FG +NP GRLP+T + N +IP Y RP
Sbjct: 534 ASAIVQAWHLGVQSGNAIADVLFGDFNPSGRLPVT-FPQNVGQIPIYHYRKKTGRP---- 588
Query: 586 PGRTYK--FFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
P Y + D +YPFGYGL+YT+F+Y + KS ++G
Sbjct: 589 PAGAYSSYYIDSTTEPLYPFGYGLTYTEFEYGAIQTSKS----------------SIGA- 631
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYER 700
D + + + N+G + G EVV Y + + T +K+++ + +
Sbjct: 632 ---------------DEQLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVAFRK 676
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVG 746
V +AAG+S V FT+ A + L I+D + G T+ +G G
Sbjct: 677 VKVAAGESVDVTFTIGAAE-LAILDKHMKRTVEPGDFTLWIGPSAG 721
>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 894
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 163/450 (36%), Positives = 236/450 (52%), Gaps = 52/450 (11%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + LP RA+DLV RMTL EK Q+ + A +PRL +P Y WWSEALHGV+
Sbjct: 39 YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVA------ 92
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
V G T FP I A+F+ ++ + TE R +Y
Sbjct: 93 -----------VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIF 141
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL FW+PN+N+ RDPRWGR ET GEDP++ G+ + +V G+Q + +Y+R
Sbjct: 142 HGLDFWAPNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQG-DNPKYYR------- 193
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ A KH+ D+ + R D V+ D +T+ F + +G SVMCSYN
Sbjct: 194 -VIATPKHF---DVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNA 249
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+NG P CA+ L +RG W F GY+VSDCD++ I HK+ T A A ++ G+
Sbjct: 250 INGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKY-RPTLAQAAAISMERGM 308
Query: 306 DLDCGDY--------YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYK 355
D DC D+ Y + + AVQQG +++ +DT+L L+ ++LG FD G Y
Sbjct: 309 DNDCADFAQPKGDDDYKAY-IDAVQQGYLSQQAMDTALVRLFTARIKLGLFDPKGMDPYA 367
Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
+ + + +P H A + A + +VLLKND G LPL G++ ++A+VGP A+ T ++GN
Sbjct: 368 DTPHSELNSPAHRAYARKLADESMVLLKND-GTLPLKPGSVHSIAVVGPLADQTAVLLGN 426
Query: 416 YEGTPCRYTSPMDGFYAY--SKVINYAPGC 443
Y G P S ++G A + I Y PG
Sbjct: 427 YNGVPTHTVSFLEGLRAEYPNTKITYVPGT 456
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 149/305 (48%), Gaps = 52/305 (17%)
Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINK 499
+N+ PAA+ AAK AD + V G+ +E E G DR +L +P + L+
Sbjct: 609 DNTPSPAAVAAAKKADVVIAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEA 668
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
VA K PV +V+M+ A+ +N+ + ++L Y GEEGG AIAD + GK +P GR
Sbjct: 669 VAKTGK-PVVVVLMNGSALAVNWISQH--ANAVLEAWYSGEEGGAAIADTLSGKNDPAGR 725
Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK 619
LP+T+Y++ +P + RTY++F G +YPFGYGLSYT F+Y S P
Sbjct: 726 LPVTFYKS------VNQLPNFEDYSMENRTYRYFKGKPLYPFGYGLSYTTFRYSDLSIPH 779
Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
+ TV +P A+ V N GK+ G EVV
Sbjct: 780 A----------------TVDAGQPVEASA----------------TVTNTGKVAGDEVVQ 807
Query: 680 VYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
+Y K P + G + G++R+ + GQS +V F + + L +V ++A G +T+
Sbjct: 808 LYLKFPKVDGAPDIALRGFQRIHLEPGQSQQVHFELKK-RDLSMVTALGQIIVAQGDYTL 866
Query: 740 LVGEG 744
+G G
Sbjct: 867 SIGGG 871
>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 904
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 166/421 (39%), Positives = 228/421 (54%), Gaps = 43/421 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +MT EK+ Q + A +PRLG+P YEWWSE LHG++ G
Sbjct: 54 DRATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------- 102
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA---------GLTFWSP 133
AT FP I AS+N L +G STEARA +NL GLT WSP
Sbjct: 103 -----ATVFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSP 157
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDPY+ G+ A+ ++ GLQ D + P I A KH
Sbjct: 158 NINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDPTHPRTI-ATPKH 208
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + + + R FD V+ D + T+ F + EG SVMC+YN ++GIP CA
Sbjct: 209 LAVH---SGPESGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACA 265
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
L++ +RG+W F G++VSDCD+I + + H + + A LKAG DL+CG Y
Sbjct: 266 ADWLIDGRVRGNWGFKGFVVSDCDAIDDMTQFH-YYRADNAGSAAAALKAGHDLNCGYAY 324
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
+ A+ +G+ EA +D SL L+ RLG + Y LG +I +P H LA
Sbjct: 325 RDLGT-ALDRGEAEEAMLDRSLVRLFAARYRLGELQPRSKDPYARLGAKDIDSPTHRALA 383
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA+Q +VLL+N N LPL G LA++GP+A+A A+ NY+GT +P+ G
Sbjct: 384 LQAAQQSLVLLQNRNDTLPLRPG--LRLAVIGPNADALAALEANYQGTSVAPVTPLQGLR 441
Query: 432 A 432
A
Sbjct: 442 A 442
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 139/278 (50%), Gaps = 45/278 (16%)
Query: 470 VAGLDLSVEA---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V G +L ++ +G DR DL LP Q L+ + A A+ P+ +V+MS AV +N+AK +
Sbjct: 646 VEGEELRIDVPGFDGGDRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALNWAKQH 704
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
+IL YPG+ GG AIA + G NPGGRLP+T+Y + PY S ++
Sbjct: 705 --ADAILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK------ 756
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY++F G ++PFGYGLSYT F Y ++P+ L Q D + T
Sbjct: 757 GRTYRYFKGEALFPFGYGLSYTHFAY---TAPQLSSTTL----QAGDTLHVTTT------ 803
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAG 706
V N G G EVV VY + P A + ++ ++G++RV + G
Sbjct: 804 -------------------VRNTGARAGDEVVQVYLQYPPRAQSPLRALVGFQRVSLQPG 844
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++ + F + + L VD + + +G + + VG G
Sbjct: 845 EARTLSFALEP-RQLSDVDRSGQRAVEAGDYRLFVGGG 881
>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
Length = 853
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 232/421 (55%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 88 --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V GLQ D
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG---------DDPH 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 240
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P +P LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 241 NALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 299
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q +++ADID++ + M+LG FD + Y + +
Sbjct: 300 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPS 359
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AARQ IVLLKN LPLN +K++A+VG NA K G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAP 417
Query: 421 C 421
Sbjct: 418 V 418
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 152/300 (50%), Gaps = 49/300 (16%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
IN+ + I +I+ YPGE+GG A+A+V+FG YNP GRLP+T+Y++ ++P P
Sbjct: 658 AINWMDEH--IPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----P 710
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
+ GRTYK+F G V+YPFGYGLSY+ F Y D Q +D V
Sbjct: 711 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTY--------------SDLQVKD---GV 753
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIG 697
G + T ++N GK +G EV VY + P G +K++ G
Sbjct: 754 G-------------------EVTVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKG 794
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+ RV + +G+S +V +N + L+ D ++ GA ++VG + ++L
Sbjct: 795 FRRVPLKSGESRRVEIKLNK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRLQTVIDL 853
>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1073
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 162/434 (37%), Positives = 236/434 (54%), Gaps = 45/434 (10%)
Query: 1 RFESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWS 60
+ S V ++P+ D L + ER KDL+ R+ + EK+ + + +PRLG+ Y +
Sbjct: 16 QISSFAVAQINYPFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGN 75
Query: 61 EALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAM 120
EALHGV G+ T FP I + +N +++ +S EAR
Sbjct: 76 EALHGVVRPGK----------------FTVFPQAIGLASMWNPDFLQEVSTAISDEARGR 119
Query: 121 YNLGNAG----------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQD 170
+N N G LTFWSP IN+ RDPRWGR ET GEDP++ G +VRGLQ
Sbjct: 120 WNELNQGKDQTAGASDLLTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQG 179
Query: 171 VEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMC 230
+D + +K+ + KH+AA N E ++R ++ ++E+D++E + FE C
Sbjct: 180 ---------NDPKYIKVVSTPKHFAA----NNEEHNRASGNAVISERDLREYYFPAFEKC 226
Query: 231 VNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLN 290
+ EG SVM +YN VNGIP + LL +R DW F GY+VSDC + + IV H ++
Sbjct: 227 IKEGQAQSVMSAYNAVNGIPCTLNKWLLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYV- 285
Query: 291 DTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
DT E+A + +KAGLDL+CGD Y + A +G + ++ID++ + MRLG FD
Sbjct: 286 DTYEEAASLCIKAGLDLECGDNVYITPLLNAYNRGMVTMSEIDSAAYRVLRGRMRLGLFD 345
Query: 350 GSPQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ Y + + + +H ELA EAARQ +VLLKND LP+ T NIK++A+VG N
Sbjct: 346 DPNENPYNKISPSIVGCEKHRELALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVG--IN 403
Query: 408 ATKAMIGNYEGTPC 421
A G+Y GTP
Sbjct: 404 AANCEFGDYSGTPV 417
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 147/298 (49%), Gaps = 50/298 (16%)
Query: 450 NNSMIPAAIDAA---KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
+ S++ A DA + +D T+ V G+D ++E EG+DR + LP Q I + A
Sbjct: 726 SESLLDAYGDAGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKA--N 783
Query: 507 PVTLVIMSAGA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
P T+V++ AG+ + IN+ N I ++L YPGE+GG A+A+ +FG YNPGGRLP+T+Y
Sbjct: 784 PNTVVVLVAGSSLAINWIDQN--IPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFY 841
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ + +R NN RTY +F+G +YPFGYGLSYT F Y + +D+
Sbjct: 842 NSLSDLPAFDDYNVR--NN---RTYMYFEGKPLYPFGYGLSYTDFAY------RGLDVTQ 890
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
D++ T + V N G DG EV VY + P
Sbjct: 891 DEEN------------------------------VTVKFFVSNTGNYDGDEVAQVYIQFP 920
Query: 686 GIAGT-HIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
T +KQ+ G++RV I+ GQ ++ + + +N + G + LVG
Sbjct: 921 DQGTTLPLKQLKGFKRVHISKGQETEITVRIPKKELRLWSENNSEFYTPEGNYIFLVG 978
>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 876
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 162/454 (35%), Positives = 240/454 (52%), Gaps = 53/454 (11%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
K +F + + L + +R DLV R+TL EKV QM + + +PRL +P Y+WW+E LHGV+
Sbjct: 25 KQKEFLFQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVA 84
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--- 124
T F T +P I A+F+++ K+ + E RA+YN
Sbjct: 85 ----------RTPFK-----VTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVES 129
Query: 125 ------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
GLT+W+PNIN+ RDPRWGR ET GEDPY+ G ++V+GLQ
Sbjct: 130 GRTNERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQG-------- 181
Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
D + LK +AC KHYA + G + R FD VT ++ +T++ F+ V E V
Sbjct: 182 -DDPKYLKAAACAKHYAVH-----SGPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKV 235
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
+ VMC+YN P CA L+ +R W F GY+ SDC +I ++HK D E A
Sbjct: 236 AGVMCAYNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDA-ESA 294
Query: 297 VARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A + G D+DCG + AV+ GKI+E ID S++ L+++ RLG FD +Y
Sbjct: 295 SADAVFHGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMFDPVEMVKY 354
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+ + N +H A + ARQ IVLL+N+N LPL + +K + ++GP+ + A++G
Sbjct: 355 AQTPTSVLENDEHKAHALKMARQSIVLLRNENKTLPL-SKKLKKIVVLGPNVDNAIAILG 413
Query: 415 NYEGTPCRYTSPMDGF---------YAYSKVINY 439
NY GTP + T+ ++G Y K +N+
Sbjct: 414 NYNGTPSKLTTVLEGIKEKVGSNTEVVYEKAVNF 447
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 139/296 (46%), Gaps = 54/296 (18%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ K+ADA V V G+ +E E G DR +LLP QT+L+ + K P
Sbjct: 602 VNRVKDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTTGK-P 660
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
+ V+M+ A+ I + N I +I Y G+ G A+ADV+FG YNP GRLP+T+Y++
Sbjct: 661 IVFVMMTGSAIAIPWEAEN--IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVTFYKS 718
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+ P+ + RTY++F G +Y FGYGLSYT FKY SV
Sbjct: 719 DADLSPFVDYKM------DNRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAPSV------ 766
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
+K K+ T ++V N GK+ G EVV +Y
Sbjct: 767 -------------------------IKGKNVPIT--VKVTNTGKVSGEEVVQLYVINQNT 799
Query: 688 A-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
A +K + G+ER+ + AG+S + FT+ + + L + N +G I +G
Sbjct: 800 AIKAPLKTLKGFERISLKAGKSKTITFTL-SPEDLSYITAEGNHQQYNGKIKIAIG 854
>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
Length = 727
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 223/781 (28%), Positives = 361/781 (46%), Gaps = 114/781 (14%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + L +R +L+ MT+ EK+ + GVPRLG+ SE LHG++ G
Sbjct: 24 YPFQNTSLSDEKRLDNLLSIMTIDEKINALS-TNLGVPRLGI-RNTGHSEGLHGMALGG- 80
Query: 72 RTNSPPGT---------HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR---- 118
PG +V T+FP +++ L KK+ +TE R
Sbjct: 81 -----PGNWGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQ 135
Query: 119 -AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
Y G GL +PN ++ RDPRWGR E+ GEDP++V A+ +++GLQ
Sbjct: 136 NERYTKG--GLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQ-------- 185
Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVS 237
+ R K ++ KH+ A ++ + +FD+R+ E + PF + +G
Sbjct: 186 -GENPRYWKSASLMKHFLANSNEDGRDSTSSNFDNRL----FHEYYSYPFRKGIEKGGSQ 240
Query: 238 SVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV 297
+ M +YN N IP P L + IR DWNF G I +D ++ ++++HK T +
Sbjct: 241 AFMAAYNSWNEIPMTIHPIL--KKIRKDWNFKGIICTDGGALDLLIKAHKTF-PTHTEGS 297
Query: 298 ARVLKAGLDLDCGDYYTNF---TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ- 353
A ++KAG+ G + NF A+++G + EA+ID ++R + + ++LG DG
Sbjct: 298 AAIVKAGV----GQFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTK 353
Query: 354 --YKNLGKNNIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
Y ++G + N + + + +VLLKN+ LPLN GNIK +A++GP AN
Sbjct: 354 LPYAHIGVTDTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRAN 413
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADAT 467
+ ++ Y GTP S + G + N +++ ++++ I A AA+ AD
Sbjct: 414 --EVLLDWYSGTPPYTVSILQG------IKNAVGNNVEVIYESSNEIDKAYLAAQKADIA 465
Query: 468 VIVAGLDL----------SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
++ G + V ++G++ VD + E + K+ A +V++S+
Sbjct: 466 IVCVGNHVYGTDPKWKYSPVPSDGREAVDRKALSLEQEDLVKIVHKANPNTVMVLVSSFP 525
Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
IN+++ N I +IL + +E G +ADVIFG YNP GR TW ++ +P
Sbjct: 526 FAINWSQEN--IPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKS-IADLP---- 578
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
P+ + GRTY + +YPFGYGLSYT F Y D+ L + N
Sbjct: 579 PMMDYDIRNGRTYMYAKEKPLYPFGYGLSYTNFTYS--------DMALSSSALSKGKNLK 630
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
V N V+N G MDG EV +Y S P IKQ+
Sbjct: 631 VSVN------------------------VKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLK 666
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVGEGVGGVSFPLQLN 755
G++R+ I G+S FT++A L DN +S ++ IL+G + ++
Sbjct: 667 GFDRISIKKGESKTFEFTLSA-DDLAYWDNDKDSFVIEPETVNILIGSSSEDIRLTKEIQ 725
Query: 756 L 756
L
Sbjct: 726 L 726
>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 853
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 160/421 (38%), Positives = 232/421 (55%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 88 --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V GLQ D
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG---------DDPH 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 240
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P +P LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 241 NALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 299
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q +++ADID++ + M+LG FD + Y + +
Sbjct: 300 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPS 359
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AARQ +VLLKN LPLN +K++A+VG NA K G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAP 417
Query: 421 C 421
Sbjct: 418 V 418
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 150/300 (50%), Gaps = 49/300 (16%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
IN+ + I +I+ YPGE+GG A+A+V+FG YNP GRLP+T+Y++ ++P P
Sbjct: 658 AINWMDEH--IPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----P 710
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
+ GRTYK+F G V+YPFGYGLSY+ F Y D Q +D V
Sbjct: 711 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTY--------------SDLQVKDGGGEV 756
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIG 697
T ++N GK +G EV VY + P G +K++ G
Sbjct: 757 ----------------------TVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKG 794
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+ RV + +G+S +V ++ + L+ D ++ GA ++VG + ++L
Sbjct: 795 FRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRLQTVIDL 853
>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
Length = 815
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 227/723 (31%), Positives = 343/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG T FPT I A+++ L +++
Sbjct: 160 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPVLIEEV 201
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G ++ E R+ + P +++ RDPRW RV ET GEDP + GR V GL
Sbjct: 202 GNVIAKEIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVIGLG 256
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
SR A KH+ AY + EG ++ S V +D+ E F+ PF+
Sbjct: 257 S--------GDLSREYATIATLKHFLAYAVP--EGGQNGNYAS-VGTRDLHENFLPPFQE 305
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP A+ LL Q +R +W F G++VSD SI+ + ESH F+
Sbjct: 306 AIDAGALS-VMTSYNSIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLYSIEGVHESH-FV 363
Query: 290 NDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E+A +V+ AG+D+D G+ + N T AVQ GKI+EA IDT++ + + +G F
Sbjct: 364 APTIEEAAMQVVSAGVDIDLGGNAFMNLTH-AVQSGKISEAVIDTAVCRVLRMKFEMGLF 422
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HI LA + A+ IVLLKN N LPLN IK +A+VGP+A+
Sbjct: 423 EHPYVNPKSATKVVRSEEHIRLAHKVAQSSIVLLKNKNSILPLNK-KIKKVAVVGPNADN 481
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
M+G+Y + +DG + SKV Y GCA I + I ++AA
Sbjct: 482 RYNMLGDYTAPQEDENIKTVLDGVISKLSPSKV-EYVRGCA-IRDTTVNEIAEVVEAASR 539
Query: 464 ADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINKV 500
++ + V G + + EG DR L L G Q +L+N +
Sbjct: 540 SEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNAL 599
Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
K P+ +V + +D +A ++L YPG+EGG AIADV+FG YNP GRL
Sbjct: 600 KATGK-PLIVVYIEGRPLDKVWASEYA--DALLTASYPGQEGGYAIADVLFGDYNPAGRL 656
Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
P++ + +IP P N+ Y +Y FGYGLSYT F+Y
Sbjct: 657 PVS-VPRSVGQIPVYYNKKAPCNH----DYVEQAASPLYTFGYGLSYTTFEYS------- 704
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D+++ + K PC F +V+N G DG EV +
Sbjct: 705 -DLQVIR--------------KSPCY-------------FEVSFKVKNTGSYDGEEVAQL 736
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + ++Q+ +ER F+ G+ ++ FT+ K L I+D ++ +G I
Sbjct: 737 YLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMARVVETGDFRI 795
Query: 740 LVG 742
++G
Sbjct: 796 MIG 798
>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
Length = 819
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 234/758 (30%), Positives = 337/758 (44%), Gaps = 124/758 (16%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
D Y DA LP +R L + + L + + Q+ + A V + LP Y W ++ HGV
Sbjct: 68 DGTYLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVKGTA 127
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-----LGN 125
T P G AS++ L ++G + E+RA +N GN
Sbjct: 128 HATVYPMGASLG----------------ASWSVDLAWRVGAAIGNESRATHNGLADKSGN 171
Query: 126 A--------------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDV 171
A G+T ++PN+N+VRDPRWGR E GEDP++ A+ V GLQ
Sbjct: 172 ACGSTSTGEVVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQG- 230
Query: 172 EGVEYHRDSDSRPLKISACCKHYAAY-------DLDNWEGNDRFHFDSRVTEQDMQETFI 224
PL ACCKH+AA+ DL DR D+ V+ +D+ ET++
Sbjct: 231 NAEGSTSGPGGGPLVTGACCKHFAAHFAVYQNEDLPA----DRMVLDANVSSRDLWETYL 286
Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
+ CV V VNG PTCA P+LLN +R W F G++VSD D+ +V
Sbjct: 287 PVMKACV-------VRAKATHVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVT 339
Query: 285 SHKFLNDTKEDAVARVLKAGLDLDC--GDYY-TNFTMGAVQQGKIAEADIDTSLRFLYIV 341
+HK+++ T E+A A + AG+D + GDY + AV+ G +A A + S L V
Sbjct: 340 THKYVS-TWEEAAAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRRSFERLMRV 398
Query: 342 LMRLGYFDGSPQYKNLGKNNICNPQ-----HIELAAEAARQGIVLLKNDNGALPLNTGNI 396
+RLG FD G+ C+ Q + LA EAAR+GIVL KN GALPL G
Sbjct: 399 RLRLGMFDPPASTAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGALPLAKG-- 456
Query: 397 KTLALVGPHANATKAMIG--NY---EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNN 451
+ALVGP + + ++G NY +G + G A + V + A GC + C
Sbjct: 457 ARIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVTIQKGLEAVANV-SVAAGCDSVACAAL 515
Query: 452 SMIPAAI--------------DAAKNADATVIVAGL-DLSVEAEGKDRVDLLLPGFQTEL 496
+ A D+ D + G D E+E DR + LPG Q L
Sbjct: 516 VDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVAL 575
Query: 497 INKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
+ + A+ V +++ A + + LWV PG+ GG A+ADV+FG Y+P
Sbjct: 576 VAALRAASSRLVCVLVHGGAVALGAAADDCDAVLD-LWV--PGQMGGAALADVLFGDYSP 632
Query: 557 GGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV-VYPFGYGLSYTQFKYKVA 615
GR PIT Y A P + G TY+++ GP Y FG GLSY F Y A
Sbjct: 633 AGRSPITMYAATSDLPPMGVFDEYAGESSNGTTYRYYAGPAPTYAFGDGLSYASFSYAWA 692
Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
++P T C A+ + ++ V N G +
Sbjct: 693 AAPP--------------------TTVDACGAIRL------------RVAVTNTGSVASD 720
Query: 676 EVVMVYSK-PPGIAGTHIKQVIGYERV-FIAAGQSAKV 711
EVV VY++ P +++ ++RV IA G +A V
Sbjct: 721 EVVQVYARVPDATVPAPAIRLVAFDRVRAIAPGATATV 758
>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
Length = 712
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 222/753 (29%), Positives = 342/753 (45%), Gaps = 133/753 (17%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD ERA LV +T EKV + A G PR+GLP Y WW+EALHGV+
Sbjct: 42 CDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVA------- 94
Query: 75 SPPGTHFDSEVP--GATSFPTVILTTASFNESLWKKIGQTVSTEARA-MYNLGNAGLTFW 131
PG ++ + P ATSFP +L ++F++ L IG+ + TEARA G+ +W
Sbjct: 95 GAPGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYW 154
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK----- 186
+PN+N +DPRWGR ETPGED V RYA +E RD+ +
Sbjct: 155 TPNVNPFKDPRWGRGSETPGEDALHVSRYA----------RAMECTRDAKVGSIMCSYNA 204
Query: 187 ---ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
I AC Y +QET +
Sbjct: 205 VNGIPACANSY------------------------LQETLLR------------------ 222
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
K N T +W I SDC ++Q I + H + T +A +
Sbjct: 223 ------------KHWNWTHTNNW-----ITSDCGAMQDIWQHHNY-TKTGAEAAKAAFEN 264
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDG-SPQYKNLGKNNI 362
G D C T + +QG + E +D +L+ L+ L+ G+FDG ++ +L +++
Sbjct: 265 GQDSSCEYTTTKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGFFDGDKSEWSSLDFDDV 324
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
+LA ++A +G VLLKNDN LPLN +++AL+G A+ + G Y G
Sbjct: 325 NTRHAQDLALQSAVRGAVLLKNDN-TLPLNIKKKESVALIGFWADDKTKLQGGYSGPAPH 383
Query: 423 YTSPMDGFYAYSKVINYAPGCADIVCQNNSMIP-----AAIDAAKNADATVIVAGLDLSV 477
+P YA +K++ A NS +P A++AAK +D V + GLD +
Sbjct: 384 VRTPA---YA-AKMLGLNTNVAWGPTLQNSSVPDNWTTNALEAAKKSDYIVYLGGLDATA 439
Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
E +DR DL P Q L+ K+++ K P+ +V + D KN + SILWV Y
Sbjct: 440 AGEERDRTDLDWPSTQLTLLKKLSNLGK-PLVVVQLGDQVDDTPLLKNK-GVNSILWVNY 497
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLRPVNNFPGRTYKFFDGP 596
PG+EGG A+ ++I G+ P GRLP+T Y + Y ++ M LRP + PGRTY+++
Sbjct: 498 PGQEGGTAVMELITGRKGPAGRLPLTQYPSKYTEQVGMLEMELRPTKSSPGRTYRWYSDS 557
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
V+ PFG+G YT FK S + +++ + K + D Y PP
Sbjct: 558 VL-PFGFGKHYTTFKAMFKS--QKIEMNIQKILKGCDATYVDTCPLPP------------ 602
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYER---VFIAAGQSAK 710
+ V+N G+ V +V+ + G G +K + Y R + A + +
Sbjct: 603 -----IHLSVKNTGRTTSDFVSLVFIQ--GKVGPKPYPLKTLAAYSRSHDIKPRATKDVE 655
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGE 743
+ +TM+ ++ + + ++ G +T+L+ E
Sbjct: 656 LQWTMD---NIARREKNGDLVVYPGTYTLLLDE 685
>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 756
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 205/670 (30%), Positives = 335/670 (50%), Gaps = 89/670 (13%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
G+T FP I +++N L ++ + E R+ SP IN+ RDPR GR
Sbjct: 147 GSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RGIHQVLSPTINIARDPRCGRT 201
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+ R A+ Y++G+Q+ +GV A KH+ A + + G D
Sbjct: 202 EETYGEDPYLASRMAVAYIKGVQE-QGV-------------IATPKHFVANFVGDG-GRD 246
Query: 207 RF--HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+ HF R+ ++E + F + E S+M +YN ++GIP ++ LL + +R
Sbjct: 247 SYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLDGIPCSSNKWLLTRILRK 302
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM-GAVQQ 323
+W F GY+VSD S+ ++ HK + ++K +A L+AGLD++ D + G +++
Sbjct: 303 EWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAAKLSLEAGLDMELPDSDCFEEIPGLIRE 361
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIELAAEAARQGIV 380
K+++ +D ++R + V +G FD P Y + N C+ +H ELA AR+ IV
Sbjct: 362 SKLSQDTLDEAVRRVLRVKFWIGLFDNPFVDPDYAE--RINDCS-EHRELALRVARESIV 418
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKV-I 437
LLKN+ G LPLN +I+++A++GP NA +G Y G + +P++G KV +
Sbjct: 419 LLKNE-GILPLNK-DIRSIAVIGP--NAAVPRLGGYSGYGVKVVTPLEGIKNKLGDKVKV 474
Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRVDLLLPGFQTEL 496
+A GC + + S AI A+ +D ++ G + E E +DR +L LPG Q +L
Sbjct: 475 YFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGEQRDRHNLNLPGVQEDL 533
Query: 497 INKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
I ++ + PV +V+++ A I K+++++ YPGEEGG AIADV+FG YNP
Sbjct: 534 IKEICNT-NTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGNAIADVLFGDYNP 590
Query: 557 GGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD---GPVVYPFGYGLSYTQFKYK 613
GG+LPI++ + + + +PL + GR + D ++PFGYGLSYT FKY
Sbjct: 591 GGKLPISFPKYS------SQLPLYYNHKPSGRVDDYVDLRGNQYLFPFGYGLSYTDFKYS 644
Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
N + + P +D + ++EN+GK
Sbjct: 645 ---------------------NLRITPEEIP-----------RDGEVVITFDIENIGKYK 672
Query: 674 GSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
G EVV +Y IK++ +ERV + G+ V F +N + L+ + ++
Sbjct: 673 GDEVVQLYLHDEFASVARPIKELKRFERVTLDVGERKTVSFKLNR-RDLEFLSMDMELVV 731
Query: 733 ASGAHTILVG 742
G +L+G
Sbjct: 732 EPGRFEVLIG 741
>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 875
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 160/449 (35%), Positives = 237/449 (52%), Gaps = 49/449 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+ + L + ER ++LV ++TL EKV QM + A +PRLG+P Y+WW+E LHGV+
Sbjct: 27 FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVARTPF 86
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
+T T FP I A+F+++ K+ + E RA+YN
Sbjct: 87 KT---------------TVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTK 131
Query: 125 --NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GLT+W+PNIN+ RDPRWGR ET GEDPY+ +V+GLQ D
Sbjct: 132 ERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQG---------DDP 182
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ LK +AC KHYA + + R FD VT ++ +T++ F+ V V+ VMC+
Sbjct: 183 KYLKAAACAKHYAVHSGPE---SLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCA 239
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN P CA L+N +R W F GY+ SDC +I ++HK D + VL
Sbjct: 240 YNAFRTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLH 299
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
G D+DCG + AV+ G+I E ID S++ L+++ RLG FD +Y +
Sbjct: 300 -GTDIDCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFDPVSMVKYAQTPSS 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H E A + ARQ IVLLKN+ LPL + +K + ++GP+A+ + +++GNY GTP
Sbjct: 359 VLESEEHKEHALKMARQSIVLLKNEKNTLPL-SKKLKKIVVLGPNADNSISILGNYNGTP 417
Query: 421 CRYTSPMDGF---------YAYSKVINYA 440
+ T+ + G Y K IN+
Sbjct: 418 SKLTTVLQGIKEKISPETEVVYEKAINFT 446
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 60/327 (18%)
Query: 433 YSKVINY--APGCADIVCQNNSMIPA----AIDAAKNADATVIVAGLDLSVEAE------ 480
Y V+ Y G A++ Q + I I+ KNADA + G+ +E E
Sbjct: 569 YKLVLEYWQGEGKAEVALQTGNFIKTDFANLIERHKNADAFIFAGGISPQLEGEEMPVDA 628
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
G DR +LLP QT L+ + + K PV +IM+ A+ + + N I +IL +
Sbjct: 629 PGFNGGDRTSILLPEVQTRLLKALQSSGK-PVVFLIMTGSAIAVPWEAEN--IPAILNIW 685
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
Y G+ G A ADVIFG YNP GRLP+T+Y+ + + + +TY++F G
Sbjct: 686 YGGQSAGTASADVIFGDYNPAGRLPVTFYKGDSDLSSFVDYKM------DNKTYRYFKGI 739
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+Y FGYGLSYT+FKY +P K+ K Q
Sbjct: 740 PLYGFGYGLSYTEFKYSGLKTPD----KIKKGQPV------------------------- 770
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTM 715
T ++V N GKM+G EV +Y P + + +K + G+ER + GQS V FT+
Sbjct: 771 ----TISVKVTNTGKMEGEEVAQLYLINPNTSIKSPLKSLKGFERFNLKPGQSTVVNFTL 826
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ + L V + N G I VG
Sbjct: 827 SP-EDLSYVTESGNLKPYEGKIQIAVG 852
>gi|399029098|ref|ZP_10730151.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398073120|gb|EJL64304.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 744
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 221/734 (30%), Positives = 338/734 (46%), Gaps = 143/734 (19%)
Query: 28 LVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
L+ +MTL EK+ + G+ + GV RLG+P + L I R +P G D
Sbjct: 52 LISQMTLEEKIGMLHGNSMFSNGGVKRLGIPELKMADGPLGVREEISRDNWAPAGLTNDF 111
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
AT +P A++N + G ++ E RA SP IN+VR P
Sbjct: 112 ----ATYYPAGGGLAATWNAEMAHTFGNSLGEELRA-----RDKDMLLSPAINMVRSPLG 162
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
GR E EDP++ + A+ + GLQ+ + + AC KHYAA +N E
Sbjct: 163 GRTYEYMSEDPFLNKKIAVPLIVGLQEKD--------------VMACVKHYAA---NNQE 205
Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
N F D ++ E+ ++E ++ FE V E S+M +YN+ G C + +LN+ +R
Sbjct: 206 TNRDF-VDVQIDERTLREIYLPAFEASVKEAKAYSIMGAYNKFRGEYLCENDYMLNKILR 264
Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-------YYTNF 316
+W F G +VSD ++ + A+ LK GLD++ G + +
Sbjct: 265 DEWGFKGVVVSDWAAVHS---------------TAKTLKNGLDIEMGTPKPFNEFFLADK 309
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
+ AV+ G+++EA+ID ++ + VL ++ G + K +I H + A + A
Sbjct: 310 LIAAVKSGEVSEAEIDLHVKRILRVLFQVKAMGGGER----AKGSIATEAHYQDAYKIAS 365
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC-RYTSPMDGF---YA 432
+ +VLLKNDN ALPL +K++A++G +A A+ G G R +P++G
Sbjct: 366 EAVVLLKNDNNALPLKLDGVKSIAVIGNNATKKNALAGFGAGVKTKREITPLEGLKNRLP 425
Query: 433 YSKVINYAPGCADIVCQNN--------------------SMIPAAIDAAKNADATVIVAG 472
S INYA G + + N + + A++AAKN+D +I AG
Sbjct: 426 SSIKINYAEGYLERYEEKNKGNLGNITSSGPVTIDQLDPAKLQEAVEAAKNSDVAIIFAG 485
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKS 531
+ E E DR DL LP Q ELI KV A P T+V+M AGA DIN + + K +
Sbjct: 486 SNRDYETEASDRRDLHLPFGQEELIKKV--LAVNPKTIVVMIAGAPFDIN--EVSKKTSA 541
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM--PLRPVNNFPGRT 589
++W + G EGG A+ADV+ GK NP G+LP T +P M P N+FPG
Sbjct: 542 LVWSWFNGSEGGNALADVLLGKVNPSGKLPWT--------MPKNLMDSPAHATNSFPGGK 593
Query: 590 -----------YKFFDGPVV---YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN 635
Y++FD + YPFG+GLSYT F + A + K+ +
Sbjct: 594 EVNYAEGILIGYRWFDTKKIAPLYPFGFGLSYTTFAFDNAKTDKT--------------S 639
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV 695
Y V T ++V+N GK+DG EVV +Y+ T Q
Sbjct: 640 YAVTET------------------ITVSVDVKNTGKVDGKEVVQLYASKSDSKITRAAQE 681
Query: 696 I-GYERVFIAAGQS 708
+ G+++ + AG S
Sbjct: 682 LKGFQKTDVKAGGS 695
>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
Length = 1039
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 233/810 (28%), Positives = 369/810 (45%), Gaps = 144/810 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D P R +DL+ +MTL EK QM L YG R+ LP EW ++ G+ I
Sbjct: 145 YEDPSAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAI 203
Query: 70 GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
N PP T F +E + G
Sbjct: 204 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 263
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++IG EAR + G T ++P ++V RD RWGR
Sbjct: 264 KATNFPTQLGLGHTWNRQLLRQIGLITGREARML------GYTNVYAPILDVGRDQRWGR 317
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I V+G+Q H +++A KH+ AY +
Sbjct: 318 YEEVYGESPYLVAELGIEMVKGMQ-------HNH------QVAATGKHFIAYSNNKGARE 364
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G P + L +RGD
Sbjct: 365 GMARVDPQMSPREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGD 424
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 425 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNIRCTFRSPDSYVLPLRELV 483
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G+++E I+ +R + V +G FD Q G + + + E+A +A+R+ IV
Sbjct: 484 KEGELSEEIINDRVRDILRVKFLVGLFDHPYQTDLKGADEEVEKASNEEIALQASRESIV 543
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
LLKND LPLN IK +A+ GP+A+ + +Y TS + G +
Sbjct: 544 LLKNDKNVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEV 603
Query: 438 NYAPGCADI-------------VCQN-NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
Y GC + + +N I A+ K AD V+V G E K
Sbjct: 604 LYTKGCELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKS 663
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +GG
Sbjct: 664 RSSLALPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGG 720
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV------ 597
+A+ADV+FG YNPGG+L +T + +IP+ + P +P + G +G +
Sbjct: 721 KAVADVLFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGLNGNMSRVNGA 778
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD----V 653
+YPFG+GLSYT F+Y D+K+ A++ + V
Sbjct: 779 LYPFGFGLSYTTFEYS--------DLKI-------------------SPAIITPNQKTYV 811
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
CK V N GK G EVV +Y + T+ K + G+ERV + G++ ++
Sbjct: 812 TCK---------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEIT 862
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
F ++ K+L++++ + ++ G T+++G
Sbjct: 863 FPIDR-KALELLNADMHWVVEPGEFTLMIG 891
>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
Length = 946
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 239/807 (29%), Positives = 369/807 (45%), Gaps = 138/807 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D R +DL+ +MTL EK QM L YG R+ LP EW ++ G+ I
Sbjct: 53 YEDPTATIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAI 111
Query: 70 GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
N PP T F +E + G
Sbjct: 112 DEHLNGFQQWGLPPSDNENIWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRRLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q H +I+A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QIAATGKHFIAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ T + PF+ + E + VM SYN +G P + L +RG+
Sbjct: 273 GMARVDPQMSPREVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + + E+A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDHPYQIDLKGADEEVEKAANEEIALQASRESIV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKND LPL+ I+ +A+ GP+A+ + +Y TS + G K +
Sbjct: 452 LLKNDKNILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N I A+D K AD V+V G E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ VA K PV LV+++ + IN+A + + +I+ YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVAATGK-PVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G+A+ADV+FG+YNPGG+L +T + +IP+ + P +P + G +G +
Sbjct: 628 GKAVADVLFGEYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGMEGNMSRANG 685
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y S K + +QQ V CK
Sbjct: 686 ALYPFGYGLSYTTFEY---SDLKISPAIITPNQQTF--------------------VTCK 722
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EVV +Y + T+ K + G+ERV + G++ +V F +
Sbjct: 723 ---------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVTFPI 773
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ K+L++++ + ++ G T++VG
Sbjct: 774 DR-KALELLNADMHWVVEPGDFTLMVG 799
>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 875
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 157/433 (36%), Positives = 234/433 (54%), Gaps = 40/433 (9%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
K DF + + L + +R DLV R+TL EKV QM + + + RLG+P Y+WW+E LHGV+
Sbjct: 23 KKYDFQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVA 82
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG--- 124
+T T +P I A+F+++ + + E RA+YN
Sbjct: 83 RTPFKT---------------TVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVEL 127
Query: 125 ------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
GLT+W+PNIN+ RDPRWGR ET GEDPY+ +V+GLQ
Sbjct: 128 KRTNERYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQG-------- 179
Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
D + LK +AC KHYA + + + R FD VT ++ +T++ F + E +V+
Sbjct: 180 -DDPKYLKAAACAKHYAVH---SGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAG 235
Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
VMC+YN P CA L+N +R +W F GY+ SDC +I ++HK D E A A
Sbjct: 236 VMCAYNAFRTQPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDA-ESAAA 294
Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKN 356
+ G D+DCG + AV+ GKI+E ID S++ L+++ RLG FD +Y
Sbjct: 295 DAVFHGTDIDCGTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDPVSMVKYAQ 354
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
+ + + +H A + ARQ IVLLKN+ LPLN N+K + ++GP+A+ +++GNY
Sbjct: 355 TPSSVLESKEHQLHALKMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNY 413
Query: 417 EGTPCRYTSPMDG 429
GTP + T+ + G
Sbjct: 414 NGTPSKLTTVLQG 426
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 137/301 (45%), Gaps = 59/301 (19%)
Query: 433 YSKVINY--APGCADIVCQNNSMIPA----AIDAAKNADATVIVAGLDLSVEAE------ 480
Y V+ Y G A++ Q + + I+ KNADA + G+ +E E
Sbjct: 569 YKIVLEYWQGEGKAEVSLQTGNFVKTNFADLIEHHKNADAFIFAGGISPQLEGEEMPVDF 628
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
G DR +L P QT+L+ + + K PV +M+ A+ I + N I +IL +
Sbjct: 629 PGFKGGDRTSILFPEVQTKLLKALQSSGK-PVVFAMMTGSAIAIPWEAEN--IPAILNIW 685
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
Y G+ G A ADVIFG YNP GRLP+T+Y+ + + +P +TY++F G
Sbjct: 686 YGGQSAGTAAADVIFGDYNPAGRLPVTFYKND------SDLPSFVDYKMDNKTYRYFKGT 739
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+Y FGYGLSYT FKY +P +K+ K Q
Sbjct: 740 PLYGFGYGLSYTSFKYSDLKTP----VKIKKGQSV------------------------- 770
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTM 715
+ ++V N GK +G EV +Y A T +K + G+ER + G++ + F +
Sbjct: 771 ----SILVKVANTGKTEGEEVAQLYLINQDTAIKTPLKSLKGFERFNLKPGENKTITFNL 826
Query: 716 N 716
+
Sbjct: 827 S 827
>gi|404487205|ref|ZP_11022392.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
YIT 11860]
gi|404335701|gb|EJZ62170.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
YIT 11860]
Length = 860
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 232/815 (28%), Positives = 363/815 (44%), Gaps = 158/815 (19%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM-----------------------GDL 44
K PY + LP ER +DL+ RMT+ EK+ Q+ G+
Sbjct: 23 KAQSLPYKNKNLPIEERVEDLLNRMTVDEKIAQIRHIHSSKIFNGQELDMKKLTDWAGNT 82
Query: 45 AYGV-------------------------PRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
++G RLG+P++ +E+LHG
Sbjct: 83 SWGFVEGFPLTGDNCAKSMYLIQKYMVEKTRLGIPIFTV-AESLHG------------AV 129
Query: 80 HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVR 139
H GAT +P I ++FN L +K Q +S + +M SP I+VVR
Sbjct: 130 H-----DGATIYPQNIALGSTFNPELARKKTQMISDDLHSM-----GFRQVLSPCIDVVR 179
Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
D RWGRV E+ GEDPY+ G + I V G Y + IS KHY +
Sbjct: 180 DLRWGRVEESYGEDPYLCGLFGIEEVSG--------YLENG------ISPMLKHYGPH-- 223
Query: 200 DNWEGNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
GN + E +D+ E ++ PFEM V + +VM +YN N IP A
Sbjct: 224 ----GNPLSGLNLASVECGLRDLHEIYLKPFEMVVKNTGILAVMSTYNSWNHIPNSASHY 279
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
LL +R +W F GY+ SD +I+ + H F +A + + AGLD + F
Sbjct: 280 LLTDILRDEWGFKGYVYSDWGAIEMLKTLH-FTARNSSEAAIQAISAGLDAEASSKCYPF 338
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
G +++G+ E +DT++R + +G F+ P K +P+ ++LA A
Sbjct: 339 LKGLIEKGQFDEKILDTAVRRVLFAKFAMGLFE-DPYGKTFKNRKRHSPESVKLAKTIAD 397
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY--TSPMDGF---Y 431
+ VLLKN+N LPL+ ++K++A++GP NA + G+Y + +P+ G
Sbjct: 398 ESTVLLKNENQLLPLDAKSLKSIAIIGP--NADQVQFGDYTWSRNNKDGVTPLQGIKNRV 455
Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGK 482
+ I+YA GC+ + + S I A++AAKN++ VI G S EG
Sbjct: 456 NKNTAIHYAKGCS-LTSLDTSGIAEAVEAAKNSEVAVIFGGSASAALARDYKSSTCGEGF 514
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
D DL L G Q++LI +V PV LV+++ I + KNN + +IL Y GE+
Sbjct: 515 DLNDLNLTGAQSQLIREVYRTGT-PVILVLVTGKPFVIEWEKNN--LPAILVQWYAGEQA 571
Query: 543 GRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMP------LRPVN-NFPGRTYKFFD 594
G +IAD++FG+ P GRL ++ ++ + Y +P P + + PGR Y F
Sbjct: 572 GNSIADILFGEVVPSGRLTFSFPRSTGHLPVYYNYLPSDRGFYKNPGSYDSPGRDYVFSA 631
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+Y FGYGLSYT F YK ++ DKD+ ++N T+
Sbjct: 632 PSALYSFGYGLSYTSFVYK--------NLSTDKDKY--ELNDTIHAT------------- 668
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
+EV+N GK G EVV +Y + T +KQ+ ++++ +A G++ V
Sbjct: 669 ---------VEVKNTGKYTGKEVVQLYVRDKASTYVTPVKQLRDFKKIELAPGETRTVQL 719
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
+ L +VD + +G + VG+ +
Sbjct: 720 QV-PISDLYLVDEKNQRFVEAGEFILEVGQASNNI 753
>gi|371776218|ref|ZP_09482540.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 774
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 198/658 (30%), Positives = 332/658 (50%), Gaps = 82/658 (12%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T+FP + S++ L +K + + EA A +G+ + ++P +++ RDPRWGR++
Sbjct: 128 TTFPIPLAEACSWDLQLMEKSARIAAEEATA------SGVAWNFAPMVDISRDPRWGRIM 181
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDP++ A VRG Q G++ ++D S+P + AC KH+ Y G D
Sbjct: 182 EGAGEDPFLGSLIARARVRGFQ---GIDSYKDF-SKPNTMMACAKHFVGYGAAQ-AGRDY 236
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D ++E+ + ET++ PF+ V+EG V+S M ++N +NG+P + + +R WN
Sbjct: 237 HTVD--ISERTLFETYLPPFKAAVDEG-VASFMTAFNELNGVPCTGNKYIFQDILRHQWN 293
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
F+G +V+D +IQ +V +H F D K+ A + AG+D+D + + + V++G++
Sbjct: 294 FNGMVVTDYTAIQEMV-AHGFAKDLKQ-ASKLAIDAGIDMDMISEGFVTYLKELVEEGQV 351
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQHIELAAEAARQGIVLLKN 384
+E ID ++ + + LG FD +Y K + NPQH++ A E A++ IVLLKN
Sbjct: 352 SEKQIDVAVARILEMKFLLGLFDDPYKYCDAEREKEVLMNPQHLQAAREVAQRSIVLLKN 411
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---YAYSKV-IN 438
+N LPL K +AL+GP +++ G + +G + + +G YA + V N
Sbjct: 412 ENNVLPLRKDIPKRVALIGPFVKERESLNGEWAIKGDRSKSVTLWEGLQEKYADTPVRFN 471
Query: 439 YAPGCA----DIVCQNNSM--------IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
YA G + D ++ S+ A+ AK +D ++ G E R D
Sbjct: 472 YAKGTSLPLIDGATRHVSLEQGFDKSGFAEALRVAKTSDLILVAMGEHYHWSGEAASRTD 531
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
+ LPG Q EL+ ++ K P+ LV+ + +D+++ N + +I+ YPG G A+
Sbjct: 532 ITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDLSWEAEN--VDAIVEAWYPGIMAGHAV 588
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RPVNNFPGRTYK--FFDGP--VVY 599
ADV+ G YNP RL +T + N +IP + +M RP + YK + D P ++
Sbjct: 589 ADVLSGDYNPSARLVVT-FPRNVGQIPIFYNMKNTGRPFDENHPADYKSSYIDSPNSPLF 647
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GLSYT F+Y N T+ + K LI
Sbjct: 648 PFGFGLSYTSFQYD---------------------NATISSQKLTKGGSLI--------- 677
Query: 660 FTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
++V N G +DG EVV +Y G +K++ G++++F+ G++ V FT+N
Sbjct: 678 --VSVDVTNTGNVDGEEVVQLYIHDKVGSVTRPVKELKGFKKIFLKKGETKTVEFTIN 733
>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
Length = 721
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 222/733 (30%), Positives = 340/733 (46%), Gaps = 100/733 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP--LYEWWSEALHGVSFI 69
+P+ + L +R DL+ RMTL EK+Q +GD GVPRLG+P L E E LHG +
Sbjct: 24 YPFQNPALSPDQRIDDLLSRMTLQEKIQALGDDP-GVPRLGIPGALTE---EGLHGAAIG 79
Query: 70 GRRTNSPPGTHFDSE---VPGATSFPTVILTTASFNESLWKKIGQTVSTEAR-AMYNLGN 125
G H++ V T FP +++ +L +K + E R A+ +
Sbjct: 80 GP-------AHWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHD 132
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GL +PN N+ RDPRWGR E+ GEDPY+VG A+ +++GLQ ++ R
Sbjct: 133 GGLIVRAPNANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQ---------GNNPRYW 183
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ +A KH+ AY + +F R+ E + +PF M + +G + M SYN
Sbjct: 184 ETAALMKHFDAYSNEANRDGSSSNFGKRL----FYEYYSVPFRMGIEQGHSDAFMTSYNA 239
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
NGIP A+P +L + W F+G I +D ++ +V +H T +A A + AG+
Sbjct: 240 WNGIPMTANP-VLKSVVMKKWGFNGIICTDAGALSNMV-THFHYYKTMPEAAAGAVHAGI 297
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
+ D Y A+QQ + E ID L+ +Y V++RLG D S Y +G N
Sbjct: 298 N-QFLDRYQQPVEEALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPSSMSPYSMIGLTNDN 356
Query: 364 N--------PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
P HI L + + IVLLKN N ALPL+ + ++A++GP AN +
Sbjct: 357 PAKGDPWDWPSHIALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDW 414
Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
Y GTP +P++G P + + S + AA AK +D +++ G
Sbjct: 415 YSGTPPFGVTPVEGIRQ-----RVGPDV-KVTFNDGSNLQAAAALAKQSDEAIVIIGNHP 468
Query: 476 SVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
+ +A EGK+ D E I K AA +V+ ++ ++ + +
Sbjct: 469 TCDAGWGKCALPSEGKEAFDRTALNLPDESIAKAVYAANPHTVVVLQTSFPYTTDWTQAH 528
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
I +IL + + EE G A+ADV+FG Y+P GRL TW A+ ++P P+ N
Sbjct: 529 --IPAILEMAHNSEEQGTALADVLFGDYDPAGRLAQTWV-ASIGQLP----PMMDYNIRD 581
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
GRTY + +YPFG+GLSYT FKY N + ++ P
Sbjct: 582 GRTYMYLKSKPLYPFGFGLSYTTFKYS---------------------NLRLSSHTLPAG 620
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAA 705
+ T ++V N GK +G EVV +Y K ++ + G++RV I
Sbjct: 621 G-----------QLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPV 669
Query: 706 GQSAKVGFTMNAC 718
GQ+ V + A
Sbjct: 670 GQTRTVTLPLKAS 682
>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 882
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 171/451 (37%), Positives = 238/451 (52%), Gaps = 52/451 (11%)
Query: 20 PYPER-AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPG 78
P PE+ A LV +MT EK+ Q + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 28 PSPEQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY------- 80
Query: 79 THFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLT 129
AT FP I AS+N L + +G STEARA +NL AGLT
Sbjct: 81 ---------ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLT 131
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPNIN+ RDPRWGR +ET GEDPY+ G+ A++++RGLQ D+ P I A
Sbjct: 132 LWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQG--------DTPDHPRTI-A 182
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
KH+A + R FD V+ D++ T+ F + +G SVMC+YN ++G
Sbjct: 183 TPKHFAVHSGPE---QGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGT 239
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LLN +R DW F+G++VSDCD+I+ + H F D A A LK+G DL+C
Sbjct: 240 PACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNC 298
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQH 367
G+ Y + A+ +G I E+ +D +L L+ RLG Y +G +I P H
Sbjct: 299 GNTYRDLNQ-AIARGDIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAH 357
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA +AA Q +VLLKN LPL TLA++GP A++ A+ NY+GT +P+
Sbjct: 358 RALALQAAAQSLVLLKNSGNTLPLPPET--TLAVLGPDADSLTALEANYQGTSSTPVTPL 415
Query: 428 DGFYA--------YSKVINYAPGCADIVCQN 450
G Y++ + APG + +
Sbjct: 416 TGLRTRFGTAKVHYAQGASLAPGVPSTIPET 446
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 136/295 (46%), Gaps = 52/295 (17%)
Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
A +ADA V GL VE E G DR + LP Q L+ V K P+
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
+V+MS AV +N+A+++ +IL YPG+ GG AIA + G NPGGRLP+T+Y +
Sbjct: 666 VVLMSGSAVALNWAQHH--ADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYRSTQ 723
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
PY S + GRTY++F G +YPFGYGLSYTQF Y+
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAP-------------- 763
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
Q G T V N G G EVV +Y +PP
Sbjct: 764 QLSTATLKAGNT------------------LTVTAHVRNTGTRAGDEVVQLYLEPPYSPQ 805
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++ ++G++RV + G+S + FT++A + L V + +G + + VG G
Sbjct: 806 APLRSLVGFKRVTLRPGESRLLTFTLDA-RQLSGVQQTGQRSVEAGHYHLFVGGG 859
>gi|289577460|ref|YP_003476087.1| glycoside hydrolase [Thermoanaerobacter italicus Ab9]
gi|289527173|gb|ADD01525.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
italicus Ab9]
Length = 787
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 227/816 (27%), Positives = 376/816 (46%), Gaps = 156/816 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL---------- 63
Y D K P ++ ++L+ +MT+ EK+ Q+ G+ +YE + +
Sbjct: 6 YLDPKQPVEKKVENLLAQMTIEEKIAQLS---------GIWVYEILDDMMKFSYEKANRL 56
Query: 64 --HGVSFIGR---------------------------RTNSPPGTHFDS----EVPGATS 90
HG+ I R R P H +S GAT
Sbjct: 57 MTHGIGQITRLGGASNLSPQETVKIANQIQKYLVENTRLGIPALIHEESCSGYMAKGATI 116
Query: 91 FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETP 150
FP I +++N L +K+ + + +A+ +P ++V RDPRWGR ET
Sbjct: 117 FPQTIGVASTWNPKLVEKMASVIREQMKAV-----GARQALAPLLDVTRDPRWGRTEETF 171
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+V ++Y+RGLQ +++ + A KH+ Y N EG +
Sbjct: 172 GEDPYLVMHMGVSYIRGLQ----------TENLKEGVIATGKHFVGYG--NSEGGMNWA- 218
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
+ + +++ E F+ PFE V E + S+M Y+ ++GIP +LL +R +W F G
Sbjct: 219 PAHIPMRELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDG 278
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAE 328
+VSD +I + E H+ ++ KE A L+AG+D++ D Y ++QG I
Sbjct: 279 IVVSDYFAINQLYEYHRLASNKKE-AAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDI 337
Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ-HIELAAEAARQGIVLLKNDNG 387
++ ++R + LG F+ +P I + Q +LA + A++ IVLLKN++
Sbjct: 338 DFVNDAVRRILKAKFLLGLFE-NPYVDEKRVVEIFDTQEQRQLAYKIAQESIVLLKNESN 396
Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR-------------YTSPM-DGFYA- 432
LPL +++++A++GP+A+ + MIG+Y PC + +P+ +G A
Sbjct: 397 LLPLKK-DLQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAK 454
Query: 433 -------------------YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG- 472
+KVI YA GC D++ + + A++ AK AD ++V G
Sbjct: 455 DIYVPIVSVLQGIKEKVSPKTKVI-YAKGC-DVISDDTAGFNKAVEVAKQADVAIVVVGD 512
Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
D E +DR DL LPG Q ELI V + PV +V+++ + I++ K
Sbjct: 513 RAGLTDGCTSGESRDRADLNLPGVQEELIKAVYETGT-PVIVVLINGRPMSISWIAE--K 569
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
I +I+ PGEEGGRAIADVIFG YNPGG+LPI+ + + Y P N+ G
Sbjct: 570 IPAIIEAWLPGEEGGRAIADVIFGDYNPGGKLPISIPRSVGQLPVYYYHKPSGGRTNWKG 629
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
+ P +YPFGYGLSYT+F Y N ++ K
Sbjct: 630 DYVESSTKP-LYPFGYGLSYTEFLYS---------------------NLSISHPKVATQG 667
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
+I+ +V+N+GK+ G EVV +Y ++ T +K++ G++R+ + G
Sbjct: 668 GIIE----------ISADVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVG 717
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ V F +++ + L + ++ G +++G
Sbjct: 718 EQKTVIFQLSS-EQLGFYNEEMEYVVEPGRVEVMIG 752
>gi|146298537|ref|YP_001193128.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146152955|gb|ABQ03809.1| Candidate beta-glycosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 745
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 232/767 (30%), Positives = 351/767 (45%), Gaps = 141/767 (18%)
Query: 28 LVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
L+ +MTL EK+ + G+ + GV RLG+P + L I R +P G D
Sbjct: 53 LISQMTLEEKIGMLHGNSMFANAGVKRLGIPELKMADGPLGVREEISRDNWAPAGWTNDF 112
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
AT +P A++N + G ++ E RA SP IN+VR P
Sbjct: 113 ----ATYYPAGGALAATWNAEMAHTFGTSLGEELRA-----RDKDMLLSPAINMVRTPLG 163
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
GR E EDP++ + A+ V GLQ+ + + AC KHYAA +N E
Sbjct: 164 GRTYEYMSEDPFLNKKIAVPLVVGLQEKD--------------VMACVKHYAA---NNQE 206
Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
N F D ++ E+ ++E ++ FE V E S+M +YN+ G C + +LN+ +R
Sbjct: 207 TNRDF-VDVQIDERTLREIYLPAFEATVKEAKAYSIMGAYNKFRGEYLCENDYMLNKILR 265
Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-------YYTNF 316
+W F G +VSD ++ + A+ LK GLD++ G + +
Sbjct: 266 DEWGFKGVVVSDWAAVHS---------------TAKSLKNGLDIEMGTPKPFNEFFLADK 310
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAAR 376
+ AV+ G+++E +ID ++ + VL ++ G + K +I H + A + A
Sbjct: 311 LIAAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER----AKGSIATEAHYQDAYKIAA 366
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC-RYTSPMDGF---YA 432
+ I+LLKN+N ALPL +K++A++G +A A+ G G R +P++G
Sbjct: 367 EAIILLKNENNALPLKLDGVKSIAVIGNNATKKNALGGFGAGVKTKREVTPLEGLKNRLP 426
Query: 433 YSKVINYAPGCADIVCQNN--------------------SMIPAAIDAAKNADATVIVAG 472
S INYA G + + N + + A++AAK +D +I AG
Sbjct: 427 SSVKINYAEGYLEKYEEKNKGNLGNITSTGPVTIDKLDPAKVQEAVEAAKKSDVAIIFAG 486
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDINFAKNNPKIKS 531
+ E E DR DL LP Q ELI KV +A P T+V+M AGA D+N + + K +
Sbjct: 487 SNRDYETEASDRRDLHLPFGQEELIKKVIEA--NPKTIVVMIAGAPFDLN--EVSQKSSA 542
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT-- 589
++W + G EGG A+ADVI GK NP G+LP W +K P N+FPG
Sbjct: 543 LVWSWFNGSEGGNALADVILGKVNPSGKLP--WTMPKQLK----DSPAHATNSFPGDKAV 596
Query: 590 ---------YKFFDGPVV---YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
Y++FD V YPFGYGLSYT F A K DKD +
Sbjct: 597 NYAEGILIGYRWFDTKNVAPLYPFGYGLSYTTFALDNA--------KTDKDSYAQ----- 643
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI- 696
+DV ++V+N GK+DG EVV +Y+ T Q +
Sbjct: 644 -------------NDV------IEVTVDVKNTGKVDGKEVVQLYTSKSDSKITRAAQELK 684
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
G+++ + AG S K+ + K L D AA + G +TI +G
Sbjct: 685 GFKKADVKAGGSEKITIKV-PVKELAYYDVAAKKWTVEPGKYTIKLG 730
>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 172/456 (37%), Positives = 240/456 (52%), Gaps = 49/456 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DLV R+TL EK M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 25 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------- 125
AT FP I ASFN L + VS EARA +
Sbjct: 84 ---------------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKR 128
Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT W+PNIN+ RDPRWGR ET GEDPY+ G+ + VRGLQ EG +Y
Sbjct: 129 YQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD------- 181
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ AC KHYA + W +R F++ + +D+ ET++ F+ V + V VMC+Y
Sbjct: 182 -KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND-TKEDAVARVLK 302
NR G P C +LL Q +R +W + +VSDC +I D K+ A A+ +
Sbjct: 238 NRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKAVL 297
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
+G D++CGD Y + AV++G I E ID SL+ L LG D Q + + +
Sbjct: 298 SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIPYS 356
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H ELA AR+ +VLL+N+ LPLN N+K +A+VGP+AN + GNY G P
Sbjct: 357 VVDSKEHRELALRMARESLVLLQNNQSLLPLNK-NLK-VAVVGPNANDSVMQWGNYNGFP 414
Query: 421 CRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ ++G Y S++I Y PGC +D+ Q+
Sbjct: 415 SHTITLLEGIREYLPESQII-YEPGCDLTSDVTLQS 449
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 136/300 (45%), Gaps = 53/300 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ +D K AD + G+ +VE E G DR + LP Q+ L+ ++ A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K +V ++ I + +IL YPG+ GG AIA+V+FG YNP GRLP+T
Sbjct: 650 GK---KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y++ +P + GRTY++ ++PFG+GLSYT F+Y AS S +I
Sbjct: 707 FYKST------KQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNASLNTS-EI 759
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
K D +Q T I V N GK DG EVV VY +
Sbjct: 760 K-DGEQ------------------------------VTLTIPVSNTGKYDGEEVVQVYLR 788
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
PG + ++RV IA G + V ++ ++ + D + N++ G + IL G
Sbjct: 789 HPGDKEGPSHALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847
>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 172/456 (37%), Positives = 240/456 (52%), Gaps = 49/456 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DLV R+TL EK M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 25 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------- 125
AT FP I ASFN L + VS EARA +
Sbjct: 84 ---------------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKR 128
Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT W+PNIN+ RDPRWGR ET GEDPY+ G+ + VRGLQ EG +Y
Sbjct: 129 YQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD------- 181
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ AC KHYA + W +R F++ + +D+ ET++ F+ V + V VMC+Y
Sbjct: 182 -KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMCAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND-TKEDAVARVLK 302
NR G P C +LL Q +R +W + +VSDC +I D K+ A A+ +
Sbjct: 238 NRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVL 297
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
+G D++CGD Y + AV++G I E ID SL+ L LG D Q + + +
Sbjct: 298 SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIPYS 356
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H ELA AR+ +VLL+N+ LPLN N+K +A+VGP+AN + GNY G P
Sbjct: 357 VVDSKEHRELALRMARESLVLLQNNQSLLPLNK-NLK-VAVVGPNANDSVMQWGNYNGFP 414
Query: 421 CRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ ++G Y S++I Y PGC +D+ Q+
Sbjct: 415 SHTITLLEGIREYLPESQII-YEPGCDLTSDVTLQS 449
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 136/300 (45%), Gaps = 53/300 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ +D K AD + G+ +VE E G DR + LP Q+ L+ ++ A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K +V ++ I + +IL YPG+ GG AIA+V+FG YNP GRLP+T
Sbjct: 650 GK---KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y++ +P + GRTY++ ++PFG+GLSYT F+Y AS S +I
Sbjct: 707 FYKST------KQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNASLNTS-EI 759
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
K D +Q T I V N GK DG EVV VY +
Sbjct: 760 K-DGEQ------------------------------VTLTIPVSNTGKYDGEEVVQVYLR 788
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
PG + ++RV IA G + V ++ ++ + D + N++ G + IL G
Sbjct: 789 HPGDKEGPSHALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847
>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 814
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 49 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 218 RWSRVEETYGEDPYLNGAMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 503 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 620
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 621 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 672
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 673 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 706
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 707 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 760
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 761 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792
>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
Length = 614
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 189/586 (32%), Positives = 282/586 (48%), Gaps = 73/586 (12%)
Query: 48 VPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWK 107
V R+GLP Y+W A+HGV + + D V TSFP + ++N S +
Sbjct: 72 VSRIGLPEYDWGMNAIHGVQSSCIKDD-------DGTVYCPTSFPNPVNYGFTWNYSAYL 124
Query: 108 KIGQTVSTEARAMYNLG-----------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYV 156
++G+ + E RA++ G + GL WSPNIN+ R P WGR E PGEDP++
Sbjct: 125 ELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRNQEVPGEDPFM 184
Query: 157 VGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE 216
G++ Y GLQ D L+ KH+ AY L++ +G R +F++ V+
Sbjct: 185 NGQFGKAYTLGLQG---------DDDTYLQAIVTLKHWDAYSLEDSDGATRHNFNAIVSN 235
Query: 217 QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDC 276
+ +T+ F + V EG VMCSYN VNGIPTCA P LL +R W F GY+ SD
Sbjct: 236 FSLMDTYWPAFRVAVTEGKAKGVMCSYNAVNGIPTCAHP-LLRTVLRDLWKFDGYVSSDT 294
Query: 277 DSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLR 336
+++ I ++HK+ A A + D+D G Y + V +G D+D +LR
Sbjct: 295 GAVEDISDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRMEDVDNALR 354
Query: 337 FLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA---EAAR--------QGIVLLKND 385
+ LG FD H+ LAA A+R + +VLL+N
Sbjct: 355 NTLRLRFELGLFDPVENQSYW---------HVPLAAVNTNASRATNMLHTLESMVLLQNK 405
Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR------YTSPMDGFYAY--SKVI 437
N LPL N K +AL+GPHA A + M+GNY G C SP D + + +
Sbjct: 406 NNVLPL-ASNTK-VALIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAV 463
Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
YAPG C + S I A+ A AD V++ G+D S+EAE DR + LP Q +L
Sbjct: 464 TYAPGTNVTTC-SQSHIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLA 522
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
+ + K P +V+++ G + I K + +I+ GYPG GG AIA + G+
Sbjct: 523 SAIFAVGK-PTVIVLLNGGMLAIENEKQ--QADAIIEAGYPGFYGGTAIAQTLTGQNEHL 579
Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
G +Y+ + +M + + PGRTY+++ ++ F +
Sbjct: 580 G---------DYIN--WINMSDMEMTSGPGRTYRYYKNETLWAFHF 614
>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
Length = 805
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 94 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 209 RWSRVEETYGEDPYLNGAMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|312621303|ref|YP_004022916.1| glycoside hydrolase family 3 domain-containing protein
[Caldicellulosiruptor kronotskyensis 2002]
gi|312201770|gb|ADQ45097.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
kronotskyensis 2002]
Length = 770
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 211/709 (29%), Positives = 341/709 (48%), Gaps = 111/709 (15%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I +F+ + +++ + + T+ +A+ +P I+V RD RWGRV
Sbjct: 102 GATVFPQSIGVACTFDNEIVEELAKVIKTQMKAV-----GAHQALAPLIDVARDARWGRV 156
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NW 202
ET GEDPY+V A++YV+G+Q D I A KH+ Y + NW
Sbjct: 157 EETFGEDPYLVANMAVSYVKGIQ----------GDDIKDGIVATGKHFVGYAMSEGGMNW 206
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
+ E++++E ++ PFE+ V + S+M +Y+ ++GIP A+ KLL
Sbjct: 207 A-------PVHIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIA 259
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGA 320
RG+W F G VSD ++ I++ HK + T +A L AGLD++ + +T + A
Sbjct: 260 RGEWGFDGIFVSDYAGVRNILDYHKAVK-TYAEAAYISLWAGLDIELPKIECFTEEFIKA 318
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC-NPQHIELAAEAARQGI 379
+++GK A +D +++ + + RLG FD +P K G + N + EL+ + A++ +
Sbjct: 319 LKEGKFDMAVVDAAVKRVLEMKFRLGLFD-NPYIKTEGILELFDNKEQRELSRKVAQESM 377
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS----- 434
VLLKNDN LPL + ++K +A++GP+A++ + ++G+Y P + + ++ F+
Sbjct: 378 VLLKNDN-FLPL-SNDVKKIAVIGPNADSVRNLLGDY-SYPA-HIATLEMFFIKEDKGVG 433
Query: 435 -------KVIN-------------------YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
KVIN YA GC D+ Q+ S A AA+ AD +
Sbjct: 434 NEEEFVRKVINIKSILEAIKDRVQNKAEVVYAKGC-DVNNQDESGFEEAKKAAQGADVVI 492
Query: 469 IV----AGLDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
+V AGL L E +DR L LPG Q +LI +V+ + +V++ +
Sbjct: 493 LVVGDKAGLRLDCTSGESRDRASLKLPGVQEKLIEEVSKVNE---NIVVVLVNGRPVALE 549
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPV 582
K K+IL +PGEEG A+ADV+FG YNPGG+L I++ + V + Y P
Sbjct: 550 GIWQKAKAILEAWFPGEEGAEAVADVLFGDYNPGGKLAISFPRDVGQVPVYYGHKPSGGK 609
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+ + G + P + PFGYGLSYT F+YK N+ + K
Sbjct: 610 SCWHGDYVEMSTKPFL-PFGYGLSYTTFEYK---------------------NFAIEKEK 647
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
D +EVEN GK G E+V +Y++ T +K++ Y+RV
Sbjct: 648 ISM-----------DESIKISVEVENTGKYAGDEIVQLYTRKEEFLVTRPVKELKAYKRV 696
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ G+ KV F + D N +++ G ++VG + F
Sbjct: 697 HLKPGEKKKVVFEIFP-DQFAYYDYDMNRVISPGTVEVMVGASSEDIKF 744
>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
Length = 805
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 94 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAAQLVASSEHTGLAREVARQSIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGLFTIMVG 783
>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 874
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 170/482 (35%), Positives = 246/482 (51%), Gaps = 53/482 (10%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
R +L+ +MT+ E++ Q+ D A + RLGLP Y WW+E LHG++ G
Sbjct: 38 RIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDGY------------ 85
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA-MYNLGN------AGLTFWSPNIN 136
AT FP I A+++ L ++G VSTEARA Y+ G GLT WSPNIN
Sbjct: 86 ----ATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPNIN 141
Query: 137 VVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAA 196
+ RDPRWGR ET GEDP++ +V G+Q +D LK A KH+AA
Sbjct: 142 IFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQG---------NDPFYLKADATPKHFAA 192
Query: 197 YDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPK 256
+ EG D F ++ V+ D+ +T++ F +++MCSYN ++G P+CA
Sbjct: 193 HSGPE-EGRDSF--NAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGN 249
Query: 257 LLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNF 316
L +R W F GY+VSDCD++ I H F D A A L AG+DLDCG+ Y
Sbjct: 250 NLQDLVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGA-ADALNAGVDLDCGNTYAAL 308
Query: 317 TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNNICNPQHIELAAEA 374
+ ++ Q EA ++ +L L + +RLG D Y+++G + +P H LA A
Sbjct: 309 SK-SLDQNLTTEAKLNQALHRLLLARVRLGMLDPLSCSPYRDIGAEELDSPAHHTLALRA 367
Query: 375 ARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS 434
A + IVLLKND G LPL K ++++GP A+ K + NY GT +P+DGF +
Sbjct: 368 AEESIVLLKND-GVLPLQASTQK-VSVIGPTADMVKVLEANYHGTALHPITPLDGFRSRF 425
Query: 435 KVINYAPGCADIVCQNNSMIPAAIDA--AKNADATVIVAGLDLSVEAEGKDRVDLL-LPG 491
++YA G S++ + A +NA G ++AE D+ L P
Sbjct: 426 HDVSYAQG---------SLLAEGVSAPVPRNALRVAAAPGSSAGLQAEYFDKASLEGTPA 476
Query: 492 FQ 493
FQ
Sbjct: 477 FQ 478
Score = 123 bits (309), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 58/304 (19%)
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVA 501
+++ A+ A +D V GL +E E G DR L LP Q L++++
Sbjct: 593 ALLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLT 652
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIK---SILWVGYPGEEGGRAIADVIFGKYNPGG 558
K PV +V+ S V + P+ K +L YPGE GG A+A ++ G NP G
Sbjct: 653 QLHK-PVIIVLTSGSGVALG-----PEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSG 706
Query: 559 RLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
RLP+T+Y + +P + RTY++FDGPV++PFGYGLSY+ F+Y
Sbjct: 707 RLPVTFYRS------VDDLPAFTDYSMAHRTYRYFDGPVLFPFGYGLSYSHFQYG----- 755
Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
Q R + + T++P A V V N + +G+EV
Sbjct: 756 -----------QLRLSTHMLKTSEPLVAMV----------------TVHNESQREGTEVA 788
Query: 679 MVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHT 738
+Y +PP +G + G +RV + G++ ++ F + A L VD + + +G +
Sbjct: 789 ELYLQPPQASGAPRLTLQGVQRVALRPGETRELTFKL-APGQLSTVDTSGARTVRAGEYK 847
Query: 739 ILVG 742
+ VG
Sbjct: 848 LFVG 851
>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
Length = 814
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 236/813 (29%), Positives = 358/813 (44%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 49 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102
Query: 58 ----------------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------- 86
W LH G+ S R +N H +P
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 218 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 503 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 620
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 621 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 672
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 673 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 706
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 707 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 760
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 761 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792
>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
17393]
gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 162/451 (35%), Positives = 245/451 (54%), Gaps = 43/451 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DL++RMTL EK+ QM + + + RLG+P Y+WW+EALHGV+ G+
Sbjct: 24 PYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK- 82
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
AT FP I A+F+ + VS EARA Y+ G
Sbjct: 83 ---------------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGG 127
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + V+GLQ +Y
Sbjct: 128 YKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGNGAGKYD------- 180
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K AC KHYA + W +R FDS+ ++++D+ ET++ F+ V EG V VMC+Y
Sbjct: 181 -KAHACAKHYAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAY 236
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVLK 302
NR G P C++ +LL + +R DW + +VSDC +I +H + + E A A +
Sbjct: 237 NRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVV 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
+G DL+CG Y++ AV++G I E I+ S+ L +LG FD + + +
Sbjct: 297 SGTDLECGGSYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFDDDTLVSWSEIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H++ A E AR+ +VLL N N +LPL+ +I+ +A++GP+AN + + NY G P
Sbjct: 356 VVESKEHVDKALEMARKSMVLLTNKNNSLPLSK-SIRKVAVLGPNANDSVMLWANYNGFP 414
Query: 421 CRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
+ + ++G + + Y GC + Q
Sbjct: 415 TKSVTILEGIRSKLPEGAVYYEKGCDFVSTQ 445
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 138/294 (46%), Gaps = 53/294 (18%)
Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
+ V GL ++E E DR ++ LP Q E++ + K PV V+ S
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGST 663
Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
+ + + N + ++L YPG++GG A+ADV+FG YNP GRLP+T+Y ++ + +
Sbjct: 664 LALPWEAEN--LDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASD------SDL 715
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
P N RTY++F G ++PFGYGLSYT F Y A K+DK
Sbjct: 716 PDFEDYNMSNRTYRYFKGKPLFPFGYGLSYTTFDYGKA--------KVDKKS-------- 759
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG 697
+K D T I ++N GKMDG EVV VY + P IK +
Sbjct: 760 ---------------IKTGD-SMTLTIPLKNTGKMDGDEVVQVYLRNPADKEGPIKMLRA 803
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGEGVGGVSF 750
+ RV + AGQ+ + + A + + + A N + + G + +L G G S
Sbjct: 804 FRRVSLKAGQAENIQIELPAS-TFECFNPATNRMEILPGNYELLYGGTSDGKSL 856
>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 871
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 159/457 (34%), Positives = 241/457 (52%), Gaps = 52/457 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
++ + SD+PY + L + R DLV+RMTL EKV QM + + +PRL +P Y+WW+E L
Sbjct: 18 AVIAQTSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVL 77
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV+ T F T +P I A+F+ ++ + E RA++N
Sbjct: 78 HGVA----------RTPFK-----VTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNK 122
Query: 124 G---------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
GLT+W+PNIN+ RDPRWGR ET GEDP++ G +V GLQ
Sbjct: 123 ALQMRKPGEKYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQ----- 177
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVN 232
+D + LK +AC KHYA + G + R F++ ++ D+ +T++ F+ V
Sbjct: 178 ----GNDPKYLKAAACAKHYAVH-----SGPEPLRHVFNADISTYDLWDTYLPAFKKLVV 228
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDT 292
+ V+ VMC+YN P C L+ +R W F GY+ SDC I ++HK + T
Sbjct: 229 DDKVAGVMCAYNAFKTQPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHK-THAT 287
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
EDA + G D++CG + AV++GKI+E ID S++ L+++ RLG FD S
Sbjct: 288 AEDASTDAVLHGTDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFDPSD 347
Query: 353 --QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
+Y + + +P+H A + ARQ +VLLKN N LPL+ I+ + ++GP+A+
Sbjct: 348 VVKYAQTPVSVLESPEHQAHALKMARQSVVLLKNANHTLPLSK-TIRKIVVLGPNADNPI 406
Query: 411 AMIGNYEGTPCRYTSPMDGF--------YAYSKVINY 439
A++GNY GTP T+ G Y K +N+
Sbjct: 407 AILGNYNGTPSNLTTVYQGIRQKLPQAEVVYEKAVNF 443
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 128/274 (46%), Gaps = 53/274 (19%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ A + +ADA V V G+ +E E G DR + LP QT L+ K A
Sbjct: 593 VAALVKRVADADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
PV V+M+ A+ + N I +I+ Y G+ G A+ADV+FG YNP GRLP+T
Sbjct: 652 TGKPVVFVMMTGSALATPWEAEN--IPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVT 709
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y+++ T +P + RTY++F G +Y FGYGLSYTQFKY P +V
Sbjct: 710 FYKSD------TDLPDFTDYSMTNRTYRYFKGIPLYGFGYGLSYTQFKYDKLIVPATV-- 761
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
+ + I+ +V V N G++ G EVV +Y K
Sbjct: 762 -----KSGKAIHLSV--------------------------TVTNSGQIAGDEVVQIYMK 790
Query: 684 PPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMN 716
+K + G+ RV++ AG+ + F ++
Sbjct: 791 HHSQRIKVPLKALKGFARVYLKAGERRTLNFILS 824
>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
17393]
gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 863
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 158/427 (37%), Positives = 226/427 (52%), Gaps = 38/427 (8%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ + +LP ER DLV R+TL EK+ QM + A + RLG+P Y WW+E LHGV+ R+
Sbjct: 26 FLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVA----RS 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
P TSFP I A+++ ++ S E RA+Y+
Sbjct: 82 PYP-----------VTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIF 130
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT+WSPNIN+ RDPRWGR ET GEDP++ +++V+GLQ D L
Sbjct: 131 RGLTYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQG---------DDPVYL 181
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K SAC KHYA + W +R +D++V D+ +T++ F+ V EG V+ VMC+YN
Sbjct: 182 KSSACAKHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNS 238
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G P C + L+ +R W F GY+ SDC +++ +HK D + VL G
Sbjct: 239 FFGQPCCGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLH-GT 297
Query: 306 DLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNIC 363
D +CG+ AV +G I E ID SL+ L+ + RLG FD + Y N+ + +
Sbjct: 298 DCECGNGAYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDDRVPYSNIPLSVLE 357
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY 423
H A + ARQ IVLLKN + LPLN IK +A+VGP+A+ ++ NY G P
Sbjct: 358 CDAHKAHALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHI 417
Query: 424 TSPMDGF 430
T+ ++G
Sbjct: 418 TTALEGI 424
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 93/295 (31%), Positives = 141/295 (47%), Gaps = 54/295 (18%)
Query: 460 AAKNADATVIVAGL-------DLSVEAEG---KDRVDLLLPGFQTELINKVADAAKGPVT 509
A K+AD + V GL ++ VE EG DR + +P Q L+ ++ K PV
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELYATGK-PVV 653
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
V+M+ A+ + + + + +IL Y G+ GG+AIADV+FG YNP GRLP+T+Y++
Sbjct: 654 FVMMTGSALGLEW--ESAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKS-- 709
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
+P + RTY++F G VYPFGYGLSYT F+Y +KL
Sbjct: 710 ----VNDLPDFEDYSMENRTYRYFTGTPVYPFGYGLSYTTFQYS--------SLKLQPSP 757
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
R + T ++ N GKM+G EV +Y P
Sbjct: 758 DKRSVKVTA--------------------------KITNTGKMEGEEVAQLYVSNPRDFV 791
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
T I+ + G++R+ + G+S V F + + K L +VD + S+ G I +G G
Sbjct: 792 TPIRALKGFKRINLKPGESQTVEFVLTS-KELSVVDISGKSVPMKGKVQISLGGG 845
>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 858
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 170/478 (35%), Positives = 247/478 (51%), Gaps = 42/478 (8%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S+ PYC+ L ERA+DL+ R+TL EK + M D + +PRLG+ + WWSEAL
Sbjct: 14 SLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
HG + +G G T FP + ASFN+ L +++ S E RA YN
Sbjct: 74 HGAANMG----------------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNR 117
Query: 123 -LGNAG-------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
+ N G L+ W+PN+N+ RDPRWGR ET GEDPY+ VRGLQ E
Sbjct: 118 RMLNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETA 177
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+Y K+ AC KHYA + + + D V+ +D+ ET++ F+ V E
Sbjct: 178 KYR--------KLWACAKHYAVHSGPEYTRHTANVAD--VSPRDLWETYLPAFKTLVTEA 227
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
V VMC+Y R++ P C++ +LL Q +R +W F+ +VSDC ++ I +HK +D
Sbjct: 228 KVREVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH 287
Query: 295 DAVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
A AG D++CG Y T+ AV++G I EA++D + L LG D
Sbjct: 288 AAAK-AAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMDDPKL 346
Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
++ + + + + H +LA + ARQ +VLL+N G LPL G + +A++GP+A+
Sbjct: 347 VEWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPM 405
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC--ADIVCQNNSMIPAAIDAAKNADAT 467
M GNY GTP R + +DG A K + Y GC D N+ + AID K T
Sbjct: 406 MWGNYNGTPNRTVTILDGIKARHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKGLRGT 463
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 121/286 (42%), Gaps = 57/286 (19%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
A I + V V G+ ++E E G DR ++ LP Q + + + +A K
Sbjct: 591 AIIRKLQGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK 650
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
T+V ++ I +IL Y G+EGG A++DV+FG NP G+LP+T+Y
Sbjct: 651 ---TVVFVNCSGSAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFY 707
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ Y +R GRTY++F P ++ FGYGLSYT F++ A +
Sbjct: 708 KRTDQLPDYEDYSMR------GRTYRYFSDP-LFAFGYGLSYTTFRFGRAHA-------- 752
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ + + + + N G G EVV VY +
Sbjct: 753 ----------------------------EAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRV 784
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
+K + + RV + AG+S V ++ KS + D + N++
Sbjct: 785 ADTNGPLKSLRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTM 829
>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
Length = 954
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 231/764 (30%), Positives = 362/764 (47%), Gaps = 119/764 (15%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
K K++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY E
Sbjct: 162 KGKVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 218
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
A+HG S+ G+ GAT FP + A++N L +++ + E A
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA- 261
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
N A WSP ++V +D RWGR ET GEDP +V + +++G Q
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 305
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
SR L + KH+ + R D ++E++M+E ++PF + D S+M
Sbjct: 306 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHAIRNYDCQSLMM 360
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y+ GIP +LL Q +R +W F+G+IVSDC +I + + K +A + L
Sbjct: 361 AYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQAL 420
Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG+ +CGD Y N + A + G+I ++D R + + R F+ +P K L
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWK 479
Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
I + H E+A +AAR+ IV+L+N LPL T N++T+A++GP A+ + G+Y
Sbjct: 480 KIYPGWNSDSHKEMARQAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDY 536
Query: 417 --EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
+ P + S + G +KV+ Y GC D + + IP A+ AA +D V+V
Sbjct: 537 TPKLLPGQLKSVLTGIKEAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVVMV 594
Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
G + EA E D L+LPG Q EL+ V K PV L++ + DI
Sbjct: 595 LGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI- 652
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
K + K+IL PG+EGG A+ADV+FG YNPGGRLP+T+ +PL
Sbjct: 653 -LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 705
Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
GR Y++ D +Y FG+GLSYT F+Y D+K+ +
Sbjct: 706 NFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------DLKIQE------------ 745
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
KP + T Q V+N+G G EV +Y + T + ++ +
Sbjct: 746 --KP-------------NGNVTVQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+R+++ G+S V F + + ++++ + ++ G I VG
Sbjct: 791 DRIYLQPGESKTVSFELTPY-DISLLNDHMDRVVEKGEFKICVG 833
>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 950
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 231/764 (30%), Positives = 362/764 (47%), Gaps = 119/764 (15%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
K K++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY E
Sbjct: 158 KGKVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 214
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
A+HG S+ G+ GAT FP + A++N L +++ + E A
Sbjct: 215 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA- 257
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
N A WSP ++V +D RWGR ET GEDP +V + +++G Q
Sbjct: 258 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 301
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
SR L + KH+ + R D ++E++M+E ++PF + D S+M
Sbjct: 302 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHAIRNYDCQSLMM 356
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y+ GIP +LL Q +R +W F+G+IVSDC +I + + K +A + L
Sbjct: 357 AYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQAL 416
Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG+ +CGD Y N + A + G+I ++D R + + R F+ +P K L
Sbjct: 417 AAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWK 475
Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
I + H E+A +AAR+ IV+L+N LPL T N++T+A++GP A+ + G+Y
Sbjct: 476 KIYPGWNSDSHKEMARQAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDY 532
Query: 417 --EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
+ P + S + G +KV+ Y GC D + + IP A+ AA +D V+V
Sbjct: 533 TPKLLPGQLKSVLTGIKEAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVVMV 590
Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
G + EA E D L+LPG Q EL+ V K PV L++ + DI
Sbjct: 591 LGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI- 648
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
K + K+IL PG+EGG A+ADV+FG YNPGGRLP+T+ +PL
Sbjct: 649 -LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 701
Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
GR Y++ D +Y FG+GLSYT F+Y D+K+ +
Sbjct: 702 NFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------DLKIQE------------ 741
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
KP + T Q V+N+G G EV +Y + T + ++ +
Sbjct: 742 --KP-------------NGNVTVQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+R+++ G+S V F + + ++++ + ++ G I VG
Sbjct: 787 DRIYLQPGESKTVSFELTPY-DISLLNDHMDRVVEKGEFKICVG 829
>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
Length = 805
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 94 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
Length = 954
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 231/764 (30%), Positives = 362/764 (47%), Gaps = 119/764 (15%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
K K++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY E
Sbjct: 162 KGKVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 218
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
A+HG S+ G+ GAT FP + A++N L +++ + E A
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA- 261
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
N A WSP ++V +D RWGR ET GEDP +V + +++G Q
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 305
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
SR L + KH+ + R D ++E++M+E ++PF + D S+M
Sbjct: 306 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHAIRNYDCQSLMM 360
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y+ GIP +LL Q +R +W F+G+IVSDC +I + + K +A + L
Sbjct: 361 AYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQAL 420
Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG+ +CGD Y N + A + G+I ++D R + + R F+ +P K L
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRIDMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWK 479
Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
I + H E+A +AAR+ IV+L+N LPL T N++T+A++GP A+ + G+Y
Sbjct: 480 KIYPGWNSDSHKEMARQAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDY 536
Query: 417 --EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
+ P + S + G +KV+ Y GC D + + IP A+ AA +D V+V
Sbjct: 537 TPKLLPGQLKSVLTGIKEAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVVMV 594
Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
G + EA E D L+LPG Q EL+ V K PV L++ + DI
Sbjct: 595 LGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI- 652
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
K + K+IL PG+EGG A+ADV+FG YNPGGRLP+T+ +PL
Sbjct: 653 -LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 705
Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
GR Y++ D +Y FG+GLSYT F+Y D+K+ +
Sbjct: 706 NFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------DLKIQE------------ 745
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
KP + T Q V+N+G G EV +Y + T + ++ +
Sbjct: 746 --KP-------------NGNVTVQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+R+++ G+S V F + + ++++ + ++ G I VG
Sbjct: 791 DRIYLQPGESKTVSFELTPY-DISLLNDHMDRVVEKGEFKICVG 833
>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 782
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 375/813 (46%), Gaps = 144/813 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQM-------------GDLA--------------- 45
Y D+ P PER + L+ MTL EK Q+ G++
Sbjct: 20 YKDSSKPIPERVEHLLGLMTLEEKAGQLVQPFGWQTYEHKDGEIKLTEAFKAQVKNGGVG 79
Query: 46 --YGV----PRLGLPLYEWWS--EALHGVSFIGRRT--NSPPG---------THFDSEVP 86
YGV P G+ L S E V+ I R NS G +H +
Sbjct: 80 SLYGVLRADPWTGVTLETGLSPREGTEAVNAIQRYAIENSRLGIPILIGEECSHGHMAI- 138
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP + +++N L++++ + V+ E RA G +SP ++VVRDPRWGR
Sbjct: 139 GATVFPVPLSLGSTWNVELYREMCRAVARETRA-----QGGAVTYSPVLDVVRDPRWGRT 193
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
E GED Y++ A+ V GLQ E DS ++A KH+ Y + EG
Sbjct: 194 EECFGEDAYLISEMAVASVEGLQG----ESLDGEDS----VAATLKHFVGYG--SSEGG- 242
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
R + +++ E +LPF V G +S+M +YN ++G+P + +LL+ +RG+W
Sbjct: 243 RNAGPVHMGRRELLEVDLLPFRKAVEAG-AASIMPAYNEIDGVPCTTNEELLDGVLRGEW 301
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGK 325
F G +++DC +I + H D + DA + ++AG+D++ G + + AV+ G+
Sbjct: 302 GFDGMVITDCGAIDMLASGHDVAEDGR-DAAIQAIRAGIDMEMSGVMFGKHLVEAVRSGQ 360
Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
+ E +D ++R + + RLG F+ + I + +H+ELA + A +G+VLLKN
Sbjct: 361 LEEEVLDRAVRRVLTLKFRLGLFERPYADPERAERVIGSAEHVELARQLASEGVVLLKNK 420
Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGFYA----YSKVINY 439
+G LPL + + T+A++GP+A+A +G+Y R T+ + G + + + Y
Sbjct: 421 DGVLPL-SADAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLAETPERVLY 479
Query: 440 APGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA--------- 479
APGC I + A+ A+ AD V+V G +DL A
Sbjct: 480 APGCR-INGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGASKVTDNAES 538
Query: 480 -----EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
EG DR++L L G Q ELI ++ K P+ +V ++ + + + +IL
Sbjct: 539 DMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPIAEPWIDEHA--DAILE 595
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
YPG+EGG AIAD++FG NP GRL I+ V + Y R G+ Y
Sbjct: 596 AWYPGQEGGHAIADILFGDVNPSGRLTISIPKHVGQVPVYYHGKRSR------GKRYLEG 649
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
D YPFGYGLSYT+F Y ++KL+ D IN
Sbjct: 650 DSQPRYPFGYGLSYTEFTYN--------NLKLESDT----IN------------------ 679
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVG 712
KD +EV N+G+ G+EV+ +Y T K++ G+ ++F+ G++ V
Sbjct: 680 --KDGSTKVTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQPGETQTVE 737
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
FT+ + L+ + ++ G + VG+ V
Sbjct: 738 FTVGP-EQLQYIGQNYKPVVEPGEFRVHVGKNV 769
>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
Length = 772
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 201/606 (33%), Positives = 312/606 (51%), Gaps = 72/606 (11%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P ++V RDPRWGR+ E GED Y+ A V+G Q D S+P I A
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQG--------DDLSQPHTILAT 227
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
KH+AAY G D D ++++++++T++ PF+ V+ G V+S M S+N +NG+P
Sbjct: 228 AKHFAAYG-QGQAGRDYHTTD--MSDRELRDTYLPPFKAAVDAG-VTSFMTSFNELNGVP 283
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC- 309
A+ LL +R +W+F G++V+D SI +V+ H F D + A +KAG+D+D
Sbjct: 284 ASANKYLLTDILRDEWSFEGFVVTDYTSINEMVK-HGFARDN-DHAGELAVKAGVDMDMQ 341
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQH 367
G Y ++ V QGK++ ID + R + + RLG F+ +Y N + I +
Sbjct: 342 GSVYFDYLANQVTQGKVSPQQIDNAARRILEMKYRLGLFEDPYRYSNEEREAQEIYKEYN 401
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSP- 426
++ A + AR+ +VLLKN+N LPL+ ++ T+A++GP A++ + +IG++ RY P
Sbjct: 402 LQAAQDVARKSMVLLKNENQQLPLSKSDL-TIAVIGPLADSKEDLIGSWSAAGDRYEKPI 460
Query: 427 --MDGFYAY----SKVINYAPGCA-DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+ G A SKV+ YA G + + Q+NS AAI AK AD V+ G +
Sbjct: 461 TLLTGIKAKVADPSKVL-YAKGASYEFSHQDNSGFEAAIAIAKKADVIVLAMGEKWDMTG 519
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E R L PG Q L+ ++ AK P+ LV+M+ + I +A N + +IL YPG
Sbjct: 520 EATSRTSLDFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEWADQN--VDAILEAWYPG 576
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTYKFF 593
GG AIADV+FG YNP G+LP+T + N +IP T P N ++
Sbjct: 577 TMGGPAIADVLFGDYNPSGKLPVT-FPRNVGQIPLYYNMKNTGRPYSKDNAEQKYVSRYI 635
Query: 594 DG--PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
D +Y FG+GLSYT F Y S K+V
Sbjct: 636 DSLNTPLYHFGHGLSYTTFDYSKISLNKAV------------------------------ 665
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
+ K+ K T I+V N G DG EVV +Y + G +KQ+ G++++F+ G++
Sbjct: 666 -ITAKE-KLTASIDVTNSGNYDGEEVVQLYIRDRIGSVTRPVKQLKGFKKIFLHKGETKT 723
Query: 711 VGFTMN 716
V F+++
Sbjct: 724 VSFSIS 729
>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 737
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 211/727 (29%), Positives = 349/727 (48%), Gaps = 99/727 (13%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGL---PLYEWWSEALHGV 66
+ +P+ +A L R DL+ RMTL EKV + VPRLG+ P E HGV
Sbjct: 38 TSYPFQNADLDMETRVDDLLSRMTLEEKVSALSTDP-SVPRLGIKGAPHIE----GYHGV 92
Query: 67 SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN---L 123
+ G +P G D VP T FP A++N L +K G+ S EAR ++ +
Sbjct: 93 AMGGPANWAPKG---DERVP-TTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEI 148
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GL +PN ++ RDPRWGR E GEDP++VG + + +GLQ D +
Sbjct: 149 SKGGLVVRAPNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQ---------GDDEK 199
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
+ ++ KH+ A +N + +FD+++ E + F + EG ++ M +Y
Sbjct: 200 YWRTASLLKHFLANSNENTRDSSSSNFDTQL----FYEYYGATFRRAILEGGSNAYMTAY 255
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N VNG+P P + + W +G I +D +V +HK +D A V+KA
Sbjct: 256 NAVNGVPAHIHP-MHKEISMARWGVNGIICTDGGGYTLLVRAHKAYDDYYR-AAEGVIKA 313
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGK 359
GL+ D Y GA+ G +AE D+D L+ +Y V+++LG D PQ Y ++G+
Sbjct: 314 GLN-QFLDNYREGVWGALAHGYLAEEDLDEVLKGVYRVMIKLGQLD--PQDKVPYASIGR 370
Query: 360 NN----ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
+ +P+H E A + AR+ +VLLKN+ LPL + +A++G A+ ++
Sbjct: 371 DGKPAPWTSPEHQEAALQMARESVVLLKNEKQTLPLAGDELGKVAVIGHLADTI--LLDW 428
Query: 416 YEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
Y G P ++P+DG I G ++ ++ AA++AA AD ++V G
Sbjct: 429 YSGMPPFMSTPLDG-------IKEKMGADKVLFAPDNDYNAAVEAASQADVAIVVLGNHP 481
Query: 476 SVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
++E G++ VD E + + A LV+ S+ IN+++
Sbjct: 482 YCDSERWGDCPDPGMGREAVDRKTLRLTDEWLAQRVFEANPNTILVLQSSFPYGINWSQE 541
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
N + +I+ + + G+ G A+ADV+FG YNPGG+L TW ++ +R
Sbjct: 542 N--LPAIVHITHNGQSTGTALADVLFGDYNPGGKLTQTWPKSEEQLPDMMEYDIR----- 594
Query: 586 PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPC 645
G TY +F+G +YPFG+GLSYT F++ VD+++ G++
Sbjct: 595 KGHTYMYFNGEPLYPFGFGLSYTSFEW--------VDMEI------------TGSS---- 630
Query: 646 AAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIA 704
VK + + ++++N+G++ G EV+ +Y+ P + K + G++RV +
Sbjct: 631 -------VKSNEEEVIVTVKLKNVGQVKGDEVIQLYASFPETSSRRPDKALKGFKRVTLE 683
Query: 705 AGQSAKV 711
G+S V
Sbjct: 684 PGESKNV 690
>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
Length = 769
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 216/697 (30%), Positives = 340/697 (48%), Gaps = 110/697 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ + E LHG++ ATS+P I A+FN L +KI
Sbjct: 110 RLGIPVI-FHEECLHGLA-----------------AKDATSYPVPIGLAATFNPELIEKI 151
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
++ +AR+ +P ++VVRDPRWGRV ET GED Y+V + I V+GLQ
Sbjct: 152 FSAIAEDARS-----RGAHQALTPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQ 206
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ + K+ A KH+AA+ N + +E+ +++TF++PF+
Sbjct: 207 GDGSLNNNN-------KVIATLKHFAAHGQPESGTN---CAPANFSERFLRDTFLMPFKE 256
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
+++ V SVM SYN ++GIP+ A+ LL + +R +WNF G++VSD +I + + +
Sbjct: 257 AIDKAGVISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETV 316
Query: 290 ND----TKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
+ K +A L+AG++++ D Y N T V+ G E+DID + +
Sbjct: 317 SHGVAANKVEAAKLALEAGVNIEFPNPDCYPNLTE-MVKGGLADESDIDALVLPMLKYKF 375
Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
LG FD G+ Q ELA +AAR+ I LLKN+ LPL + K +A++G
Sbjct: 376 ELGLFDNPYVEAEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLK--DFKKIAVIG 433
Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGF---YAYSKVINYAPGCADIV------------- 447
P NA + ++G Y GTP YTS G + + Y+ GC V
Sbjct: 434 P--NADRTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPD 491
Query: 448 -CQNNSMIPAAIDAAKNADATVIVAG--LDLSVEAEGK----DRVDLLLPGFQTELINKV 500
++ +I A+ A+ +D V+V G S EA K DR L L G Q +L+ ++
Sbjct: 492 PAEDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEI 551
Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
K PV +++ + I F K+N + +IL Y G+E GRA+ADV+FG YNP G+L
Sbjct: 552 LKTGK-PVVVLLFNGRPNSIGFIKDN--VPAILECWYLGQETGRAVADVLFGDYNPSGKL 608
Query: 561 PITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK 619
P++ A ++ Y+ P R Y F D ++ FGYGLSYT+F +
Sbjct: 609 PVSIPRSAGHIPAHYSHKP------SARRGYLFDDVSPLFAFGYGLSYTKFSFD------ 656
Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
+++L KD T+ D K + IEV+N G + G EVV
Sbjct: 657 --NLRLSKD--------TISA----------------DEKVSVSIEVKNEGAIAGEEVVQ 690
Query: 680 VYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
+Y + + T +K++ G+ ++ +A GQ++ V F +
Sbjct: 691 LYIRDKVSSVTRPVKELKGFRKITLAPGQTSTVVFEL 727
>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
Length = 285
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 133/290 (45%), Positives = 188/290 (64%), Gaps = 9/290 (3%)
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
GLD S+EAE +DRV LLLPG Q EL+++VA A+GPV LV+MS G +D++FAKN+PKI +
Sbjct: 2 GLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKISA 61
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYV-KIPYTSMPLR--PVNNFPGR 588
ILWVGYPG+ GG AIADVIFG NPGGRLP+TWY NY+ K+P T+M +R P +PGR
Sbjct: 62 ILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPGR 121
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY+F+ GPVV+PFG+GLSYT+F + +A +PK V + Q N TV T+K AV
Sbjct: 122 TYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAF--TNSTVSTSK----AV 175
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQS 708
+ C + F ++V+N G MDG+ ++V+SKPP + KQ++ + + ++ AG
Sbjct: 176 RVSHANCDAMEVGFHVDVKNEGSMDGTNTLLVFSKPPPGKWSATKQLVSFHKTYVPAGSK 235
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
+V ++ CK L +VD + G H + +G+ +S Q + H
Sbjct: 236 QRVKVGVHVCKHLSVVDEFGIRRIPMGEHELQIGDLKHSISVQTQEEIKH 285
>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 853
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 160/421 (38%), Positives = 231/421 (54%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 30 YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 87
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 88 --------------FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQ 133
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V GLQ D
Sbjct: 134 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG---------DDPH 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E + FEMCV EG +S+M +Y
Sbjct: 185 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAY 240
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 241 NALNDVPCTLNSWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 299
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q +++ADID++ + M+LG FD + Y + +
Sbjct: 300 GLDLECGDDVYDGPLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDSGERNPYTKISPS 359
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AARQ IVLLKN LPLN +K++A+VG NA K G+Y G P
Sbjct: 360 VIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAP 417
Query: 421 C 421
Sbjct: 418 V 418
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 152/300 (50%), Gaps = 49/300 (16%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 600 AVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 657
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + + +I+ YPGE+GG A+A+V+FG YNP GRLP+T+Y++ ++P P
Sbjct: 658 AVNWMDEH--VPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-LDELP----P 710
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
+ GRTYK+F G V+YPFGYGLSY+ F Y D Q +D
Sbjct: 711 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTY--------------SDLQVKDGG--- 753
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQVIG 697
D+V T ++N GK +G EV VY + P G +K++ G
Sbjct: 754 ------------DEV-------TVSFRLKNTGKRNGDEVAQVYVRIPETGGIVPLKELKG 794
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
+ RV + +G+S +V ++ + L+ D ++ GA ++VG + ++L
Sbjct: 795 FRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRLQTVIDL 853
>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
Length = 805
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 225/772 (29%), Positives = 359/772 (46%), Gaps = 136/772 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYG-VPRLGLPLYEW------------- 58
Y D P +R +DL+++MT+ EK Q+G + YG V + LP EW
Sbjct: 61 YEDLSQPIDKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNID 120
Query: 59 ------W------------SEALHGVS--FIGRRTNSPPGTHFDSEVPG-----ATSFPT 93
W +EA++ V F+ P + + G +T FP
Sbjct: 121 EHLNGEWKRTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPA 180
Query: 94 VILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETPGE 152
I ++++ L +IG+ EA+A+ G T +SP +++ RDPRWGR +E+ GE
Sbjct: 181 QIGQGCTWDKELIYEIGRITGEEAKAL------GYTNIYSPILDLSRDPRWGRTVESYGE 234
Query: 153 DPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDS 212
D Y+ G G Q V G++ +R + + KH+A Y + + D
Sbjct: 235 DSYLAGEL------GRQQVLGIQSNR--------VVSTPKHFAIYGIPGGGRDCYSRTDP 280
Query: 213 RVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYI 272
+ Q++ E + PF + E MCS+N NG P A L+ + +R W F GY+
Sbjct: 281 HASPQEVHELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYV 340
Query: 273 VSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL----DCGDYYTNFTMGAVQQGKIAE 328
VSD +I V+ + + DT+E+AVA L AGL++ + + + A+Q+G + E
Sbjct: 341 VSDSWAIDKNVKFYHIV-DTEEEAVASELNAGLNVRTFFEQSEVFIEALRRALQKGLVEE 399
Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDN 386
+ +D +R + V LG FD P K+ L + + ++ E++ AAR+ IVLLKN+N
Sbjct: 400 STLDQRVREVLYVKFWLGLFD-DPYVKDTKLADKIVNSDKNREVSLRAARESIVLLKNEN 458
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FYAYSKVINYAPGC 443
LPL + +K +A++GP A+ K++ Y + + G + + YA GC
Sbjct: 459 NTLPL-SKTLKNIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGC 517
Query: 444 ---------ADIVC-----QNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
+D++ + I A++ AK A+ +I G D E + RV+L L
Sbjct: 518 NVRDKNFPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDL 577
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
G Q EL+ V A PV LV+ + V +N+ N + +I+ YPGE G+A+A+V
Sbjct: 578 SGRQKELVRAV-QATGTPVVLVLFNGRPVTLNWEDAN--LPAIVEAWYPGEFSGQAVAEV 634
Query: 550 IFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
+FG YNPGG+L T + + +IP+ + P +P N G+ + DG +YPFGYGLSYT
Sbjct: 635 LFGDYNPGGKLSTT-FPKSVGQIPW-AFPFKP--NATGKGFARVDGE-LYPFGYGLSYTT 689
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID----DVKCKDYKFTFQIE 665
F+ +N P A + D V CK
Sbjct: 690 FEI---------------------------SNLQPSATKIADGDTLTVTCK--------- 713
Query: 666 VENMGKMDGSEVVMVYSKPPGIAGTHI-KQVIGYERVFIAAGQSAKVGFTMN 716
V+N G + G EVV +Y + + K++ G+ERV + G+ V F +N
Sbjct: 714 VKNTGSVKGDEVVQLYLNDETSSISRFEKELCGFERVALEPGEEKTVTFKVN 765
>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 792
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 231/800 (28%), Positives = 362/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 48 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDACPTAGWLAEIWKDGIGNI 106
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 107 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 166
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 167 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 220
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G + GLQ+ EG I A KH+A Y + +
Sbjct: 221 GEDPYLAGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 266
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 267 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 326
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 327 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 380
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + +V
Sbjct: 381 DEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 440
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N K +A++GP+A K + Y + G Y + +
Sbjct: 441 LLKNENQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 499
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 500 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 558
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N I +I+ +PGE G
Sbjct: 559 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYIPAIIHAWFPGEFMG 615
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG +YPFGY
Sbjct: 616 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-ALYPFGY 670
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 671 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 698
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 699 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLG 757
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 758 LWDKNNRFTVEPGSFSVMVG 777
>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
Length = 805
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 356/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 94 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 664 VEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESR 751
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
3_8_47FAA]
gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
3_8_47FAA]
Length = 735
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 213/760 (28%), Positives = 356/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
Y D K P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89
Query: 59 WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
+ AL + R P +D+ T +P + S+N L ++ +
Sbjct: 90 TNPALRNSMQKKAMEKSRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G + V+G
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + +++Q + +T++LP+EM V G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +I+ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
+A AGL++D + Y V++G+++ A +D ++R + ++ RLG F+
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K PQ +++AA A + +VLLKN+N LPL + K +A++GP A ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G +G +A + YA GCA N A++AA+ +D V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNREGFAEALEAARWSDVVV 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G ++ E R + LP Q EL ++ A K P+ LV+++ +++N + P
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLEPI 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G +A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D + + ++ V N+G DG+E V + P + T +K++ +E+
Sbjct: 631 PSATKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDMERDFGFVNEDGKRFLEAGEYHILV 724
>gi|383302737|gb|AFH08276.1| hypothetical protein [uncultured bacterium]
Length = 768
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 229/790 (28%), Positives = 368/790 (46%), Gaps = 125/790 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQM------------------GDLAYGVPRLGLP- 54
Y D ER +DL+ RMTL EKV QM GDL + P
Sbjct: 27 YKDPSASVSERVEDLLSRMTLEEKVGQMNQFVGIEHIKANSAVLTEGDLFNNTAQAFYPG 86
Query: 55 -----LYEWWSEALHG----------VSFIGRRTNSP----------PGTHFDSEVPGAT 89
+ W E L G + + R S H ++ P T
Sbjct: 87 ITGDTVIRWTREGLVGSFLHVLTIEEANMLQRHAMSSRLAIPILFGIDAIHGNANAPDNT 146
Query: 90 SFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLET 149
+PT I +SF+ + KI + + E RAM N TF +PN++VVRDPRWGRV ET
Sbjct: 147 VYPTNIGLASSFDPEMAYKIARQTAAEMRAM----NLHWTF-NPNVDVVRDPRWGRVGET 201
Query: 150 PGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFH 209
GEDPY++ V G + V+G + D+ P + AC KH+ + N
Sbjct: 202 FGEDPYLIS------VLGAESVKGYQGTLDT---PNDVLACIKHFVG---GGFPANGTNG 249
Query: 210 FDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFH 269
+ V+E+ ++E + PFE V G S+M S+N VNGIP ++ L+ +RG+W F
Sbjct: 250 SPTDVSERTLREVLLPPFEAGVEAG-AGSLMTSHNEVNGIPAHSNEWLMRDVLRGEWGFK 308
Query: 270 GYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAE 328
G++VSD I+ I + H+ + KE A + + AG+D+ G Y+ V++G+I E
Sbjct: 309 GFVVSDWMDIEHIYDLHRTAENLKE-AFYQSIMAGMDMHMHGIYWNELVCELVREGRIPE 367
Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA 388
+ ID S+R + V RLG F+ + +P H A EAAR IVLLKND G
Sbjct: 368 SRIDESVRRILDVKFRLGIFENPYADEARTMEVRLSPGHRATALEAARNSIVLLKND-GV 426
Query: 389 LPLNTGNIKTLALVGPHANATKAMIGNYEGT--PCRYTSPMDGFYAYSKVINYAPGCADI 446
LPL+ K + + G +A+ + ++G++ + P T+ ++G + ++ D
Sbjct: 427 LPLDASKYKRVMVTGINAD-DENILGDWSASQRPENVTTILEGLREVAPDTHFE--FVDQ 483
Query: 447 VCQNNSMIPA----AIDAAKNADATVIVAG-------LDLSVEAEGKDRVDLLLPGFQTE 495
+M PA A + A++AD ++VAG L E DR D+ L G Q E
Sbjct: 484 GWNPQTMSPAQVEKAAEHARHADLNIVVAGEYMMRHRWALRTGGEDTDRSDIDLVGLQNE 543
Query: 496 LINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYN 555
LI KVA + K P L++++ + + +A N + +I+ PG GG+A+A++++G N
Sbjct: 544 LIEKVAASGK-PTILILVNGRQLGVEWAAEN--LPAIVEAWEPGMYGGQAVAEILYGTVN 600
Query: 556 PGGRLPITW-YEANYVKIPYTSMPLRPVNNF-PGRTYKFFDGPVVYPFGYGLSYTQFKYK 613
P +LP+T +++ Y P + + G++ ++PFG+GLSYT ++Y
Sbjct: 601 PSAKLPVTIPRSVGQIQMYYNHKPSLYFHPYAAGKS-----SSPLWPFGFGLSYTTYEYS 655
Query: 614 VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMD 673
D++L D+ D GT + V+N G D
Sbjct: 656 --------DLRLSSDEIAAD-----GT-------------------LDVTVRVKNTGSRD 683
Query: 674 GSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL 732
G E++ +Y + + T +K++ + RV + AG++ + FT+ K L+ +D ++
Sbjct: 684 GVEIIQLYIRDLYSSVTRPVKELKDFGRVALKAGETKDITFTITPDK-LQFLDKDLRPVV 742
Query: 733 ASGAHTILVG 742
G ++VG
Sbjct: 743 EPGEFVVMVG 752
>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 499
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 160/420 (38%), Positives = 231/420 (55%), Gaps = 45/420 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EKV + + G+ RL +P Y +EALHGV GR
Sbjct: 28 YKNEDAPLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 85
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L ++ +S EARA +N + G
Sbjct: 86 --------------FTVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKGQ 131
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +VRGLQ D+R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQG---------DDAR 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+ + KH+AA N E ++RF + +++E+ ++E ++ FE C+ +G +S+M +Y
Sbjct: 183 YLKVVSTPKHFAA----NNEEHNRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAY 238
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 239 NAINNVPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 297
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q +++ADID++ + MRLG FD Y + +
Sbjct: 298 GLDLECGDDVYYEPLLNAYKQYMVSDADIDSTAYHVLKARMRLGLFDNGKNNPYTKISPS 357
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + H +A EAARQ IVLLKN N LPL+T +K++A+VG NA G+Y G+P
Sbjct: 358 IIGSKLHQRVALEAARQCIVLLKNHNWVLPLDTKKLKSIAVVG--INAGNCEFGDYSGSP 415
>gi|423214394|ref|ZP_17200922.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692809|gb|EIY86045.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
CL03T12C04]
Length = 800
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 231/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G + GLQ EG I A KH+A Y + +
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRHAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + +V
Sbjct: 389 NEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN N LPL+ N K +A++GP+A K + Y + G Y + +
Sbjct: 449 LLKNKNQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
Length = 853
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 163/431 (37%), Positives = 234/431 (54%), Gaps = 45/431 (10%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S+ V + Y D P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EAL
Sbjct: 20 SVAVAQTKELYKDMNAPQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEAL 79
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV PG T FP I + +N L +I +S EAR +N
Sbjct: 80 HGVV--------RPGNF--------TVFPQAIGLASMWNPELLYEISTAISDEARGRWNE 123
Query: 124 GNAG----------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEG 173
N G LTFWSP +N+ RDPRWGR ET GEDP++ G+ + +V+GLQ
Sbjct: 124 LNRGKDQKGFFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQG--- 180
Query: 174 VEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNE 233
+D R LKI + KH+AA N E ++RF + ++E++++E ++ FE C+ E
Sbjct: 181 ------NDPRYLKIVSTPKHFAA----NNEEHNRFECNPHISERNLREYYLPAFESCIKE 230
Query: 234 GDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTK 293
G S+M +YN +N +P +P LL Q +R +W F+GY+VSDC +V HK++ T
Sbjct: 231 GKAQSIMSAYNAINDVPCTLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVK-TP 289
Query: 294 EDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
E A +KAGLDL+CGD Y M A +Q + +ADIDT+ + M LG FD
Sbjct: 290 EAAATLSIKAGLDLECGDNVYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFDDPE 349
Query: 353 Q--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
+ Y + + + +H +LA EAARQ +VLLKN+ LPLN +K++A+VG NA
Sbjct: 350 KNPYNAISPSIVGCEKHRQLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVG--INAGN 407
Query: 411 AMIGNYEGTPC 421
G+Y GTP
Sbjct: 408 CEFGDYSGTPV 418
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 102/287 (35%), Positives = 144/287 (50%), Gaps = 51/287 (17%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + D TV V G++ S+E EG+DR + LP Q I + A P T+V++ AG+ +
Sbjct: 600 AIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKA--NPNTVVVLVAGSSL 657
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
IN+ N I +IL YPGE+GG A+A+ +FG YNPGGRLP+T+Y + +
Sbjct: 658 AINWIDEN--IPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYYRSLDELPAFDDYD 715
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
++ GRTY +F+ +YPFGYGLSYT+F YK S S D
Sbjct: 716 IQ-----KGRTYMYFENKPLYPFGYGLSYTRFDYKNLKSEVSDD---------------- 754
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVI 696
+ KFT V+N GK G EV VY + P GI +KQ+
Sbjct: 755 ----------------AVNLKFT----VKNTGKYAGDEVAQVYVRFPESGIK-VPLKQLK 793
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
G+ERV I G+SA+V ++ K L++ D SG + +VG
Sbjct: 794 GFERVHIGKGKSAQVSVSIPK-KELRLWDEKDGKFYTPSGNYIFMVG 839
>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 753
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 217/676 (32%), Positives = 334/676 (49%), Gaps = 84/676 (12%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I ++++ + + + + +A + GL SP ++V RDPRWGRV
Sbjct: 108 GATVFPQAIGLASTWDAEAIEAMAGVIRQQMKAAG--AHQGL---SPVLDVARDPRWGRV 162
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+V A++YVRGLQ G + + I A KH+A + EG
Sbjct: 163 EETFGEDPYLVASMAVSYVRGLQ---GQDLTK-------GIFATLKHFAGHSFS--EGG- 209
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
R V E+++ + F+ PFE V E + SVM +Y+ ++G+P A +LL +RG +
Sbjct: 210 RNCAPVHVGERELWDIFLFPFEAAVREANAKSVMNAYHDIDGVPCAASRELLTDILRGHF 269
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQG 324
F G +VSD D+I + ++H F K++A + L+AG+D++ D Y M AV++G
Sbjct: 270 GFDGIVVSDYDAIDRLRKAH-FTAGNKKEAAVQALEAGIDIELPKMDCYGQPLMDAVKEG 328
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
I+EA I+ S+ + LG FDG + P+ E++ + AR+ IVLLKN
Sbjct: 329 MISEATINESVERVLTAKFELGLFDGVYVDVDSVPGLFETPEQREMSRDIARKSIVLLKN 388
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGN--------YEGTPCRYTSPMDGFYAY--- 433
DN LPL+ +IK++A++GP+A+ + M+G+ Y+ T + ++G
Sbjct: 389 DN-VLPLSK-DIKSIAVIGPNADNARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLD 446
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSV-----EAEGKDRVDLL 488
S I YA GC DI+ + A++AA+ ADA ++V G + + E DR D+
Sbjct: 447 SCRITYAKGC-DIIDPSTDGFVEAVNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDIT 505
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
LPG Q +L+ + D K PV +V+++ A +N W YPGEEGG A+AD
Sbjct: 506 LPGVQMQLVKAIKDTGK-PVIVVLINGRAFAAKELADNASALMEAW--YPGEEGGNAVAD 562
Query: 549 VIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSY 607
V+FG YNP GRLPI+ E + I Y P +N T F FGYG+SY
Sbjct: 563 VLFGDYNPAGRLPISLPCEVGQIPINYNLKPASYINYLSTETKPAF------AFGYGMSY 616
Query: 608 TQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVE 667
T F Y S +V P A K +V
Sbjct: 617 TTFGYSDLSITPAV---------------------APSAG-----------KVDISFKVT 644
Query: 668 NMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDN 726
N G++ G EVV +Y + + +K++ G++RV + G++ ++ FT+ A L D
Sbjct: 645 NAGQLAGDEVVQLYIRDEVSSIVRPVKELKGFKRVNLQPGETKEITFTLYA-DQLAFHDK 703
Query: 727 AANSLLASGAHTILVG 742
++ G I+VG
Sbjct: 704 DMRLVVEPGTFKIMVG 719
>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 941
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 227/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D R +DL+++MTL EK QM L YG R+ LP EW
Sbjct: 53 YEDPAATVDARVEDLLKQMTLDEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGIGAI 111
Query: 59 --------------------WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
W + H + F+ P + + G
Sbjct: 112 DEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N +L K+G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRALIHKVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRGLQ ++A KH+AAY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGLQQ---------------HVAATGKHFAAYSNNKGARE 270
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D + + +++ I PF + E + VM SYN +GIP L +R +
Sbjct: 271 GMARVDPQTSPHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDE 330
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D + V
Sbjct: 331 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLRELV 389
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQGIV 380
++G + E ++ +R + V +G FD Q G + + E +A +A+R+ +V
Sbjct: 390 KEGGLDEETVNDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKEENEAVALQASRESVV 449
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY----AYSKV 436
LLKN+N LPLN +K +A+ GP+A+ + +Y T+ + G ++V
Sbjct: 450 LLKNENSTLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEV 509
Query: 437 INYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ Y GC D+V N + I A++ A+ AD V+V G E
Sbjct: 510 L-YTKGC-DLVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGEN 567
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
K R L LPG Q +L+ V K PV L++++ + +N+A + + +IL YPG +
Sbjct: 568 KSRSSLDLPGRQLQLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSK 624
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
GG A+AD++FG YNPGG+L +T + +IP+ + P +P + G DG +
Sbjct: 625 GGVALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPASQIDGGKNAGPDGNMSRIN 682
Query: 598 --VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGLSYT F+Y + +PK V
Sbjct: 683 GALYPFGYGLSYTTFEYSNLEITPK---------------------------------VI 709
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGF 713
+ K T +++V N GK G EVV +Y++ T+ K + G+ER+ + G++ +V F
Sbjct: 710 TPNEKATVRLKVTNTGKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTF 769
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
++ K L+++D ++ G I+ G
Sbjct: 770 ILDR-KHLELLDADMKRVVEPGDFAIMAG 797
>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
Length = 814
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 235/813 (28%), Positives = 357/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 49 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102
Query: 58 ----------------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------- 86
W LH G+ S R +N H +P
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 218 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GC + + + AI+ A+NADA V+V G D S E
Sbjct: 503 RVLYAKGCT-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 620
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 621 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 672
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 673 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 706
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 707 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 760
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 761 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 792
>gi|297543748|ref|YP_003676050.1| glycoside hydrolase family 3 domain-containing protein
[Thermoanaerobacter mathranii subsp. mathranii str. A3]
gi|296841523|gb|ADH60039.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
mathranii subsp. mathranii str. A3]
Length = 787
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 224/816 (27%), Positives = 376/816 (46%), Gaps = 156/816 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL---------- 63
Y D K P ++ ++L+ +MT+ EK+ Q+ G+ +YE + +
Sbjct: 6 YLDPKQPVEKKVENLLAQMTIEEKIAQLS---------GIWVYEILDDMMKFSYKKANRL 56
Query: 64 --HGVSFIGR---------------------------RTNSPPGTHFDS----EVPGATS 90
HG+ I R R P H +S GAT
Sbjct: 57 MTHGIGQITRLGGASNLSPQETVKIANQIQKYLVENTRLGIPALIHEESCSGYMAKGATI 116
Query: 91 FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETP 150
FP I +++N L +K+ + + +A+ +P ++V RDPRWGR ET
Sbjct: 117 FPQTIGVASTWNPKLVEKMASVIREQMKAV-----GARQALAPLLDVTRDPRWGRTEETF 171
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+V ++Y+RGLQ +++ + A KH+ Y N EG +
Sbjct: 172 GEDPYLVMHMGVSYIRGLQ----------TENLKEGVIATGKHFVGYG--NSEGGMNWA- 218
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
+ + +++ E F+ PFE V E + S+M Y+ ++GIP +LL +R +W F G
Sbjct: 219 PAHIPMRELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDG 278
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAE 328
+VSD +I + E H+ ++ KE A L+AG+D++ D Y ++QG I
Sbjct: 279 IVVSDYFAINQLYEYHRLASNKKE-AAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDI 337
Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ-HIELAAEAARQGIVLLKNDNG 387
++ ++R + LG F+ +P I + Q +LA + A++ IVLLKN++
Sbjct: 338 DFVNDAVRRILKAKFLLGLFE-NPYVDEKRVVEIFDTQEQRQLAYKIAQESIVLLKNESN 396
Query: 388 ALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR-------------YTSPM-DGFYA- 432
LPL +++++A++GP+A+ + MIG+Y PC + +P+ +G A
Sbjct: 397 LLPLKK-DLQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAK 454
Query: 433 -------------------YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG- 472
+KVI YA GC D++ + + A++ AK AD ++V G
Sbjct: 455 DIYVPIVSVLQGIKEKVSPKTKVI-YAKGC-DVISDDTAGFNKAVEIAKQADVAIVVVGD 512
Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
D E +DR DL LPG Q +L+ + + PV +V+++ + I ++ K
Sbjct: 513 RAGLTDGCTSGESRDRADLNLPGVQEQLVKAIYETGT-PVVVVLINGRPMSI--SRLAEK 569
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPG 587
I +I+ PGEEGGRAIADVIFG YNPGG+LPI+ + + Y P N+ G
Sbjct: 570 IPAIIEAWLPGEEGGRAIADVIFGDYNPGGKLPISIPCSVGQLPVYYYHKPSGGRTNWKG 629
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
+ P +YPFGYGLSYT+F Y N + K
Sbjct: 630 DYVESSTKP-LYPFGYGLSYTEFLYS---------------------NLNISNPK----- 662
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAG 706
V ++ ++V+N+GK+ G EVV +Y ++ T +K++ G++R+ + G
Sbjct: 663 -----VSTQEGIIEISVDVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVG 717
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ V F +++ + L + ++ G +++G
Sbjct: 718 EQKTVIFQLSS-EQLGFYNEEMEYVVEPGRVEVMIG 752
>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 875
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 158/446 (35%), Positives = 237/446 (53%), Gaps = 42/446 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY +A L +RA DL+ R+TL EKV M D + +PRLG+P ++WW+EALHG+ G
Sbjct: 23 LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNGF 82
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
AT FP + AS++++L ++ VS EAR
Sbjct: 83 ----------------ATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIK 126
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS--D 181
L+FW+PNIN+ RDPRWGR ET GEDPY+ + + VRGLQ GV Y+ +
Sbjct: 127 RYQSLSFWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQ---GVGYNGEDLGV 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVM 240
S+ K+ AC KH+A + W +R F+ + E+D+ ET++ F+ V EG V+ VM
Sbjct: 184 SKYRKLLACAKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALVQEGKVAEVM 240
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARV 300
C+Y R++G CA + Q +R +W F G I SDC +I+ + ++ +A A+
Sbjct: 241 CAYQRIDGQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKA 300
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
+ AG D++CG Y + AV++G + EADID SLR L I LG D + +
Sbjct: 301 VLAGTDVECGSEYKHLPE-AVRRGDVKEADIDRSLRRLLIARFELGDMDSDDLNAWTKIP 359
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPL------NTGNIKTLALVGPHANATKAM 412
+ + + H +LA + A + IVLL+N LPL G+ K + ++GP+AN + M
Sbjct: 360 ETVVASQAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMM 419
Query: 413 IGNYEGTPCRYTSPMDGFYAYSKVIN 438
GNY G P + +DG +K ++
Sbjct: 420 WGNYAGYPTHTVTALDGITRMAKTLS 445
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/321 (22%), Positives = 130/321 (40%), Gaps = 61/321 (19%)
Query: 433 YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GK 482
Y + Y DI + N + N + V G+ ++E E G
Sbjct: 589 YVQETGYGALNFDIKKRVNPTAEELLAQIGNTQTIIFVGGISPNLEGEEMRVNEPGFKGG 648
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
DR + LP Q +L+ + A K ++ ++ + A +IL Y GE+G
Sbjct: 649 DRTSIELPQAQRDLLAVLHKAGK---KVIFVNCSGSAMALAPELETCDAILQWWYGGEQG 705
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
G A+A +FG P G+LP+T+Y++ +P RTY++++G ++PFG
Sbjct: 706 GAALATTLFGMVAPSGKLPVTFYKST------DELPDFLDYTMKNRTYRYYEGEPLFPFG 759
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GL YT F ++D + K+ +
Sbjct: 760 FGLGYTTF---------NIDKPIYKNNKV------------------------------- 779
Query: 663 QIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLK 722
Q+ V+N+G G+E V VY + K + Y++V + A ++ + + KS +
Sbjct: 780 QVRVKNLGTTAGTETVQVYIRHLADKEGPKKSLRAYQQVTLNAAEAKTISIEL-PRKSFE 838
Query: 723 IVDNAANSL-LASGAHTILVG 742
D N++ + G + ++VG
Sbjct: 839 GWDVKTNTMRVVPGKYEVMVG 859
>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides thetaiotaomicron VPI-5482]
gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
[Bacteroides thetaiotaomicron VPI-5482]
Length = 769
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 227/728 (31%), Positives = 345/728 (47%), Gaps = 124/728 (17%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG T FPT I A+++ +L +++
Sbjct: 114 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPTLIEEV 155
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G ++ E R+ + P +++ RDPRW RV ET GEDP + GR + GL
Sbjct: 156 GNVIAKEIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMILGLG 210
Query: 170 DVE-GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
+ EY A KH+ AY + EG ++ S V +D+ E F+ PF
Sbjct: 211 SGDLSCEY---------ATIATLKHFLAYAVP--EGGQNGNYAS-VGTRDLHENFLPPFR 258
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
++ G +S VM SYN ++G+P A+ LL Q +R +W F G++VSD SI+ + ESH F
Sbjct: 259 EAIDAGALS-VMTSYNSIDGVPCTANHYLLTQLLRNEWRFRGFVVSDLYSIEGVHESH-F 316
Query: 289 LNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
+ T E+A + + AG D+D GD + N T AVQ GKI+EA IDT++ + + +G
Sbjct: 317 VAPTIEEAAMQAVSAGADIDLGGDAFMNLTH-AVQFGKISEAVIDTAVCRVLRMKFEIGL 375
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
F+ + + HI+LA + A+ IVLLKN+N LPLN IK +A+VGP+A+
Sbjct: 376 FEHPYVNPKTATKIVRSKDHIKLARKVAQSSIVLLKNENSILPLNK-KIKKVAVVGPNAD 434
Query: 408 ATKAMIGNYEG--TPCRYTSPMDGFYAY---SKVINYAPGCADIVCQNNSMIPAAIDAAK 462
M+G+Y + +DG + SKV Y GCA I + I A++AA
Sbjct: 435 NRYNMLGDYTAPQEDENIKTVLDGVISKLSPSKV-EYVRGCA-IRDTTVNEIAEAVEAAS 492
Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
++ + V G + + EG DR L L G Q +L+
Sbjct: 493 RSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLIA 552
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
+ K P+ +V + +D +A ++L YPG+EGG AIADV+FG YNP GR
Sbjct: 553 LKATGK-PLIVVYIEGRPLDKVWASEYA--DALLTASYPGQEGGYAIADVLFGDYNPAGR 609
Query: 560 LPITWYEANYVKIPYTSMPLRPV--NNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVA 615
LP++ IP S+ PV N R + + + +Y FGYGLSYT F+Y
Sbjct: 610 LPVS--------IP-RSVGQIPVYYNKKAPRNHDYVEQAASPLYTFGYGLSYTTFEYS-- 658
Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
D+++ K PC F +V+N G DG
Sbjct: 659 ------DLQV--------------IRKSPC-------------HFEVSFKVKNTGSYDGE 685
Query: 676 EVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
EV +Y + + ++Q+ +ER F+ G+ ++ FT+ K L I+D ++ +
Sbjct: 686 EVAQLYLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMKRVVET 744
Query: 735 GAHTILVG 742
G I++G
Sbjct: 745 GDFRIMIG 752
>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 944
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 231/810 (28%), Positives = 368/810 (45%), Gaps = 144/810 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D +R ++L+++MTL EK QM L YG R+ LP EW W + G+
Sbjct: 53 YEDPAATLDDRIENLLQQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKD---GI 108
Query: 67 SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
I N P H F +E + G
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168
Query: 88 ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
AT+FPT + ++N L +++G EAR + G T ++P ++V RD R
Sbjct: 169 ESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E GE PY+V I VRGLQ H +++A KH+AAY +
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATAKHFAAYSNNKG 269
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
D +++ ++++ I PF+ + E + +M SYN +GIP L +
Sbjct: 270 AREGMSRVDPQMSPREVENIHIYPFKRVIRETGLLGIMSSYNDYDGIPVQGSYYWLTTRL 329
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
R + F GY+VSD D+++ + H D KE AV + ++AGL++ C D +
Sbjct: 330 RQEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
V++G ++E I+ +R + V +G FD Q G +N E +A +A+R+
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDSPYQTDLAGADNEVEKAANEAVALQASRE 448
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
+VLLKN + LPLN IK +A+ GP+A+ + +Y T+ ++G ++
Sbjct: 449 SVVLLKNADNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGK 508
Query: 436 -VINYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+ Y GC ++I+ + I A A+ AD V+V G E
Sbjct: 509 AEVLYTKGCDLVDAHWPESEIIEYPLTPDEQAEIDRAAANARQADVAVVVLGGGQRTCGE 568
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
K R L LPG Q +L+ V K PV LV+++ + +N+A + + +IL YPG
Sbjct: 569 NKSRTSLDLPGHQLKLLQAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAILEAWYPGS 625
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--- 597
+GG A+AD++FG YNPGG+L +T + +IP+ + P +P + G DG +
Sbjct: 626 KGGTAVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPASQIDGGKNPGADGNMSRI 683
Query: 598 ---VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
+YPFGYGLSYT F+Y + SPK V
Sbjct: 684 NGALYPFGYGLSYTTFEYSDLEISPK---------------------------------V 710
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVG 712
D K T +++V N GK G EVV +Y++ T+ K + G+ER+ + G++ +V
Sbjct: 711 ITPDQKATVRLKVTNTGKRAGDEVVQLYTRDILSSITTYEKNLAGFERIRLKPGETKEVT 770
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT++ K L++++ ++ G I+ G
Sbjct: 771 FTLDR-KHLELLNADMKWIVEPGEFAIMAG 799
>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
Length = 882
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 170/451 (37%), Positives = 237/451 (52%), Gaps = 52/451 (11%)
Query: 20 PYPER-AKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPG 78
P PE+ A LV +MT EK+ Q + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 28 PSPEQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY------- 80
Query: 79 THFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLT 129
AT FP I AS+N L + +G STEARA +NL AGLT
Sbjct: 81 ---------ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLT 131
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSPNIN+ RDPRWGR +ET GEDPY+ + A++++RGLQ + P I A
Sbjct: 132 LWSPNINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQG--------NIPDHPRTI-A 182
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
KH+A + R FD V+ D++ T+ F + +G SVMC+YN ++G
Sbjct: 183 TPKHFAVHSGPE---PGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGT 239
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P CA LLN +R DW F+G++VSDCD+I+ + H F D A A LK+G DL+C
Sbjct: 240 PACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNC 298
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQH 367
G+ Y + A+ +G I E+ +D +L L+ RLG Y +G +I P H
Sbjct: 299 GNTYRDLNQ-AIARGDIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAH 357
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM 427
LA +AA Q +VLLKN LPL G TLA++GP A++ A+ NY+GT +P+
Sbjct: 358 RALALQAAAQSLVLLKNSGNTLPLTPGT--TLAVLGPDADSLTALEANYQGTSSTPVTPL 415
Query: 428 DGFYA--------YSKVINYAPGCADIVCQN 450
G Y++ + APG + +
Sbjct: 416 IGLRTRFGTAKVHYAQGASLAPGVPSTITET 446
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 138/295 (46%), Gaps = 52/295 (17%)
Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
A +ADA V GL VE E G DR + LP Q L+ V K P+
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
+V+MS AV +N+A+++ +IL YPG+ GG AIA + G NPGGRLP+T+Y +
Sbjct: 666 VVLMSGSAVALNWAQHH--ADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPMTFYRSTQ 723
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
PY S + GRTY++F G +YPFGYGLSYTQF Y+ +
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAPQLSTAT-------- 769
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
+K D T V N G G EVV +Y +PP
Sbjct: 770 -----------------------LKAGD-TLTVTAHVRNTGTRAGDEVVQLYLEPPHSPQ 805
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++ ++G++RV + G+S + FT++A + L V + +G + + VG G
Sbjct: 806 APLRNLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGGG 859
>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
Length = 1049
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 219/767 (28%), Positives = 358/767 (46%), Gaps = 100/767 (13%)
Query: 16 DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG- 70
++KLP+ A KDL+ RMT+ EK+ Q+ G L P E+ S++L +G
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 71 ----------RRTNSPPGTHFDSEVP----------GATSFPTVILTTASFNESLWKKIG 110
R H ++P T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 111 QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ + E+ A AGL + ++P +++ RD RWGRV+E GED Y+ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ +S + AC KH+ AY L G D D ++E+ + +T++ PF+
Sbjct: 501 ---WNLWENNS------VLACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
C++ G V + M ++N +NGIP A P LL +RG WNF+G++VSD ++++ +V
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607
Query: 290 NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+D +DA +G+D+D D Y + ++ GKI+ D+D S+ + + LG F
Sbjct: 608 DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 349 DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
++ N I + ++ A + A + VLLKNDN LPL N++++A+VGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 407 NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ ++G++ G T+ + G + YA GC D ++ S A+
Sbjct: 725 DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A +D + V G + E + R L LPG Q ELI ++ K PV +V+M+ + I
Sbjct: 784 ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
+ N + +IL + G G AIAD++FG YNP GRL I++ V I Y
Sbjct: 843 EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPIYYNYKKS 900
Query: 579 LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
RP + T + D P +YPFGYGLSYT F Y S+P+S + + +
Sbjct: 901 GRPGDMLHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQSTQKEYTRQET------ 951
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
+ + V N G DG E V +Y + +K++
Sbjct: 952 -----------------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++++F+ AG+S V F ++ +L D A N ++ G I+ G
Sbjct: 989 KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|298482587|ref|ZP_07000772.1| xylosidase [Bacteroides sp. D22]
gi|336405443|ref|ZP_08586122.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
gi|295085727|emb|CBK67250.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
gi|298271294|gb|EFI12870.1| xylosidase [Bacteroides sp. D22]
gi|335938024|gb|EGM99918.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
Length = 800
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 231/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G + GLQ EG I A KH+A Y + +
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + +V
Sbjct: 389 DEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN N LPL+ N K +A++GP+A K + Y + G Y + +
Sbjct: 449 LLKNKNQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
Length = 786
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 228/799 (28%), Positives = 360/799 (45%), Gaps = 139/799 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P EW W + +
Sbjct: 42 YEDPAAPIEARVADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAEWSKEIWKDGIGNI 100
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 101 DEQANGLGKFGSELSYPYANSVKNRHEIQRWFVEQTRLGIPVDFTNEGIRGLCHNRATMF 160
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T ++P +++ +DPRWGRV+E+
Sbjct: 161 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 214
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G + GLQ EG ++A KH+A Y + +
Sbjct: 215 GEDPYLAGELGKQMILGLQ-AEG-------------LAATPKHFAVYSIPVGGRDGGTRT 260
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 261 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 320
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 321 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 374
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GKI+ +D + + V LG FD P + + N H E++ +AA + IV
Sbjct: 375 SEGKISLHTLDQRVGEILRVKFMLGLFDNPYPGDDRHPETVVHNAAHQEVSMKAALESIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ ++ +A++GP+A K + Y + G Y + ++
Sbjct: 435 LLKNENQMLPLSK-SLNKIAVIGPNAEEVKELTCRYGPAHAPIKTVYQGIKEYLPNAEVS 493
Query: 439 YAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
YA GC + Q +MI A++ AK +D ++V G + E R
Sbjct: 494 YAKGCNIIDKYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSR 553
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 554 TSLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIVHAWFPGEFMGN 610
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
AIA V+FG YNPGGRL +T + + ++P+ + P +P ++ GR DG V+YPFGYG
Sbjct: 611 AIAKVLFGDYNPGGRLAVT-FPKSVGQVPF-AFPFKPGSDSKGRVR--VDG-VLYPFGYG 665
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F+Y K V +G + T
Sbjct: 666 LSYTTFEYSALKISKPV----------------IGPQE----------------NMTLSC 693
Query: 665 EVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V+N GK G EVV +Y + T+ K + G+ER+ + G+ + FT+ + L +
Sbjct: 694 IVKNTGKRAGDEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGL 752
Query: 724 VDNAANSLLASGAHTILVG 742
D + G+ +I++G
Sbjct: 753 WDKNNQFTVEPGSFSIMIG 771
>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
tundricola MP5ACTX9]
Length = 892
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 167/451 (37%), Positives = 237/451 (52%), Gaps = 51/451 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY D L +R DLV RMTL EKV Q + A + RL +P Y++WSE LHG++ G
Sbjct: 32 LPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARSGY 91
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
AT FP I A+++ L ++IG +S EARA +N
Sbjct: 92 ----------------ATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHS 135
Query: 125 -NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT WSPNIN+ RDPRWGR ET GEDP++ GR + +V+G+Q D
Sbjct: 136 IYYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQG---------PDPN 186
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
+ A KH+A + + + R + T D+ +T++ F + E S+MC+Y
Sbjct: 187 YFRAIATPKHFAVH---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAY 243
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQ----TIVESHKFLNDTKEDAVAR 299
N V G P CA LL T+R DW F G++ SDC +I T SH D KE A A
Sbjct: 244 NAVEGSPACASKLLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPD-KEAAAAA 302
Query: 300 VLKAGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
+KAG D +CG Y T+G AV++G + EA+IDT+L+ L+ +LG FD + + +
Sbjct: 303 GIKAGTDSNCGQTY--LTLGSAVKKGLVTEAEIDTALKHLFTARFQLGLFDPAAKVAFNA 360
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
+ + + +P H LA +AA + IVLLKND LP +++T+A++GP A + GNY
Sbjct: 361 IPFSEVNSPAHQALALKAAEESIVLLKNDAHTLPFKP-SVRTIAVIGPSAATLNNLEGNY 419
Query: 417 EGTPCRYTSPMDGF---YAYSKVINYAPGCA 444
P P+DG + SKV+ YA G +
Sbjct: 420 NAIPLHPVLPLDGILTQFKSSKVL-YAQGSS 449
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 124/261 (47%), Gaps = 42/261 (16%)
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
G DR D+ LP Q +++ VA K P+ +V+++ A+ +N+A N +IL YPG+
Sbjct: 650 GGDRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAVNWA--NDHAAAILEAWYPGQ 706
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG AIA+ + GK NP GRLP+T+Y + +P + RTY++ ++
Sbjct: 707 AGGTAIAETLAGKNNPAGRLPVTFYSS------IDQIPAFDDYSMANRTYRYSKAKPLFE 760
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGLSYT F Y +IKL T+ P
Sbjct: 761 FGYGLSYTTFTYS--------NIKLSTQ--------TLHAGDP----------------L 788
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
T + +V N G++ G EV +Y PP A + + + + RV +A G+ V FT++ ++
Sbjct: 789 TVEADVRNTGRVAGDEVAELYLTPPHTAVSPQRALSAFTRVHLAPGELRHVTFTLDP-RT 847
Query: 721 LKIVDNAANSLLASGAHTILV 741
L VD + G +T+ V
Sbjct: 848 LSQVDEKGARAVTPGNYTLSV 868
>gi|262405837|ref|ZP_06082387.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294647798|ref|ZP_06725350.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294806192|ref|ZP_06765039.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345510348|ref|ZP_08789916.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356712|gb|EEZ05802.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636706|gb|EFF55172.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294446448|gb|EFG15068.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345454537|gb|EEO48843.2| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
Length = 800
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 231/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDLTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G + GLQ EG I A KH+A Y + +
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRHAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + +V
Sbjct: 389 NEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN N LPL+ N K +A++GP+A K + Y + G Y + +
Sbjct: 449 LLKNKNQMLPLSK-NFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|336412865|ref|ZP_08593218.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
3_8_47FAA]
gi|335942911|gb|EGN04753.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
3_8_47FAA]
Length = 800
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 234/800 (29%), Positives = 365/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDLSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRR-----TNSPPGTH-----------------FDSE-VPG-----ATSF 91
+G+ G NS H F +E + G AT F
Sbjct: 115 DEQANGLGKFGSEISYSYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ+ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ ++ + + V +G FD P + + N H ++ +AA + IV
Sbjct: 389 DEGKVSLHTLNQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N K +A++GP+ K + Y + G Y + +
Sbjct: 449 LLKNENQMLPLSK-NFKKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPDSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTIFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 766
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 215/695 (30%), Positives = 327/695 (47%), Gaps = 105/695 (15%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP + +++N L++ + + V+ E R+ G +SP ++VVRDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNPELFRSMCRAVAAETRS-----QGGAATYSPVLDVVRDPRWGRT 177
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDP++V +A+ V+GLQ D + A KH+A Y N
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQG--------DRLDAEDSLLATLKHFAGYGASEGGRNG 229
Query: 207 R-FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
H R ++ E +LPF V G SVM +YN ++G+P + LL+ +R
Sbjct: 230 APVHMGLR----ELHEIDLLPFRKAVEAG-AQSVMTAYNEIDGVPCTSSRYLLHDVLREA 284
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
W F G++++DC +I + H + E+A A+ L AG+D++ G + + A++QG
Sbjct: 285 WGFDGFVITDCGAIDMLKSGHN-TAASGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQG 343
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
I E D++T++ + + RLG FD + I +HIELA A +GIVLLKN
Sbjct: 344 HITEDDLNTAVGRVLAMKFRLGLFDRPYTDPERAEKVIGCEEHIELARRVAAEGIVLLKN 403
Query: 385 DNGALPLN--TGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAY------S 434
+ LPLN TG I A++GP+ANA +G+Y P + + ++G + +
Sbjct: 404 EGNVLPLNPKTGKI---AVIGPNANAPYNQLGDYTSPQPPGQIITVLEGIRRHIGEDADT 460
Query: 435 KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA---- 479
+V+ YAPGC I + + A+ A AD V+ G +DL A
Sbjct: 461 RVL-YAPGC-RIQGDSREGLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVT 518
Query: 480 ----------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
EG DR L L G Q EL+ ++ K PV +V ++ + + + I
Sbjct: 519 GLAQSDMECGEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYINGRPITEPWIDEH--I 575
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGR 588
+IL YPG+EGG AIAD++FG NP GRL +T E + I Y + R G+
Sbjct: 576 PAILEAWYPGQEGGSAIADILFGDVNPSGRLTLTIPKEVGQLPINYNAKRTR------GK 629
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
Y D YPFGYGLSYT F Y N +V P
Sbjct: 630 RYLETDLEPRYPFGYGLSYTDFHYG---------------------NLSVEPAVIPA--- 665
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQ 707
D +I V N G DG+EVV +Y + T ++ + + +VF+ AG+
Sbjct: 666 --------DGSAAVRIVVTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGE 717
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
S +V FT+ + L+++ +++ G I VG
Sbjct: 718 SREVTFTVGP-EQLELIGPDMKAVVEPGEFRIRVG 751
>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
Length = 946
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 231/807 (28%), Positives = 365/807 (45%), Gaps = 138/807 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R +DL+ +M L EK QM L YG R+ LP EW W + + +
Sbjct: 53 YEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGMGAI 111
Query: 67 S-----------------------------------FIGRRTNSPPGTHFDSEVPG---- 87
F+ P + + G
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L +IG EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRKLIHQIGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q H +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFIAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G P + L +RG
Sbjct: 273 GMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQ 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQGIV 380
Q+G ++E I+ +R + V +G FD Q G ++ + E +A +A+R+ IV
Sbjct: 392 QEGGLSEEVINDRVRDILRVKFLVGLFDAPYQTDLKGADDEVEKEENEAVALQASRESIV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
LLKN+N LPL+ ++K +A+ GP+A + +Y T+ +DG +
Sbjct: 452 LLKNENNTLPLDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVDGLREKLNGKAEV 511
Query: 438 NYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
Y GC ++I+ S I A+ A+ AD V+V G E K
Sbjct: 512 LYTKGCDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKS 571
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R L LPG Q +L+ V K PV LV+++ + +N+A + + +IL YPG +GG
Sbjct: 572 RSSLDLPGRQLDLLKAVQATGK-PVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGG 628
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV------ 597
AIADV+FG YNPGG+L +T + + +IP+ + P +P + G G +
Sbjct: 629 TAIADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPHKPSSQIDGGKNPGTKGDMSRVNGA 686
Query: 598 VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y + SPK + P V V+CK
Sbjct: 687 LYPFGYGLSYTTFEYSDINISPKVI---------------------TPNQKV---QVRCK 722
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EVV +Y + T+ K + G+ER+ + G++ +V FT+
Sbjct: 723 ---------VTNTGKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHLQPGETKEVSFTL 773
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ K+L++++ + ++ G +I++G
Sbjct: 774 DR-KALELLNAKNDWVVEPGDFSIMLG 799
>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 800
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 233/800 (29%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L +I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIGEIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ+ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
YIVSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YIVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N +A++GP+ K + Y + G Y + +
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N I +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
Length = 946
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 234/806 (29%), Positives = 369/806 (45%), Gaps = 136/806 (16%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D P R +DL+++MTL EK QM L YG R+ LP EW ++ G+ I
Sbjct: 53 YEDPSAPVDARIEDLLKQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAI 111
Query: 70 GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
N PP T F +E + G
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRELIRQVGVITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q + +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------------QDYQVAATGKHFIAYSNNKGGRE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G P + L +RG+
Sbjct: 273 GMSRVDPQMSPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + ++ E+A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDHPYQTDLKGADEEVEKAENEEVALQASRESIV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS--KV-I 437
LLKND LPL+ IK +A+ GP+A+ +G+Y TS + G + KV +
Sbjct: 452 LLKNDQDVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEV 511
Query: 438 NYAPGCA--------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
Y+ GC + + I A+ AK AD V+V G E K
Sbjct: 512 LYSKGCELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKS 571
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +GG
Sbjct: 572 RSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGAKGG 628
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV------ 597
+A+ADV+FG YNPGG+L +T + +IP+ + P +P + G DG +
Sbjct: 629 KAVADVLFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGMDGNMSRANGA 686
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
+Y FG+GLSYT F+Y D+ T P V CK
Sbjct: 687 LYAFGHGLSYTSFEYS-------------------DLKITPAVITPNQKTY----VTCK- 722
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
V N GK G EVV +Y + T+ K + G+ER+ + G++ +V F ++
Sbjct: 723 --------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIHLKPGETKEVFFPID 774
Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
K+L++++ + ++ G T++VG
Sbjct: 775 R-KALELLNADMHWVVEPGDFTLMVG 799
>gi|403744211|ref|ZP_10953568.1| glycoside hydrolase family 3 domain-containing protein
[Alicyclobacillus hesperidum URH17-3-68]
gi|403122228|gb|EJY56463.1| glycoside hydrolase family 3 domain-containing protein
[Alicyclobacillus hesperidum URH17-3-68]
Length = 789
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 232/784 (29%), Positives = 352/784 (44%), Gaps = 141/784 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYGV-PRLGLPLYEWWSEALHGVSFIGR 71
Y + LP ER + L+ MT+ EK Q+ + AY V L + S HG+ I R
Sbjct: 5 YQNPNLPIEERVELLLSEMTIEEKAAQLTSVWAYEVLDDLVFSDAKAASLFEHGIGQITR 64
Query: 72 ---------------------------RTNSPPGTHFDS----EVPGATSFPTVILTTAS 100
R P H +S GAT FP I ++
Sbjct: 65 IGGATNLDPADVARLSNRIQQHLLTQTRLAIPALVHEESCSGYMAKGATCFPQSIGIAST 124
Query: 101 FNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRY 160
+++ + +KIG+ + T+ RA+ +P ++V RDPRWGRV ET GEDPY+V +
Sbjct: 125 WDQDIARKIGEVIRTQMRAV-----GAQQALAPLLDVTRDPRWGRVEETFGEDPYLVAQM 179
Query: 161 AINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NWEGNDRFHFDSRVTE 216
I YV GLQ D + A KH+ Y NW + + E
Sbjct: 180 GIGYVGGLQ----------GDDLRDGVIATGKHFVGYGASEGGMNWA-------PAHIPE 222
Query: 217 QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDC 276
++++E ++ PFE V E + S+M Y+ ++G+P + LL +T+R W F G +VSD
Sbjct: 223 RELREVYLYPFEAVVREAKLQSIMPGYHELDGVPCHHNRDLLVETLRNRWGFEGIVVSDY 282
Query: 277 DSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAEADIDTS 334
++ + E H+ D E AV V +AG+D++ D Y + AV QG++ +D
Sbjct: 283 FAVNQLFEYHQVARDKVEAAVFAV-EAGVDVELPSRDVYGQPLVEAVNQGRLRIEQVDAL 341
Query: 335 LRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTG 394
+R + RLG F+ + N N + +LA EAA + IVLLKN+ LPL
Sbjct: 342 VRRVLTAKFRLGLFERPFVDEGRAPNLFDNHEQRQLAREAAAKSIVLLKNEGNLLPLE-- 399
Query: 395 NIKTLALVGPHANATKAMIGNYEGTPCR------------YTSPM-------DGFYAYSK 435
N +A++GP+A++ + M+G+Y PC + SPM D F
Sbjct: 400 NRGKIAVIGPNADSIRNMVGDY-AYPCHIESLLEQSEDNVFHSPMPKGMKSVDDFIEMKT 458
Query: 436 VIN-------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDLSV 477
++ YA GC D++ + S I A A+ AD ++V G D
Sbjct: 459 IVQAIRDKVGDGAEVLYAKGC-DVLGDDTSGIAEAEHVARQADVAIVVVGDRAGLTDGCT 517
Query: 478 EAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGY 537
E +DR L L G Q EL+ +V A P +V++ + I + + + +IL
Sbjct: 518 TGESRDRATLTLLGAQQELVERVV-ATGTPTVVVLVGGRPLSITWIAEH--VPAILEAWL 574
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
PGEEG AIADV+FG NP G+LPIT V I Y P +++ G + P
Sbjct: 575 PGEEGAPAIADVVFGDMNPSGKLPITIPRSVGQVPIYYGHKPSGGRSHWKGVYVDESNKP 634
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+ Y FG+GLSYT F Y+ ++ L K + I+ TV DV C
Sbjct: 635 L-YAFGHGLSYTTFAYR--------ELALSKSEI--GIHDTV-------------DVSCV 670
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
+EN G G EVV +Y T ++++ G+ RV + A ++A V F +
Sbjct: 671 ---------IENTGDRVGEEVVQLYVYDRAADVTRPVQELRGFARVHLEAKEAALVTFRL 721
Query: 716 NACK 719
+A +
Sbjct: 722 SAHQ 725
>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
Length = 882
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 167/447 (37%), Positives = 236/447 (52%), Gaps = 51/447 (11%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+ A LV +MT EK+ Q + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 32 QHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY----------- 80
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N L + +G STEARA +NL AGLT WSP
Sbjct: 81 -----ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSP 135
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDPY+ + A++++RGLQ D+ P I A KH
Sbjct: 136 NINIFRDPRWGRGMETYGEDPYLTSQLAVSFIRGLQG--------DTPDHPRTI-ATPKH 186
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
+A + + R FD V+ D++ T+ F + +G SVMC+YN ++G P CA
Sbjct: 187 FAVH---SGPEQGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACA 243
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +R DW F+G++VSDCD+I+ + H F D A A LK+G DL+CG+ Y
Sbjct: 244 SDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTY 302
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
+ A+ +G I E+ +D +L L+ RLG Y +G +I P H LA
Sbjct: 303 RDLNQ-AIARGDIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALA 361
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY 431
+AA Q +VLLKN LPL TLA++GP A++ A+ NY+GT +P+ G
Sbjct: 362 LQAAAQSLVLLKNSGNTLPLPPET--TLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419
Query: 432 A--------YSKVINYAPGCADIVCQN 450
Y++ + APG + + +
Sbjct: 420 TRFGTAKVHYAQGASLAPGVPNTIPET 446
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 136/295 (46%), Gaps = 52/295 (17%)
Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
A +ADA V GL VE E G DR + LP Q L+ V K P+
Sbjct: 607 AVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK-PLI 665
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
+V+MS AV +N+A+++ +IL YPG+ GG AIA + G NPGGRLP+T+Y +
Sbjct: 666 VVLMSGSAVALNWAQHH--ADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYRSTQ 723
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
PY S + GRTY++F G +YPFGYGLSYTQF Y+
Sbjct: 724 DLPPYISY------DMTGRTYRYFKGQPLYPFGYGLSYTQFAYEAP-------------- 763
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
Q G T V N G G EVV +Y +PP
Sbjct: 764 QLSTATLKAGNT------------------LTVTTHVRNTGTRAGDEVVQLYLEPPYSPQ 805
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++ ++G++RV + G+S + FT++A + L V + +G + + VG G
Sbjct: 806 APLRSLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGGG 859
>gi|364284956|gb|AEW47953.1| GHF3 protein [uncultured bacterium D1_14]
gi|364284964|gb|AEW47958.1| GHF3 protein [uncultured bacterium E2_1]
Length = 752
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 221/734 (30%), Positives = 351/734 (47%), Gaps = 98/734 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYG-----VPRL------GLPLYEWWSE---ALHGVSF 68
+R + L+ +MTL EK+ QM +++ V RL G L E E AL V+
Sbjct: 35 KRVESLLTKMTLEEKIGQMNQVSFSGNIEEVSRLIKNGEVGSILNEVDPERVNALQRVAI 94
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGL 128
R P D T FP + ASFN + +K + + EA ++ G+
Sbjct: 95 EESRLGIPILIGRDVIHGFKTIFPIPLGQAASFNPQIVEKGARVSAVEASSV------GV 148
Query: 129 TF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKI 187
+ ++P I++ RDPRWGR+ E+ GEDPY+ V+G Q DS + P I
Sbjct: 149 RWTFTPMIDISRDPRWGRIAESCGEDPYLTSVMGAAMVKGFQG--------DSLNNPNSI 200
Query: 188 SACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVN 247
+AC KH+ Y EG R + + +TE+ ++ ++ PFE V +G V++ M S+N +
Sbjct: 201 AACAKHFVGYGAA--EGG-RDYNTTCITERQLRNVYLPPFEAAVKQG-VATFMTSFNAND 256
Query: 248 GIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL 307
GIP+ +P +L + +R +W F G++VSD SI +V +H F D K DA + + AG+D+
Sbjct: 257 GIPSSGNPFILKKVLRDEWGFDGFVVSDWASIIEMV-AHGFCTDDK-DAAMKAVNAGVDM 314
Query: 308 DCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQ 366
+ Y Y N + K++E ID ++R + V RLG FD +P + I + +
Sbjct: 315 EMVSYTYMNHLKDLKNENKVSEETIDNAVRNILRVKFRLGLFD-NPYVDEKAPSPIYSKE 373
Query: 367 HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYT 424
++ +A EAA Q +LLKND LP+N ++KT+A+VGP A+A +G ++G
Sbjct: 374 NLAIAKEAAIQSAILLKNDKQILPINE-SVKTIAVVGPMADAPYEQMGTWAFDGEKSMTQ 432
Query: 425 SPMDG---FYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+P+ FY + PG A +N S I A+ AA AD + G + + E
Sbjct: 433 TPLMALRQFYGDKVNFIFEPGLAYTRDKNTSGISKAVSAANRADLVLAFVGEEAILSGEA 492
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
+L L G Q++LIN +A K VT+VI + K K++L+ +PG
Sbjct: 493 HCLANLNLQGAQSDLINALAKTGKPIVTVVI---AGRPLTIGKEAELSKAVLYSFHPGTM 549
Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPVNNFP------------- 586
GG AIAD++FGK P G+ P+T+ E + I Y+ RP N
Sbjct: 550 GGPAIADLLFGKAVPSGKTPVTFPKEVGQIPIYYSHYNTGRPANRNEILLDNIAVGAGQT 609
Query: 587 --GRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
G T + D +YPFG+GLSYT F+Y ++KL ++
Sbjct: 610 SLGNTSFYLDAGFDPLYPFGFGLSYTTFEYS--------NLKLSSNE------------- 648
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
+ KD + T +++N G +G+EV +Y + G +K++ + R+
Sbjct: 649 ----------LSAKD-ELTVTFDLKNTGNYEGAEVAQLYVRDMVGSVVRPVKELKRFNRI 697
Query: 702 FIAAGQSAKVGFTM 715
+ G++ V T
Sbjct: 698 TLKPGETRNVSMTF 711
>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 233/800 (29%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L +I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIGEIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ+ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
YIVSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YIVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N +A++GP+ K + Y + G Y + +
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N I +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNRFTVEPGSFSVMVG 785
>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
Length = 858
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 168/478 (35%), Positives = 246/478 (51%), Gaps = 42/478 (8%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S+ PYC+ L ERA+DL+ R+TL EK + M D + +PRLG+ + WWSEAL
Sbjct: 14 SLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
HG + +G G T FP + ASFN+ L +++ S E RA YN
Sbjct: 74 HGAANMG----------------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNR 117
Query: 123 -LGNAG-------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
+ N G L+ W+PN+N+ RDPRWGR ET GEDPY+ VRGLQ E
Sbjct: 118 RMLNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETA 177
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+Y K+ AC KHYA + + + D V+ +D+ ET++ F+ V E
Sbjct: 178 KYR--------KLWACAKHYAVHSGPEYTRHTANVAD--VSPRDLWETYLPAFKTLVTEA 227
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
V VMC+Y R++ P C++ +LL Q +R +W F+ +VSDC ++ I +HK +D
Sbjct: 228 KVREVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH 287
Query: 295 DAVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
A AG D++CG Y T+ AV++G I EA++D + L LG D
Sbjct: 288 AAAK-AAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMDDPKL 346
Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
++ + + + + H +LA + ARQ +VLL+N G LPL G + +A++GP+A+
Sbjct: 347 VEWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPM 405
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC--ADIVCQNNSMIPAAIDAAKNADAT 467
M GNY GTP R + ++G K + Y GC D N+ + AID K T
Sbjct: 406 MWGNYNGTPNRTVTILNGIKVRHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKGLRGT 463
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 121/286 (42%), Gaps = 57/286 (19%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
A I + V V G+ ++E E G DR ++ LP Q + + + +A K
Sbjct: 591 AIIRKLQGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK 650
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
T+V ++ I +IL Y G+EGG A++DV+FG NP G+LP+T+Y
Sbjct: 651 ---TVVFVNCSGSAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFY 707
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ Y +R GRTY++F P ++ FGYGLSYT F++ A +
Sbjct: 708 KRTDQLPDYEDYSMR------GRTYRYFSDP-LFAFGYGLSYTTFRFGRARA-------- 752
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ + + + + N G G EVV VY +
Sbjct: 753 ----------------------------EAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRV 784
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
+K + + RV + AG+S V ++ KS + D + N++
Sbjct: 785 ADTNGPLKSLRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTM 829
>gi|335433420|ref|ZP_08558246.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
gi|335434171|ref|ZP_08558974.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
gi|334898028|gb|EGM36149.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
gi|334898759|gb|EGM36857.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
Length = 783
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 206/692 (29%), Positives = 331/692 (47%), Gaps = 101/692 (14%)
Query: 86 PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGR 145
PG T FP I ++++ +L + I ++ T A+ + SP ++V RD RWGR
Sbjct: 121 PGGTIFPQSIGLASTWSPALVESITDSIRTRLDAV-----GTVQALSPVLDVSRDMRWGR 175
Query: 146 VLETPGEDPYVVGRYAINYVRGLQ-DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
V ET GEDP +VG YV GLQ D EG I A KH+AA+ + EG
Sbjct: 176 VEETYGEDPQLVGALGAAYVAGLQSDGEG-------------IDATLKHFAAHG--SGEG 220
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+ ++ E++++E + PFE+ + E D +VM +Y+ ++G+P + LL +RG
Sbjct: 221 G-KNRSSVQIGERELREVHLYPFEVAIQEADARAVMNAYHDIDGVPCASSEWLLTDVLRG 279
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQ 322
+W F G++V+D S+ + E H + DT+ +A L+AGLD++ D Y AV+
Sbjct: 280 EWGFDGHVVADYFSVDLLKEEHG-IADTQREAGVAALEAGLDVELPATDCYDENLRKAVE 338
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
G+++EA +DT++R + + G FD + + ELAA AAR+ I LL
Sbjct: 339 DGELSEATVDTAVRRVLRAKIESGVFDDPYVDPDAATEPFDTDEQTELAARAARESITLL 398
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNY---------EGTPCRYTSPMDGFYAY 433
+ND G LPL G + ++ALVGP A+ +A +G+Y E +P D A
Sbjct: 399 END-GLLPLAGGELDSVALVGPQADDGRAQVGDYTHAARFDTEEAGDFESVTPRDALEAR 457
Query: 434 SKV----INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL---------------- 473
+ + Y G A + + AA + +AD V G
Sbjct: 458 GETAGFDVEYVEG-ATMTGPSTDGFDAAEETVADADLAVACVGARSDIDFADRENPAELP 516
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSI 532
D+ E D DL LPG Q L++++A+ P+ +V +S I A++ P ++
Sbjct: 517 DVPTSGENCDVTDLELPGVQEALVDRLAE-TDTPLIVVQVSGKPHAIPEIAESVP---AL 572
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYK 591
L PG+EGG AIADV+FG+YNP G LP++ ++ + Y+ P N +
Sbjct: 573 LHAWLPGQEGGTAIADVLFGEYNPSGHLPVSVPKSVGQQPVYYSRKP-----NSANEEHV 627
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
+ DG +Y FGYGLSYT F+Y D+++D + +GT
Sbjct: 628 YMDGEPLYSFGYGLSYTDFEYG--------DLEVDAETVA-----PMGT----------- 663
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
T + V N G + G +VV +Y + +++++G+ERV + G++ +
Sbjct: 664 --------LTASVTVTNAGDVAGDDVVQLYQHAENPSQARPVQELLGFERVHLEPGETKR 715
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F+ +A + L D N + G + + VG
Sbjct: 716 VTFSFDATQ-LAYHDLDMNLAVEEGPYELRVG 746
>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 846
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 164/438 (37%), Positives = 236/438 (53%), Gaps = 46/438 (10%)
Query: 7 VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
V ++ Y + P ER +DL+ ++T+ EK+ + + G+ R+G+ Y +EALHG+
Sbjct: 16 VSMAQDLYKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGI 75
Query: 67 SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
G+ T FP I + +N L I +S EARA +N
Sbjct: 76 IRPGK----------------FTVFPQAIGLASMWNPELHHIIASVISDEARARWNELER 119
Query: 127 G----------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
G LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ
Sbjct: 120 GKKQKDQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQG------ 173
Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
R LK + KH+AA N E ++RF+ D+ +TE DM+E ++ FE C+ EG
Sbjct: 174 ---DHPRYLKSVSTPKHFAA----NNEEHNRFYCDAAITETDMREYYLPAFEKCIREGKA 226
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDA 296
S+M +YN +NG+P A+ LLN+ ++ DW F+GYIVSDC + ++ H+++ T E A
Sbjct: 227 ESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAA 285
Query: 297 VARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-- 353
+KAGLDL+CGDY + + A +Q ++ A+ID++ + MRLG FD +
Sbjct: 286 AMIAIKAGLDLECGDYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFDDPEKNP 345
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y +L + +H ELA EAARQ IVLLKN LPLN IK++A+VG NA
Sbjct: 346 YNHLSPEIVGCEKHKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVG--INAANCEF 403
Query: 414 GNYEGTPCRY-TSPMDGF 430
G+Y GTP S +DG
Sbjct: 404 GDYSGTPVNAPVSVLDGI 421
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 144/286 (50%), Gaps = 51/286 (17%)
Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDI 520
+ +D + V G++ S+E EG+DR + LP Q I + A P T+V++ AG+ + +
Sbjct: 595 RESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKA--NPNTIVVLVAGSSMAV 652
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP-L 579
+ N I +I+ YPGE+GG AIA+V+FG YNP GRLP+T+Y + +P
Sbjct: 653 GWMDQN--IPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS------IEDLPAF 704
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
N RTY +F+G +Y FGYGLSYT+F Y+ ++ + +D Q +N++
Sbjct: 705 NDYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDYR--------NLNIKQDSQNITLNFS-- 754
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGY 698
V+N GK +G EV VY + P + T +KQ+ G+
Sbjct: 755 --------------------------VKNSGKYNGDEVAQVYVQFPDLGIKTPLKQLKGF 788
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGE 743
+RV I G + ++ + + L++ D+ SG + +VG+
Sbjct: 789 KRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYNFMVGK 833
>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
Length = 791
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 215/720 (29%), Positives = 345/720 (47%), Gaps = 107/720 (14%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ + E LHG + +G ATSFP I +S++ ++ +++
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------------ATSFPQSIAMASSWDPAMLRQV 179
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
Q ++ E RA SP +++ RDPRWGR+ ET GEDPY+VG + V GLQ
Sbjct: 180 NQVIAREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQ 234
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
V R +P + A KH + N + V+E++++E F PFE
Sbjct: 235 GV-----GRSRTLQPNHVFATLKHLTGHGQPESGTN---IGPAPVSERELRENFFPPFEQ 286
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V + +VM SYN ++G+P+ A+ LL+ +R +W F G +VSD ++ ++ H
Sbjct: 287 VVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIA 346
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGA-VQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+ E+A R L AG+D D + + T+G V++GK++EA +D ++R + + R G F
Sbjct: 347 ANL-EEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLF 405
Query: 349 DGSPQYKNLGKNNICNPQHIE-LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ +P I N + LA AA++ I LLKND G LPL T+A++GP +
Sbjct: 406 E-NPYADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEG--TIAVIGP--S 459
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKV---INYAPGC---------ADIV-----CQN 450
A A +G Y G P S ++G A I +A G AD V +N
Sbjct: 460 AAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAEN 519
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAA 504
+I A++AA+N D ++ G EG DR L L G Q EL + +
Sbjct: 520 RKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALG 579
Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
K P+T+V+++ + K + + +IL Y GE+GG A+AD++FG NPGG+LP+T
Sbjct: 580 K-PITVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVTV 636
Query: 565 -YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
A + + Y P R Y F +YPFG+GLSYT F S+P+
Sbjct: 637 PRSAGQLPLFYNMKP------SARRGYLFDTTDPLYPFGFGLSYTSFSL---SAPRLSAT 687
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
+ +GT K + ++V N G +G EVV +Y +
Sbjct: 688 R-------------IGTGG----------------KTSVSVDVRNTGAREGDEVVQLYIR 718
Query: 684 PPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ T +K++ G++RV + G+S + FT+ ++L++ ++ ++ G I+ G
Sbjct: 719 DKVSSVTRPVKELKGFQRVTLKPGESRTITFTVGP-EALQMWNDQMRRVVEPGDFEIMTG 777
>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
49814]
Length = 897
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 178/499 (35%), Positives = 254/499 (50%), Gaps = 65/499 (13%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHG 65
+ K S+F + D L ERA DLV MTL EK QM D A +PRLGL Y WW+EALHG
Sbjct: 36 EAKSSEFRFMDPSLSPKERALDLVSHMTLEEKAAQMYDKAAAIPRLGLHEYNWWNEALHG 95
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
V+ G AT FP I A+++E L ++ +S E RA ++
Sbjct: 96 VARAGH----------------ATVFPQAIGMAATWDEDLMLEVANVISDEGRAKHHFYA 139
Query: 126 --------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYH 177
GLTFWSPNIN+ RDPRWGR ET GEDPY+ GR A+N++ GLQ
Sbjct: 140 NEDVYAMYGGLTFWSPNINIFRDPRWGRGQETYGEDPYLTGRMAVNFINGLQ-------- 191
Query: 178 RDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRV-TEQDMQETFILPFEMCVNEGDV 236
D + K A KHYA + H D+ + T+ D+ ET++ F+ +E +V
Sbjct: 192 -GDDDKYFKSVATVKHYAVHS----GPEPSRHRDNYIATDADLYETYLPAFKTAFDETEV 246
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-------------V 283
+SVMC+YN V G P C +L+ +R + F GY+VSDC +I
Sbjct: 247 ASVMCAYNAVWGDPACGSERLMKDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPY 306
Query: 284 ESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG---AVQQGKIAEADIDTSLRFLYI 340
+H + DT+ A A + G DL+CGD N AV++G I E ID S+ LY
Sbjct: 307 AAHDHV-DTRAQAAALSVNMGTDLNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYS 365
Query: 341 VLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKT 398
L +LG +D + N+ + + +P H+E + EAAR +VLLKND G LPL
Sbjct: 366 ALFKLGMYDDPSLVPWSNISIDTVASPSHLEKSEEAARASLVLLKND-GILPLKPDT--K 422
Query: 399 LALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGC--ADIVCQNNSMI 454
+A++GP+A+ ++ NY G P + + G A ++ ++Y+ G A + N +
Sbjct: 423 VAVIGPNADNWWTLVANYYGQPTAPVTALKGIKAKIGAENVSYSVGSTIAGDIYSNYKAV 482
Query: 455 PAAIDAAKNADATVIVAGL 473
P+ KN +A +V G+
Sbjct: 483 PSNTLFHKN-EAGELVPGV 500
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 131/296 (44%), Gaps = 54/296 (18%)
Query: 468 VIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA 517
+ G+D ++E E G DR + LP Q +L+ ++ K PV LV S A
Sbjct: 634 LFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKELHATGK-PVVLVNFSGSA 692
Query: 518 VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSM 577
+ +N+ N + +I+ YPGE+ G AIAD+++G+++P GRLP+T+Y++ M
Sbjct: 693 MALNWEDEN--LPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKS------LEGM 744
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
P + RTYK+++G +YPFG+GLSYT F+Y D+KL + Y
Sbjct: 745 PAFDDYSMENRTYKYYEGEQLYPFGHGLSYTSFEYS--------DLKL-------ETAYA 789
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQV-- 695
N ++V N G E+V Y +A +V
Sbjct: 790 ANEN------------------LQVSVKVTNSGDKASREIVQAYVTRDTLANVSTPRVEL 831
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
++ + +A +S V ++ +N + G+ T+ +G G G P
Sbjct: 832 AAFDAIELAPKESQTVTLSIKPDAIGYFNENGKLTFPEDGSFTLSIGGGQPGFDAP 887
>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 756
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 208/674 (30%), Positives = 333/674 (49%), Gaps = 97/674 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
G+T FP I +++N L ++ + E R+ SP IN+ RDPR GR
Sbjct: 147 GSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RGIHQVLSPTINIARDPRCGRT 201
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+ R A+ Y++G+Q+ +GV A KH+AA + + G D
Sbjct: 202 EETYGEDPYLASRMAVAYIKGVQE-QGV-------------IATPKHFAANFVGDG-GRD 246
Query: 207 RF--HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+ HF R+ ++E + F+ + E S+M +YN ++GIP ++ LL +R
Sbjct: 247 SYPIHFSERL----LREVYFPAFKASIKEAGALSLMAAYNSLDGIPCSSNKWLLTDVLRK 302
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMG 319
+W F GY+VSD S+ ++ HK + ++K +A L+AGLD+ DC + N G
Sbjct: 303 EWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAARLALEAGLDMELPDSDCFEEMINLVKG 361
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDG---SPQYKNLGKNNICNPQHIELAAEAAR 376
GK++E I+ ++R + V G FD P Y + N C +H ELA AR
Sbjct: 362 ----GKLSEETINEAVRRILGVKFWAGLFDNPFVDPDYAE--RVNDC-AEHRELALRVAR 414
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---YAY 433
+ IVLLKN+ G LPL+ +I ++A++GP NA +G Y G + +P++G
Sbjct: 415 ESIVLLKNE-GILPLSK-DIGSIAVIGP--NAAVPRLGGYSGYGVKIVTPLEGIKNKMEN 470
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-SVEAEGKDRVDLLLPGF 492
I +A GC + + S AI A+ +D ++ G + E E +DR +L LPG
Sbjct: 471 KAKIYFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFVGNSVPETEGEQRDRHNLNLPGV 529
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q ELI ++ + PV +V+++ A I K+++++ YPGEEGG AIADV+FG
Sbjct: 530 QEELIKEICNT-NTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGEEGGNAIADVLFG 586
Query: 553 KYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD---GPVVYPFGYGLSYTQ 609
YNPGG+LPIT+ + + + +PL + GR + D ++PFGYGLSYT+
Sbjct: 587 DYNPGGKLPITFPKYS------SQLPLYYNHKPSGRVDDYVDLRSPQYLFPFGYGLSYTE 640
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
F+Y N + + P D + T EVEN+
Sbjct: 641 FRYS---------------------NLRITPEEIPM-----------DGEITITFEVENI 668
Query: 670 GKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAA 728
GK G EVV +Y + +K++ ++R+ +A G+ V F ++ + L+ ++
Sbjct: 669 GKYKGDEVVQLYLHDEFASVVRPVKELKRFKRITLAVGEKKTVSFKLDR-RDLEFLNIDM 727
Query: 729 NSLLASGAHTILVG 742
++ G + +G
Sbjct: 728 EPIVEPGRFEVFIG 741
>gi|344995394|ref|YP_004797737.1| glycoside hydrolase family protein [Caldicellulosiruptor
lactoaceticus 6A]
gi|343963613|gb|AEM72760.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
lactoaceticus 6A]
Length = 770
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 209/709 (29%), Positives = 338/709 (47%), Gaps = 111/709 (15%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP I +F+ + +++ + + + +A +P I+V RD RWGRV
Sbjct: 102 GATVFPQSIGVACTFDNEIVEELAKVIRIQMKA-----TGSHQALAPLIDVARDARWGRV 156
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD----NW 202
ET GEDPY+V A++YV+G+Q D I A KH+ Y + NW
Sbjct: 157 EETFGEDPYLVANMAVSYVKGIQ----------GDDIKDGIVATGKHFVGYAMSEGGMNW 206
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
+ E++++E ++ PFE+ V + S+M +Y+ ++GIP A+ KLL
Sbjct: 207 A-------PVHIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIA 259
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG--DYYTNFTMGA 320
RG+W F G VSD ++ I++ HK + T +A L AGLD++ + +T + A
Sbjct: 260 RGEWGFDGIYVSDYSGVRNILDYHKAVK-TYAEAAYISLWAGLDIELPKIECFTEEFIKA 318
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC-NPQHIELAAEAARQGI 379
+++GK A +D +++ + + RLG FD +P K G + N + EL+ + A++ +
Sbjct: 319 LKEGKFDMAVVDAAVKRVLEMKFRLGLFD-NPYIKTEGILELFDNKEQRELSRKVAQESM 377
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYS----- 434
VLLKNDN LPL+ ++K +A++GP+A++ + ++G+Y P + + ++ F+
Sbjct: 378 VLLKNDN-FLPLSK-DVKKIAVIGPNADSVRNLLGDY-SYPA-HIATLEMFFIKEDRGVG 433
Query: 435 -------KVIN-------------------YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
KVIN YA GC D+ Q+ S A AA+ AD +
Sbjct: 434 NEEEFVRKVINMKSIFEAVKDRVQNKAEVVYAKGC-DVNTQDESGFEEAKKAAQGADVVI 492
Query: 469 IV----AGLDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
+V AGL L E +DR L LPG Q +LI +V+ + +V++ +
Sbjct: 493 LVVGDKAGLRLDCTSGESRDRASLKLPGVQEKLIEEVSKVNE---NIVVVLVNGRPVALE 549
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPV 582
K K+IL +PGEEG A+ADV+FG YNPGG+L I++ + V + Y P
Sbjct: 550 GIWQKAKAILEAWFPGEEGAEAVADVLFGDYNPGGKLAISFPRDVGQVPVYYGHKPSGGK 609
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+ + G + P + PFGYGLSYT F+YK N+ + K
Sbjct: 610 SCWHGDYVEMSTKPFL-PFGYGLSYTTFEYK---------------------NFAIEKEK 647
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
D +EVEN GK G E+V +Y++ T +K++ Y+RV
Sbjct: 648 ISM-----------DESIKISVEVENTGKYAGDEIVQLYTRKEEFLVTRPVKELKAYKRV 696
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSF 750
+ G+ KV F + D +++ G ++VG + F
Sbjct: 697 HLKPGEKKKVVFEIFP-DQFAYYDYDMKRVISPGTVEVMVGASSEDIKF 744
>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
Length = 850
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 157/420 (37%), Positives = 231/420 (55%), Gaps = 45/420 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+PRLG+ Y +EALHGV GR
Sbjct: 27 YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-- 84
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L +K+ +S EARA +N + G
Sbjct: 85 --------------FTVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQ 130
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G +V+GLQ D R
Sbjct: 131 FSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG---------EDPR 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+ A N E ++RF + +++E+ ++E + FEMCV +G +S+M +Y
Sbjct: 182 YLKIVSTPKHFVA----NNEEHNRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 238 NALNDVPCTLNAWLLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVK-TKETAATLSIKA 296
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + + A +Q +++ADID++ + M+LG FD + Y + +
Sbjct: 297 GLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSKERNPYARISPS 356
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + H ++A +AAR+ IVLLKN LPLN +K++A+VG NA G+Y G P
Sbjct: 357 VIGSKDHQQVALDAARECIVLLKNQKNMLPLNVDKLKSIAVVG--INAGTCEFGDYSGAP 414
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/288 (32%), Positives = 144/288 (50%), Gaps = 53/288 (18%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 597 AVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAGSSL 654
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + I +I+ YPGE+GG A+ADV+FG YNP GRLP+T+Y++ ++P P
Sbjct: 655 AVNWMDEH--IPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS-LDELP----P 707
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
+ GRTYK+F G V+YPFGYGLSY+ FKY
Sbjct: 708 FDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKY-------------------------- 741
Query: 639 GTNKPPCAAVLIDDVKCKDY--KFTFQIEVENMGKMDGSEVVMVYSKPPGIAG-THIKQV 695
D+K KD K T ++N G+ G EV VY + P G IK++
Sbjct: 742 ------------SDLKVKDSTDKVTVSFRLKNTGRRKGDEVAQVYVRIPETGGIVPIKEL 789
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS-LLASGAHTILVG 742
G+ RV + G+S + ++ + L+ D +L +G ++VG
Sbjct: 790 KGFRRVPLEPGESRAIDIELDK-EQLRYWDTTKEQFILPAGTFDVMVG 836
>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 854
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 229/421 (54%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D K P ER DL+ R+T+ EK+ + + G+ RL +P Y +EALHGV GR
Sbjct: 28 YKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 85
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L ++ +S EARA +N + G
Sbjct: 86 --------------FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQ 131
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------DDDR 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E ++ FE CV +G +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAY 238
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A A +KA
Sbjct: 239 NALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAAALSIKA 297
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q + +ADID++ + M LG FD Q Y +
Sbjct: 298 GLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRVLRARMELGLFDSGEQNPYTKISPA 357
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H E+A AAR+ IVLLKN LPLN +K++A+VG NA + G+Y G P
Sbjct: 358 VIGSAEHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGSSEFGDYSGLP 415
Query: 421 C 421
Sbjct: 416 V 416
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 151/289 (52%), Gaps = 55/289 (19%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 598 AVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKV--NPNIVVVLVAGSSL 655
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
IN+ + I +I+ YPGE GG+A+A+V+FG YNPGGRLP+T+Y + ++P P
Sbjct: 656 AINWMDEH--IPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----P 708
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
+ GRTYK+F G V+YPFGYGLSYT FKY +VA + +++
Sbjct: 709 FDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVADGEEEINV------------ 756
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQ 694
+FQ+ +N GK G EV VY K P IK+
Sbjct: 757 -------------------------SFQL--KNSGKYAGDEVAQVYVKLPERDEVMPIKE 789
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
+ G+ERV + +G++ KV + L+ D A + + SG +TI+VG
Sbjct: 790 LKGFERVTLKSGENKKVTLKLRK-DLLRYWDEAKDKFVCPSGDYTIMVG 837
>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 725
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 222/729 (30%), Positives = 343/729 (47%), Gaps = 90/729 (12%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVS 67
K D+P+ + K+ +R +L+ MT+ EKV + VPRLG+ E LHG++
Sbjct: 26 KSYDYPFQNPKIATEKRVDNLLSLMTIDEKVNALSTNP-EVPRLGVK-GTGHVEGLHGLA 83
Query: 68 FIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEAR-AMYNLGNA 126
G E T+FP +++ L K+I + EAR A+ G
Sbjct: 84 LGGPAGWG----GKGKEPLPTTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRG 139
Query: 127 GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLK 186
GL +PN ++ RDPRWGR E+ GED + G+ + +V+GLQ SD +
Sbjct: 140 GLVIRAPNADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQ---------GSDKTYWQ 190
Query: 187 ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRV 246
++ KH+ A N + R + S E+ +E + LPF+M V EG + M +YN+V
Sbjct: 191 TASLMKHFLA----NSNEDGRTYTSSDFDERLWREYYALPFKMGVVEGGSRAYMAAYNKV 246
Query: 247 NGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLD 306
NGIP P L + T+ +W +G I +D + + ++ HK+ D K A +KAG++
Sbjct: 247 NGIPAMVHPMLKDITV-DEWGQNGIICTDGGAYKLLLSDHKYYKD-KYLGAAATIKAGIN 304
Query: 307 LDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS---PQYKNLGKNNIC 363
D +T GA+ G + EAD+D LR Y V+++LG D S P K + +
Sbjct: 305 QFLDD-FTEGVYGALANGYLTEADLDEVLRGNYRVMIKLGMLDSSANNPYAKIGAEADSM 363
Query: 364 NPQHIE----LAAEAARQGIVLLKNDNGA--LPLNTGNIKTLALVGPHANATKAMIGNYE 417
+P +E LA EA + IVLLKND LPL +K +A++G +A+A ++ Y
Sbjct: 364 DPWELEAHKKLALEATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYADAV--LLDWYS 421
Query: 418 GTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----- 472
GTP SP+ G + N +++ N+ A++ AKNAD ++ G
Sbjct: 422 GTPPYTISPLQG------IKNKVGENVEVLFAKNNADGKAVEIAKNADVAIVFIGNHPTC 475
Query: 473 ----LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
V + GK+ VD + E + K+ A + ++S+ IN+ + N
Sbjct: 476 NAGWAQCPVPSNGKEAVDRQALNSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQEN-- 533
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
I +I V +E G AIA+V+FG YNP GRL TW VK PL N GR
Sbjct: 534 IPAIFHVTQNSQELGTAIANVLFGAYNPAGRLTQTW-----VKDISDLPPLMDYNIRNGR 588
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
TY +F G +Y FG+GLSYT FKYK PK +
Sbjct: 589 TYMYFKGKPLYAFGHGLSYTTFKYKDMEIPKQIK-------------------------- 622
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQ 707
++ + + ++ + N G++DG EVV +Y K IK++ ++R+ I AG+
Sbjct: 623 -------ENEEVSVKVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAGE 675
Query: 708 SAKVGFTMN 716
+ V +N
Sbjct: 676 TKTVSLLLN 684
>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
DSM 2366]
Length = 799
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 221/807 (27%), Positives = 366/807 (45%), Gaps = 148/807 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
Y D P R +L+ +MTL EK QM L YG R+ LP EW W
Sbjct: 48 YEDPLQPLNARIDNLLSQMTLEEKTCQMATL-YGWKRVLKDSLPTKEWKTAIWKDGIANI 106
Query: 61 -EALHGVSFIGRRTNSPPGTHFDSEVPG-------------------------------- 87
E L+G G + S T V
Sbjct: 107 DEHLNGFLTWGVTSTSELVTDIKKHVWAMNETQRFFIEQTRLGIPVDFTNEGIRGVEAYE 166
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
AT FPT + ++N +L +K+G+ EARA+ G T ++P ++V RD RWGR+
Sbjct: 167 ATGFPTQLNMGMTWNRNLIRKMGRITGQEARAL------GYTNVYAPILDVARDQRWGRL 220
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
E GEDPY+V R + G+Q+ +I++ KH+A Y +
Sbjct: 221 EEVYGEDPYLVARLGVEMTLGMQENN-------------QIASTAKHFAVYSANKGAREG 267
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
D +V+ +++++ + PF+ + E + VM SYN NGIP L Q +R D+
Sbjct: 268 LARTDPQVSPREVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIPITGSEYWLTQRLRKDF 327
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQ 322
F GY+VSD D+++ + H + KE AV + AGL++ D + V
Sbjct: 328 GFGGYVVSDSDALEYLYNKHHVAANLKE-AVFQAFMAGLNVRTTFRPPDSIIIYARQLVN 386
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIV 380
+G+I I++ ++ + V +LG FD P K+ + + + H +A +A+++ IV
Sbjct: 387 EGRIPIETINSRVKDVLRVKFKLGLFD-QPYVKDAAASEKLVNSIAHQAVALQASKESIV 445
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ ++K +A++GP+A +Y + T+ ++G + +
Sbjct: 446 LLKNNNQILPLSR-SLKKIAVIGPNAADNDYAHTHYGPLQSKSTNILEGIRNKIGADKVW 504
Query: 439 YAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC ++V +N ++I A++ A AD ++V G + E K
Sbjct: 505 YAKGC-ELVDKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGGNTKTAGENKS 563
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R L LPGFQ LI + K PV V++ + IN+ + I I++ GYPG +GG
Sbjct: 564 RTTLELPGFQLNLIKAIQKTGK-PVVAVMIGTQPMGINWI--DKYIDGIVYAGYPGVKGG 620
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD-------GP 596
A+ADV+FG YNPGG+L +T+ ++ +PL NFP + D
Sbjct: 621 IAVADVLFGDYNPGGKLTLTFPKS------VGQLPL----NFPSKPNAQTDEGELAKIKG 670
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
++YPFG+GLSYT F Y ++K+ +Q +D N ++
Sbjct: 671 LLYPFGFGLSYTTFAYS--------NLKISPIEQEKDGNISI------------------ 704
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+++ N K++G E+V +Y + T+ K + G+ER+ + ++ + FT+
Sbjct: 705 ------SVDITNTAKLEGDEIVQLYIRDVLSTVTTYEKILRGFERISLKPNETKTLKFTL 758
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
LK+ + ++ G +++G
Sbjct: 759 -FPDDLKLWNREMQHVIEPGTFKVMIG 784
>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
CL09T03C10]
gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
CL09T03C10]
Length = 735
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 216/765 (28%), Positives = 351/765 (45%), Gaps = 87/765 (11%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP 49
S K K Y DAK P +R DL+ RMTL EKV Q+ G VP
Sbjct: 20 SAKDKKGGALYKDAKAPIEKRVDDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVP 79
Query: 50 -RLGLPLYEWWSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNES 104
+G +Y + L + R P +D+ T +P + S+N
Sbjct: 80 AEIGSLIYFETNPELRNNMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPD 139
Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
L ++ + EAR + TF SP I+V RDPRWGRV E GEDPY G +
Sbjct: 140 LVEQACAVSAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYANGVFGAAS 194
Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
VRG Y D+ S +++AC KHY Y G D + + +++Q + +T++
Sbjct: 195 VRG--------YQGDNMSAENRVAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYL 243
Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
LP++M V G +++M S+N ++G+P A+P + + ++ W G+IVSD +I+ +
Sbjct: 244 LPYKMGVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFIVSDWGAIEQL-- 300
Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
++ L TK++A AGL++D + Y V++GK++ A +D ++R + ++
Sbjct: 301 KNQGLAATKKEAARHAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQVDEAVRRVLLLKF 360
Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
RLG F+ K PQ +++AA A + +VLLKN+N LPL + K +A++G
Sbjct: 361 RLGLFERPYTPVTTEKERFLRPQSMDIAARLAAESMVLLKNENNVLPL--ADKKKIAVIG 418
Query: 404 PHANATKAMIGNYEGTPCRYTSPM--DGF---YAYSKVINYAPGCADIVCQNNSMIPAAI 458
P A ++G++ G M DG +A + YA GC + N A+
Sbjct: 419 PMAKNGWDLLGSWRGHGKDTDVVMLYDGLAAEFAGKAELRYALGC-NTKGDNREGFAEAL 477
Query: 459 DAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
AA+ +D V+ G ++ E R + LP Q EL ++ K PV L++++ +
Sbjct: 478 GAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPL 536
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSM 577
++N + P +IL + PG G +A ++ G+ NP G+L +T+ Y + I Y
Sbjct: 537 ELN--RLEPVSDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTFPYSTGQIPIYYNR- 593
Query: 578 PLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
R YK +YPFG+GLSYT+FKY
Sbjct: 594 --RKSGRGHQGFYKDMTSDPLYPFGHGLSYTEFKY------------------------- 626
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVI 696
GT P V + K + ++ V N+G DG+E V + P + T +K++
Sbjct: 627 -GTVTPSATKV------KRGEKLSAEVTVTNIGARDGAETVHWFISDPYCSITRPVKELK 679
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+E+ I AG++ F ++ + V+ L +G + I V
Sbjct: 680 HFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNIHV 724
>gi|423291211|ref|ZP_17270059.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
CL02T12C04]
gi|392663822|gb|EIY57367.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
CL02T12C04]
Length = 800
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 230/800 (28%), Positives = 360/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTANEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G + GLQ EG I A KH+A Y + +
Sbjct: 229 GEDPYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N +A++GP+ K + Y + G Y + +
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDVAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 867
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 160/420 (38%), Positives = 225/420 (53%), Gaps = 43/420 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
ERA DL++ +TL EKV M D + RLG+ Y WW+EALHGV+ G+
Sbjct: 35 ERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVARAGQ----------- 83
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPN 134
AT FP + A+F+ + + S EARA ++ GLT W+PN
Sbjct: 84 -----ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQGLTMWTPN 138
Query: 135 INVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY 194
INV RDPRWGR +E GEDP++ G V+GLQ D + K+ AC KHY
Sbjct: 139 INVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQG--------DRSGKYDKLHACAKHY 190
Query: 195 AAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + W +R F++ + +D+ ET++ F+ V +GDV VMC+YNR G P C
Sbjct: 191 AVHSGPEW---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGEPCCG 247
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ +LL +R +W F G +VSDC +I ++H D K + VL AG DL+CGD
Sbjct: 248 NNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVL-AGTDLNCGD 306
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIE 369
Y + + AV+QG I E +D SLR L I LG D ++ + + + +P H E
Sbjct: 307 SYPSL-VEAVEQGLITEEQLDISLRRLLIARFELGEMDPDEEVEWSKIPHSVVSSPTHSE 365
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
+A EAAR+ + LL N NGALPL + T+A++GP+AN + GNY GTP T+ + G
Sbjct: 366 MALEAARKSMTLLMNKNGALPLKKEGL-TVAVMGPNANDSLMQWGNYNGTPATTTTILQG 424
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 129/272 (47%), Gaps = 51/272 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
IP+++ +AD V +G+ +E E G DR D+ LP Q E++ + A
Sbjct: 593 IPSSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKA 652
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K +++++ I F + +IL YPG+ GG+A+A+V+FG YNP GRLP+T
Sbjct: 653 GK---EIILVNCSGSAIGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVT 709
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y++ +P N RTY++F+G +YPFGYGLSYT F Y
Sbjct: 710 FYKS------VDQLPDFQDYNMTNRTYRYFEGEPLYPFGYGLSYTTFSY----------- 752
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
++P + I + + + ++ V N G DG EVV +Y +
Sbjct: 753 -----------------DQPELSQTSI----STEEEASLKVSVANTGDYDGEEVVQLYLQ 791
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
P + G++RVFI G++ +V F +
Sbjct: 792 KPDDTEGPSLTLRGFQRVFIPKGETVEVEFQL 823
>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
Length = 923
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 159/430 (36%), Positives = 237/430 (55%), Gaps = 42/430 (9%)
Query: 21 YPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTH 80
Y ER DL+ MT EK++Q+ + A +PRLGL Y +W+E+LHGV
Sbjct: 113 YKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVL------------- 159
Query: 81 FDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRD 140
GATSFP I A+++ L ++ VS EARA+ L GLT+WSP IN+ RD
Sbjct: 160 ----AEGATSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARD 215
Query: 141 PRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD 200
PRWGR E+ EDPY++ R + +++G+Q Y+ LK A KH+ A
Sbjct: 216 PRWGRNEESYSEDPYLLSRMGVAFIKGMQGDH--PYY-------LKTVATPKHFIA---- 262
Query: 201 NWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
N E R S V +++ E ++ F+ + E S+M +YN +N +P+ A+ L+
Sbjct: 263 NNEEERRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTD 322
Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
+R W F GY+VSDC +I ++ HKF T +AVAR + AG DL+CG Y F A
Sbjct: 323 LLRRQWGFEGYVVSDCGAIHDMLYGHKFFK-TGAEAVARSILAGCDLNCGQAYREFIKDA 381
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YKNLGKNNICNPQHIELAAEAARQ 377
+ +G + E DID++L + RLG FD P+ Y ++GK+ + + ++ LA +AAR+
Sbjct: 382 LDEGLLREKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARK 440
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
IVLLKN N LP++ IK++A++GP NA +A +G Y G P SP++G + +
Sbjct: 441 SIVLLKN-NDILPIDKSKIKSIAVIGP--NAREAQLGIYSGFPNVLISPLEGIKNKADSL 497
Query: 438 N----YAPGC 443
+ Y GC
Sbjct: 498 DIRVGYVKGC 507
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 131/285 (45%), Gaps = 43/285 (15%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
AA+N D ++V G+ + E DR ++ LP Q EL+ + A+ + +V+++ G V
Sbjct: 665 AAEN-DLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEVNPN-IVIVLVNGGPVA 722
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
+ A+ K W Y GE GG+A+ADV+FG YNPGG+LP T+Y + P + +
Sbjct: 723 LAGAEKYAKAIVENW--YNGEFGGQALADVLFGDYNPGGKLPQTFYASTEQLPPMSDYDI 780
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+NN RTY + + ++PFG+GLSYT FKY S K V L++
Sbjct: 781 --INN--PRTYMYLNEQALFPFGHGLSYTTFKY---DSLKIVSNTLNETDT--------- 824
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK-PPGIAGTHIKQVIGY 698
+ Q + N+G +G EVV +Y+ KQ+ +
Sbjct: 825 --------------------LSLQFRLTNVGNRNGDEVVQIYASCKDAKFKVPRKQLKRF 864
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
R+ + G+S + F + L N + GA IL+G
Sbjct: 865 RRLTLQTGESKVLEFKI-PVDELAFYSTYENDFVVEKGAWEILIG 908
>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
Length = 955
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 229/764 (29%), Positives = 363/764 (47%), Gaps = 119/764 (15%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
K +++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY E
Sbjct: 163 KGEVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 219
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
A+HG S+ G+ GAT FP + A++N L +++ + E +
Sbjct: 220 AVHGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDET-VVA 262
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
N A WSP ++V +D RWGR ET GEDP +V + +++G Q
Sbjct: 263 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------ 306
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
SR L + KH+ + R D ++E++M+E ++PF V D S+M
Sbjct: 307 SRGLFTTP--KHFGGHGA---PLGGRDSHDIGLSEREMREVHLVPFRHVVRNYDCQSLMM 361
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y+ GIP +LL Q +R +W F+G+IVSDC +I + + K +A + L
Sbjct: 362 AYSDYMGIPVAGSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQAL 421
Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG+ +CGD Y + + A + G+I ++D R + + R F+ +P K L N
Sbjct: 422 AAGIATNCGDTYNDKEVIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNP-CKPLDWN 480
Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
I + +H E+A +AAR+ IV+L+N + LPL+ +KT+A++GP A+ + G+Y
Sbjct: 481 KIYPGWNSDRHREMARQAARESIVMLENKDNLLPLSK-TLKTIAVLGPGADDLQP--GDY 537
Query: 417 --EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
+ P + S + G A +KV+ Y GC D + + IP A+ AA +D V+V
Sbjct: 538 TPKLQPGQLKSVLSGIKAAVGKQTKVL-YEQGC-DFTTPDATNIPKAVKAASQSDVVVMV 595
Query: 471 AGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
G + EA E D L+LPG Q EL+ V K PV L++ + D+
Sbjct: 596 LGDCSTSEATNNVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDL- 653
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRP 581
K + K+IL PG+EGG A ADV+FG YNPGGRLP+T+ +PL
Sbjct: 654 -LKASEMCKAILVNWLPGQEGGPATADVLFGDYNPGGRLPMTFPRH------VGQLPLYY 706
Query: 582 VNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
GR Y++ D +Y FGYGLSYT F+Y D+K+ Q+ + N V
Sbjct: 707 NFKTSGRRYEYVDMEFYPLYRFGYGLSYTSFEYS--------DLKI---QEKSNGNVMV- 754
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGY 698
Q V+N+G G EV +Y + T + ++ +
Sbjct: 755 -----------------------QATVKNVGGCAGDEVAQLYITDMYASVKTRVMELKDF 791
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
R+ + G+S V F + + ++++ + ++ G ++VG
Sbjct: 792 TRIHLQPGESKNVSFELTPY-DISLLNDRMDRVVEKGEFKVMVG 834
>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 814
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 219/727 (30%), Positives = 336/727 (46%), Gaps = 117/727 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ E HG IG T FPT I +++N L +++
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------------TTVFPTSIGQASTWNPELIRRM 190
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ ++TEA A + P +++ RDPRW RV ET GED Y+ G V+G Q
Sbjct: 191 GRAIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQ 245
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
E+ R K+ A KH+AAY W + V ++M+E PF
Sbjct: 246 G----EFPRTKG----KVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFRE 294
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V G +S VM SYN ++GIP A+ LL ++ W F G++VSD +I + E +
Sbjct: 295 AVAAGALS-VMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLYAIGGLREHG--V 351
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
DT +A + + AG+D D G + Y + AV++G + E I+ ++ + + +G F
Sbjct: 352 ADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLF 411
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
D + + + + +H+ELA E ARQ I+LLKN N LPLN +KT+A++GP+A+
Sbjct: 412 DHPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADN 470
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSKVIN-----YAPGCADIVCQNNSMIPAAIDAA 461
M+G+Y + + +DG KV N YA GCA + + S AI+AA
Sbjct: 471 IYNMLGDYTAPQSESSVVTVLDGIR--QKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAA 527
Query: 462 KNADATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELIN 498
+ +D V+V G D S + EG DR L L G Q ELI
Sbjct: 528 RQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIR 587
Query: 499 KVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG 558
+V K P+ LV++ + + + ++ +I+ YPG +GG A+ADV+FG YNP G
Sbjct: 588 EVGKLNK-PIVLVLIKGRPLLLEGIE--AEVDAIVDAWYPGMQGGNAVADVLFGDYNPAG 644
Query: 559 RLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF--DGPVVYPFGYGLSYTQFKYKVAS 616
RL I+ V +P+ G K+ +G YPFGYGLSYT F Y
Sbjct: 645 RLTIS------VPRSVGQLPVYYNTKRKGNRSKYIEEEGTPRYPFGYGLSYTSFNYS--- 695
Query: 617 SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSE 676
D+ V + C ++V N G DG E
Sbjct: 696 ----------------DLKAEVVEAEDSCLV-------------NISVKVRNEGSRDGDE 726
Query: 677 VVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASG 735
VV +Y + + T KQ+ G++R+ + G++ ++ F ++ KSL + + G
Sbjct: 727 VVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPG 785
Query: 736 AHTILVG 742
T+++G
Sbjct: 786 RFTLMLG 792
>gi|322371968|ref|ZP_08046510.1| glycoside hydrolase family 3 domain protein [Haladaptatus
paucihalophilus DX253]
gi|320548390|gb|EFW90062.1| glycoside hydrolase family 3 domain protein [Haladaptatus
paucihalophilus DX253]
Length = 776
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 228/789 (28%), Positives = 361/789 (45%), Gaps = 136/789 (17%)
Query: 24 RAKDLVERMTLPEKVQQMGD---------------------LAYGV---PRLG----LPL 55
R ++L+E MT+ EKV Q+G L G+ RLG LP
Sbjct: 17 RVEELLEEMTITEKVAQLGSVNANKLLDDDGSLDRKAVEELLENGIGHLTRLGGEGSLPP 76
Query: 56 YEWWSEALHGVSFIGRRTNS--PPGTHFDSEV----PGATSFPTVILTTASFNESLWKKI 109
E F+G T P H + P T+FP ++ ++++ L +I
Sbjct: 77 REAAKRTNELQDFLGSETRLGIPAIPHEECLSGYMGPSGTTFPQMLGVASTWSPDLVAEI 136
Query: 110 GQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
T+ + A+ G T SP +++ RD RWGRV ET GEDPY+V A YV GL
Sbjct: 137 TDTIRGQLEAI------GTTHALSPVLDIARDLRWGRVEETFGEDPYLVAAMARGYVNGL 190
Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
Q D D ISA KH+A + G +R + V ++++ET + PFE
Sbjct: 191 QG--------DGDG----ISATLKHFAGHGAGEG-GKNRSSVN--VGRRELRETHLFPFE 235
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
+ D SVM +Y+ ++GIP +D LL +RG+W F G +VSD S++ ++S
Sbjct: 236 AVIKTADAESVMNAYHDIDGIPCASDGWLLTDVLRGEWGFDGTVVSDYYSVE-FLQSEHG 294
Query: 289 LNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLG 346
+ +K+ A ++AGLD++ D Y + + AV+ G +AEA ++T++R + G
Sbjct: 295 VAASKQAAGVMAVEAGLDVELPYTDCYGDHLVNAVEDGDVAEATVNTAVRRVLRAKAEKG 354
Query: 347 YFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
D + +L AAR+ + LLKN++ LP + ++T+A+VGP A
Sbjct: 355 LLDDPTVDVDAAAAPFNTENARDLTTRAARESMTLLKNEDDFLPFDGEELETVAVVGPKA 414
Query: 407 NATKAMIGNYEGTPCRY---------TSPMDGFYAYSKV----INYAPGCADIVCQNNSM 453
+ + ++G+Y P Y T+P+D A + + Y GC
Sbjct: 415 DNAQELMGDY-AYPAHYPTEEVDLDATTPLDAIEARGEHAGFDVRYEQGCTTTGSSTEDF 473
Query: 454 IPAAIDAAKNADATVIV---AGLDLS-------------VEAEGKDRVDLLLPGFQTELI 497
AA A A V + +D S EG D VDL LPG Q EL+
Sbjct: 474 DSAAEAAEAADVAVTFVGARSAVDFSDIDEKQADLPSVPTSGEGCDVVDLDLPGVQQELV 533
Query: 498 NKVADAAKGPVTLVIMSAGAVDINF-AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
+V + P+ +V++S + + A+ P ++L+ PGE GG IA+V+FG++NP
Sbjct: 534 ERVHETGT-PLVVVVVSGKPHSVEWIAEEAP---ALLYAWLPGERGGEGIAEVLFGEHNP 589
Query: 557 GGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVA 615
GGRLP++ + + Y P N + + + +YPFG+GLSYT F+Y
Sbjct: 590 GGRLPVSIPRSVGQLPVYYNRKP-----NTANEEHVYTESTPLYPFGHGLSYTDFEYG-- 642
Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
D+ L D P V + ++ V N G DG
Sbjct: 643 ------DLSLSTDSIA------------PSGRV------------SAEVTVSNTGDRDGH 672
Query: 676 EVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
EVV +Y +K P A +++++G+ER+F+AAG+S ++ F ++A + L D N +
Sbjct: 673 EVVQLYASAKSPSQA-RPVQELVGFERIFLAAGESKRIIFEIDASQ-LAFHDRDMNLAVE 730
Query: 734 SGAHTILVG 742
G + + VG
Sbjct: 731 RGPYELRVG 739
>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 858
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 234/421 (55%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + K P ER DL+ R+T+ EK+ + + G+ RL +P Y +EALHGV GR
Sbjct: 29 YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 86
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 87 --------------FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQ 132
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ +DSR
Sbjct: 133 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------NDSR 183
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E ++ FE CV EG +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 240 NALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 298
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q + +ADID++ + M+LG FD Y +
Sbjct: 299 GLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPK 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AAR+ IVLLKN N LPL+ IK++A+VG NA ++ G+Y G P
Sbjct: 359 VIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLP 416
Query: 421 C 421
Sbjct: 417 V 417
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 145/284 (51%), Gaps = 49/284 (17%)
Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDI 520
+ + V V G++ ++E EG+DR D+ LP Q E + ++ P +V++ AG+ + I
Sbjct: 601 RECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAGSSLSI 658
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N+ + I +I+ YPGE GG+A+A+V+FG YNPGGRLP+T+Y + ++P P
Sbjct: 659 NWMDEH--IPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----PFD 711
Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
+ GRTY++F G V+YPFGYGLSYT FKY D Q D N V
Sbjct: 712 DYDITKGRTYQYFKGNVLYPFGYGLSYTSFKY--------------SDLQVTDGNQEV-- 755
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYE 699
N C ++N+GK G EV +Y K P IK++ G+E
Sbjct: 756 NVSFC--------------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKELKGFE 795
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
R+ + G+S KV + L+ D + SG +TI++G
Sbjct: 796 RISLKRGESRKVTIRLKK-DLLRYWDEEKECFVHPSGDYTIMIG 838
>gi|346226406|ref|ZP_08847548.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 775
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 197/659 (29%), Positives = 324/659 (49%), Gaps = 84/659 (12%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T+FP + S++ L ++ + + EA A +G+ + ++P I++ RDPRWGRV+
Sbjct: 129 TTFPIPLAEACSWDLELMEQSARIAAEEATA------SGIAWNFAPMIDIARDPRWGRVM 182
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDPY+ A VRG Q G+E ++D S+ + A KH+ Y G D
Sbjct: 183 EGAGEDPYLGSLVARARVRGFQ---GIETYKDF-SKINTMMATSKHFVGYGAVQ-AGRDY 237
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D V + + ET++ PF+ V+EG V++ M ++N +NG+P + L + +R W
Sbjct: 238 HSVDMSV--RTLHETYLPPFKAAVDEG-VTAFMTAFNDLNGVPCTGNKYLFKEILRDRWG 294
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
F G +V+D +IQ +V +H F D K A + AG+D+D + + + V++GK+
Sbjct: 295 FGGMVVTDYTAIQEMV-AHGFARDLKH-ATELAIDAGIDMDMISEGFVTYLKELVEEGKV 352
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLLKN 384
+E ID ++ + + LG FD +Y N + + NP+H++ A E A++ IVLL+N
Sbjct: 353 SEKQIDVAVSRILEMKFLLGLFDDPFKYCNAERQKEVVMNPEHLKAAREVAQRSIVLLEN 412
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---YAYSKV-IN 438
N LPL K +AL+GP +++ G + +G P + + M+G Y S+V +
Sbjct: 413 KNNVLPLKKNEPKRVALIGPFVKERESLTGEWAIKGDPDKSVTLMEGLEEKYKDSQVKFS 472
Query: 439 YAPGCA----DIVCQ--------NNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
YA G + D Q + S AI+ A+ +D ++ G E R D
Sbjct: 473 YAKGTSLPVIDRTTQKVSTTRVPDRSGFSEAINLARTSDVILVAMGEKFHWSGEAASRTD 532
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
+ LPG Q EL+ ++ K P+ LV+ + +D+++ N + +I+ YPG G A+
Sbjct: 533 ITLPGNQRELLKELKKTGK-PIILVLFNGRPLDLSWEAEN--VDAIVEAWYPGIMAGHAV 589
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTYKFFDGP--VV 598
ADV+ G YNP +L +T + N +IP T P N R+ + D P +
Sbjct: 590 ADVLSGDYNPSAKLVMT-FPRNVGQIPIFYNVKNTGRPFDEDNPADYRS-SYIDCPNSPL 647
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
YPFGYGLSYT F+Y N + + K +L
Sbjct: 648 YPFGYGLSYTSFEYD---------------------NAKISSKKLERGGIL--------- 677
Query: 659 KFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
T ++V N G MDG EVV +Y G +K++ G++++ + G++ V FT++
Sbjct: 678 --TVSVDVTNTGTMDGEEVVQLYIHDKVGSVVRPVKELKGFKKIHLKKGETKTVEFTID 734
>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 858
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 234/421 (55%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + K P ER DL+ R+T+ EK+ + + G+ RL +P Y +EALHGV GR
Sbjct: 29 YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 86
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L K++ +S EARA +N + G
Sbjct: 87 --------------FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQ 132
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ +DSR
Sbjct: 133 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------NDSR 183
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E ++ FE CV EG +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 240 NALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKA 298
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q + +ADID++ + M+LG FD Y +
Sbjct: 299 GLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPK 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H ++A +AAR+ IVLLKN N LPL+ IK++A+VG NA ++ G+Y G P
Sbjct: 359 VIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLP 416
Query: 421 C 421
Sbjct: 417 V 417
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 147/284 (51%), Gaps = 49/284 (17%)
Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VDI 520
+ + V V G++ ++E EG+DR D+ LP Q E + ++ P +V++ AG+ + I
Sbjct: 601 RECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAGSSLSI 658
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N+ + I +I+ YPGE GG+A+A+V+FG YNPGGRLP+T+Y + ++P P
Sbjct: 659 NWMDEH--IPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----PFD 711
Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
+ GRTY++F G V+YPFGYGLSYT FKY D+++ + Q ++++
Sbjct: 712 DYDITKGRTYQYFKGNVLYPFGYGLSYTSFKYS--------DLQVTEGNQEVNVSFC--- 760
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYE 699
++N+GK G EV +Y K P IK++ G+E
Sbjct: 761 -------------------------LKNVGKYAGDEVAQIYVKLPERDKIMPIKELKGFE 795
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
R+ + G S KV + L+ D + SG +TI+VG
Sbjct: 796 RISLKRGGSRKVTIRLKK-DLLRYWDEEKGCFVHPSGDYTIMVG 838
>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 881
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 162/423 (38%), Positives = 235/423 (55%), Gaps = 50/423 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + +L R DL+ER+T+ EK+ Q+ + + RLG+P Y WW+E+LHGV+ G
Sbjct: 27 YPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARAGY 86
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN-- 125
AT FP I A+++ L K++ +S EARA ++ G
Sbjct: 87 ----------------ATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRG 130
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFWSPNIN+ RDPRWGR ET GEDPY+ G+ I YV+GLQ +D
Sbjct: 131 IYQGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQ---------GNDPN 181
Query: 184 PLKISACCKHYAAYDLDNWEGND--RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK+ A KH+A + G + R FD +++D+ ET++ F V +GDV SVM
Sbjct: 182 YLKLVATAKHFAVH-----SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMT 236
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+YNRV G A L +R W+F GY+VSDC +I I + HK D E + V+
Sbjct: 237 AYNRVYGEAASASDTLFT-ILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAEASAMAVI 295
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNL 357
+ G DL+CGD Y A QQG + E DID +L L ++LG FD P+ Y +
Sbjct: 296 E-GCDLNCGDSYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMFD--PEQLVPYAQI 351
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
N + +H +LA +AA++ IVLLKN LPL + ++K++A++GP+A+ +++ GNY
Sbjct: 352 PFNVNTSEKHNQLALKAAKESIVLLKNQGDLLPL-SKDLKSVAVIGPNADNIQSLWGNYN 410
Query: 418 GTP 420
G P
Sbjct: 411 GNP 413
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 139/272 (51%), Gaps = 45/272 (16%)
Query: 473 LDLSVEA-EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
+D+ VE G DR L LP Q L+ +VA K P+ LV+++ A+ IN+A N I +
Sbjct: 620 MDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSALSINWAAEN--IPA 676
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYK 591
I+ GY G++GG A+A+V+FG YNP RLP+T+Y++ +P N GRTY+
Sbjct: 677 IMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKS------VEDLPDFEDYNMDGRTYR 730
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
+F+ +YPFGYGLSYT F Y P +D+
Sbjct: 731 YFEKEPLYPFGYGLSYTTFDYSKFQLPSKIDM---------------------------- 762
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAK 710
+ +EV N G DG EVV VY + G I++++G++R+ + G+S K
Sbjct: 763 -----NESIELSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRIHLKKGESQK 817
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V FT+ + L ++D+ + ++ G +I VG
Sbjct: 818 VQFTIEP-RQLSMIDDKGDLVIEPGVFSISVG 848
>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 221/723 (30%), Positives = 344/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG AT FPT I A+++ L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP ++ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 275 AIDAGALS-VMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ +A IDT++ + + +G F
Sbjct: 333 APTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 627 IS-VPRSVGQIPVYYNQKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 673
Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + K +C ++++ +V+N GK DG EV +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ +ER + G+ KV F + + +V+ ++ SG +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 764
Query: 740 LVG 742
++G
Sbjct: 765 MIG 767
>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
Length = 805
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 233/813 (28%), Positives = 355/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+Y+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYKRVGEDIRLTPQLEKEI 93
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 94 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 552
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 664 IEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + + ++N G DG EV +Y + + T KQ+ + R+ + A +S
Sbjct: 698 ------DCRVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESR 751
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 800
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 230/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ+ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N +A++GP+ K + Y + G Y + +
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
Y GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YVKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNRFTVEPGSFSVMVG 785
>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
17393]
gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 879
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 169/456 (37%), Positives = 238/456 (52%), Gaps = 49/456 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DLV R+TL EK M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 41 PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 99
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------ 126
AT FP I ASFN L + +S EARA +
Sbjct: 100 ---------------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKR 144
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT W+PNIN+ RDPRWGR ET GEDPY+ + + VRGLQ EG +Y
Sbjct: 145 YQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYD------- 197
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ AC KHYA + W +R F++ + +D+ ET++ F+ V + V VMC+Y
Sbjct: 198 -KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAY 253
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLND-TKEDAVARVLK 302
NR G P C +LL +R +W + +VSDC +I D K+ A A+ +
Sbjct: 254 NRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKAVL 313
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
+G D++CGD Y + AV++G I E ID SL+ L LG D Q + + +
Sbjct: 314 SGTDIECGDSYGSLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIPYS 372
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H ELA AR+ +VLL+N+ LPLN N+K +A+VGP+AN + GNY G P
Sbjct: 373 VVDSKEHRELALRMARESLVLLQNNQSLLPLNK-NLK-VAVVGPNANDSVMQWGNYNGFP 430
Query: 421 CRYTSPMDGFYAY---SKVINYAPGC---ADIVCQN 450
+ ++G Y S++I Y PGC +D+ Q+
Sbjct: 431 SHTITLLEGIREYLPESQII-YEPGCDLTSDVTLQS 465
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 135/300 (45%), Gaps = 53/300 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ ++ K AD + G+ +VE E G DR + LP Q+ L+ ++ A
Sbjct: 606 LKQTVNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 665
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K +V ++ I + +IL YPG+ GG AIA+V+FG YNP GRLP+T
Sbjct: 666 GK---KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 722
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y++ + +P + RTY++ ++PFG+GLSYT F+Y AS
Sbjct: 723 FYKST------SQLPGFEDYSMKERTYRYMTEAPLFPFGHGLSYTTFRYGDASL------ 770
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
Q+ +D T+ T I V N+G+ DG EVV VY +
Sbjct: 771 ---NTQEVKDGEQTILT-----------------------IPVSNVGEYDGEEVVQVYLR 804
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
PG + ++R IA G ++ V +++ + + D N++ G + IL G
Sbjct: 805 RPGDKEGPSHALRAFKRANIAKGATSNVTVSLSK-EDFEWFDTETNTMRPIEGDYEILYG 863
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 223/763 (29%), Positives = 338/763 (44%), Gaps = 133/763 (17%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG-----------------VPRLGLP 54
PY +A LP R DL++RMTL EK+ QM + + +
Sbjct: 719 LPYQNAALPSAIRVHDLLQRMTLDEKLAQMRHIHFKHYNTDGHVDLTKLRNNYTHSMSFG 778
Query: 55 LYEWW----SEALHGVSFIGRRTNSPPGTHFDSEV------------PGATSFPTVILTT 98
+E + ++ VS I + N+ T F V G T FP I
Sbjct: 779 CFEAFPYSSTQYRQAVSTI--QQNAADSTRFGIPVIPVIEGIHGIVQDGCTIFPQAIAQG 836
Query: 99 ASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVG 158
A+FN L ++ Q + TE RA+ +P++++ R+ RWGRV ET GEDPY++
Sbjct: 837 ATFNPQLVFRMAQHIGTEMRAI-----GARQVLAPDLDIAREQRWGRVEETFGEDPYLIS 891
Query: 159 RYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAY-------DLDNWEGNDRFHFD 211
R NYV+G+Q G+ KH+ A+ +L + +G R FD
Sbjct: 892 RMGYNYVKGIQSRGGI--------------PTLKHFVAHGTPQGGLNLASVKGGQRELFD 937
Query: 212 SRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGY 271
++ PFE + SVM Y+ + + P L +R +F GY
Sbjct: 938 ----------VYVKPFEYVIRHTKAGSVMNCYSAYDNEAITSSPFFLRTLLRDSLHFKGY 987
Query: 272 IVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADI 331
I SD SI + H D++ +A + + AG+DL+ G Y + QG + +A I
Sbjct: 988 IYSDWGSIPMLRYFHH-TADSETEAAQQAINAGVDLEAGSDYYRTAPTLIAQGLLDKARI 1046
Query: 332 DTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
D++ + G FD + I P+ + +A + A + +VLL+N N LPL
Sbjct: 1047 DSAAAHVLYTKFEAGLFDELASDTLHWRQQIHTPEAVAVAKQLADESLVLLENRNHFLPL 1106
Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTP-CRY-TSPMDGFYAYSKV---INYAPGCADI 446
+ + ++A+VGP NA + G+Y T R+ +P+ G + + + Y GC D
Sbjct: 1107 DLNRLHSIAVVGP--NAAQVQFGDYSWTADNRHGITPLAGIQQVAGMRTKVRYVKGC-DY 1163
Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDL---------SVEAEGKDRVDLLLPGFQTELI 497
QN I A+ AK +D TV+V G S EG D DL+LPG Q +LI
Sbjct: 1164 YSQNTDSIDEAVALAKQSDVTVVVVGTQSMLLARPSQPSTSGEGYDLSDLILPGVQQQLI 1223
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
++ AA G +V+M G + A N K ++L Y GE+ G ++A +FG+ NP
Sbjct: 1224 ERI--AATGKPFIVVMVTGRPLLTEAFKN-KADALLVQWYGGEQAGLSLAQALFGQLNPS 1280
Query: 558 GRLPITWYEAN-YVKIPYTSMPL-------RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQ 609
GRLPI++ +A + + Y +P + + PGR Y F D YPFGYGLSYT
Sbjct: 1281 GRLPISFPKATGQLPVYYNHLPTDKGYYNKKGTPDKPGRDYVFADPYPAYPFGYGLSYTT 1340
Query: 610 FKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENM 669
FKY + L K Q TN+ AV TF+ V+N
Sbjct: 1341 FKYS--------QLALSKKQ----------TNENDTIAV------------TFR--VQNT 1368
Query: 670 GKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKV 711
GK G EV +Y + T IKQ+ G+E+ + G++ +
Sbjct: 1369 GKRAGKEVAQLYIRDMKSSVATPIKQLFGFEKCALQPGETKTI 1411
>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
CL03T12C04]
Length = 1049
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 219/767 (28%), Positives = 357/767 (46%), Gaps = 100/767 (13%)
Query: 16 DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG- 70
++KLP+ A KDL+ RMT+ EK+ Q+ G L P E+ S++L +G
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 71 ----------RRTNSPPGTHFDSEVP----------GATSFPTVILTTASFNESLWKKIG 110
R H ++P T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 111 QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ + E+ A AGL + ++P +++ RD RWGRV+E GED Y+ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
++ ++ L AC KH+ AY L G D D ++E+ + +T++ PF+
Sbjct: 501 ------WNLWENNSVL---ACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
C++ G V + M ++N +NGIP A P LL +RG WNF+G++VSD ++++ +V
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607
Query: 290 NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+D +DA +G+D+D D Y + ++ GKI+ D+D S+ + + LG F
Sbjct: 608 DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 349 DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
++ N I + ++ A + A + VLLKNDN LPL N++++A+VGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 407 NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ ++G++ G T+ + G + YA GC D ++ S A+
Sbjct: 725 DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A +D + V G + E + R L LPG Q ELI ++ K PV +V+M+ + I
Sbjct: 784 ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
+ N + +IL + G G AIAD++FG YNP GRL I++ V + Y
Sbjct: 843 EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 900
Query: 579 LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
RP + T + D P +YPFGYGLSYT F Y V S + Y
Sbjct: 901 GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSYSVPQSTQK--------------EY 946
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
T + + + V N G DG E V +Y + +K++
Sbjct: 947 T------------------RQETISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++++F+ AG+S V F ++ +L D A N ++ G I+ G
Sbjct: 989 KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
Length = 799
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 241/814 (29%), Positives = 364/814 (44%), Gaps = 155/814 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
Y D + P R +DL+ +MTL EK QM L YG R+ LP W W
Sbjct: 41 YEDPEAPIEARVQDLLNQMTLEEKSCQMATL-YGFGRVLKDSLPTEGWKNEIWKDGIANI 99
Query: 61 -EALHGVSFIGRRTNS---PPGTH----------------------FDSE-VPG-----A 88
E L+GV RRT P H F +E + G A
Sbjct: 100 DEQLNGVGSARRRTPDLIYPFSNHAEAINKTQRWFIEETRLGIPVDFSNEGIHGLNHTKA 159
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRWGRVL 147
T P I +++N L + G EA+A+ YN ++P ++V RDPRWGRVL
Sbjct: 160 TPLPAPINIGSTWNRDLVHQAGDIAGKEAKALGYN------NVYAPILDVARDPRWGRVL 213
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
ET GEDPY+VG I V+G+Q GV ++ KH+A Y + +
Sbjct: 214 ETYGEDPYLVGELGIQMVKGIQQ-NGV-------------ASTLKHFAVYSIPKGGRDAA 259
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D V +++ E + PF+ V + VM SYN +G+P A L Q +R ++
Sbjct: 260 VRTDPHVAPRELHEIHLYPFKRVVQKAHPKGVMSSYNDWDGVPVTASYYFLTQLLRQEYG 319
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------M 318
F GYIVSD ++++ V++ + D+ E+AV +V++AGL++ TNFT
Sbjct: 320 FKGYIVSDSEAVE-FVQTKHHVADSYEEAVRQVVEAGLNV-----RTNFTHPKDYILPVR 373
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKN--LGKNNICNPQHIELAAEAAR 376
V++GK++ +D + + V LG FD SP K+ + +H + + +
Sbjct: 374 KLVKEGKLSMKSVDRMVADVLRVKFELGLFD-SPYVKDPKAADKIVGADKHRDFVLDMQK 432
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--- 433
Q +VLLKN+N LPL+ K + + GP A T MI Y + DG Y
Sbjct: 433 QSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETNYMISRYGPQGLDNITVYDGIKDYLGN 492
Query: 434 SKVINYAPGC--ADIVCQNNSMIPAAI-DAAK-----------NADATVIVAGLDLSVEA 479
+ YA GC D ++ ++P + D K + D + V G D S
Sbjct: 493 QTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKGIAEAATAAADCDVIIAVLGEDESCTG 552
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E K R L LPG Q +L+ + K PV LV+++ + IN+A N I SIL +PG
Sbjct: 553 ESKSRTGLDLPGRQQQLLEALHATGK-PVVLVLINGQPLTINWADRN--IPSILEAWFPG 609
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP--- 596
+ GG AIA +FG YNPGGRL +T + + +I + + P +P + + ++F+GP
Sbjct: 610 QLGGEAIAQTLFGDYNPGGRLSVT-FPRSIGQIEF-NFPFKPGS----QDGQYFEGPNGS 663
Query: 597 -------VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
+YPFGYGLSYT F Y N +V P +
Sbjct: 664 GRTRVNGALYPFGYGLSYTTFAYS---------------------NLSVKQETPYSQS-- 700
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQS 708
T ++V N GK G EVV +Y + + + V+ G+ER+ + G++
Sbjct: 701 ---------PVTVTVDVTNTGKRAGDEVVQLYIRDKVSSVIAYESVLRGFERISLQPGET 751
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F + + L+I+D + G + +G
Sbjct: 752 KTVSFVL-LPEDLQILDRHMEWTVEPGEFEVRIG 784
>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
Length = 735
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 212/760 (27%), Positives = 355/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
Y D K P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89
Query: 59 WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
+ AL + R P +D+ T +P + S+N L ++ +
Sbjct: 90 TNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G + V+G
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + +++Q + +T++LP+EM V G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+ ++ + ++ W G+IVSD +I+ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANSYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
+A AGL++D + Y V++G+++ A +D ++R + ++ RLG F+
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K PQ +++AA A + +VLLKN+N LPL + K +A++GP A ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G +G +A + YA GCA N A++AA+ +D V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEALEAARWSDVVV 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G ++ E R + LP Q EL ++ A K P+ LV+++ +++N + P
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLEPI 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G +A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D + + ++ V N+G DG+E V + P + T +K++ +E+
Sbjct: 631 PSVTKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 837
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 163/431 (37%), Positives = 233/431 (54%), Gaps = 46/431 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER +DL+ ++T+ EKV + + G+ R+G+ Y +EALHG+ G+
Sbjct: 14 YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGIIRPGK-- 71
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I + +N L I +S EARA +N G
Sbjct: 72 --------------FTVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQ 117
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ R
Sbjct: 118 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQG---------DHPR 168
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK A KH+AA N E ++RF+ D+ +TE D++E + FE C+ EG S+M +Y
Sbjct: 169 YLKAVATPKHFAA----NNEEHNRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAY 224
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +NG+P A+ LLN+ ++ DW F+GYIVSDC + ++ H+++ T E A +KA
Sbjct: 225 NAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAAAMIAIKA 283
Query: 304 GLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLD++CGDY + N + A +Q ++ A+ID++ + MRLG FD + Y +L
Sbjct: 284 GLDVECGDYVFANPLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFDDPEKNPYNHLSPE 343
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ +H +LA EAARQ IVLLKN LPLN IK++A+VG NA G+Y GTP
Sbjct: 344 IVGCKKHHDLALEAARQSIVLLKNQQNTLPLNAQKIKSIAVVG--INAANCEFGDYSGTP 401
Query: 421 CRY-TSPMDGF 430
S +DG
Sbjct: 402 VNAPVSVLDGI 412
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 51/286 (17%)
Query: 462 KNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDIN 521
+ +D + V G++ S+E EG+DR + LP Q I + A P T+V++ AG+ +
Sbjct: 586 RESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKA--NPNTIVVLVAGS-SMA 642
Query: 522 FAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP-LR 580
+ I +I+ YPGE+GG AIA+V+FG YNP GRLP+T+Y + +P
Sbjct: 643 IGWMDQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS------IEDLPAFD 696
Query: 581 PVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
N RTY +F+G +Y FGYGLSYT+F Y+ ++ + +D Q +N++
Sbjct: 697 DYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDYR--------NLNIKQDTQNVTLNFS--- 745
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP--GIAGTHIKQVIGY 698
++N GK +G EV VY K P GI T +KQ+ G+
Sbjct: 746 -------------------------IKNSGKYNGDEVAQVYVKFPDQGIK-TPLKQLKGF 779
Query: 699 ERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGE 743
+RV I G + ++ + + L++ D+ SG + +VG+
Sbjct: 780 KRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYHFMVGK 824
>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 875
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 158/434 (36%), Positives = 232/434 (53%), Gaps = 44/434 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + LP +R ++L+ +T+ EK+ M D + VPRL +P Y WW+EALHGV+ G
Sbjct: 23 YPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVARAGT 82
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
AT FP I A+++ K + +S EARA YN
Sbjct: 83 ----------------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKT 126
Query: 125 --NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GLTFW+PNIN+ RDPRWGR ET GEDPY+ + V+GLQ +D
Sbjct: 127 GRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQG---------NDP 177
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ K AC KH+A + W +R +++ V+++D+ ET++ F+ V EG+V VMC+
Sbjct: 178 KYFKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
YN +G P CA LLN+ +RG W + G +VSDC ++ + H D K A A
Sbjct: 235 YNAFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTA-ADA 293
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
LK DL+CGD Y N ++ G I E DID S+R + LG D S + +
Sbjct: 294 LKHSTDLECGDTYNNLNK-SLAGGLITEKDIDISMRRILKGWFELGMLDPKSSVLWNQIP 352
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + + +H + A + A++ IVL+KN+N LP N NIK +A+VGP+A+ +GNY G
Sbjct: 353 YSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNG 411
Query: 419 TPCRYTSPMDGFYA 432
TP + ++G A
Sbjct: 412 TPSSIVTILEGIKA 425
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 136/300 (45%), Gaps = 53/300 (17%)
Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
+ K+AD V GL S+E E G D+ + LP Q EL+ ++ K PV
Sbjct: 597 EKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKTGK-PV 655
Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
V+ + ++ + + N + W G G+ GG A+ADV+ G YNP GRLP+T+Y+ N
Sbjct: 656 VFVLCTGSSLGLEQDEKNYDVLLNAWYG--GQSGGTAVADVLAGDYNPSGRLPVTFYK-N 712
Query: 569 YVKIPYTSMPLRPVNNFP-----GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
++ F GRTY++ +Y FG+GLSY++F Y A
Sbjct: 713 LEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYGNA-------- 764
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL K+ P ++I + V N+ DG EVV VY K
Sbjct: 765 KLSKNSIS------------PNEDIII------------TVPVTNISDRDGEEVVQVYVK 800
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
+K + +ERV I + ++ + T++ +S K D A+ L++ SG +TIL G
Sbjct: 801 RNNDVLAPVKTLRAFERVLIRSKETKNIQLTISK-ESFKFYDEKADDLISKSGDYTILYG 859
>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 823
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 225/799 (28%), Positives = 365/799 (45%), Gaps = 133/799 (16%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
Y D+ P R DL+ +MTL EK Q+ L YG R+ +P EW W
Sbjct: 73 YEDSTQPIEARLNDLIGQMTLEEKTCQLATL-YGYKRILKDSVPTPEWKNEIWKDGIANI 131
Query: 61 -EALHGVSFIGRRTNSPPGTHFDSEVPG-------------------------------- 87
E L+G G+ ++ P T V
Sbjct: 132 DEHLNGFITWGKTSDLPLVTDVKKHVWAMNQTQRFFIEQTRLGIPVDFTNEGIRGVEAYQ 191
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
AT+FPT + ++++ L ++G EARA+ G T ++P ++V RD RWGR+
Sbjct: 192 ATAFPTQLNMGMTWDKPLVNQMGNITGMEARAL------GYTNVYAPILDVARDQRWGRL 245
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
E GEDPY+V R + +G+Q +I+A KH+A Y +
Sbjct: 246 EEVYGEDPYLVARLGVEMAKGMQQNN-------------QIAATAKHFAVYSANKGGREG 292
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
D +V ++++ + PF+ + E + VM SYN +GIP L Q +R ++
Sbjct: 293 LARTDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYNDYDGIPISGSSYWLIQRLRQEF 352
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQ 322
F GY+VSD D+++ + H D K DAV + AG+++ D + V+
Sbjct: 353 GFKGYVVSDSDALEYLYNKHHVAADLK-DAVYQAFMAGMNVRTTFRTPDSIIIYARQLVK 411
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICN-PQHIELAAEAARQGIVL 381
+GK+ I++ +R + V +LG FD + N + +A +A+++ IVL
Sbjct: 412 EGKLPIDTINSRVRDVLRVKFKLGLFDHPYVQDAEASAKLVNCAANQAVALQASKESIVL 471
Query: 382 LKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVIN 438
LKN LPL+ +TLA++GP+A +Y + + ++G A KV+
Sbjct: 472 LKNKGAILPLSKQ--QTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAKVGAGKVL- 528
Query: 439 YAPGC---------ADIVCQN-----NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
YA GC ++I+ Q+ + I +A+ A++AD V+V G + E K R
Sbjct: 529 YALGCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQTAGENKSR 588
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
L LPG+Q L+ V K PV +V++ + + IN+ + I I++ GYPG +GG
Sbjct: 589 TSLDLPGYQLRLVKAVKATGK-PVVVVLIGSQPMTINWIDQH--IDGIIYAGYPGTQGGT 645
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYG 604
A+ADV+FG YNPGG+L +T + + ++P+ + P +P + G ++YPFG+G
Sbjct: 646 AVADVLFGDYNPGGKLTLT-FPKSVGQLPF-NFPTKPNSETDEGELAKIKG-LLYPFGFG 702
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F Y D+K+ Q N TV CK
Sbjct: 703 LSYTTFAYS--------DLKISPAIQSDQGNVTVS---------------CK-------- 731
Query: 665 EVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
V N GK+ G EVV +Y + T+ K + G++R+ + G++ +V FT+ LK+
Sbjct: 732 -VTNTGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDRLSLKPGETKEVMFTI-VPDDLKL 789
Query: 724 VDNAANSLLASGAHTILVG 742
+ ++ G ++VG
Sbjct: 790 YNRQMKYVVEPGEFKVMVG 808
>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 858
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 168/478 (35%), Positives = 244/478 (51%), Gaps = 42/478 (8%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S+ PYC+ L ERA+DL+ R+TL EK + M D + +PRLG+ + WWSEAL
Sbjct: 14 SLSATAQLLPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
HG + +G G T FP + ASFN+ L +++ S E RA YN
Sbjct: 74 HGAANMG----------------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNR 117
Query: 123 -LGNAG-------LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
+ N G L+ W+PN+N+ RDPRWGR ET GEDPY+ VRGLQ E
Sbjct: 118 RMLNGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETA 177
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+Y K+ AC KHYA + + + D V+ +D+ ET++ F+ V E
Sbjct: 178 KYR--------KLWACAKHYAVHSGPEYTRHTANVAD--VSPRDLWETYLPAFKTLVTEA 227
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
V VMC+Y R++ P C++ +LL Q +R +W F+ +VSDC ++ I +HK +D
Sbjct: 228 KVREVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH 287
Query: 295 DAVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
A AG D++CG Y T+ AV++G I EA++D + L LG D
Sbjct: 288 AAAK-AAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMDDPKL 346
Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
++ + + + + H +LA + ARQ +VLL+N G LPL G + ++GP+A+
Sbjct: 347 VEWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-DPITVIGPNADDGPM 405
Query: 412 MIGNYEGTPCRYTSPMDGFYAYSKVINYAPGC--ADIVCQNNSMIPAAIDAAKNADAT 467
M GNY GTP R + +DG A + Y GC D N+ + AID K T
Sbjct: 406 MWGNYNGTPNRTVTILDGIKARHTRVTYLKGCDLTDTKTVNSLLPQCAIDGRKGLRGT 463
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 121/286 (42%), Gaps = 57/286 (19%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
A I + V V G+ ++E E G DR ++ LP Q + + + +A K
Sbjct: 591 AIIRKLQGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK 650
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
T+V ++ I +IL Y G+EGG A++DV+FG NP G+LP+T+Y
Sbjct: 651 ---TVVFVNCSGSAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFY 707
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ Y +R GRTY++F P ++ FGYGLSYT F++ A +
Sbjct: 708 KRTDQLPDYEDYSMR------GRTYRYFSDP-LFAFGYGLSYTTFRFGRARA-------- 752
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ + + + + N G G EVV VY +
Sbjct: 753 ----------------------------EAAEGGYRLSVPLTNTGTRPGEEVVQVYIRRV 784
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
+K + + RV + AG+S V ++ KS + D + N++
Sbjct: 785 ADTNGPLKSLRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTM 829
>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 1049
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 218/767 (28%), Positives = 358/767 (46%), Gaps = 100/767 (13%)
Query: 16 DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG- 70
++KLP+ A KDL+ RMT+ EK+ Q+ G L P E+ S++L +G
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 71 ----------RRTNSPPGTHFDSEVP----------GATSFPTVILTTASFNESLWKKIG 110
R H ++P T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 111 QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ + E+ A AGL + ++P +++ RD RWGRV+E GED Y+ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ +S + AC KH+ AY L G D D ++E+ + +T++ PF+
Sbjct: 501 ---WNLWENNS------VLACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
C++ G V + M ++N +NGIP A P LL +RG WNF+G++VSD ++++ +V
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607
Query: 290 NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+D +DA +G+D+D D Y + ++ GKI+ D+D S+ + + LG F
Sbjct: 608 DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 349 DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
++ N I + ++ A + A + VLLKNDN LPL N++++A+VGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 407 NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ ++G++ G T+ + G + YA GC D ++ S A+
Sbjct: 725 DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A +D + V G + E + R L LPG Q ELI ++ K PV +V+M+ + I
Sbjct: 784 ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
+ N + +IL + G G AIAD++FG YNP GRL I++ V + Y
Sbjct: 843 EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 900
Query: 579 LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
RP + T + D P +YPFGYGLSYT F Y S+P+S + + +
Sbjct: 901 GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQSTQKEYTRQET------ 951
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
+ + V N G DG E V +Y + +K++
Sbjct: 952 -----------------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++++F+ AG+S V F ++ +L D A N ++ G I+ G
Sbjct: 989 KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 896
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 157/431 (36%), Positives = 230/431 (53%), Gaps = 44/431 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + LP ER ++L+ +T EK+ M D + VPRL +P Y WW+EALHGV+ G
Sbjct: 44 YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI 103
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
AT FP I A+++ K + +S EARA YN
Sbjct: 104 ----------------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKT 147
Query: 125 --NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GLTFW+PNIN+ RDPRWGR ET GEDPY+ + V+GLQ +D
Sbjct: 148 GRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQG---------NDP 198
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ K AC KH+A + W +R +++ ++++D+ ET++ F+ V EG+V VMC+
Sbjct: 199 KFFKTHACAKHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCA 255
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
YN +G P CA+ LL + +RG W + G +VSDC ++ + H D K A A
Sbjct: 256 YNAFDGQPCCANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTA-ADA 314
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLG 358
LK DL+CGD Y N ++ G I E DID S+R + LG D S + +
Sbjct: 315 LKHSTDLECGDTYNNLNK-SLASGLITEKDIDESMRRILKGWFELGMLDPKSSVHWNTIP 373
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + + +H + A + A++ IVL+KN+ LPLN NIK +A+VGP+A+ +GNY G
Sbjct: 374 YSVVDSEEHKKQALKMAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNG 432
Query: 419 TPCRYTSPMDG 429
TP + +DG
Sbjct: 433 TPSSIVTILDG 443
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 134/300 (44%), Gaps = 53/300 (17%)
Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
+ KNAD V GL S+E E G D+ + LP Q +L+ ++ K PV
Sbjct: 618 EKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK-PV 676
Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
V+ + A+ + + N ++L Y G+ GG A+ADV+ G YNP G+LPIT+Y+ N
Sbjct: 677 VFVLCTGSALGLEQDEKN--YDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPITFYK-N 733
Query: 569 YVKIPYTSMPLRPVNNFP-----GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
++ F GRTY++ +YPFG+GLSY++F Y D
Sbjct: 734 LEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYG--------DS 785
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL K+ + N T+ I V N+ + +G EVV VY K
Sbjct: 786 KLSKNSISVNENVTI------------------------TIPVTNISEREGEEVVQVYIK 821
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS-GAHTILVG 742
A +K + +ER I + ++ + ++ S D A+ L++ G +TI G
Sbjct: 822 RNNDAQAPVKTLRAFERTPIKSKETKNIQLILSK-DSFAFYDEKADDLVSKPGDYTIFYG 880
>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
Length = 800
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 230/800 (28%), Positives = 361/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T +SP +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYSPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ+ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H ++ +AA + IV
Sbjct: 389 NEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEAVVHNDAHKAVSMKAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+N LPL+ N +A++GP+ K + Y + G Y + +
Sbjct: 449 LLKNENQMLPLSK-NFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
Y GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YVKGC-DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G+ DG V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKGKVR--VDG-VLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ K +G + T
Sbjct: 679 GLSYTTFGYS--------DLKISKP--------VIGPQE----------------NITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ V FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNRFTVEPGSFSVMVG 785
>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 727
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 220/741 (29%), Positives = 349/741 (47%), Gaps = 105/741 (14%)
Query: 7 VKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGV 66
V + FP+ + LP ER +L+ MTL EKV + GVPRLG+ SE LHG+
Sbjct: 20 VSQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALST-NLGVPRLGI-RNTGHSEGLHGM 77
Query: 67 SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTA-----SFNESLWKKIGQTVSTEAR--- 118
+ G PG SE A ++PT I A +++ L +K+ +TE R
Sbjct: 78 ALGG------PGNWGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYA 131
Query: 119 AMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHR 178
NL G+ +PN ++ RDPRWGR E+ GED ++ R + +V+GLQ
Sbjct: 132 QNANLQKGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQ--------- 182
Query: 179 DSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSS 238
+D + K ++ KH+ A ++ + +FD R+ +E + PF + EG +
Sbjct: 183 GNDPKYWKSASLMKHFLANSNEDGRDSTSSNFDERL----FREYYSFPFYKGITEGGSRA 238
Query: 239 VMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVA 298
M SYN NG+P +P +L + R +W +G I +D ++ +V +H E A A
Sbjct: 239 FMASYNAWNGVPMTVNP-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAA 297
Query: 299 RVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ---YK 355
V+KA + D + ++ A+++G + E +ID +R + V ++LG D Y
Sbjct: 298 -VVKASVG-QFLDNFRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYT 355
Query: 356 NLGKNNICNPQHIE----LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
+G + +P + + + + +VLLKN G LPLN IK++A++GP AN +
Sbjct: 356 GIGVTDTVSPWNKQDTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EV 413
Query: 412 MIGNYEGTPCRYTSPMDGFY-AYSKVIN--YAPGCADIVCQNNSMIPAAIDAAKNADATV 468
++ Y GTP S + G A K I YAP ++ M A + AA+ AD +
Sbjct: 414 LLDWYSGTPPYAVSILQGIKNAVGKDIEVFYAP--------SDEMDKATL-AARKADVAI 464
Query: 469 IVAG---------LDLS-VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAV 518
+ G +S V ++G++ VD + E + K+ A +V++S
Sbjct: 465 VCVGNHPYGTDARWKISPVPSDGREAVDRKSITLEQEDLVKLVMQANPKTVMVLVSNFPF 524
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
IN+++ N + +IL V +E G +ADVIFG +P GR TW ++ +P P
Sbjct: 525 AINWSQEN--VPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWVKS-ITDLP----P 577
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
+ + GRTY++F +YPFG+GLSYT F+Y +
Sbjct: 578 MMDYDIRHGRTYQYFKSKPLYPFGFGLSYTSFEYS-----------------------GL 614
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIG 697
T+ P + VK K N+GK DG EV+ +Y S P +KQ+ G
Sbjct: 615 ETSNPTLTDSIFVSVKVK-----------NIGKRDGDEVIQLYVSYPDSKVERPMKQLKG 663
Query: 698 YERVFIAAGQSAKVGFTMNAC 718
++RVFI AG+S V + A
Sbjct: 664 FKRVFIPAGKSKTVEIPLKAS 684
>gi|423301682|ref|ZP_17279705.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
CL09T03C10]
gi|408471675|gb|EKJ90206.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
CL09T03C10]
Length = 1365
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 234/807 (28%), Positives = 355/807 (43%), Gaps = 162/807 (20%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDL--------------------------- 44
PY A LP ER KDL++RMT EK+ Q+ +
Sbjct: 534 LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGF 593
Query: 45 AYGVP---------------------RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
G P RLG+P++ +E+LHGV
Sbjct: 594 VEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH--------------- 637
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
GAT FP I ++F+ L + ++ E A+ SP I+VVRD RW
Sbjct: 638 --EGATVFPQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRW 690
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
GRV E+ GEDPY+ GR+ I V+G D IS KHY +
Sbjct: 691 GRVEESFGEDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPH------ 730
Query: 204 GNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
GN + E +D+ E ++ PFEM + + +VM +YN N IP A LL
Sbjct: 731 GNPLSGLNLASVETSIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTD 790
Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
+R +W F GY+ SD +I+ + H F E+A + L AGLD++ G
Sbjct: 791 VLRKEWGFKGYVYSDWGAIEMLKNFH-FTARNSEEAALQALTAGLDVEASSDCYPAIPGL 849
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
+++G++ +D ++R + R+G FD P + K I + + I L+ + A + V
Sbjct: 850 IERGELNREIVDEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTV 908
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-PCRY-TSPMDGFYAYSKV-- 436
LLKND LPL+ G +K++A++GP NA + G+Y T R+ +P+ G ++
Sbjct: 909 LLKNDRQLLPLSIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNV 966
Query: 437 -INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGKDRVD 486
+NY GC+ +V + S I A++AA+ +D V+ G S EG D D
Sbjct: 967 KVNYVKGCS-LVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLND 1025
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L L G Q LI V K PV LV+++ I + K N I +IL Y GE+ G +I
Sbjct: 1026 LTLTGAQPALIKAVQATGK-PVILVLVTGKPFAIPWEKKN--IPAILVQWYAGEQSGNSI 1082
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF---------PGRTYKFFDGPV 597
AD++FGK +P GRL ++ E+ +P LR F PGR Y F PV
Sbjct: 1083 ADILFGKVSPSGRLTFSFPEST-GHLPVFYNHLRSDRGFYKSPGSYDSPGRDY-VFSAPV 1140
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
++ FG+GL+YT F+Y L D+ +N TV
Sbjct: 1141 PLWSFGHGLTYTTFEYS----------NLQTDRTSYLLNDTVHV---------------- 1174
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+I+++N GK +G EVV +Y S + Q+ + +V + AG++ V ++
Sbjct: 1175 ------RIDLKNTGKREGKEVVQLYVSDVYSSVAMPVHQLRDFRKVALQAGETQTVRLSI 1228
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
L I++ +++ G I VG
Sbjct: 1229 -PVSELTILNEKNEAIVEPGEFEIQVG 1254
>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
Length = 746
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 219/767 (28%), Positives = 355/767 (46%), Gaps = 100/767 (13%)
Query: 16 DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
++KLP+ A KDL+ RMT+ EK+ Q+ G L P E+ S++L +G
Sbjct: 25 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 83
Query: 72 ---------------------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG 110
R P D T FPT + + S++ + ++
Sbjct: 84 VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 143
Query: 111 QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ + E+ A AGL + ++P +++ RD RWGRV+E GED Y+ A V G Q
Sbjct: 144 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 197
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
++ ++ L AC KH+ AY L G D D ++E+ + +T++ PF+
Sbjct: 198 ------WNLWENNSVL---ACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 245
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
C++ G V + M ++N +NGIP A P LL +RG WNF+G++VSD ++++ +V
Sbjct: 246 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 304
Query: 290 NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+D +DA +G+D+D D Y + ++ GKI+ D+D S+ + + LG F
Sbjct: 305 DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 362
Query: 349 DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
++ N I + ++ A + A + VLLKNDN LPL N++++A+VGP A
Sbjct: 363 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 421
Query: 407 NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ ++G++ G T+ + G + YA GC D ++ S A+
Sbjct: 422 DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 480
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A +D + V G + E + R L LPG Q ELI ++ K PV +V+M+ + I
Sbjct: 481 ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 539
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
+ N + +IL + G G AIAD++FG YNP GRL I++ V + Y
Sbjct: 540 EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 597
Query: 579 LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
RP + T + D P +YPFGYGLSYT F Y V S + Y
Sbjct: 598 GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSYSVPQSTQK--------------EY 643
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
T + + + V N G DG E V +Y + +K++
Sbjct: 644 T------------------RQETISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 685
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++++F+ AG+S V F ++ +L D A N ++ G I+ G
Sbjct: 686 KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 731
>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 762
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 233/781 (29%), Positives = 356/781 (45%), Gaps = 125/781 (16%)
Query: 20 PYPERAKDLVERMTLPEKVQQMGDLAYGVPRL-GLPLYEWWSEALHGVSF---IGRR--- 72
P R DL++RMTL EK+ QM DL + + G L G+S+ G R
Sbjct: 34 PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93
Query: 73 ---------TNSPPGTHFDSEVP--------------GATSFPTVILTTASFNESLWKKI 109
N H +P GAT FP I +++FN + ++
Sbjct: 94 EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153
Query: 110 GQTVSTEARAMYNLGNAGL-TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
++ EA+A G+ SP +++ R+ RWGRV ET GEDPY+VGR + YV
Sbjct: 154 ATVIAQEAKA------TGVDQVLSPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAF 207
Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVT--EQDMQETFILP 226
EGV KH+ A+ N + VT E+D++ ++ P
Sbjct: 208 NK-EGV-------------MTTLKHFLAHGSPTGGLNL-----ASVTGCERDLRSLYLKP 248
Query: 227 FEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESH 286
F+ + E SVM SYN +P A +L+ +RG+ F GYI SD S++ + H
Sbjct: 249 FQDVMREAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLH 308
Query: 287 KFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRL 345
D K DA + + AG+D++ GD Y V+ G + E +ID + + +
Sbjct: 309 HTAKD-KADAACQAVIAGVDVEVDGDCYETLD-SLVRSGVLPEKEIDKCVSRVLTAKFAM 366
Query: 346 GYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
G FD + + P+ +ELA AAR+ +L+KN+N LPL+ ++++A++GP
Sbjct: 367 GLFDKDYTKRANLSQTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGP- 425
Query: 406 ANATKAMIGNYEGTPCRY--TSPMDGFYAYSK---VINYAPGCADIVCQNNSMIPAAIDA 460
NA + G+Y T +P+ G A ++ INYA GC +I Q+ S A+ A
Sbjct: 426 -NAAQVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTA 483
Query: 461 AKNADATVIVAGLDL---------SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
A+N+D ++ G SV E D D+ LPG Q LI V A G T+V
Sbjct: 484 ARNSDVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALIRAV--KATGKPTIV 541
Query: 512 IMSAGA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-Y 569
++ AG I + K+N + + W Y GE+ GRAIA+++FG+ NP GRL +++ ++ +
Sbjct: 542 VLVAGKPFAIPWVKDNCEAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVSFPQSTGH 599
Query: 570 VKIPYTSMPL-------RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
+ + Y P PGR Y F V+ FG+GLSYT FKY
Sbjct: 600 LPVFYNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKY---------- 649
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY- 681
K Q + +T DD C+ +EV N GK DG EVV +Y
Sbjct: 650 ----KSMQISNKEFT-------------DDDTCE-----ITVEVANTGKRDGKEVVQLYV 687
Query: 682 SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+ T +K++ +E+VFI AG++ V F + K L + + ++ G + V
Sbjct: 688 NDIVSSVVTPVKELRRFEKVFIPAGETRTVKFNL-PIKELALWNTDMKEVVEPGDFELQV 746
Query: 742 G 742
G
Sbjct: 747 G 747
>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
Length = 805
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 234/813 (28%), Positives = 354/813 (43%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 93
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 94 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 153
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 154 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 208
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 209 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 257
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 258 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 316
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 317 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 374
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 375 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 434
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 435 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 493
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NAD V+V G D S E
Sbjct: 494 RVLYAKGCA-VRDSSRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQ 552
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +++ K PV LV++ + + A +
Sbjct: 553 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVD 611
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
W YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 612 AW--YPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGNRSRY 663
Query: 593 FDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ P YPFGYGLSYT F Y D+K + T G++
Sbjct: 664 VEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD--------- 697
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSA 709
D + ++N G DG EV +Y + + T KQ+ + R+ + AG+S
Sbjct: 698 ------DCWVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESR 751
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V FT++ KSL + ++ G TI+VG
Sbjct: 752 EVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
Length = 863
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 171/456 (37%), Positives = 238/456 (52%), Gaps = 49/456 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DLV R+TL EK M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 25 PYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGL- 83
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN------- 125
AT FP I ASFN L + VS EARA +
Sbjct: 84 ---------------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKR 128
Query: 126 -AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT W+PN+N+ RDPRWGR ET GEDPY+ G+ + VRGLQ EG +Y
Sbjct: 129 YQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGGKYD------- 181
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ AC KH+A + W +R FD+ V +D+ ET++ F+ V + V VMC+Y
Sbjct: 182 -KLHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMCAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVL 301
NR G P C +LL Q +R +W + G IVSDC +I +H+ D KE A A+ +
Sbjct: 238 NRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPD-KEHASAKAV 296
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
G D++CG+ Y + AV+ G I E ID SL+ L LG D + +
Sbjct: 297 LTGTDVECGESYASLPQ-AVKAGLIDEKKIDISLKRLMKARFELGEMDNPELVSWAQIPY 355
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + +H ELA AR+ +VLL+N+ LPLN ++K +A+VGP+AN + GNY G
Sbjct: 356 SVVDSKEHRELALRMARESLVLLQNNQNVLPLNK-SLK-VAVVGPNANDSVMQWGNYNGF 413
Query: 420 PCRYTSPMDGFYAY--SKVINYAPGC---ADIVCQN 450
P + ++G Y + Y PGC +D+ Q+
Sbjct: 414 PGHTVTLLEGIRQYLPEAQLIYEPGCDLTSDVTLQS 449
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 136/299 (45%), Gaps = 59/299 (19%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
I K+AD V G+ +VE E G DR + LP Q+ L+ ++ A K
Sbjct: 594 IQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKAGK-K 652
Query: 508 VTLVIMSAGAVDINFAKNNPKIKS---ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
V V S A+ + P+ K+ IL YPG+ GG AIA+V+FG YNP GRLP+T+
Sbjct: 653 VVFVNFSGSAIALT-----PETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTF 707
Query: 565 YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
Y++ + +P + GRTY++ ++PFG+GLSYT F+Y AS
Sbjct: 708 YKST------SQLPDFEDYSMKGRTYRYMAEAPLFPFGHGLSYTTFRYGDASL------- 754
Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP 684
Q+ ++ + T I V N G+ DG EVV VY +
Sbjct: 755 --STQEVKEGEQAILT-----------------------IPVSNTGERDGEEVVQVYLRR 789
Query: 685 PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
PG + ++RV IA G + V +++ + + D N++ G + IL G
Sbjct: 790 PGDKEGPSHALRAFKRVNIAKGTTGNVTISLSK-EDFEWFDTETNTMRPIEGDYEILYG 847
>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
17393]
gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 814
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 219/727 (30%), Positives = 335/727 (46%), Gaps = 117/727 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ E HG IG T FPT I +++N L +++
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------------TTVFPTSIGQASTWNPELIRRM 190
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ ++TEA A + P +++ RDPRW RV ET GED Y+ G V+G Q
Sbjct: 191 GRAIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQ 245
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
E+ R K+ A KH+AAY W + V ++M+E PF
Sbjct: 246 G----EFPRTKG----KVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFRE 294
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V G +S VM SYN ++GIP A+ LL ++ W F G++VSD +I + E +
Sbjct: 295 AVAAGALS-VMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLYAIGGLREHG--V 351
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
DT +A + + AG+D D G + Y + AV++G + E I+ ++ + + +G F
Sbjct: 352 ADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLF 411
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
D + + + + +H+ELA E ARQ I+LLKN N LPLN KT+A++GP+A+
Sbjct: 412 DHPFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADN 470
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSKVIN-----YAPGCADIVCQNNSMIPAAIDAA 461
M+G+Y + + +DG KV N YA GCA + + S AI+AA
Sbjct: 471 IYNMLGDYTAPQSESSVVTVLDGIR--QKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAA 527
Query: 462 KNADATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELIN 498
+ +D V+V G D S + EG DR L L G Q ELI
Sbjct: 528 RQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIR 587
Query: 499 KVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGG 558
+V K P+ LV++ + + + ++ +I+ YPG +GG A+ADV+FG YNP G
Sbjct: 588 EVGKLNK-PIVLVLIKGRPLLLEGIE--AEVDAIVDAWYPGMQGGNAVADVLFGDYNPAG 644
Query: 559 RLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFF--DGPVVYPFGYGLSYTQFKYKVAS 616
RL I+ V +P+ G K+ +G YPFGYGLSYT F Y
Sbjct: 645 RLTIS------VPRSVGQLPVYYNTKRKGNRSKYIEEEGTPRYPFGYGLSYTSFNYS--- 695
Query: 617 SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSE 676
D+ V + C ++V N G DG E
Sbjct: 696 ----------------DLKAEVVEAEDSCLV-------------NISVKVRNEGSRDGDE 726
Query: 677 VVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASG 735
VV +Y + + T KQ+ G++R+ + G++ ++ F ++ KSL + + G
Sbjct: 727 VVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPG 785
Query: 736 AHTILVG 742
T+++G
Sbjct: 786 RFTLMLG 792
>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
Length = 782
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 220/723 (30%), Positives = 343/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG AT FPT I A+++ L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP ++ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 275 AIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ + IDT++ + + +G F
Sbjct: 333 APTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKTVIDTAVCRVLRMKFEMGLF 391
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 627 IS-VPRSVGQIPVYYNKKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 673
Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + K +C ++++ +V+N GK DG EV +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ +ER + G+ KV F + + +V+ ++ SG +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGTFQV 764
Query: 740 LVG 742
++G
Sbjct: 765 MIG 767
>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
Length = 791
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 217/721 (30%), Positives = 342/721 (47%), Gaps = 109/721 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ + E LHG + +G ATSFP I +S++ ++ +++
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------------ATSFPQSIAMASSWDPTMLRQV 179
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
Q + E RA SP +++ RDPRWGR+ ET GEDPY+VG + V GLQ
Sbjct: 180 NQVIGREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQ 234
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
EG R RP + A KH + N + V+E++++E F PFE
Sbjct: 235 G-EG----RSRLLRPGHVFATLKHLTGHGQPESGTN---VGPAPVSERELRENFFPPFEQ 286
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V + +VM SYN ++G+P+ A+ LL+ +R +W F G +VSD ++ ++ H
Sbjct: 287 VVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIA 346
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGA-VQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+ E+A R L AG+D D + + T+G V++GK++EA +D ++R + + R G F
Sbjct: 347 ANL-EEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLF 405
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ N N + LA AA++ I LLKND G LPL T+A++GP +A
Sbjct: 406 ENPYADANAAAAITNNDEARALARTAAQRSITLLKND-GMLPLKPEG--TIAVIGP--SA 460
Query: 409 TKAMIGNYEGTPCRYTSPMDGFYAYSKV---INYAPGCA---------DIV-----CQNN 451
A +G Y G P S ++G A I +A G D V +N
Sbjct: 461 AVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVKSDPAENR 520
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAAK 505
+I A++AA+N D ++ G EG DR L L G Q EL + + K
Sbjct: 521 KLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK 580
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
P+T+V+++ + K + + +IL Y GE+GG A+AD++FG NPGG+LP+T
Sbjct: 581 -PITVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVT-- 635
Query: 566 EANYVKIPYTSMPLRPVNNF---PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
+P + L N R Y F +YPFG+GLSYT F S+P+
Sbjct: 636 ------VPRSVGQLPMFYNMKPSARRGYLFDTTDPLYPFGFGLSYTNFSL---SAPRLSA 686
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
K +GT K + ++V N G +G EVV +Y
Sbjct: 687 TK-------------IGTGG----------------KTSVSVDVRNTGAREGDEVVQLYI 717
Query: 683 KPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+ + T +K++ G++RV + G+S V FT+ ++L++ ++ ++ G I+
Sbjct: 718 RDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMRRVVEPGDFEIMT 776
Query: 742 G 742
G
Sbjct: 777 G 777
>gi|409730324|ref|ZP_11271901.1| beta-glucosidase [Halococcus hamelinensis 100A6]
gi|448724096|ref|ZP_21706609.1| beta-glucosidase [Halococcus hamelinensis 100A6]
gi|445786548|gb|EMA37314.1| beta-glucosidase [Halococcus hamelinensis 100A6]
Length = 747
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 208/706 (29%), Positives = 339/706 (48%), Gaps = 99/706 (14%)
Query: 86 PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWG 144
P T+FP I +S++ L +++ + +E A+ G T SP ++V RD RWG
Sbjct: 88 PEGTTFPQSIGMASSWDPDLMRQVMERTRSEMAAI------GTTHALSPVLDVARDLRWG 141
Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
RV ET GEDPY+V A YV GLQ S ISA KH+AA+ G
Sbjct: 142 RVEETFGEDPYLVAAMASAYVAGLQ----------GPSIEDGISATLKHFAAHSASEG-G 190
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+R + V ++++ET + P+E + SVM +Y+ ++GIP+ ++ LL +RG
Sbjct: 191 KNRASVN--VGPRELRETHLFPYEAAITTAGAESVMNAYHDIDGIPSASNEWLLTDLLRG 248
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQ 322
+ F G +VSD S+ + E H + +E AV L+AG+D++ D Y + A++
Sbjct: 249 ELGFDGTVVSDYYSVDFLREEHGVSDSDRESAVM-ALEAGIDVELPATDCYEHLPE-AIE 306
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
G+++EA +D ++R + + R G D S ++ + EL AAR+ IVLL
Sbjct: 307 NGELSEATLDEAVRRVLRMKFRKGLVDDSTVDASVAADAFNTEAATELTERAARESIVLL 366
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY---------TSPMDGF--Y 431
KN+N LPL+ + +LA+VGP A+ + M+G+Y P Y T+P+D +
Sbjct: 367 KNENELLPLD--DTDSLAVVGPKADDGQEMMGDY-AYPAHYPEAEVSLDATTPLDAIRVH 423
Query: 432 AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV---AGLDLS------------ 476
A I Y GC + A AA V + +D S
Sbjct: 424 ADGTEIAYEEGCTTSGPSTDGFDAAVEAAAGADVTLAFVGARSAVDFSDPDAEDVTNPAL 483
Query: 477 -VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
EG D DL LPG QTEL+ +V + P+ +V++S I + ++ +++
Sbjct: 484 PTSGEGSDVTDLGLPGVQTELLERVHETGT-PLVVVVVSGKPHSIEWVAE--EVPAVVQA 540
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKFFD 594
PGEEGG IADV+FG YNPGG LP++ + + + Y P N + + + +
Sbjct: 541 WLPGEEGGTGIADVLFGDYNPGGHLPVSLARSVGQLPVHYDRRP-----NSANKDHVYTE 595
Query: 595 GPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+Y FG+GLSYT+F+Y D ++ D T+G +
Sbjct: 596 SEPLYSFGHGLSYTEFEYD--------DFEVSTD--------TLGASG------------ 627
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
T + N+G GS+VV +Y ++ P A +++++G+ERV + AG+S ++
Sbjct: 628 ----SVTASVTATNVGGRGGSDVVQLYAHAESPDQA-RPVQELVGFERVSLDAGESTRIS 682
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
F ++A + L D N + G++ + VG ++ +N+N+
Sbjct: 683 FEIDATQ-LAYHDRDMNLRVHDGSYELRVGHSASDIAATGSVNINN 727
>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 875
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 160/455 (35%), Positives = 233/455 (51%), Gaps = 49/455 (10%)
Query: 5 IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
++ + S FP+ + +L + +R DLV R+TL EKV QM + A G+PRL +P Y+WW+E LH
Sbjct: 21 LQAQNSKFPFQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLH 80
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
GV+ RT T FP I A+++ + ++ + E R ++N
Sbjct: 81 GVA----RTPY-----------NVTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKA 125
Query: 125 NA---------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
A GLT+W+PNIN+ RDPRWGR ET GEDPY+ A +VRGLQ
Sbjct: 126 IAAGKEKDRYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQ------ 179
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
+D + LK +AC KHYA + + R FD VT D+ +T++ F+ V +
Sbjct: 180 ---GNDPKYLKAAACAKHYAVH---SGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSN 233
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
V+ VMC+YN P CA L+ +R W+F GY+ SDC +I +HK D
Sbjct: 234 VAGVMCAYNAFRKQPCCASDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAA 293
Query: 296 AVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQ 353
+ V G D+DCG+ + AV++ KI E ID S++ L+++ RLG FD +
Sbjct: 294 SADAVFH-GTDIDCGNEAYRALVQAVKENKITEKQIDISVKRLFMIRFRLGMFDPPSMVK 352
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
Y + + H + A A + IVLLKN N LPL G +K + ++GP+A A +
Sbjct: 353 YAQTPATELESAAHAKHALLMAHESIVLLKNANNTLPLKKG-LKKIVVLGPNATNVIAPL 411
Query: 414 GNYEGTPCRYTSPMDGF---------YAYSKVINY 439
GNY GTP + + G Y K +NY
Sbjct: 412 GNYSGTPSKLITLFQGIKEKAGAATQVVYEKAVNY 446
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 94/327 (28%), Positives = 141/327 (43%), Gaps = 60/327 (18%)
Query: 433 YSKVINYAPGCADIVCQNNSMIPAAID------AAKNADATVIVAGLDLSVEAE------ 480
Y+ V+ Y G + ++ PA D +ADA + G+ +E E
Sbjct: 570 YNLVLEYWQGEGKATIKMHTGHPAVTDFNALVKKYSDADAFIFAGGISPQLEGEEMKVSD 629
Query: 481 ----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
G DR +LLP QTEL+ K A+ PV V+M+ A+ + N I +I+
Sbjct: 630 PGFKGGDRTTILLPAIQTELM-KALQASGKPVVFVMMTGSALATPWESEN--IPAIVNAW 686
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
Y G+ G A+ADV+FG YNP GRLP+T+Y ++ +P + RTY++F G
Sbjct: 687 YGGQAAGTALADVLFGDYNPSGRLPVTFYGSD------NDLPSFEDYSMKNRTYRYFTGK 740
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+Y FGYGLSYT F+Y + P + Q + + TV
Sbjct: 741 PLYGFGYGLSYTTFRYDQLTMPVTA-------QNGKPVKVTV------------------ 775
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EV +Y + T +K + G++R+ + +S V F +
Sbjct: 776 --------RVTNTGKTTGDEVAQIYVVNENTSIQTALKTLKGFQRISLRPAESKMVSFVL 827
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ L VD +G I VG
Sbjct: 828 QS-DDLTYVDADGQRKPLTGKIQICVG 853
>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 864
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 165/448 (36%), Positives = 240/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D+ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---MYNLGNA---- 126
AT FP I ASF I VS EARA Y+ +
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ + D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCM-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V E V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R DW + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPDKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 136/300 (45%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRNT------AQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL++ T K A +I + V N G DG EVV VY K
Sbjct: 755 KLEQ------------TIKVGETAKII-------------VPVTNTGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A +K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
Length = 782
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 220/722 (30%), Positives = 341/722 (47%), Gaps = 112/722 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG AT FPT I A+++ L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP ++ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 275 AIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ +A IDT++ + + +G F
Sbjct: 333 ALTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 627 IS-VPRSVGQIPVYYNKKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYSALQV---- 677
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
+ K +C ++++ +V+N GK DG EV +Y
Sbjct: 678 ---VQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQLY 706
Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ + +KQ+ +ER + G+ KV F + + +V+ ++ SG ++
Sbjct: 707 MRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHLM 765
Query: 741 VG 742
+G
Sbjct: 766 IG 767
>gi|293372493|ref|ZP_06618877.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|299144770|ref|ZP_07037838.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|292632676|gb|EFF51270.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|298515261|gb|EFI39142.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 735
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 212/760 (27%), Positives = 355/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
Y D K P +R DL+ RMTL EKV Q+ G VP +G +Y
Sbjct: 30 YKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89
Query: 59 WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
+ AL + R P +D+ T +P + S+N L ++ +
Sbjct: 90 TNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G + V+G
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + +++Q + +T++LP+EM V G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +I+ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
+A AGL++D + Y V++G+++ A +D ++R + ++ RLG F+
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K PQ +++AA A + +VLLKN+N LPL + K +A++GP A ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G +G +A + YA GCA N A++AA+ +D V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEALEAARWSDVVV 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G ++ E R + LP Q EL ++ A K P+ LV+++ +++N +
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLELI 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G +A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D + + ++ V N+G DG+E V + P + T +K++ +E+
Sbjct: 631 PSVTKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
Length = 864
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y ++ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASF I VS EARA +A
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V EG V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R +W + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KLD+ + + V I V N G DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 777
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 212/716 (29%), Positives = 334/716 (46%), Gaps = 114/716 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG T FPT I +++N +L +K+
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------------TTVFPTGIGQASTWNPALLQKM 167
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
TV+ E R + P +++ RDPRW RV E+ GEDP + G A V GL
Sbjct: 168 SATVAKEVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVTGLG 222
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S P KH+ AY + N + + E++++E F+ PF+
Sbjct: 223 S--------GNLSDPFATIPTLKHFVAYGIPEGGHNGSA---ASIGERELREYFLPPFQS 271
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V G SVM +YN V+GIP ++ LL +R +WNF+G+ VSD SI+ I SH+
Sbjct: 272 AVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWNFNGFTVSDLGSIEGIKGSHRVA 330
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
D K+ A+ ++AGLD D G + AV+QG++ E ID ++ + + +G F+
Sbjct: 331 KDHKQAAIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRVLALKFEMGLFE 389
Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
K + +I L+ + AR+ IVLL+N N LPL ++K +A++GP+A+
Sbjct: 390 KPFVDAKTAKKEVKTEANIALSRQVARESIVLLENKNNILPLRK-DVK-IAIIGPNADNI 447
Query: 410 KAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y +G + ++V +Y GC+ I NS IPAA+ AA+ +
Sbjct: 448 YNMLGDYTAPQPDGAVTTVRQAISARLPKAQV-SYVKGCS-IRDTTNSDIPAAVTAAQQS 505
Query: 465 DATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELINKVA 501
D V V G D E EG DR L L G Q EL+ +
Sbjct: 506 DIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALK 565
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ ++ + +++N+A + W YPG+EGG AIADV+FG YNP G++P
Sbjct: 566 QTGK-PLVVIYIQGRPLNMNWAATHADALLCAW--YPGQEGGHAIADVLFGDYNPAGKMP 622
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
++ + +IP P+++ Y +Y FGYG SY+ F+YK
Sbjct: 623 LS-VPRSVGQIPVHYNRKSPLDH----RYVEEAATPLYAFGYGKSYSDFEYK-------- 669
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
D+K+ KD KDY+ +F + N GK DG EV +Y
Sbjct: 670 DLKIQKDN--------------------------KDYRVSFTLT--NTGKYDGDEVAQLY 701
Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
+ + + ++Q+ +ER+ + G+S V F + A L +++ +L G+
Sbjct: 702 IRNQYASVSQPVQQLKHFERIHLKTGESKTVSFVLTAG-DLSVINTQMKKVLEPGS 756
>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
Length = 864
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y ++ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASF I VS EARA +A
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V EG V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R +W + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 132/300 (44%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL++ + + V I V N G DG EVV VY K
Sbjct: 755 KLEQTIKVGETAKMV-------------------------IPVTNTGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDTEGPTKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
Length = 850
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 162/448 (36%), Positives = 236/448 (52%), Gaps = 47/448 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
+ PY + L +RA DL++R+T+ EK+ M + + G+PRLG+ YEWW+EALHGV+
Sbjct: 12 AQLPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARA 71
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+SL +K+ VS EARA N
Sbjct: 72 GL----------------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQ 115
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ R + V+GLQ + Y
Sbjct: 116 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYD---- 171
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V E DV VM
Sbjct: 172 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVM 224
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R +W F+G +VSDC +I + K ++T DA
Sbjct: 225 CAYNRFEGDPCCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKK--HNTHPDAAHAS 282
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + +G DL+CG Y T AV+ G I+E ID S++ L LG + S + L
Sbjct: 283 ADAVLSGTDLECGSNYRKLT-DAVKAGIISEEQIDISVKRLLKARFELGEMEESHPWA-L 340
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + P+H LA + A + + LL+N LPL+ +A++GP+AN + GNY
Sbjct: 341 PYSIVDCPEHRHLALQIAHETMTLLQNKENILPLDKH--AKVAVIGPNANDSVMQWGNYN 398
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
GTP ++ + + + + Y P C
Sbjct: 399 GTPSHTSTLLSALRSKLPAAQLIYEPVC 426
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/299 (29%), Positives = 135/299 (45%), Gaps = 55/299 (18%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
A ++ K+ + + G+ +E E G DR D+ LP Q ++ + A K
Sbjct: 579 ATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKKAGK 638
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
V V S A+ + N +IL YPG+EGG A+ADV+FG YNP GRLP+T+Y
Sbjct: 639 -KVIFVNFSGSAMALTPETEN--CDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPVTFY 695
Query: 566 EANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIK 624
+ N ++P + ++ GRTY++ ++PFGYGLSYT F Y A + K
Sbjct: 696 K-NMEQLPDFEDYSMQ------GRTYRYMKEAPLFPFGYGLSYTTFTYGKARADKK---- 744
Query: 625 LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP 684
+ T + K T I V N+G DG EVV VY +
Sbjct: 745 ------------RISTGE----------------KMTLTIPVSNIGSRDGEEVVQVYLRR 776
Query: 685 PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
K + ++RV I G+S V + + + DN+ +++ + G + +L G
Sbjct: 777 EDDPEGPTKTLRAFKRVEITKGKSLNVKIEL-PYTAFEWFDNSTHTMHSMKGEYEVLYG 834
>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 908
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 168/444 (37%), Positives = 233/444 (52%), Gaps = 45/444 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + L +RA DLV RMTL EK QM + A +PRL +P Y++W+E LHGV+ G
Sbjct: 24 YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSGY-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+++ L K+IG ++TEARA N
Sbjct: 82 --------------ATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDP++ + +N++ GLQ +D +
Sbjct: 128 FGLTFWSPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQ---------GTDPKFY 178
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K+ A KH+A + EG R FD T D+ +T++ F + + S+MC+YNR
Sbjct: 179 KVIATPKHFAVHSGPE-EG--RHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNR 235
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDTKEDAVARVLKA 303
++G P C LL +R DW F G++ SDC +I +H+ D E A L A
Sbjct: 236 IDGQPACGSKLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDA-EHADKAALLA 294
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKNN 361
G D +CG Y AV+ G I E+DID SLR L+ +RLG FD GS Y + +
Sbjct: 295 GTDTNCGSTYRKLG-DAVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSVPYAQIPFSQ 353
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +P + +A AA + +VLLKND G LPL G KT+A++GP+ + ++ GNY G
Sbjct: 354 VNSPANAAVAKRAAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAH 412
Query: 422 RYTSPMDGFYAYSKVIN--YAPGC 443
P+D + N YAPG
Sbjct: 413 DPRMPVDALRSALSGTNVVYAPGA 436
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 145/304 (47%), Gaps = 55/304 (18%)
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVA 501
+++P A++AA +D V + GL +E E G DR D+ LP Q L+ +
Sbjct: 619 TLLPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLI 678
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P +V+++ A+ IN A + K +IL YPGE G A+AD + G+ NP GRLP
Sbjct: 679 ATGK-PTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNPSGRLP 735
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
IT+Y++ + +P + RTY++F G +Y FG+GLSYT+F Y S K
Sbjct: 736 ITFYKSE------SDLPGFEDYSMQNRTYRYFKGAPLYGFGFGLSYTKFAY---SGLKLA 786
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
KL+ T ++ V+N GK+ G EV +Y
Sbjct: 787 KAKLNAGD-----------------------------TLTAEVTVKNTGKVAGEEVAELY 817
Query: 682 SKPP--GIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHT 738
PP G AG KQ + G++RV + G+S K+ FT+ + L VD + G +
Sbjct: 818 LLPPAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTLTP-RQLSEVDAKGTRAIQPGTYA 876
Query: 739 ILVG 742
I +G
Sbjct: 877 IAIG 880
>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
Length = 864
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y ++ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASF I VS EARA +A
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V EG V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R +W + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KLD+ + + V I V N G DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 864
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 165/448 (36%), Positives = 240/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D+ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---MYNLGNA---- 126
AT FP I ASF I VS EARA Y+ +
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ + D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCM-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V E V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R DW + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPDKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 137/300 (45%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y T +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------ITQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL++ T K A +I + V N G DG EVV VY K
Sbjct: 755 KLEQ------------TIKVGETAKII-------------VPVTNTGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A +K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 221/723 (30%), Positives = 344/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG AT FPT I A+++ L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGLG 225
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP ++ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 275 AIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 332
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ +A IDT++ + + +G F
Sbjct: 333 APTKENAAIQSVMAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-MINKVAVIGPNADN 450
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI+AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 509
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 627 IS-VPRSVGQIPVYYNQKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 673
Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + K +C ++++ +V+N GK DG EV +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ +ER + G+ KV F + + +V+ ++ SG +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 764
Query: 740 LVG 742
++G
Sbjct: 765 MIG 767
>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
dorei DSM 17855]
Length = 864
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y ++ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASF I VS EARA +A
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V EG V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R +W + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KLD+ + + V I V N G DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
Length = 864
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 165/448 (36%), Positives = 240/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D+ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---MYNLGNA---- 126
AT FP I ASF I VS EARA Y+ +
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ + D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCM-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V E V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R DW + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPDKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 129 bits (323), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 138/300 (46%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A+A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y T +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------ITQLPNFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL++ T K A +I + V N G DG EVV VY K
Sbjct: 755 KLEQ------------TIKVGETAKII-------------VPVTNTGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A +K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDTQTNTMRTLAGNFDIMVG 848
>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
43183]
gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 944
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 228/811 (28%), Positives = 366/811 (45%), Gaps = 146/811 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D R ++L+++MTL EK QM L YG R+ LP EW W + G+
Sbjct: 53 YEDPAATLDARIENLLQQMTLEEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKD---GI 108
Query: 67 SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
I N P H F +E + G
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNENVWPASRHAWALNEIQRFFVEDTRLGIPVDFTNEGIRGV 168
Query: 88 ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
AT+FPT + ++N L +++G EAR + G T ++P ++V RD R
Sbjct: 169 ESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E GE PY+V I VRGLQ H +++A KH+AAY +
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATAKHFAAYSNNKG 269
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
D ++ ++++ I PF+ + E + VM SYN +GIP L +
Sbjct: 270 AREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQGSYYWLTTRL 329
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
R + F GY+VSD D+++ + H D KE AV + ++AGL++ C D +
Sbjct: 330 RKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
V++G ++E I+ +R + V +G FD Q G ++ + E +A +A+R+
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADDEVEKEANEAVALQASRE 448
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
IVLLKN + LPLN IK +A+ GP+A+ + +Y T+ ++G ++
Sbjct: 449 SIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGK 508
Query: 436 -VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+ Y GC D+V + + I A+ A+ AD V+V G
Sbjct: 509 AEVLYTKGC-DLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVLGGGQRTCG 567
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E K R L LPG Q +L+ V K PV L++++ + +N+A + + +IL YPG
Sbjct: 568 ENKSRTSLELPGHQLKLLQAVQATGK-PVILILINGRPLSVNWA--DKFVPAILEAWYPG 624
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV-- 597
+GG +AD++FG YNPGG+L +T + +IP+ + P +P + G DG +
Sbjct: 625 SKGGTVVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPYKPASQIDGGKNPGPDGNMSR 682
Query: 598 ----VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+YPFGYGLSYT F+Y + +PK +
Sbjct: 683 INGALYPFGYGLSYTTFEYSDLEITPKVI------------------------------- 711
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKV 711
+ K T +++V N GK G EVV +Y++ T+ K + G+ER+ + G+S ++
Sbjct: 712 --TPNQKATIRLKVTNTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHLKPGESKEI 769
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT++ K L++++ + G I+ G
Sbjct: 770 VFTLDR-KHLELLNADMKWTVEPGEFAIMAG 799
>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 812
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 235/817 (28%), Positives = 358/817 (43%), Gaps = 163/817 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P R + L+ +MTL EKV QM + LG P+YE
Sbjct: 49 YENPSAPVEYRVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEDIRLTPQLEKEI 102
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 103 GEYHIGSLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPH 162
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 163 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 217
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q E D S + A KH+A+Y
Sbjct: 218 RWSRVEETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---G 266
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G + SVM SYN ++G P LL
Sbjct: 267 WTEGGHNGGTAHIGERELEEAIFPPFREAVGAGAL-SVMSSYNEIDGNPCTGSRYLLTDI 325
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD ++ + E ND +A + + AG+D D G + Y + A
Sbjct: 326 LKDRWQFKGFVVSDLYAVGGLREHGVAGNDY--EAAIKAVNAGVDSDLGTNVYAEQLVAA 383
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A A ID ++R + + ++G FD + + + +H LA E ARQ IV
Sbjct: 384 VKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIV 443
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN + LPL +I+TLA++GP+A+ M+G+Y +GT + +
Sbjct: 444 LLKNKDKLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKET 502
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+ A+NADA V+V G D S E
Sbjct: 503 RVLYAKGCA-VRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQ 561
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLV----IMSAGAVDINFAKNNPK 528
EG DR L L G Q EL+ +++ K PV L+ ++ GA+ +
Sbjct: 562 ISDMESGEGYDRATLHLMGRQLELLEEISRLGK-PVVLIKGRPLLMEGAIQ--------E 612
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR 588
++I+ YPG +GG A+ADV+FG YNP GRL ++ V +P+ G
Sbjct: 613 AEAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTRRKGN 666
Query: 589 TYKFFDGPVV--YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
++ + P YPFGYGLSYT F Y D+K + T G++
Sbjct: 667 RSRYVEEPGTPRYPFGYGLSYTTFSY--------TDMK---------VQVTEGSD----- 704
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQVIGYERVFIAA 705
D + ++N G DG EV +Y + + T KQ+ + R+ + A
Sbjct: 705 ----------DCWVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKA 754
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
G+S +V FT++ KSL + ++ G TI+VG
Sbjct: 755 GESREVTFTLDK-KSLALYMQEGEWVVEPGRFTIMVG 790
>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
Length = 864
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 239/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y ++ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASF I VS EARA +A
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V EG V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R +W + G ++SDC +I + HK D + + A VL
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVL- 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 132/300 (44%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYG--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KL++ + + V I V N G DG EVV VY K
Sbjct: 755 KLEQTIKVGETAKMV-------------------------IPVTNTGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDTEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
Length = 850
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/430 (37%), Positives = 234/430 (54%), Gaps = 46/430 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D P ER DL+ R+T+ EK+ + + G+PRL + Y +EALHG+
Sbjct: 27 YKDMDAPQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGIV------ 80
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
PG T FP I + +N +I +S EARA +N N G
Sbjct: 81 --RPGNF--------TVFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRL 130
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G+ + +V+GLQ +D R
Sbjct: 131 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQG---------NDPR 181
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+ + KH+AA N E ++RF + +++E+D++E ++ FE C+ +G S+M +Y
Sbjct: 182 YLKVVSTPKHFAA----NNEEHNRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAY 237
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F+GY+VSDC + +V HK++ T E A LKA
Sbjct: 238 NAINDVPCTLNTWLLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVK-TPEAAATLALKA 296
Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CGD Y M A +Q ++EA+IDT+ + M LG FD + Y L +
Sbjct: 297 GLDLECGDNVYIEPLMNAYKQYMVSEAEIDTAAYRILRARMMLGLFDDPAKNPYNALSPS 356
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ +H +A EAARQ +VLLKN+N LP+N IK++A+VG NA G+Y G P
Sbjct: 357 IVGCEKHKNMALEAARQSLVLLKNENNFLPINPKKIKSIAVVG--INAGNCEFGDYSGKP 414
Query: 421 CRY-TSPMDG 429
S +DG
Sbjct: 415 VNVPVSVLDG 424
Score = 128 bits (322), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 93/288 (32%), Positives = 143/288 (49%), Gaps = 47/288 (16%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
A A + D T+ V G++ S+E EG+DR + LP Q EL + A + +V+++
Sbjct: 594 AKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQ-ELFIEEAYKLNPKMAVVLVAGS 652
Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
++ +N+ + + +IL YPGE+GG A+A+ +FG YNP GRLP+T+Y + P+
Sbjct: 653 SLAVNWMDEH--VPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRSLDDLPPFDD 710
Query: 577 MPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
++ RTY +F G +Y FGYGLSYT+F Y+ KL DQ ++
Sbjct: 711 YAVQ-----KNRTYMYFTGKPLYAFGYGLSYTKFDYR----------KLSVDQDAENV-- 753
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA-GTHIKQV 695
+ +F I +N GK +G EV VY + P I IKQ+
Sbjct: 754 ----------------------RLSFTI--KNSGKYNGDEVAQVYVQFPEIGVKVPIKQL 789
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
G+ERV IA G++ V T+ K L+I + SG + +VG
Sbjct: 790 KGFERVHIAKGKTLPVTITV-PKKELRIWNERKGEFFTPSGNYVFMVG 836
>gi|255690486|ref|ZP_05414161.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260623937|gb|EEX46808.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1365
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 233/807 (28%), Positives = 360/807 (44%), Gaps = 162/807 (20%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDL--------------------------- 44
PY A LP ER KDL++RMT EK+ Q+ +
Sbjct: 534 LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGF 593
Query: 45 AYGVP---------------------RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
G P RLG+P++ +E+LHGV
Sbjct: 594 VEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH--------------- 637
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRW 143
GAT FP I ++F+ L + ++ E A+ SP I+VVRD RW
Sbjct: 638 --EGATVFPQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRW 690
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
GRV E+ GEDPY+ GR+ I V+G D IS KHY +
Sbjct: 691 GRVEESFGEDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPH------ 730
Query: 204 GNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
GN + E +D+ E ++ PFEM + + +VM +YN N IP A LL
Sbjct: 731 GNPLSGLNLASVETSIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTD 790
Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA 320
+R +W F GY+ SD +I+ + H F E+A + L AGLD++ G
Sbjct: 791 VLRKEWGFKGYVYSDWGAIEMLKNFH-FTARNSEEAALQALTAGLDVEASSDCYPAIPGL 849
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
+++G++ +D ++R + R+G FD P + K I + + I L+ + A + V
Sbjct: 850 IERGELNREIVDEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTV 908
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT-PCRY-TSPMDGFYAYSKV-- 436
LLKN+ LPL+ G +K++A++GP NA + G+Y T R+ +P+ G ++
Sbjct: 909 LLKNERQLLPLSIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNV 966
Query: 437 -INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGKDRVD 486
+NYA GC+ +V + S I A++AA+ +D V+ G S EG D D
Sbjct: 967 KVNYAKGCS-LVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLND 1025
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
L L G Q LI V K PV LV+++ I + K N I +IL Y GE+ G +I
Sbjct: 1026 LTLTGAQPALIKAVQATGK-PVILVLVTGKPFAIPWEKKN--IPAILVQWYAGEQSGNSI 1082
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF---------PGRTYKFFDGPV 597
AD++FGK +P GRL ++ E+ +P LR F PGR Y F PV
Sbjct: 1083 ADILFGKVSPSGRLTFSFPEST-GHLPVYYNHLRSDRGFYKSPGSYDSPGRDY-VFSAPV 1140
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
++ FG+GL+YT F+Y +++ D+ A+ L++D
Sbjct: 1141 PLWSFGHGLTYTTFEYS--------NLQTDR------------------ASYLLNDT--- 1171
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
+I ++N GK +G EVV +Y S ++Q+ + +V + AG++ V ++
Sbjct: 1172 ---VHVRIGLKNTGKCEGKEVVQLYVSDVCSSVAMPVRQLRDFRKVALQAGETQIVRLSI 1228
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
L I++ +++ G I VG
Sbjct: 1229 -PVSELTILNEKNEAIVEPGEFEIQVG 1254
>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
Length = 854
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 160/421 (38%), Positives = 232/421 (55%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D P ER DL+ ++T+ EK+ + + G+PRL + Y +EALHGV
Sbjct: 28 YLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGVV------ 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
PG T FP I A +N L +I +S EARA +N G
Sbjct: 82 --RPGNF--------TVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQ 131
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G+ +++V+GLQ D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQG---------DDPR 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + ++E+D++E ++ FE C+ EG +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAY 238
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V HK++ T E A A ++A
Sbjct: 239 NAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVK-TLEAAAALSIQA 297
Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
GLDL+CGD Y + A +Q ++EA+ID++ + MRLG FD Y + +
Sbjct: 298 GLDLECGDEVYMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFDDPALNPYNKISPS 357
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ +H +LA EAARQ IVLLKN+ LPL++ IK++A+VG NA + G+Y GTP
Sbjct: 358 IVGCEKHSKLALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVG--INAGNSEFGDYSGTP 415
Query: 421 C 421
Sbjct: 416 V 416
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 129/263 (49%), Gaps = 49/263 (18%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
A D + D TV V G++ S+E EG+DR + LP Q I + P T+V++ AG
Sbjct: 595 AGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKI--NPNTVVVLVAG 652
Query: 517 A-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYT 575
+ + IN+ + I +I+ YPGE GG A+A+V+FG YNPGG+LP+T+Y + +
Sbjct: 653 SSLAINWMDEH--IPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRSLDELPAFD 710
Query: 576 SMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDIN 635
+R GRTY+FF+G +Y FG+GLSYT F YK S + D+
Sbjct: 711 DYDIR-----KGRTYQFFEGDPLYAFGHGLSYTTFSYKKLSIDAAGDV------------ 753
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI---AGTHI 692
+ ++N GK +G EV +Y K G +
Sbjct: 754 ------------------------VSVSFTLKNTGKYEGDEVAQLYVKYQGSDSQVKLPL 789
Query: 693 KQVIGYERVFIAAGQSAKVGFTM 715
KQ+ G+ER+ + G+S ++ T+
Sbjct: 790 KQLKGFERIHLKKGESKQINLTV 812
>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
Length = 882
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 166/478 (34%), Positives = 244/478 (51%), Gaps = 49/478 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y DA P RA DLV RMTL EK Q+ + A +PRL + Y WW+E LHGV+ G
Sbjct: 35 YQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEGLHGVAAAGY-- 92
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY-----NLGNA-- 126
AT FP + A+++E L ++ +T+S E RA Y G +
Sbjct: 93 --------------ATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRFGGSDW 138
Query: 127 --GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLT WSPNIN+ RDPRWGR ET GEDPY+ R + +VRGLQ + V Y
Sbjct: 139 FGGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDDPVYY-------- 190
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
+ A KHYA + R + + D+ +T++ F + EG S+MC+YN
Sbjct: 191 -RTVATPKHYAVHSGPE---AGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYN 246
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
+NG P CA+ LL + +R DW F GY+VSDCD++ I SH + T E+ V +
Sbjct: 247 AINGQPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAY-RPTPEEGVTAAYQ 305
Query: 303 AGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKN 360
G DL CG+ + AV+QG + E +DT+L L+ +LG FD + + +
Sbjct: 306 VGTDLICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQFDPPAKVFPKITAE 365
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ P + + + + A +VLLKN+N LPL G + +A++GP+A++ +++GNY G P
Sbjct: 366 DYDTPANRDFSQKVAESAMVLLKNENNLLPLK-GEPRQIAVIGPNADSMDSLVGNYNGDP 424
Query: 421 CRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
+ + G A + YAPG I + ++ A D+A D G+ +S
Sbjct: 425 SHPVTVLSGIRARFPKATVTYAPGSGLI----DPVMTAVPDSAFCRDEACTQTGVTVS 478
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 145/299 (48%), Gaps = 52/299 (17%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
+A+ AAK AD V VAGL VE E G DR L LP Q +++ +V+ A K
Sbjct: 598 SAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQVSAAGK 657
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV LV+++ A+ IN+A N + +I+ YPG +GG A+A +I G Y+P GRLP+T+Y
Sbjct: 658 -PVVLVLINGSALGINWADKN--VPAIIEAWYPGGQGGAAVARLIAGDYSPAGRLPVTFY 714
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ +P N GRTY++F G +YPFGYGLS+T F+Y + L
Sbjct: 715 RSA------DQLPAFNDYNMKGRTYRYFKGEALYPFGYGLSFTTFRY--------APLTL 760
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
Q D +V +V N G D EVV +Y P
Sbjct: 761 SARQVAGDGQVSVSA------------------------DVTNSGSRDSDEVVQLYVSYP 796
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
G I+ + +ER+ + AG++ V FT++ ++L V+ + + G + +G G
Sbjct: 797 GQKLAPIRALARFERIHLKAGETKTVRFTLDP-QALSTVNADGSRSVKPGKVELWLGGG 854
>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 1049
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 219/767 (28%), Positives = 354/767 (46%), Gaps = 100/767 (13%)
Query: 16 DAKLPYPERA----KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
++KLP+ A KDL+ RMT+ EK+ Q+ G L P E+ S++L +G
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 72 ---------------------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIG 110
R P D T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 111 QTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ + E+ A AGL + ++P +++ RD RWGRV+E GED Y+ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ +S + AC KH+ AY L G D D ++E+ + +T++ PF+
Sbjct: 501 ---WNLWENNS------VLACAKHWVAYGLPQ-AGRDYAPVD--MSERTLFDTYLPPFKA 548
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
C++ G V + M ++N +NGIP A P LL +RG WNF+G++VSD ++++ +V
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVAQGVAE 607
Query: 290 NDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+D +DA +G+D+D D Y + ++ GKI+ D+D S+ + + LG F
Sbjct: 608 DD--KDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 349 DGSPQYKN--LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA 406
++ N I + ++ A + A + VLLKNDN LPL N++++A+VGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 407 NATKAMIGNY--EGTPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDA 460
+ ++G++ G T+ + G + YA GC D ++ S A+
Sbjct: 725 DNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAVKL 783
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A +D + V G + E + R L LPG Q ELI ++ K PV +V+M+ + I
Sbjct: 784 ASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPLSI 842
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN-YVKIPYT-SMP 578
+ N + +IL + G G AIAD++FG YNP GRL I++ V + Y
Sbjct: 843 EWVDKN--VSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYKKS 900
Query: 579 LRPVNNFPGRTYKFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINY 636
RP + T + D P +YPFGYGLSYT F Y V S + Y
Sbjct: 901 GRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSYSVPQSTQK--------------EY 946
Query: 637 TVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQV 695
T + + + V N G DG E V +Y + +K++
Sbjct: 947 T------------------RQETISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVKEL 988
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++++F+ AG+S V F ++ +L D A N ++ G I+ G
Sbjct: 989 KAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 865
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 168/431 (38%), Positives = 229/431 (53%), Gaps = 44/431 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D P R DL+ RMTL EK QM ++A +PRLG+P Y++W+EALHGV+ G
Sbjct: 14 YFDPGQPIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE-- 71
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I A+++ + GQTV+TE RA YN A
Sbjct: 72 --------------ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRY 117
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDPY+ G A+ +V G+Q +D+ L
Sbjct: 118 YGLTFWSPNINIFRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQ---------GTDANYL 168
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
K A KH+A + R F+ + +D+ ET++ F + +G S+MC+YN
Sbjct: 169 KAIATPKHFAVHSGPE---QLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNA 225
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
V+ CA+ LL T+RG W F G++ SDC +I I H + T + A +KAG
Sbjct: 226 VDTKAACANTMLLKDTLRGAWGFKGFVTSDCGAIDDITTGHHN-SPTNPEGAALAVKAGT 284
Query: 306 DLDCGDYYTNF--TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
D C D+ AV+ G + E D+D +LR L+ M+LG FD + + + +
Sbjct: 285 DTGC-DFKDEMLDLPRAVKAGYLTEGDMDVALRRLFTARMKLGMFDPAARVPFSTISIAE 343
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+P H LA AAR+ IVLLKND G LPL G + +A+VGP A + A+ GNY GTP
Sbjct: 344 NHSPAHRALALRAARESIVLLKND-GVLPLAAG-ARRIAVVGPTAASLIALEGNYNGTPV 401
Query: 422 RYTSPMDGFYA 432
P+DG A
Sbjct: 402 GAVLPVDGMTA 412
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 126/271 (46%), Gaps = 42/271 (15%)
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
G DR + LP Q++L++ + K P+ +V+ S A I K +++L YPGE
Sbjct: 621 GGDRTAIALPAAQSQLLDALFATGK-PLVIVLQSGSA--IALGAQEAKARAVLEAWYPGE 677
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYP 600
GG+AIA+V+ G NP GRLP+T+Y + +P RTY++F G V YP
Sbjct: 678 AGGQAIAEVLSGTVNPSGRLPVTFYAST------DQLPAFDDYRMANRTYRYFAGRVEYP 731
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FG+GLSYT+F Y A P + + + GT
Sbjct: 732 FGHGLSYTRFAYS-ALRPATSSVAAGQ-----------GT-------------------- 759
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKS 720
+ + V N G + G EV +Y PG G I+ + GY+RV +AAG++ + F + +
Sbjct: 760 SVSVAVRNTGVLAGDEVAQLYLSVPGREGAPIRSLKGYQRVHLAAGETKTLTFALEP-RD 818
Query: 721 LKIVDNAANSLLASGAHTILVGEGVGGVSFP 751
L + + A + + I VG G G P
Sbjct: 819 LALANAAGAMAVTKATYQIWVGGGQPGTGAP 849
>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1286
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 158/442 (35%), Positives = 234/442 (52%), Gaps = 35/442 (7%)
Query: 2 FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
F KV Y + + ERA DL+ R+TL EK +G+ +PRLG+ WSE
Sbjct: 21 FMPAKVSTKKPIYLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSE 80
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
ALHG+ +G G + + G TSFP + ++++ +L ++ ++ EARA+
Sbjct: 81 ALHGI--LG-------GANQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAIN 131
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
G GLT+WSP + +RDPRWGR E+ GEDP++ A +VRG+ +D
Sbjct: 132 QTGTKGLTYWSPVVEPIRDPRWGRTGESYGEDPFLAAEIAGGFVRGMV---------GND 182
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
LK C KHY A N DR S + +DM+E ++ P++ + + ++ S+M
Sbjct: 183 PTYLKSVPCAKHYFA----NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMS 238
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
SYN VNG+PT A L+ R + GYI DC +I+ I H ++ T E+A A+ L
Sbjct: 239 SYNAVNGVPTSASQLYLDTIARRTYGLKGYITGDCAAIEDIYTGHYYVK-TAEEATAKGL 297
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGK 359
KAG+D DCG Y + + A+++G I ADID +L ++IV MR G FD + Y
Sbjct: 298 KAGVDSDCGSIYQRYAIAALKKGLITMADIDRALLNIFIVRMRTGEFDPPAKVLYAQFQP 357
Query: 360 NNICNPQHIELAAEAARQGIVLLKN------DNGALPLNTGNIKTLALVGPHANATKAMI 413
N + +P + LA E A + VLLKN + ALPLN ++K +AL+GPHA+ K +
Sbjct: 358 NIVNSPANKALAKEIATKTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHAD--KVEL 415
Query: 414 GNYEGTPCR--YTSPMDGFYAY 433
G Y G P + +P G Y
Sbjct: 416 GPYSGRPAQENMITPFAGIKKY 437
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 134/263 (50%), Gaps = 39/263 (14%)
Query: 472 GLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIM-SAGAVDINFAKNNPKIK 530
G D E DR+ LLLPG Q ELI VA A P T+V+M + G V++ KN I
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVA--AVNPNTIVVMQTLGCVEVEEFKNLQNIP 676
Query: 531 SILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTY 590
I+WVGY G+ G AIA V+FG+ NPGG+L TWY++ T LR N GRT+
Sbjct: 677 GIIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYKSVKDLPEITDYTLRGGNGKNGRTF 736
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+FD V Y FG+G+SYT F+Y N+ + N +++
Sbjct: 737 WYFDKDVSYEFGFGMSYTTFEYS---------------------NFRISKN-----SIIP 770
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH---IKQVIGYERVFIAAGQ 707
D K T ++V+N GK++G EV+ VY K P + IK++ G++RV + AGQ
Sbjct: 771 HD------KITVSVDVKNTGKVEGDEVIQVYMKTPDSPASLQRPIKRLKGFKRVTLPAGQ 824
Query: 708 SAKVGFTMNACKSLKIVDNAANS 730
+ V +N C L D N+
Sbjct: 825 TKTVNIDIN-CADLWFWDMDKNT 846
>gi|237721771|ref|ZP_04552252.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
gi|229448640|gb|EEO54431.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
Length = 735
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 211/760 (27%), Positives = 355/760 (46%), Gaps = 97/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
Y D K P +R DL+ RMTL EK+ Q+ G VP +G +Y
Sbjct: 30 YKDPKAPIEKRVNDLLSRMTLEEKMMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYFE 89
Query: 59 WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
+ AL + R P +D+ T +P + S+N L ++ +
Sbjct: 90 TNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVSA 149
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
EAR + TF SP I+V RDPRWGRV E GEDPY G + V+G
Sbjct: 150 QEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKG------- 197
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
Y D S +++AC KHY Y G D + + +++Q + +T++LP+EM V G
Sbjct: 198 -YQGDDLSAENRMAACLKHYVGYGASE-AGRDYVY--TEISKQTLWDTYLLPYEMGVKAG 253
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P A+P ++ + ++ W G+IVSD +I+ + ++ L TK+
Sbjct: 254 -AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAATKK 310
Query: 295 DAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
+A AGL++D + Y V++G+++ A +D ++R + ++ RLG F+
Sbjct: 311 EAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERPYT 370
Query: 354 YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMI 413
K PQ +++AA A + +VLLKN+N LPL + K +A++GP A ++
Sbjct: 371 PATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLT--DKKKIAVIGPMAKNGWDLL 428
Query: 414 GNY--EGTPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
G++ G +G +A + YA GCA N A++AA+ +D V
Sbjct: 429 GSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGCA-TKGDNKEGFAEALEAARWSDVVV 487
Query: 469 IVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPK 528
+ G ++ E R + LP Q EL ++ A K P+ LV+++ +++N +
Sbjct: 488 LCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELN--RLELI 544
Query: 529 IKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS--MPLRPVNNFP 586
+IL + PG G +A ++ G+ NP G+L +T+ PY++ +P+
Sbjct: 545 SDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PYSTGQIPIYYNRRKS 596
Query: 587 GRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
GR ++ F + +YPFG+GLSYT+FKY GT
Sbjct: 597 GRGHQGFYKDITSDPLYPFGHGLSYTEFKY--------------------------GTVT 630
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERV 701
P V D + + ++ V N+G DG+E V + P + T +K++ +E+
Sbjct: 631 PSVTKVKRGD------RLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELKHFEKQ 684
Query: 702 FIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
I AG++ F ++ + V+ L +G + ILV
Sbjct: 685 LIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|435848436|ref|YP_007310686.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
SP4]
gi|433674704|gb|AGB38896.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
SP4]
Length = 771
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 206/704 (29%), Positives = 334/704 (47%), Gaps = 98/704 (13%)
Query: 86 PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWG 144
P AT+FP +I ++++ L +++ +T+ E A+ G T SP ++V RD RWG
Sbjct: 113 PEATTFPQMIGMASTWDPELLEEVTETIRGELEAL------GTTHALSPVLDVARDLRWG 166
Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
RV ET GEDP +V A YV GLQ D R +SA KH+ + + G
Sbjct: 167 RVEETFGEDPLLVAAMACGYVSGLQ----------GDGRADGVSATLKHFVGHGATDG-G 215
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+R + V ++++E + P+E + D SVM +Y+ ++GIP + LL +RG
Sbjct: 216 KNRSSLN--VGPRELREVHLFPYEAAIRTADAESVMNAYHDIDGIPCASSEWLLTDLLRG 273
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQ 322
++ F G +VSD S++ +V H N TK +A L+AGLD++ DYY + AV+
Sbjct: 274 EFGFDGTVVSDYYSVRHLVTEHGTAN-TKPEAATAALEAGLDVELPYTDYYGEHLITAVE 332
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
G+++E +D S+R + R G D + + L AAR+ + LL
Sbjct: 333 NGELSEKTLDESVRRVLREKARKGLLDDPSVDAEAAADAFRTDEAAALNRRAARRSMTLL 392
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--------EGTPCRYTSPMDGFYAYS 434
KN+N LPL ++ A++GP A+A K ++G+Y E T+P+ +
Sbjct: 393 KNENELLPLTADSV---AVIGPKADAKKELLGDYAYAAHYPEEEYASDATTPLAALESRD 449
Query: 435 KV-INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE--------------- 478
+ ++Y GC + PAA A++AD + G +V+
Sbjct: 450 GLEVSYEQGCTVSGPSTDGFEPAA-QVAEDADVALAFVGARSAVDFSDGDASKEEKPSVP 508
Query: 479 --AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
EG D DL LPG Q ELI+++ + P+ +VI+S I + + ++L+
Sbjct: 509 TSGEGCDVTDLGLPGVQEELIDRLQETGT-PLAVVIVSGRPHSIE--RITADVPAVLYAW 565
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
PG+EGG AI DV+FG++NP GRLP++ ++ + + Y N ++Y + DG
Sbjct: 566 LPGDEGGSAIVDVLFGEHNPSGRLPVSLPKSVGQLPVYYNRKA-----NTANKSYVYTDG 620
Query: 596 PVVYPFGYGLSYTQFKYKVAS-SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
VYPFG+GLSYT+F+Y S S K V P V+
Sbjct: 621 EPVYPFGHGLSYTEFEYGTLSLSEKRVS---------------------PLETVVA---- 655
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGF 713
+ V N G G+EVV +Y+ + ++++IG+ERV + AG++ +V F
Sbjct: 656 --------SVPVTNEGDRSGAEVVQLYAHAANPSQARPVQELIGFERVPLEAGETKRVSF 707
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
++ + L D + + G + I VG + L +N
Sbjct: 708 ELSPTQ-LAFHDESMTLTVEEGPYEIRVGRSASDIVATDDLEVN 750
>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 163/460 (35%), Positives = 248/460 (53%), Gaps = 43/460 (9%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
++ V + PY + +L ERA DL++RMTL EKV QM + + + RLG+P Y+WW+EAL
Sbjct: 14 TLNVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
HGV+ G+ AT FP I A+F+ + VS EARA Y+
Sbjct: 74 HGVARAGK----------------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHD 117
Query: 123 -------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
G GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + V+GLQ +
Sbjct: 118 FQRKGERDGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGK 177
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEG 234
Y K AC KHYA + W +R FD++ ++++D+ ET++ F+ V EG
Sbjct: 178 YD--------KAHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEG 226
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTK 293
V VMC+YNR G P C++ +LL + +R DW + +VSDC +I +H + T
Sbjct: 227 KVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTA 286
Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
A A + +G DL+CG Y++ AV++G I+E I+ S+ L +LG FD
Sbjct: 287 AAASADAVVSGTDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDAL 345
Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
+ + + + + +H+ A E AR+ +VLL N N LPL+ +I+ +A++GP+AN +
Sbjct: 346 VSWSEIPYSVVESKEHVTKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVM 404
Query: 412 MIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
+ NY G P + + ++G + + Y GC + Q
Sbjct: 405 LWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYVNTQ 444
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 139/295 (47%), Gaps = 53/295 (17%)
Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
D A ADA + V GL ++E E DR ++ LP Q E++ + K PV
Sbjct: 595 DKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK-PV 653
Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
V+ S + + + N + +IL YPG++GG A+ADV+FG YNP GRLP+T+Y ++
Sbjct: 654 IFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 711
Query: 569 YVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
+P + RTY++F G ++PFG+GLSYT F Y A K+DK
Sbjct: 712 ------NDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFDYGKA--------KVDKQ 757
Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
N G T I ++N GK+DG EV+ VY + P
Sbjct: 758 ------NVRAGEG------------------MTLTIPLKNTGKLDGDEVIQVYLRNPADK 793
Query: 689 GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
IK + + RV + AGQ+ + + A + + + + N + + G + +L G
Sbjct: 794 EGPIKTLRAFRRVSLPAGQTENIRIELPAS-TFECFNPSTNRMEILPGKYELLYG 847
>gi|381200965|ref|ZP_09908097.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
Length = 774
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 217/720 (30%), Positives = 344/720 (47%), Gaps = 107/720 (14%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ + E LHG + +G ATSFP I +S++ ++ +++
Sbjct: 121 RLGIPIL-FHEEGLHGYAAVG-----------------ATSFPQSIAMASSWDPAMLRQV 162
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
Q ++ E RA SP +++ RDPRWGR+ ET GEDPY+VG + V GLQ
Sbjct: 163 NQVIAREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQ 217
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
GV R S + A KH + N + V+E++++E F PFE
Sbjct: 218 ---GVGRSRTLQSN--HVFATLKHLTGHGQPESGTN---IGPAPVSERELRENFFPPFEQ 269
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V + +VM SYN ++G+P+ A+ LL +R +W F G +VSD ++ ++ H
Sbjct: 270 VVKRTGIEAVMASYNEIDGVPSHANRWLLENILREEWGFRGAVVSDYSAVDQLMSIHHIA 329
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGA-VQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+ E+A R L AG+D D + + T+G V++GK++EA +D ++R + + R G F
Sbjct: 330 ANL-EEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLF 388
Query: 349 DGSPQYKNLGKNNICNPQHIE-LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
+ +P I N + LA AA++ I LLKND G LPL T+A++GP +
Sbjct: 389 E-NPYADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEG--TIAVIGP--S 442
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKV---INYAPGC---------ADIV-----CQN 450
A A +G Y G P S ++G A I +A G AD V +N
Sbjct: 443 AAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAEN 502
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAA 504
+I A++AA+N D ++ G EG DR L L Q EL + +
Sbjct: 503 RKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVSEQQELFDALKALG 562
Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
K P+T+V+++ + K + + +IL Y GE+GG A+AD++FG NPGG+LP+T
Sbjct: 563 K-PITVVLINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVTV 619
Query: 565 -YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
A + + Y P R Y F +YPFG+GLSYT F S+P+
Sbjct: 620 PRSAGQLPLFYNMKP------SARRGYLFDTTDPLYPFGFGLSYTSFSL---SAPRLSAT 670
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
K +GT K + ++V N G +G EVV +Y +
Sbjct: 671 K-------------IGTGG----------------KTSVSVDVRNTGAREGDEVVQLYIR 701
Query: 684 PPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ T +K++ G++RV + G+S V FT+ ++L++ ++ + ++ G I+ G
Sbjct: 702 DKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMHRVVEPGDFEIMTG 760
>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
17393]
gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 864
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 158/421 (37%), Positives = 229/421 (54%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D K P ER DL+ R+T+ EK+ + + G+PRL +P Y +EALHGV GR
Sbjct: 28 YKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGVVRPGR-- 85
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L ++ +S EARA +N + G
Sbjct: 86 --------------FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQ 131
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQG---------DDDR 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E ++ FE CV +G +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAY 238
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 239 NALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 297
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D + + A +Q + ADID++ + M+LG FD + Y +
Sbjct: 298 GLDLECGDDVFDEPLLSAYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEKNPYTKISPA 357
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H E+A AAR+ IVLLKN LPLN +K++A+VG NA G+Y G+P
Sbjct: 358 VVGSAKHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGNCEFGDYSGSP 415
Query: 421 C 421
Sbjct: 416 V 416
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 150/289 (51%), Gaps = 55/289 (19%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 598 AVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKV--NPNIVVVLVAGSSL 655
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + + +I+ YPGE GG+A+A+V+FG YNPGGRLP+T+Y + ++P P
Sbjct: 656 AVNWMDEH--VPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----P 708
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
+ GRTYK+F G V+YPFGYGLSYT FKY +VA + +++
Sbjct: 709 FDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYSNLQVADGEEEINV------------ 756
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQ 694
+FQ+ +N GK G EV VY K P +K+
Sbjct: 757 -------------------------SFQL--KNAGKYAGDEVAQVYVKLPERDEVMPVKE 789
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ G+ERV + +G++ K+ + L+ D A + SG +TI+VG
Sbjct: 790 LKGFERVALKSGENKKMTLKLRK-DLLRYWDEAKGKFVYPSGDYTIMVG 837
>gi|325104789|ref|YP_004274443.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324973637|gb|ADY52621.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 802
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 233/824 (28%), Positives = 362/824 (43%), Gaps = 150/824 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
+ D P +R +DL+ +MT+ EK Q L YG R+ +P EW W + G+
Sbjct: 48 FEDQSQPIEKRVEDLLSQMTVAEKTNQTATL-YGYGRVLKDEMPTSEWKKSIWKD---GI 103
Query: 67 SFIGRRTNSPP---------------------------------GTHFDSEVPG------ 87
+ + NS P G D G
Sbjct: 104 ANMDEALNSLPNNKKAQTEYSFPYSKHATAINTLQKWFIEETRLGIPVDFTNEGIHGLCH 163
Query: 88 --ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWG 144
AT F I +S+N++L +K G+ E +A+ G T ++P +++ RDPRWG
Sbjct: 164 DRATPFCAPIGIGSSWNKNLVRKAGEIAGREGKAL------GYTNVYAPILDLARDPRWG 217
Query: 145 RVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEG 204
RV+E GEDP++VG N V GLQ I+A KHYA Y +
Sbjct: 218 RVVECYGEDPFLVGELGKNMVSGLQSN--------------GIAATLKHYAVYSVPKGGR 263
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+ D VT +++ + + PF+ V E VM SYN +GIP L + +R
Sbjct: 264 DGHARTDPHVTPRELHQIHLYPFKKVVQEAKPLGVMSSYNDWDGIPVTGSYYFLTELLRK 323
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGA 320
+ F+GY+VSD ++++ I H+ D KE +V LKAGL++ D Y N +
Sbjct: 324 QYGFNGYVVSDSEAVEFIASKHRVAKDFKEASVI-ALKAGLNVWTNFRQPDNYINNLRAS 382
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQG 378
V G + ++ +R + V RLG FD P +N ++ + P+ + A + ++
Sbjct: 383 VADGSLDMETLNQRVREVLSVKFRLGLFD-RPFTENPAASDKKVQTPEDKKFAEQMNKES 441
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK--- 435
IVLLKN N LPL+ + + + GP A I Y + TS +DG Y+
Sbjct: 442 IVLLKNGNDFLPLDKNKNQKILVTGPLAAEVGYTISRYGPSNNPSTSILDGLKQYNNGKL 501
Query: 436 VINYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
I+YA GC + + +MI A+ AKN D + V G + + E
Sbjct: 502 NIDYAKGCEIVNEGWPGTEIIDEPVTEKEKAMIADAVAKAKNVDVIIAVVGENEKIVGES 561
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
R L LPG Q EL+ K A PV +V+++ + IN+ N + +IL + G
Sbjct: 562 LSRTSLNLPGRQLELL-KALHATGKPVVMVLVNGRPLTINW--ENHYLTAILETWFLGPS 618
Query: 542 GGRAIADVIFGKYNPGGRLPITW------YEANYVKIP--YTSMPLRPVNNFPGRTYKFF 593
G+ +A+ +FG YNPGG+L +T+ E N+ P + + P N F G++
Sbjct: 619 AGKVVAETLFGDYNPGGKLSVTFPKSIGQIEMNFPFKPGSHANQPSSGDNGF-GKSR--V 675
Query: 594 DGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
+G V+YPFGYGLSYT+F Y D+KLD +KP +
Sbjct: 676 NG-VLYPFGYGLSYTKFSYS--------DLKLD-------------FSKPDSISA----- 708
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
++N+GK DG EVV +Y + T+ Q+ +ER+ + AG++ ++
Sbjct: 709 ---------SFVLKNIGKRDGDEVVQLYFRDLISSVITYDTQLRAFERIHLKAGETKQLN 759
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNL 756
A K L I+D N + G +L+G + + L
Sbjct: 760 LKF-ARKDLAILDKDMNWAVEPGDFEVLIGSSSEDIRLKEKFTL 802
>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
Length = 1050
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 159/421 (37%), Positives = 231/421 (54%), Gaps = 45/421 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + P ER DL+ R+T+ EK+ + + G+ RL +P Y +EALHGV GR
Sbjct: 29 YKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-- 86
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
T FP I A++N L +++ +S EARA +N + G
Sbjct: 87 --------------FTVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQ 132
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G +V+GLQ +DSR
Sbjct: 133 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQG---------NDSR 183
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + +++E+ ++E ++ FE CV +G +S+M +Y
Sbjct: 184 YLKIVSTPKHFAA----NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC +V +HK++ TKE A +KA
Sbjct: 240 NALNDVPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKA 298
Query: 304 GLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG D Y + A +Q + +ADID++ + M+LG FD + Y +
Sbjct: 299 GLDLECGDDVYDEPLLSAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEKNPYTKISPA 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I + +H E+A AAR+ IVLLKN LPLN IK++A+VG NA + G+Y G P
Sbjct: 359 VIGSKEHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVG--INAGSSEFGDYSGLP 416
Query: 421 C 421
Sbjct: 417 V 417
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 148/289 (51%), Gaps = 55/289 (19%)
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-V 518
A + + V V G++ S+E EG+DR D+ LP Q E + ++ P +V++ AG+ +
Sbjct: 599 AVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIVVVLVAGSSL 656
Query: 519 DINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP 578
+N+ + + +I+ YPGE GG+A+A+V+FG YNPGGRLP+T+Y + ++P P
Sbjct: 657 AVNWMDEH--VPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-LDELP----P 709
Query: 579 LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
+ GRTYK+F G V+YPFGYGLSYT FKY +VA + V +
Sbjct: 710 FDDYDITKGRTYKYFKGDVLYPFGYGLSYTSFKYSNLQVADGEEEVSV------------ 757
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQ 694
+FQ+ +N G+ G EV VY K P +K+
Sbjct: 758 -------------------------SFQL--KNTGRYAGDEVAQVYVKLPEREEVMPVKE 790
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
+ G+ERV + +G+S KV + L+ D A + SG + I+VG
Sbjct: 791 LKGFERVSLKSGESKKVTIKLRK-DLLRYWDEAKGKFIYPSGNYNIMVG 838
>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 682
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 199/616 (32%), Positives = 296/616 (48%), Gaps = 64/616 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMG---------DLAYGVPRLGLPLYEWWSEA 62
PYCD L ER +DL+ +TL EKV +G V R+GLP Y W E
Sbjct: 70 LPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPDYYWLVE- 128
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
TN+ G+ +E AT F + ASFN S W G TE RA+ N
Sbjct: 129 ----------TNTAVGSACIAENKCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALMN 178
Query: 123 LG----------NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
+ + GLT + PNIN RDPR+GR E PGEDP++ G+YA + V+G+Q+
Sbjct: 179 VHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQE-- 236
Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
RD++ P K+ A KH+ AY + GND D ++ D+ +T++ +EM +
Sbjct: 237 -----RDANGYP-KVLAYLKHFTAYSREEGRGND----DYNISMYDLFDTYLPQYEMGMV 286
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFH-GYIVSDCDSIQTIVESHKFLND 291
+G + VMCSYN VNGIP CA+ LLN+ +R WN ++ +DC ++ +
Sbjct: 287 QGGATGVMCSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNL-RGKPIQAA 345
Query: 292 TKEDAVARVLKAGLDLDCGD--YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
+ A A L G D++ G + N T A+ G E ++ ++R Y G FD
Sbjct: 346 DEAQAAAMALMNGADIEMGSTLFVHNLTT-AITLGYATEEAVNQAIRRSYRPHFIAGRFD 404
Query: 350 GS--PQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
++ +LG ++I + +H E+ EAA QG+VLLK+++ LP+ G LA++GP
Sbjct: 405 DPTLSEWFSLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPIAAGT--KLAVLGPLGM 462
Query: 408 ATKAMIGNYE--------GTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAID 459
++ +YE G C T + K A D+ +N S + +
Sbjct: 463 TRSGLMSDYESDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQ 522
Query: 460 AAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
A + D V+ G + E EG DR D LPG Q L V K PV LV+++ G +
Sbjct: 523 LAADRDLIVLCLGNTKTQEQEGFDRKDTALPGQQYALFEAVLTLRK-PVVLVLVNGGQIA 581
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
++ P +I+ P GG A+A +FG+ N G+LP T Y Y + M
Sbjct: 582 LDGMTGYP--SAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYP--YSVMQSFDMKD 637
Query: 580 RPVNNFPGRTYKFFDG 595
++ PGRTY++F G
Sbjct: 638 HSMSAPPGRTYRYFTG 653
>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
turnerae T7901]
Length = 888
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 167/450 (37%), Positives = 239/450 (53%), Gaps = 53/450 (11%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D L R DLV RM L EK+ QM + + + LG+ Y+WW+EALHGV+ G+
Sbjct: 47 YMDTTLDIDTRVDDLVSRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAGK-- 104
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LGN 125
AT FP I A ++ I + VS EARA ++
Sbjct: 105 --------------ATVFPQAIGMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRY 150
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLTFWSPNIN+ RDPRWGR ET GEDPY+ G A+ Y+ GLQ + + L
Sbjct: 151 TGLTFWSPNINIFRDPRWGRGQETYGEDPYLTGELALPYISGLQG---------ENPKYL 201
Query: 186 KISACCKHYAAYDLDNWEGNDRF-HFDSRV-TEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K +A KH+A + G ++ H D+ + + +D+ ET++ FE V EGDV SVMC+Y
Sbjct: 202 KTAAMAKHFAVH-----SGPEKSRHSDNYIASPKDLNETYLPAFEKAVVEGDVESVMCAY 256
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVL 301
NRVN P C + LL +T+RG W F G++VSDC +I E+H + A V
Sbjct: 257 NRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCGAIADFYAPEAHHVVMAPAAAAAWAV- 315
Query: 302 KAGLDLDCG-DYYTNFT--MGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKN 356
++G DL+CG D + F A+Q+ I + +ID S++ L +LG FD Q Y
Sbjct: 316 RSGTDLNCGTDRLSTFANLHFALQREMITQDEIDQSVKRLMKTRFKLGMFDPDDQVPYSK 375
Query: 357 LGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
+ + + + H+ L +AA + VLLKN +G LPL + +A++GP+A ++GNY
Sbjct: 376 IPMDVVGSQAHLALTQKAAEKSFVLLKN-SGILPLKKSS--KVAIIGPNATNPTVLVGNY 432
Query: 417 EGTPCRYTSPMDGFYAY--SKVINYAPGCA 444
G P + +P+DG Y + + YAPG A
Sbjct: 433 FGDPIKPVTPLDGIQQYLGEENVFYAPGSA 462
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 94/288 (32%), Positives = 140/288 (48%), Gaps = 65/288 (22%)
Query: 470 VAGLDLSVEAEG---KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
+ G ++SVE EG DR D+ LP Q +L+ + K P+ LV S A+ +N+A NN
Sbjct: 634 LEGEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWANNN 692
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
+ +IL YPGE G A+A +++G+ +P GRLPIT+Y R +++ P
Sbjct: 693 --VDAILQGFYPGEATGTALARILWGEVSPSGRLPITFY--------------RSLDDLP 736
Query: 587 G--------RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTV 638
G RTYK++ G V+YPFGYGLSYTQF Y S+P T+
Sbjct: 737 GFKDYAMTNRTYKYYQGDVLYPFGYGLSYTQFAYSELSAPA-----------------TM 779
Query: 639 GTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVI 696
+ +P +V N GK+ EVV VY K PG++ +++
Sbjct: 780 ASGEP----------------LAITAQVSNSGKVASDEVVQVYVSMKVPGLSLPQ-RELK 822
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
++R+++ G S V F++ A K L VD+ G T+ VG G
Sbjct: 823 EFKRIYLEPGASQTVEFSI-AGKDLSYVDDQGVRHPYHGPLTLSVGGG 869
>gi|384146876|ref|YP_005529692.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|340525030|gb|AEK40235.1| beta-glucosidase [Amycolatopsis mediterranei S699]
Length = 671
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 232/773 (30%), Positives = 346/773 (44%), Gaps = 146/773 (18%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQM--------GDLAYGVPRLGLPLYEWWSEALH 64
P+ DA+ RA +LV MTL EK+ Q+ +PRLG+P +
Sbjct: 13 PWRDARQSPDRRAAELVAAMTLDEKISQLHLQPDAEHQRFVPPIPRLGVPGF-------- 64
Query: 65 GVSFIGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
R N P G + P AT+ P + ++F+ L ++ G+ + E RA+ +
Sbjct: 65 ------RIANGPAGMGPADDKPQKPATALPATMALASTFDTGLARRYGRLIGDETRALAH 118
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+ G P+IN+ R PR GR E GEDP + G A +RG+Q+ +
Sbjct: 119 NVSEG-----PDINMARVPRNGRTFEGMGEDPVLAGALAAADIRGIQENGTI-------- 165
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
A KHYAA N + DR D + E+ + E ++ FE V EG SVMC+
Sbjct: 166 ------AEVKHYAA----NNQETDRHGIDEHIDERTLNEIYLPHFEQAVTEGHAGSVMCA 215
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y ++NG+ TC +P LL +R DW F G++ SD + + V S
Sbjct: 216 YPKINGVFTCENPALLQDKLRDDWGFKGFVQSDWGAAHSTVGS---------------AN 260
Query: 303 AGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG++L+ G +Y AV G+++E + L + + G FD P L
Sbjct: 261 AGMNLEMIDGTWYGEKMKQAVLAGQVSEQRVGELLLPRFRTMFAFGQFDHPPVASPL--- 317
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
QH A E A +G+VLL+N++ LPL+ G +K++AL+GP AT+A G +
Sbjct: 318 --PTAQHDAAAKEFAERGMVLLRNEHAQLPLDPG-VKSIALIGPF--ATRAKTGGGGSSA 372
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
TS +D + + PG + + S A A+ A+ +V++ G + EAE
Sbjct: 373 VIPTSTVDPLAGLQQRV---PGAV-VTLDDGSDPARAAALARTAEVSVVMVGDN---EAE 425
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIM-SAGAVDINFAKNNPKIKSILWVGYPG 539
GKDR L L G Q L+ VA+A P T+V++ S G V + + ++ +IL YPG
Sbjct: 426 GKDRPSLALDGNQDALVTAVAEA--NPHTVVVVKSGGPVLMPWVS---RVPAILQAWYPG 480
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR----------- 588
++ G A+A V+FG NP G+LPIT+ A+ P FPG
Sbjct: 481 QQDGAAVAGVLFGDVNPSGKLPITFPAAD------ADTPANTPAQFPGVGGVATYSEGLQ 534
Query: 589 -TYKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
Y++FD ++PFG+GLSYT F Y + S D
Sbjct: 535 IGYRWFDAQGRAPLFPFGHGLSYTTFAYSGLAVHNSGD---------------------- 572
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
T V N G G+EV VY P AG +Q+ G+ERV +A
Sbjct: 573 --------------GATATFTVRNTGSRAGAEVAQVYLGFPVAAGEPPRQLKGFERVSLA 618
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVGEGVGGVSFPLQLNL 756
GQ+ +V ++ + + D AA++ A GA T+ VG S PLQ L
Sbjct: 619 PGQARRVTIRLDK-RDFSVWDTAAHAWQPARGAFTVSVGG--SSRSLPLQAPL 668
>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 163/460 (35%), Positives = 248/460 (53%), Gaps = 43/460 (9%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
++ V + PY + +L ERA DL++RMTL EKV QM + + + RLG+P Y+WW+EAL
Sbjct: 14 TLNVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN- 122
HGV+ G+ AT FP I A+F+ + VS EARA Y+
Sbjct: 74 HGVARAGK----------------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHD 117
Query: 123 -------LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
G GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + V+GLQ +
Sbjct: 118 FQRKGERDGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGGGTGK 177
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEG 234
Y K AC KHYA + W +R FD++ ++++D+ ET++ F+ V EG
Sbjct: 178 YD--------KAHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEG 226
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTK 293
V VMC+YNR G P C++ +LL + +R DW + +VSDC +I +H + T
Sbjct: 227 KVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTA 286
Query: 294 EDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP- 352
A A + +G DL+CG Y++ AV++G I+E I+ S+ L +LG FD
Sbjct: 287 AAASADAVVSGTDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDDAL 345
Query: 353 -QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKA 411
+ + + + + +H+ A E AR+ +VLL N N LPL+ +I+ +A++GP+AN +
Sbjct: 346 VSWSEIPYSVVESKEHVAKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNANDSVM 404
Query: 412 MIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
+ NY G P + + ++G + + Y GC + Q
Sbjct: 405 LWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYVNTQ 444
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 139/295 (47%), Gaps = 53/295 (17%)
Query: 459 DAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPV 508
D A ADA + V GL ++E E DR ++ LP Q E++ + K PV
Sbjct: 595 DKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK-PV 653
Query: 509 TLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
V+ S + + + N + +IL YPG++GG A+ADV+FG YNP GRLP+T+Y ++
Sbjct: 654 IFVLCSGSTLALPWEAEN--LDAILEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS 711
Query: 569 YVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
+P + RTY++F G ++PFG+GLSYT F Y A K+DK
Sbjct: 712 ------DDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFDYGKA--------KVDKQ 757
Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
N G T I ++N GK+DG EV+ VY + P
Sbjct: 758 ------NVRAGEG------------------MTLTIPLKNTGKLDGDEVIQVYLRNPADK 793
Query: 689 GTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
IK + + RV + AGQ+ + + A + + + + N + + G + +L G
Sbjct: 794 EGPIKTLRAFRRVSLPAGQTENIRIELPAS-TFECFNPSTNRMEILPGKYELLYG 847
>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 763
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 213/691 (30%), Positives = 334/691 (48%), Gaps = 100/691 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
GAT FP + +++N L++ I + V+ E RA G +SP ++VVRDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA-----QGGAATYSPVLDVVRDPRWGRT 177
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDP++V +A+ V+GLQ E ++ H + A KH+A Y N
Sbjct: 178 EETFGEDPHLVAEFAVAAVQGLQG-ERLDSH-------TSLLATLKHFAGYGASEGGRNG 229
Query: 207 R-FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
H R ++ E +LPF V G + S+M +YN ++G+P + LL +R
Sbjct: 230 APVHMGLR----ELHEVDLLPFRKAVESGAL-SIMTAYNEIDGVPCTSSRYLLQNVLREA 284
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
W F G++++DC +I + H E A + LKAG+D++ G + A++QG
Sbjct: 285 WGFDGFVITDCGAIHMLACGHNTAGSGVE-AATQSLKAGVDMEMSGTMFRAHLQQALEQG 343
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
I E D++ + + + RLG FD + I +HI LA +AA +GIVLLKN
Sbjct: 344 LITEDDLNRAAGRVLELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKN 403
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGF---YAYSKVINY 439
+ LPL++ + T+A++GP+A+ +G+Y P + + +DG S+V+ Y
Sbjct: 404 EGNLLPLDSSS-GTIAVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVL-Y 461
Query: 440 APGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA--------- 479
APGC I + P A+ A+ AD V+V G +DL A
Sbjct: 462 APGC-RIQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKS 520
Query: 480 -----EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
EG DR L L G Q EL+ ++ K PV +V ++ + + + I +I+
Sbjct: 521 DMECGEGIDRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPWI--DEFIPAIIE 577
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFF 593
YPG+EGG AIAD++FG NP GRLP++ E + I Y + R G+ Y
Sbjct: 578 AWYPGQEGGGAIADMLFGDINPSGRLPLSIPKEVGQLPISYNARRTR------GKRYLET 631
Query: 594 DGPVVYPFGYGLSYTQFKY-KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
D YPFG+GLSYT+F+Y ++ P V I +
Sbjct: 632 DLAPRYPFGFGLSYTEFRYGRLTVEPAVVPIGGEA------------------------- 666
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKV 711
T +I+V N G DG+EVV +Y + T ++ + G+ +VF+ AG++ +V
Sbjct: 667 --------TVRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEV 718
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT+ + + L+++ ++ G I VG
Sbjct: 719 TFTIGS-EQLELIGLDLKPVVEPGEFRIQVG 748
>gi|300783640|ref|YP_003763931.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|399535524|ref|YP_006548186.1| beta-glucosidase [Amycolatopsis mediterranei S699]
gi|299793154|gb|ADJ43529.1| beta-glucosidase [Amycolatopsis mediterranei U32]
gi|398316294|gb|AFO75241.1| beta-glucosidase [Amycolatopsis mediterranei S699]
Length = 684
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 232/773 (30%), Positives = 346/773 (44%), Gaps = 146/773 (18%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQM--------GDLAYGVPRLGLPLYEWWSEALH 64
P+ DA+ RA +LV MTL EK+ Q+ +PRLG+P +
Sbjct: 26 PWRDARQSPDRRAAELVAAMTLDEKISQLHLQPDAEHQRFVPPIPRLGVPGF-------- 77
Query: 65 GVSFIGRRTNSPPGTHFDSEVPG--ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
R N P G + P AT+ P + ++F+ L ++ G+ + E RA+ +
Sbjct: 78 ------RIANGPAGMGPADDKPQKPATALPATMALASTFDTGLARRYGRLIGDETRALAH 131
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+ G P+IN+ R PR GR E GEDP + G A +RG+Q+ +
Sbjct: 132 NVSEG-----PDINMARVPRNGRTFEGMGEDPVLAGALAAADIRGIQENGTI-------- 178
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
A KHYAA N + DR D + E+ + E ++ FE V EG SVMC+
Sbjct: 179 ------AEVKHYAA----NNQETDRHGIDEHIDERTLNEIYLPHFEQAVTEGHAGSVMCA 228
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y ++NG+ TC +P LL +R DW F G++ SD + + V S
Sbjct: 229 YPKINGVFTCENPALLQDKLRDDWGFKGFVQSDWGAAHSTVGS---------------AN 273
Query: 303 AGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG++L+ G +Y AV G+++E + L + + G FD P L
Sbjct: 274 AGMNLEMIDGTWYGEKMKQAVLAGQVSEQRVGELLLPRFRTMFAFGQFDHPPVASPL--- 330
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
QH A E A +G+VLL+N++ LPL+ G +K++AL+GP AT+A G +
Sbjct: 331 --PTAQHDAAAKEFAERGMVLLRNEHAQLPLDPG-VKSIALIGPF--ATRAKTGGGGSSA 385
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
TS +D + + PG + + S A A+ A+ +V++ G + EAE
Sbjct: 386 VIPTSTVDPLAGLQQRV---PGAV-VTLDDGSDPARAAALARTAEVSVVMVGDN---EAE 438
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIM-SAGAVDINFAKNNPKIKSILWVGYPG 539
GKDR L L G Q L+ VA+A P T+V++ S G V + + ++ +IL YPG
Sbjct: 439 GKDRPSLALDGNQDALVTAVAEA--NPHTVVVVKSGGPVLMPWVS---RVPAILQAWYPG 493
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGR----------- 588
++ G A+A V+FG NP G+LPIT+ A+ P FPG
Sbjct: 494 QQDGAAVAGVLFGDVNPSGKLPITFPAAD------ADTPANTPAQFPGVGGVATYSEGLQ 547
Query: 589 -TYKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
Y++FD ++PFG+GLSYT F Y + S D
Sbjct: 548 IGYRWFDAQGRAPLFPFGHGLSYTTFAYSGLAVHNSGD---------------------- 585
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIA 704
T V N G G+EV VY P AG +Q+ G+ERV +A
Sbjct: 586 --------------GATATFTVRNTGSRAGAEVAQVYLGFPVAAGEPPRQLKGFERVSLA 631
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVGEGVGGVSFPLQLNL 756
GQ+ +V ++ + + D AA++ A GA T+ VG S PLQ L
Sbjct: 632 PGQARRVTIRLDK-RDFSVWDTAAHAWQPARGAFTVSVGG--SSRSLPLQAPL 681
>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
Length = 942
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 229/811 (28%), Positives = 370/811 (45%), Gaps = 146/811 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R ++L+++MTL EK QM L YG R+ LP EW W + G+
Sbjct: 53 YEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 108
Query: 67 SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
I N P H F +E + G
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168
Query: 88 ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
AT+FPT + ++N L +++G EAR + G T ++P ++V RD R
Sbjct: 169 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E GE PY+V I VRGLQ H +++A KH+AAY +
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 269
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
D +++ ++++ I PF+ + E + VM SYN +GIP L +
Sbjct: 270 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 329
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
RG+ F GY+VSD D+++ + H D KE AV + ++AGL++ C D +
Sbjct: 330 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
V++G ++E I+ +R + V +G FD Q G + + E +A +A+ +
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASHE 448
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV- 436
+VLLKN + LPL+ + K +A+ GP+AN + +Y T+ ++G +K
Sbjct: 449 SVVLLKNADELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSK 508
Query: 437 --INYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+ Y GC D+V + + I A++ A+ AD V+V G
Sbjct: 509 AEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCG 567
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E K R L LPG Q +L+ + K PV L++++ + IN+A + + +IL YPG
Sbjct: 568 ENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPG 624
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF-- 592
+GG A+AD++FG YNPGG+L +T + +IP+ + P +P + PG T
Sbjct: 625 SKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSR 682
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+G +YPFGYGLSYT F+Y D++ T P +A
Sbjct: 683 ING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA----- 717
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
T +++V N GK G EVV +Y + T+ K + G++R+ + G++ ++
Sbjct: 718 --------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQEL 769
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT++ K L+++D ++ G ++ G
Sbjct: 770 SFTIDR-KHLELLDADMKWVVEPGDFVLMAG 799
>gi|423214254|ref|ZP_17200782.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693199|gb|EIY86434.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
CL03T12C04]
Length = 735
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 223/774 (28%), Positives = 356/774 (45%), Gaps = 105/774 (13%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP 49
S K K S Y DAK P +R DL+ RMTL EK+ Q+ G VP
Sbjct: 20 SAKDKKSIPLYKDAKAPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIGEEVKKVP 79
Query: 50 -RLGLPLYEWWSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNES 104
+G +Y + AL + R P +D+ T +P + S+N
Sbjct: 80 AEIGSLIYYDTNPALRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPE 139
Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
L +K + EAR + TF SP I+V RDPRWGRV E GEDPY G +A
Sbjct: 140 LVEKACAVTAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYANGVFAAAS 194
Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
VRG Y D S +I+AC KHY Y G D + + ++ Q + +T++
Sbjct: 195 VRG--------YQGDDMSAEDRIAACLKHYIGYGASE-AGRDYVY--TEISAQTLWDTYL 243
Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
LP+EM V G +++M S+N ++G+P A+ + + ++ W G+IVSD +I+ +
Sbjct: 244 LPYEMGVKAG-AATLMSSFNDISGVPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL-- 300
Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
++ L K++A AGL++D + Y + V++GKI A +D S+R + V
Sbjct: 301 KNQGLAANKKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKF 360
Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
RLG F+ K PQ +++AA+ A + +VLLKN+NG LPL + K +A+VG
Sbjct: 361 RLGLFERPYTPVTNEKERFFRPQSMDIAAQLAAESMVLLKNENGILPLT--DKKKIAVVG 418
Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV---------INYAPGCADIVCQNNSMI 454
P A ++G++ C + D Y+ + + YA GC+ N
Sbjct: 419 PMAKNGWDLLGSW----CGHGKDTDVAMLYNGLATEFVGKAELRYALGCS-TQGDNRKGF 473
Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
A++AA+ +D V+ G ++ E R + LP Q EL ++ A K P+ LV+++
Sbjct: 474 EEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKAGK-PIVLVLVN 532
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
+++N + P +IL + PG G +A ++ G+ NP G+L +T+ PY
Sbjct: 533 GRPLELN--RLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PY 582
Query: 575 TS--MPLRPVNNFPGRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
++ +P+ GR ++ F + +Y FG+GLSYT+FKY
Sbjct: 583 STGQIPIYYNRRKSGRGHQGFYKDITSEPLYSFGHGLSYTEFKY---------------- 626
Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
GT P V + K + ++ V N GK DG E V + P +
Sbjct: 627 ----------GTVTPSVTTV------KRGGKLSVEVSVSNTGKRDGLETVHWFISDPYCS 670
Query: 689 GTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
T +K++ +E+ I AG++ F ++ + V+ L G + I V
Sbjct: 671 ITRPVKELKHFEKQLIKAGETKVFRFDVDLERDFGFVNGNGKRFLEIGEYYIQV 724
>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 855
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 161/451 (35%), Positives = 237/451 (52%), Gaps = 46/451 (10%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA+DLV R+TL EK M D++ +PRLG+ + WWSEALHG +
Sbjct: 13 PYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFA----- 67
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNA---- 126
T FP + ASF++ L ++ S E RA Y+ L N
Sbjct: 68 -----------NNDDVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENK 116
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
L+ W+PN+N+ RDPRWGR ET GEDPY+ R + V+GLQ E +Y
Sbjct: 117 RFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAKYK------ 170
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFD-SRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W R + + V+++D+ ET++ F++ V + +V VMC+
Sbjct: 171 --KLLACAKHYAVHSGPEW---SRHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCA 225
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y R++ P C +LL Q +R W F +VSDC +IQ SH +D A A+ +
Sbjct: 226 YQRLDDEPCCGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVH-AAAKAVL 284
Query: 303 AGLDLDCGDYYTNFTM--GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
AG D++C N+ + AV++G + E DID S++ + I LG D Y +
Sbjct: 285 AGTDVECQWDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIVPYAQIP 344
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ I N +H +LA + AR+ + LL+N N LPL+ G + +A++GP+A+ + GNY G
Sbjct: 345 ASVINNEEHRQLALKMARESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNG 403
Query: 419 TPCRYTSPMDGFYAY--SKVINYAPGCADIV 447
TP R S +DG + K I Y C D+V
Sbjct: 404 TPVRTISILDGITSKIGEKSIVYDKAC-DLV 433
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 126/292 (43%), Gaps = 53/292 (18%)
Query: 462 KNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLV 511
K + + V GL +E E G DR D+ LP Q + + DA K ++
Sbjct: 590 KGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGK---KVI 646
Query: 512 IMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVK 571
++ I +IL Y GE GG+A+ADV+FG YNP G+LP+T+Y+
Sbjct: 647 FVNNSGSAIGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYKDT--- 703
Query: 572 IPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQC 631
T +P + GRTY+F ++PFG+GLSYT FK A +LDK +
Sbjct: 704 ---TQLPDFEDYSMNGRTYRFMKAEPLFPFGHGLSYTNFKIGEA--------QLDKSE-- 750
Query: 632 RDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH 691
I+ + N I + N GK +G E++ VY G+
Sbjct: 751 --IDTSSSVN--------------------ITISISNEGKTEGVEIIQVYVHKQGLEEGP 788
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
IK + G++RV + + V + S + D A S+ + G + I G
Sbjct: 789 IKTLKGFKRVNLKPNEMKNVTINL-PSNSFEFYDKKARSMKVMPGNYEIFYG 839
>gi|293371677|ref|ZP_06618088.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633374|gb|EFF51944.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 783
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 214/725 (29%), Positives = 324/725 (44%), Gaps = 119/725 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG AT FPT I A+++ L +++
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 172
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ + E R G + P +++ RDPRW RV ET GEDP + G V GL
Sbjct: 173 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAIVEGLG 227
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
SRP A KH+ AY + N F +++ E F+ PF
Sbjct: 228 G--------GDLSRPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 276
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++G+P A+ LL + +R +W F G +VSD SI+ I +SH F+
Sbjct: 277 AIDAGALS-VMTSYNSMDGVPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSH-FV 334
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E+A L AG+D+D G D Y N M AV G+I + +D S+ + + +G F
Sbjct: 335 APTMEEAAVLALSAGVDVDLGGDAYMNL-MNAVNTGRIGKTALDASVARVLRLKFEMGLF 393
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ K + + + + LA A+ I LLKN++ LPLN + +AL+GP+A+
Sbjct: 394 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 451
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
M+G+Y + +DG S + Y GC+ D V + I A+ AA+
Sbjct: 452 RYNMLGDYTAPQEEANIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 508
Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
++ + V G + + EG DR L L G Q EL+
Sbjct: 509 RSEIIIAVVGGSSARDFKTSYKETGAAIANEKTISDMECGEGFDRATLSLLGKQQELLKA 568
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
+ K P+ +V + +D N+A N ++L YPG+EGG AIADV+FG +NP GR
Sbjct: 569 LKTTGK-PLVVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGR 625
Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
LP + V +PL P Y +YPFGYGLSYT F Y
Sbjct: 626 LPFS------VPRSVGQIPLYYNKKAPQSHDYVEMSASPLYPFGYGLSYTSFDYS----- 674
Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
D++ + T + F +V N GK DG EV
Sbjct: 675 --------------DLHLSALTPR----------------SFEVSFKVRNTGKYDGEEVA 704
Query: 679 MVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAH 737
+Y + + +KQ+ + R ++ G+ +V F ++ + +VD S++ G
Sbjct: 705 QLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKSIVEPGTF 763
Query: 738 TILVG 742
I++G
Sbjct: 764 QIMIG 768
>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
Length = 864
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 163/448 (36%), Positives = 238/448 (53%), Gaps = 47/448 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y ++ L ERA+DL++++TL EKV M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASF I VS EARA +A
Sbjct: 82 --------------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERY 127
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ +N V+GLQ D++ +
Sbjct: 128 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCT-------DANQKYD 180
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
KI AC KH+A + W +R F++ + +D+ ET+++PFE V EG V VMC+YN
Sbjct: 181 KIHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYN 237
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R+ G P C +LL Q +R +W + G ++SDC +I + HK + E A A +
Sbjct: 238 RLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHK-THPNAESASAAAVL 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
+G DL+CG Y A ++G I+E DID S++ L LG D ++ + +
Sbjct: 297 SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+C+ +H L+ + AR+ + LL N N LPL G +T+A++GP+AN + GNY GTP
Sbjct: 356 VVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTP 414
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCA 444
+ ++G + K+I Y GC+
Sbjct: 415 KHTITLLEGIRSAMGENDKLI-YEQGCS 441
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 133/300 (44%), Gaps = 54/300 (18%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I + K+AD + G+ S+E E DR D+ LP Q ELI + DA
Sbjct: 592 IKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCDA 651
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K ++ ++ I ++IL YPG+ GG+A A+V+FG YNP GRLP+T
Sbjct: 652 GK---KVIFVNFSGSPIAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNPAGRLPVT 708
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y +P N GRTY++F G ++PFGYGLSYT F Y +I
Sbjct: 709 FYRN------IAQLPDFEDYNMTGRTYRYFKGDPLFPFGYGLSYTTFNYD--------NI 754
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
KLD+ + + V I V N G DG EVV VY K
Sbjct: 755 KLDQTIKVGETAKMV-------------------------IPVTNAGNRDGEEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
A K + ++RV I AG++ V + K L+ D N++ +G I+VG
Sbjct: 790 KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP-KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
Length = 869
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 169/428 (39%), Positives = 230/428 (53%), Gaps = 49/428 (11%)
Query: 29 VERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGA 88
+ RMT+ +K QM + A +P GL YEWW+E LHGV+ G A
Sbjct: 40 IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------------A 83
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--------GLTFWSPNINVVRD 140
T FP I A++N +L K++G VSTEARA +N + GLT WSPNIN+ RD
Sbjct: 84 TVFPQAIGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRD 143
Query: 141 PRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLD 200
PRWGR ET GEDP++ R A +V GLQ D + K+ A KH A +
Sbjct: 144 PRWGRGQETYGEDPFLTSRLAEGFVTGLQG---------PDPQHPKVVASVKHLAVHSGP 194
Query: 201 NWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
R F + V+ D++ T++ F V SVMC+YN V G+P CA LL
Sbjct: 195 E---AGRHGFAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKT 251
Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
+R W F GY+V+DCD+I + H + LND + A + LKAG+DL+CG+ Y
Sbjct: 252 YVREAWGFKGYVVTDCDAIYDMTRFHFYRLNDAESSAES--LKAGVDLNCGNAYAALPE- 308
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ-YKNLGKNNICNPQHIELAAEAARQG 378
AVQ+G I E+ +D SL L V RLG DG+P + + I PQ LA +AA Q
Sbjct: 309 AVQKGLIPESLMDQSLNRLLDVRKRLG-IDGAPSPWARISPEAINTPQAQGLALQAAEQS 367
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SK 435
+VLLKN NG LPL G +T+A++GP+A+ + + GNY G + +P+ G A +K
Sbjct: 368 LVLLKN-NGVLPLKPG--QTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRAQLGAAK 424
Query: 436 VINYAPGC 443
V+ YA G
Sbjct: 425 VL-YAQGA 431
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 134/290 (46%), Gaps = 49/290 (16%)
Query: 468 VIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNP 527
++V G D DR DL LP Q +L+ V K P+ +V++S AV +N+A +
Sbjct: 620 ILVPGFDRG------DRTDLGLPRTQEDLLKAVKATGK-PLVVVLLSGSAVALNWADAHA 672
Query: 528 KIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG 587
W YPGE GG AIA + G+ NP GRLP+T+Y + P+ + G
Sbjct: 673 DAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPVTFYRSVQDLPPFIDYRME------G 724
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
RTY++F G +YPFG+GLSYTQF Y D+KLD T+ +P
Sbjct: 725 RTYRYFKGKPLYPFGHGLSYTQFSYS--------DLKLDTS--------TLTAGQP---- 764
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQ 707
+ V N G+ G EVV +Y K P G + + + RV + AG+
Sbjct: 765 ------------LRVSVRVRNNGQRAGDEVVQLYVKRPDTFGLN-ASLAAFARVSLKAGE 811
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
S V T++ + L V + +GA+ + VG G G + L + +
Sbjct: 812 SRTVVMTIDP-RDLSTVTLEGERAIRAGAYGLSVGGGQPGFAPTLNADFS 860
>gi|94497563|ref|ZP_01304132.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
gi|94422980|gb|EAT08012.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
Length = 774
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 227/752 (30%), Positives = 348/752 (46%), Gaps = 114/752 (15%)
Query: 20 PYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
P R +D + + L +Q+ A RLG+P+ + E LHG + +G
Sbjct: 93 PRVARGRDPRQTVALVNALQKW---AMTQTRLGIPIL-FHEEGLHGYAAVG--------- 139
Query: 80 HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVR 139
ATSFP I +S++ L +++ ++ E R SP +++ R
Sbjct: 140 --------ATSFPQSIALASSWDPHLVQQVNSVIAREIRV-----RGVPMVLSPVVDIAR 186
Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
DPRWGR+ ET GEDPY+VG + V GLQ EG R D RP K+ A KH +
Sbjct: 187 DPRWGRIEETYGEDPYLVGEMGVAAVEGLQG-EG----RSHDLRPGKVFATLKHLTGHGQ 241
Query: 200 DNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLN 259
N + ++E++++E F PFE V +++VM SYN ++G+P+ + LL+
Sbjct: 242 PESGTN---VGPAPISERELRENFFPPFEQVVKRTGINAVMASYNEIDGVPSHMNRWLLD 298
Query: 260 QTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
+RG+W F G +VSD + ++ H E A R L AG+D D + + T+G
Sbjct: 299 DVLRGEWGFRGAVVSDYSGVDQLMNIHHVAGSLDE-AARRALDAGVDADLPEGLSYATLG 357
Query: 320 -AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
V+ GK++EA +D ++R + + R G F+ P + N LA AA++
Sbjct: 358 DQVRAGKVSEAQVDKAVRRMLELKFRAGLFE-HPYADAAQAVALTNDAEARALARTAAQR 416
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SK 435
I LLKND G LPL ++A++GP +A A +G Y G P S +DG A +
Sbjct: 417 SITLLKND-GMLPLKVEG--SIAVIGP--SAAVARLGGYYGQPPHVVSILDGIKARVGDR 471
Query: 436 V-INYAPGC---------ADIV-----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
V I +A G AD V +N +I A++AA+N D V+ G E
Sbjct: 472 VRIVFAQGVKITQDDDWWADKVDKADPAENRRLIAQAVEAARNVDRIVLTLGDTEQSSRE 531
Query: 481 G------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILW 534
G DR L L G Q EL + + K P+T+V+++ + K + + ++L
Sbjct: 532 GWAANHLGDRPSLDLVGEQQELFDALKTLGK-PITVVLINGRPA--STVKVSEEANALLE 588
Query: 535 VGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF---PGRTYK 591
Y GE+GG A+AD++FG NPGG+LP+T +P + L N GR Y
Sbjct: 589 GWYLGEQGGHAVADILFGDVNPGGKLPVT--------VPRSVGQLPAFYNVKPSAGRGYL 640
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
F +YPFG+GLSYT F T PP L
Sbjct: 641 FDTNAPLYPFGFGLSYTNF-----------------------------TLSPPR---LAQ 668
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
+ ++V N G DG EVV +Y + T IK++ G+ERV + G+
Sbjct: 669 SSIGPGGTTSVTVDVRNDGARDGDEVVQLYIHDKVSSVTRPIKELKGFERVSLKPGEVRT 728
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V FT+ +SL++ ++ + ++ G I+ G
Sbjct: 729 VRFTIT-PESLQMWNDKMHRVVEPGEFEIMTG 759
>gi|86142030|ref|ZP_01060554.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
gi|85831593|gb|EAQ50049.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
Length = 803
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 229/731 (31%), Positives = 347/731 (47%), Gaps = 123/731 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA+HG IG T FP+ I ++FN L KK+
Sbjct: 135 RLGIPLF-LAEEAMHGHMAIG-----------------TTEFPSAIGQASTFNPQLNKKM 176
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G V+ E RA + P +++ R+PRW RV ET GEDPY++ + + G Q
Sbjct: 177 GAAVAKELRA-----QGAHIGYGPILDLAREPRWSRVEETFGEDPYLISEMGLGVIEGFQ 231
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND-RFHFDSRVTEQDMQETFILPFE 228
EG+E P + + KH+AAY + N H R QD ++ PF+
Sbjct: 232 G-EGIE-------NPESVISTLKHFAAYGVSEGGHNGGAVHIGQRELMQD----YMYPFK 279
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
++ G V SVM +Y+ V+GIP+ ++ LL +R W F G++VSD SI+ I H
Sbjct: 280 KAIDAG-VLSVMTAYSSVDGIPSTSNKALLTGLLREQWGFEGFVVSDLASIEGIKGDHH- 337
Query: 289 LNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
T EDA A + AG+D D G + + + + A + GK++EA +D +++++ + ++G
Sbjct: 338 AAATFEDAAALAMNAGVDADLGGNGFDDELLNAFKNGKVSEARLDEAVKYVLRLKFKMGL 397
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
F+ + K + + HI +A E A +G+ LLKN+NG LPL+ +K +A++GP+A+
Sbjct: 398 FENPYVEEKAPKKVVRSAAHIAIAKEMALEGVTLLKNENGLLPLSK-ELKKIAVIGPNAD 456
Query: 408 ATKAMIGNYEG--TPCRYTSPMDGFYAY--SKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
+G+Y P +P++G A I Y G A I + IPAA+ AAK+
Sbjct: 457 MMYNQLGDYTAPQEPEFIVTPLEGIRAKMPKAEITYVKGTA-IRDTTQTDIPAAVAAAKS 515
Query: 464 ADATVIVAG----LDLSVE----------------------AEGKDRVDLLLPGFQTELI 497
A+ ++V G D E EG DR L L G Q EL+
Sbjct: 516 AEVAIVVLGGSSARDFKTEYLETGAATVSSKEDQVLSDMESGEGYDRSTLDLMGKQLELL 575
Query: 498 NKVADAAKGPVTLVIMSAGAVDINF-AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
V +A P LV+++ + IN+ AK+ P I W YPG +GG A+ADV+FG YNP
Sbjct: 576 QAV-EATGTPTILVLITGRPLLINWPAKHIPAIIDT-W--YPGSQGGHALADVLFGDYNP 631
Query: 557 GGRLPITWYEANYVKIPYTSMPLRPV--NNF--PGRTYKFFDGPVVYPFGYGLSYTQFKY 612
GRLP++ IP S+ PV N++ R Y +Y FG+GLSYT F Y
Sbjct: 632 AGRLPVS--------IP-KSVGQSPVYYNHWWPKRRDYVEETSAPLYAFGHGLSYTTFDY 682
Query: 613 KVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKM 672
D+K+ + + V +EV N G
Sbjct: 683 S--------DLKISQSGNATNTTIEV------------------------SVEVTNTGDR 710
Query: 673 DGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
DG EVV +Y S T +KQ+ G+ER+ + G+S V F + + L + D N +
Sbjct: 711 DGDEVVQLYLSDVVSSVVTPVKQLRGFERIHLDKGESKTVTFILTPAE-LALFDAEMNHV 769
Query: 732 LASGAHTILVG 742
+G + +G
Sbjct: 770 AEAGEFEVQLG 780
>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
Length = 929
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 149/422 (35%), Positives = 238/422 (56%), Gaps = 38/422 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
P+ D L + ERAK+LV +TL EK+ Q+G +PRL + Y +W+EA+HGV+ G
Sbjct: 41 PFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSGL- 99
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
ATSFP +++++ L S EAR N + GL +W
Sbjct: 100 ---------------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWC 144
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
P IN+ RDPRWGR E GEDP++ G+ A+ Y++G+Q D + K A K
Sbjct: 145 PTINMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQ---------GDDPKYYKTIATAK 195
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+AA +N+E R S + ++++E ++ FEM V EG+V SVM +YN +NGIP
Sbjct: 196 HFAA---NNYE-KGRHSTSSDMDARNLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCG 251
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARVLKAGLDLDCG 310
A+ +LL +R +W F+G++ SDC ++ + +S H F+N T +A A + G DL+CG
Sbjct: 252 ANHELLIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVN-TAAEASAVSIVNGEDLNCG 310
Query: 311 DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHI 368
+ + ++ A+++G + EAD+DT+L ++ +G FD + ++++ + + +H
Sbjct: 311 NTFQDYCKEAIEKGYMQEADLDTALVRVFEARFSVGEFDNASNVPWRSISDDVLDCEEHR 370
Query: 369 ELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMD 428
+LA +AA++ IVLLKNDN LPL+ K++A++GP N +G Y G+P T+P
Sbjct: 371 QLAYKAAQEAIVLLKNDNNILPLD--KTKSVAVIGPFGNTI--TLGGYSGSPTALTTPFG 426
Query: 429 GF 430
G
Sbjct: 427 GI 428
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 133/274 (48%), Gaps = 47/274 (17%)
Query: 442 GCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
GCA + + + A + A AD + AG DL+V E DR +L LPG Q +L+ V
Sbjct: 592 GCA-VTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
+A V L++ + +V IN+AK + + +I+ Y G+ G+AIADV++G YNP G+L
Sbjct: 651 -SANPNVILLLQTCSSVTINWAKEH--VPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLT 707
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGR----TYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
TWY A + +P + N+ R TY + D +YPFGYG+SYT F+Y+ +
Sbjct: 708 STWYNA------LSDLP-NGMLNYDIRDAKYTYMYHDKTPLYPFGYGMSYTTFEYQKLNI 760
Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
KS +L ++ ++ N GK G+E+
Sbjct: 761 SKS---RLAAGEE-----------------------------LIVSADITNTGKYAGAEI 788
Query: 678 VMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKV 711
V +Y+ +KQ++G+ RV + G++ V
Sbjct: 789 VQLYAHVNSSIERPLKQLVGFARVELEPGETKTV 822
>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 942
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 229/811 (28%), Positives = 371/811 (45%), Gaps = 146/811 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R ++L+++MTL EK Q+ L YG R+ LP EW W + G+
Sbjct: 53 YEDPSAPLEARIENLLQQMTLDEKTCQVVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 108
Query: 67 SFIGRRTNS------PPGTH-------------------------------FDSE-VPG- 87
I N PP + F +E + G
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168
Query: 88 ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
AT+FPT + ++N L +++G EAR + G T ++P ++V RD R
Sbjct: 169 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E GE PY+V I VRGLQ H +++A KH+AAY +
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 269
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
D +++ ++++ I PF+ + E + VM SYN +GIP L +
Sbjct: 270 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 329
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
RG+ F GY+VSD D+++ + H D KE AV + ++AGL++ C D +
Sbjct: 330 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQ 377
V++G ++E I+ +R + V +G FD Q G + + E +A +A+R+
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASRE 448
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
IVLLKN LPL+ + K +A+ GP+AN + +Y T+ ++G +K
Sbjct: 449 SIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGK 508
Query: 436 -VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+ Y GC D+V + + I A++ A+ AD ++V G
Sbjct: 509 AEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCG 567
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E K R L LPG Q +L+ + K PV L++++ + IN+A + + +IL YPG
Sbjct: 568 ENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPG 624
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF-- 592
+GG A+AD++FG YNPGG+L +T + +IP+ + P +P + PG T
Sbjct: 625 SKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSR 682
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
+G +YPFGYGLSYT F+Y D++ T P +A
Sbjct: 683 ING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA----- 717
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKV 711
T +++V N GK G EVV +Y + T+ K + G++R+ + G++ ++
Sbjct: 718 --------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQEL 769
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT++ K L+++D ++ G ++ G
Sbjct: 770 SFTIDR-KHLELLDADMKWVVEPGDFVLMAG 799
>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 871
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 160/429 (37%), Positives = 231/429 (53%), Gaps = 42/429 (9%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIG 70
+F + + L +R DLV RM++ EK+ Q+ D + + RLG+P Y WW+E+LHGV+ G
Sbjct: 23 NFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARAG 82
Query: 71 RRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN----LGN- 125
AT FP I +S++ L + +S EARA ++ G
Sbjct: 83 Y----------------ATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQH 126
Query: 126 ---AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GLTFWSPN+N+ RDPRWGR ET GEDP++ G+ + YV GLQ ++
Sbjct: 127 GMYQGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQ---------GTNE 177
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ LK+ A KHYA + R F++ ++ D+ ET++ F V EG V SVM +
Sbjct: 178 KYLKVIATAKHYAVHSGPE---PSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YNR G A P L N +R W F GYIVSDC ++ I + HK D A A LK
Sbjct: 235 YNRFRGESCSASPFLFN-ILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAAT-ASALALK 292
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
GLDL+CG + + A+ + I+EADID +++ L+ +LG FD Y + +
Sbjct: 293 DGLDLECGSSFKSLKE-AIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVSYAQIPYS 351
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
N H LA A+++ IVLLKN N LPL+ +IKT+A++GP+AN +++ GNY G P
Sbjct: 352 VNNNSAHDWLARVASQKSIVLLKNQNNTLPLSR-DIKTVAVIGPNANDVQSLWGNYSGVP 410
Query: 421 CRYTSPMDG 429
+ + G
Sbjct: 411 SNPITVLKG 419
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 160/327 (48%), Gaps = 60/327 (18%)
Query: 433 YSKVINYAPGCADIVCQ------NNSMIPAAIDAAKNADATVIVAGL-------DLSVEA 479
Y + Y D + Q +++ A+ A ADA V+V GL ++ VEA
Sbjct: 562 YKITVKYQNFYGDAIAQLLWAEPQENVLQEAVQVAGQADAIVLVLGLNERLEGEEMKVEA 621
Query: 480 ---EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
EG DR L LP Q EL+ K A PV LV+++ A+ IN+A N + +IL G
Sbjct: 622 DGFEGGDRTSLDLPSNQEELM-KAMTATGKPVILVLINGSALSINWA--NDHVPAILTAG 678
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP 596
YPG++GG AIADV+FG YNP GRLP+T+Y++ +P + GRTY++F
Sbjct: 679 YPGQQGGNAIADVLFGDYNPAGRLPVTYYKST------EQLPAFENYDMKGRTYRYFQKK 732
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFG+GLSYT+FKY P +V + D
Sbjct: 733 PLYPFGFGLSYTKFKYSNLKLPTNVTPEKD------------------------------ 762
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
F ++V N+G+ DG EV+ +Y K + I Q+ G+ERV + G++ V FT+
Sbjct: 763 ---FEILVDVTNIGERDGDEVIELYLKDEKASTPRPILQLEGFERVNLKKGETKTVRFTI 819
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ L +++ ++ G TI VG
Sbjct: 820 TP-RQLSLINKKGQRVIEPGWFTISVG 845
>gi|255689951|ref|ZP_05413626.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624557|gb|EEX47428.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 735
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 223/774 (28%), Positives = 358/774 (46%), Gaps = 105/774 (13%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP 49
S K K S Y DAK+P +R DL+ RMTL EK+ Q+ G VP
Sbjct: 20 SAKDKKSIPLYKDAKVPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIGEEVKKVP 79
Query: 50 -RLGLPLYEWWSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNES 104
+G +Y + L + R P +D+ T +P + S+N
Sbjct: 80 AEIGSLIYYDTNPTLRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPE 139
Query: 105 LWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINY 164
L +K + EAR + TF SP I+V RDPRWGRV E GEDPY G +A
Sbjct: 140 LVEKACAVTAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGEDPYTNGVFAAAS 194
Query: 165 VRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFI 224
VRG Y D S +I+AC KHY Y G D + + ++ Q + +T++
Sbjct: 195 VRG--------YQGDDMSAEDRIAACLKHYIGYGASE-AGRDYVY--TEISRQTLWDTYL 243
Query: 225 LPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE 284
LP+EM V G +++M S+N ++GIP A+ + + ++ W G+IVSD +I+ +
Sbjct: 244 LPYEMGVKAG-AATLMSSFNDISGIPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL-- 300
Query: 285 SHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
++ L K++A AGL++D + Y + V++GKI A +D S+R + V
Sbjct: 301 KNQGLAANKKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKF 360
Query: 344 RLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG 403
RLG F+ K PQ +++AA+ A + +VLLKN+N LPL + K +A+VG
Sbjct: 361 RLGLFERPYTPVTSEKERFFRPQSMDIAAQLAAESMVLLKNENQILPLT--DKKKIAVVG 418
Query: 404 PHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV---------INYAPGCADIVCQNNSMI 454
P A ++G++ C + D Y+ + + YA GC N
Sbjct: 419 PMAKNGWDLLGSW----CGHGKDTDVVMLYNGLATEFVGKAELRYALGCR-TQGDNRKGF 473
Query: 455 PAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMS 514
A++AA+ +D V+ G ++ E R + LP Q EL ++ K P+ LV+++
Sbjct: 474 EEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKVGK-PIVLVLVN 532
Query: 515 AGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
+++N + P +IL + PG G +A ++ G+ NP G+L +T+ PY
Sbjct: 533 GRPLELN--RLEPISDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF--------PY 582
Query: 575 TS--MPLRPVNNFPGRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKSVDIKLDKD 628
++ +P+ GR ++ F + +YPFG+GLSYT+FKY V + S K+ +
Sbjct: 583 SNGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKYGVVTLSAS---KVKRG 639
Query: 629 QQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIA 688
+ K + ++ V N GK DG E V + P +
Sbjct: 640 E-----------------------------KLSAEVTVTNTGKRDGLETVHWFISDPYCS 670
Query: 689 GTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
T +K++ +E+ I AG++ F ++ + L VD L +G + I V
Sbjct: 671 ITRPVKELKYFEKQSIKAGETKIFRFDIDLERDLGFVDGNGKRFLEAGEYYIQV 724
>gi|160884764|ref|ZP_02065767.1| hypothetical protein BACOVA_02753 [Bacteroides ovatus ATCC 8483]
gi|156109799|gb|EDO11544.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 746
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 194/629 (30%), Positives = 320/629 (50%), Gaps = 73/629 (11%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P I+V RDPRWGRVLE GED ++ R A VRG Q ++ S+ L AC
Sbjct: 158 FAPMIDVSRDPRWGRVLEGAGEDTWLTSRVAEAKVRGYQ------WNLGSNESVL---AC 208
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
KH+AAY L G D D ++E+ ++E ++ PF+ V G V++ M ++N + G+P
Sbjct: 209 AKHFAAYGLPQ-AGKDYGTVD--ISERTLEEIYLPPFKAAVEAG-VATFMPAFNDIAGVP 264
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
A+ LL + +R W F G +VSD +I +V H + +K+ AV + AG+D+D
Sbjct: 265 CTANKWLLTEVLRNRWKFKGVVVSDWGAIWQLV-PHGMAHGSKQ-AVELSINAGVDMDMA 322
Query: 311 D-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQH 367
D Y + + +GK+ ID +R + + +LG FD ++ ++ + I N
Sbjct: 323 DGEYNRHALALINEGKVTVGQIDEMVRRILRMKFKLGLFDDPFRFCDVKREKRVIRNCDF 382
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY---EGTPCRYT 424
I A +AA++ IVLLKN+N LPL +IK++A+VGP A+ K + +Y +G Y
Sbjct: 383 IAEARKAAQKSIVLLKNENHLLPL-AKDIKSIAVVGPLAD-NKQYLRDYWAGKGEVNDYV 440
Query: 425 SPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+ ++G ++ K INYA GC D+ + S A++AA ++ + G S+ E
Sbjct: 441 TLLEGLKNNLPSHIK-INYAKGC-DVTGTDCSFFSEAVEAANQSELVIAAIGERASMSGE 498
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
R D+ +PG Q EL+ + D K PV +V+M+ + I +K ++ +I+ + G
Sbjct: 499 DASRADISIPGVQEELVQALLDTGK-PVVVVLMNGRPLTI--SKLTEQVPAIVEGWFLGT 555
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIP----YTSMPLRPVNNFPGRTYKFFDGP 596
E G AIADV+ GKYNP G+L ++ + N +IP Y + T +F D P
Sbjct: 556 ETGNAIADVLLGKYNPSGKLTMS-FPRNVGQIPVFYNYRQSGRPGTDKLTKWTNRFIDSP 614
Query: 597 V--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
V +YPFGYGLSYT F Y S+P+ + ++ +
Sbjct: 615 VSPLYPFGYGLSYTTFSY---SAPRVSQKEFSTNEILK---------------------- 649
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGF 713
++V N G+ DG E + +Y + + T +K++ G++++F+ G++ VGF
Sbjct: 650 -------VSVDVTNTGQYDGEETIQLYIRDVIASVTRPVKELKGFKKIFLRKGETRTVGF 702
Query: 714 TMNACKSLKIVDNAANSLLASGAHTILVG 742
+ A + L + ++ SG ++ G
Sbjct: 703 ELRA-EDLSFLSQDMEPVIESGEFILMTG 730
>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 391
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 154/384 (40%), Positives = 209/384 (54%), Gaps = 41/384 (10%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFD 82
+RA LV +M+ EKV Q + A +PRL +P YEWWSE LHG++ G
Sbjct: 34 QRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY----------- 82
Query: 83 SEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---------AGLTFWSP 133
AT FP I AS+N +L +++G VSTEARA +N AGLT WSP
Sbjct: 83 -----ATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSP 137
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
NIN+ RDPRWGR +ET GEDP++ G+ A+ ++RGLQ D + P I A KH
Sbjct: 138 NINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQG--------DDLNHPRTI-ATPKH 188
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
A + R FD V+ +DM+ T+ F + +G SVMC+YN ++G P CA
Sbjct: 189 IAVHSGPE---PGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACA 245
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
LLN +RGDW F G++VSDCD++ + + H F D + A LKAG DL+CG Y
Sbjct: 246 ADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAY 304
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELA 371
A+++G++ EA +D SL L+ RLG + + Y LG ++ N H LA
Sbjct: 305 RELGT-AIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALA 363
Query: 372 AEAARQGIVLLKNDNGALPLNTGN 395
+AA + IVLLKN LPL G
Sbjct: 364 LQAAAESIVLLKNTATTLPLKAGT 387
>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 990
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 229/812 (28%), Positives = 372/812 (45%), Gaps = 148/812 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R ++L+++MTL EK QM L YG R+ LP EW W + G+
Sbjct: 101 YEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 156
Query: 67 SFIGRRTNS---------------PPGTH----------------------FDSE-VPG- 87
I N P H F +E + G
Sbjct: 157 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 216
Query: 88 ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
AT+FPT + ++N L +++G EAR + G T ++P ++V RD R
Sbjct: 217 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 270
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E GE PY+V I VRGLQ H +++A KH+AAY +
Sbjct: 271 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 317
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
D +++ ++++ I PF+ + E + VM SYN +GIP L +
Sbjct: 318 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 377
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
RG+ F GY+VSD D+++ + H D KE AV + ++AGL++ C D +
Sbjct: 378 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 436
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GKNNICNPQHIELAAEAAR 376
V++G ++E I+ +R + V +G FD +P +L + ++ +A +A+R
Sbjct: 437 ELVKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASR 495
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK- 435
+ IVLLKN LPL+ + K +A+ GP+AN + +Y T+ ++G +K
Sbjct: 496 ESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKG 555
Query: 436 --VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVE 478
+ Y GC D+V + + I A++ A+ AD ++V G
Sbjct: 556 KAEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTC 614
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E K R L LPG Q +L+ + K PV L++++ + IN+A + + +IL YP
Sbjct: 615 GENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYP 671
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF- 592
G +GG A+AD++FG YNPGG+L +T + +IP+ + P +P + PG T
Sbjct: 672 GSKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMS 729
Query: 593 -FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
+G +YPFGYGLSYT F+Y D++ T P +A
Sbjct: 730 RING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA---- 765
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
T +++V N GK G EVV +Y + T+ K + G++R+ + G++ +
Sbjct: 766 ---------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQE 816
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ FT++ K L+++D ++ G ++ G
Sbjct: 817 LSFTIDR-KHLELLDADMKWVVEPGDFVLMAG 847
>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 942
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 229/812 (28%), Positives = 373/812 (45%), Gaps = 148/812 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R ++L+++MTL EK QM L YG R+ LP EW W + G+
Sbjct: 53 YEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKD---GI 108
Query: 67 SFIGRRTNS------PPGTH-------------------------------FDSE-VPG- 87
I N PP + F +E + G
Sbjct: 109 GAIDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGV 168
Query: 88 ----ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPR 142
AT+FPT + ++N L +++G EAR + G T ++P ++V RD R
Sbjct: 169 ESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQR 222
Query: 143 WGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
WGR E GE PY+V I VRGLQ H +++A KH+AAY +
Sbjct: 223 WGRYEEVYGESPYLVAELGIEMVRGLQ-------HNH------QVAATGKHFAAYSNNKG 269
Query: 203 EGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTI 262
D +++ ++++ I PF+ + E + VM SYN +GIP L +
Sbjct: 270 AREGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRL 329
Query: 263 RGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTM 318
RG+ F GY+VSD D+++ + H D KE AV + ++AGL++ C D +
Sbjct: 330 RGEMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLR 388
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GKNNICNPQHIELAAEAAR 376
V++G ++E I+ +R + V +G FD +P +L + ++ +A +A+R
Sbjct: 389 ELVKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASR 447
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK- 435
+ IVLLKN LPL+ + K +A+ GP+AN + +Y T+ ++G +K
Sbjct: 448 ESIVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKG 507
Query: 436 --VINYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVE 478
+ Y GC D+V + + I A++ A+ AD ++V G
Sbjct: 508 KAEVLYTKGC-DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTC 566
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E K R L LPG Q +L+ + K PV L++++ + IN+A + + +IL YP
Sbjct: 567 GENKSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYP 623
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-----PGRTYKF- 592
G +GG A+AD++FG YNPGG+L +T + +IP+ + P +P + PG T
Sbjct: 624 GSKGGTALADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPTGNMS 681
Query: 593 -FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
+G +YPFGYGLSYT F+Y D++ T P +A
Sbjct: 682 RING-ALYPFGYGLSYTTFEYS-------------------DLDITPRVITPNESA---- 717
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
T +++V N GK G EVV +Y + T+ K + G++R+ + G++ +
Sbjct: 718 ---------TVRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQE 768
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ FT++ K L+++D ++ G ++ G
Sbjct: 769 LSFTIDR-KHLELLDADMKWVVEPGDFVLMAG 799
>gi|441498970|ref|ZP_20981160.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
gi|441437215|gb|ELR70569.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
Length = 752
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 219/736 (29%), Positives = 348/736 (47%), Gaps = 104/736 (14%)
Query: 28 LVERMTLPEKVQQM----GDLAYGVPRLGLPLYEWWSEAL-----------HGVSFIGR- 71
L+ +MTL EKV Q+ GDL P + + + + + HG ++ GR
Sbjct: 36 LIRQMTLEEKVGQLNFYVGDLFNTGPTVRTTESDKFDQLIREGKLTGLFNVHGAAYTGRL 95
Query: 72 --------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
R P D T FP + + AS++ +K + + E+ A
Sbjct: 96 QKIAVEESRLGIPLLFGADVIHGFKTVFPIPLASAASWDLEAIEKAERVAAIESTA---- 151
Query: 124 GNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
AG+ F ++P +++ RDPRWGR+ E GEDP++ A VRG Q+ S +
Sbjct: 152 --AGINFNFAPMVDISRDPRWGRIAEGAGEDPFLGSEVAKARVRGFQE--------QSLT 201
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
P ++AC KH+AAY + G D D ++E+ ++E ++ P++ ++ G +++M S
Sbjct: 202 DPQTMAACVKHFAAYGAPDG-GRDYNTVD--MSERLLREMYLPPYKAGIDAG-AATIMTS 257
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
+N +NGI LL +R +W F G +VSD S+ +V +H +A LK
Sbjct: 258 FNELNGIAASGSQFLLRDILRKEWGFKGMVVSDWQSVNEMV-AHG-NAANNAEAAMMALK 315
Query: 303 AGLDLD-CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GK 359
AG+D+D GD Y V +GK+ +D ++R + + LG FD +Y + K
Sbjct: 316 AGVDMDMMGDVYLEEVPRLVNEGKLDIKFVDEAVRNVLKLKYDLGLFDDPYRYSDTIREK 375
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHA------NATKAMI 413
NNI +H+E A + A++ IVLLKN LPL +I T+A++GP A N T +
Sbjct: 376 NNIRAVEHLEAARDVAKKSIVLLKNKEKLLPLKK-SIGTIAVIGPLADNQADMNGTWSFF 434
Query: 414 GNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL 473
G + D S+V+ YA GC ++ ++ A++ AK AD ++ G
Sbjct: 435 GEAQHPITFLQGIKDAVSGQSRVL-YAEGC-NLYDRSKDKFAEAVNIAKKADVVILAVGE 492
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
+ E R D+ LPG Q EL+ ++A K PV ++MS +D+++ N I +IL
Sbjct: 493 SAVMNGEAGSRSDIRLPGIQPELVMEIAKTGK-PVVALVMSGRPLDLSWLDEN--IPAIL 549
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPIT----------WYEANYVKIPYTSMPLRPVN 583
V G E G A ADV+FG YNP G+LP+T +Y PY P++
Sbjct: 550 EVWTLGSEAGNAAADVLFGDYNPSGKLPVTFPRNVGQVPIYYNHKNTGRPYEGDYSEPLS 609
Query: 584 NFPGRT-YKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
R+ Y+ +YPFGYGLSY+ F+Y DI L D T+ +
Sbjct: 610 ERIYRSKYRDVQNSPLYPFGYGLSYSTFEYS--------DITLSAD--------TLNAGE 653
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
A+V + N G DG EVV +Y + G +K++ G++++
Sbjct: 654 SITASV----------------SITNEGPYDGEEVVQLYIRDLVGSVTRPVKELKGFKKL 697
Query: 702 FIAAGQSAKVGFTMNA 717
I G++ KV FT+++
Sbjct: 698 MIKNGETVKVDFTLSS 713
>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
Length = 854
Score = 265 bits (676), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 162/425 (38%), Positives = 230/425 (54%), Gaps = 53/425 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y D P ER DL+ ++T+ EK+ + + G+PRL + Y +EALHGV
Sbjct: 28 YLDMNAPQHERILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGVV------ 81
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
PG T FP I A +N L +I +S EARA +N G
Sbjct: 82 --RPGNF--------TVFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQ 131
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ G+ +++V+GLQ D R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQG---------DDPR 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LKI + KH+AA N E ++RF + ++E+D++E ++ FE C+ EG +S+M +Y
Sbjct: 183 YLKIVSTPKHFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAY 238
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +N +P + LL + +R DW F GY+VSDC + +V HK++ T E A ++A
Sbjct: 239 NAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVK-TLEAAATLSIQA 297
Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
GLDL+CGD Y + A +Q + EA+ID++ + MRLG FD NL N
Sbjct: 298 GLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFDDP----NLNPYNK 353
Query: 363 CNP------QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
+P +H +LA EAARQ IVLLKN+ LPL+ IK++A+VG NA G+Y
Sbjct: 354 ISPSVVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVG--INAGNCEFGDY 411
Query: 417 EGTPC 421
GTP
Sbjct: 412 SGTPV 416
Score = 129 bits (323), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 97/300 (32%), Positives = 145/300 (48%), Gaps = 51/300 (17%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSA 515
AA DA + D T+ V G++ S+E EG+DR + LP Q I + P T+V++ A
Sbjct: 594 AAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKI--NPNTVVVLVA 651
Query: 516 GA-VDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPY 574
G+ + IN+ + I +I+ YPGE GG A+A+V+FG YNPGG+LP+T+Y + +
Sbjct: 652 GSSLAINWMDEH--IPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRSLDELPAF 709
Query: 575 TSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDI 634
+R GRTY+FF+G +Y FG+GLSYT F YK KL+ D
Sbjct: 710 DDYDIR-----KGRTYQFFEGNPLYAFGHGLSYTTFSYK----------KLNIDSTG--- 751
Query: 635 NYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG---IAGTH 691
D VK ++N GK DG EV +Y K G +
Sbjct: 752 ----------------DAVKV-------SFALKNTGKYDGDEVAQLYVKYQGNDSLVKLP 788
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSF 750
+KQ+ G+ERV + G+S +V T+ + L+ D +G + +VG +
Sbjct: 789 LKQLKGFERVHLKKGESKRVTLTVPKSE-LRFWDEEKGEFYTPAGDYLFMVGTASDAIQL 847
>gi|329963878|ref|ZP_08301220.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328527131|gb|EGF54137.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 766
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 235/802 (29%), Positives = 378/802 (47%), Gaps = 149/802 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---------------------- 51
Y D + P ER +DL+ RMTL EKV QM G+ +
Sbjct: 25 YKDPEAPVKERVEDLLGRMTLEEKVGQMNQFV-GLEHIKANSAVMTEEELKNNTANAFYP 83
Query: 52 GLPLYE--WWSEA------LHGVSF----------IGRRTNSP-----PGTHFDSEVPGA 88
G+ E W+E LH ++ + R P H ++ PG
Sbjct: 84 GITDKEVAAWTEQGLIGSFLHVLTIEEANYLQSLAMKSRLQIPIIFGIDAIHGNANAPGN 143
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
T +PT I SF+ + +I + + E RAM N TF +PN+ V RD RWGRV E
Sbjct: 144 TVYPTNINLACSFDTLMAYRIARETAKEMRAM----NMHWTF-NPNVEVARDARWGRVGE 198
Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHY--AAYDLDNWEGND 206
T GEDPY+V R + V+G Y DS+ + AC KH+ + ++ G+
Sbjct: 199 TFGEDPYLVTRMGVQSVKG--------YQGSLDSKE-DVLACIKHFVGGSEPINGTNGS- 248
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
+ ++E+ ++E F PFE V G + S+M ++N +NG+P ++ L+ +RG+W
Sbjct: 249 ----PADLSERTLREVFFPPFEAGVKAGAM-SLMTAHNELNGVPCHSNEWLMADVLRGEW 303
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGK 325
NF G++VSD I+ + H + KE A + + +G+D+ G ++ + V++G+
Sbjct: 304 NFPGFVVSDWMDIEHTHDLHATAENLKE-AFYQSIMSGMDMHMHGIHWNEMVVELVKEGR 362
Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
I E+ ID S+R + + RLG F+ + K +C +H A EAAR GIVLLKN
Sbjct: 363 IPESRIDESVRRILDIKFRLGLFEQPYADVEETMKIRLCG-EHRATALEAARNGIVLLKN 421
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGFYAYSKVINYAPG 442
+ G LPL+ K + + G +A+ + ++G++ T+ ++G + +
Sbjct: 422 E-GVLPLDPSKYKKIMVTGINAD-DQNILGDWSAPEKEENVTTILEGLRMIAPDTQF--- 476
Query: 443 CADIVCQN---NSMIPAAIDA----AKNADATVIVAGLDL-------SVEAEGKDRVDLL 488
D V Q +M P +D AKNAD ++VAG + + E DR DL
Sbjct: 477 --DFVDQGWDPRNMDPKKVDEAAAHAKNADLNIVVAGEYMMRFRWNDRTDGEDTDRSDLD 534
Query: 489 LPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIAD 548
L G Q ELI KVA + K P LV+++ + + +A N + +I+ PG +GG+A+A+
Sbjct: 535 LVGLQEELIEKVAASGK-PTVLVLVNGRPLSVRWAAEN--LPAIVEAWAPGMQGGQAVAE 591
Query: 549 VIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV-------VYPF 601
+++GK NP +L IT IP++ L+ + N + ++F V +YPF
Sbjct: 592 ILYGKVNPSAKLAIT--------IPHSVGQLQMIYNH--KPSQYFHPYVAGKPSTPLYPF 641
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GYGLSYT +KY+ D+ LD+ + +D +VG +
Sbjct: 642 GYGLSYTTYKYE--------DLNLDRKEIEKD--GSVGVS-------------------- 671
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKS 720
++V N G DG E+V +Y + T +K++ + RV + AG+S V F + K
Sbjct: 672 --VKVTNTGSRDGVEIVQLYIRDKFSCVTRPVKELKDFARVPLKAGESRVVNFKITPDK- 728
Query: 721 LKIVDNAANSLLASGAHTILVG 742
L D ++ G ++VG
Sbjct: 729 LAFYDIKMKKVVEPGEFIVMVG 750
>gi|157363220|ref|YP_001469987.1| glycoside hydrolase family protein [Thermotoga lettingae TMO]
gi|157313824|gb|ABV32923.1| glycoside hydrolase family 3 domain protein [Thermotoga lettingae
TMO]
Length = 779
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 233/812 (28%), Positives = 374/812 (46%), Gaps = 156/812 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYGVPRLGLPLYEWWSEAL--HGVSFIG 70
Y +A LP R KDL+ RMTL EKV Q+G + +Y + +EAL +G+ I
Sbjct: 7 YKNASLPVDIRVKDLLSRMTLDEKVAQLGSVWSYELLDDQGNFSNEKAEALLKNGIGQIT 66
Query: 71 R---------------------------RTNSPPGTHFDSEVP----GATSFPTVILTTA 99
R R P H + GAT+FP I +
Sbjct: 67 RPGGATNLSAKEVARLINQIQKYLIEQTRLGIPAIMHEECLTGYMGLGATNFPQAIAMAS 126
Query: 100 SFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGR 159
+++ L +K+ T+ + R M + GL +P ++VVRDPRWGR E+ GE Y+V +
Sbjct: 127 TWDPELIEKMTSTIREDMRQMGI--HQGL---APVLDVVRDPRWGRTEESFGESAYLVAK 181
Query: 160 YAINYVRGLQDVEGVEYHRDSDSRPLK--ISACCKHYAAYDLDNWEGNDRFHFDSRVTEQ 217
++Y+ GLQ + +K + A KH+ Y EG + + + E+
Sbjct: 182 MGVSYIIGLQ------------GKDIKNGVIATAKHFVGYGAS--EGGKNWA-PTNIPER 226
Query: 218 DMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCD 277
+++E F+ PFE V E V SVM SY+ ++GIP + +L +R +W F G +VSD
Sbjct: 227 ELREIFMFPFEAAVKEASVMSVMNSYSEIDGIPCASSKELFTGVLRKNWGFSGIVVSDYF 286
Query: 278 SIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSL 335
+I + E H+ D KE A L+AG+D++ D YT V+QG I+E+ ++ +
Sbjct: 287 AIDMLREYHRLAKDKKE-AAKYALQAGIDVELPKADCYTTIRE-LVEQGLISESTVNQAT 344
Query: 336 RFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGN 395
+ + LG FD Y ++ K + +H +A E AR+ IVLLKND G LPL
Sbjct: 345 SRVLQIKFMLGLFD--KPYVDVEKIEL--KKHYSIATEIARKSIVLLKND-GILPLKKD- 398
Query: 396 IKTLALVGPHANATKAMIGNY-----------EGTPCRYTSPMDGFYAYSKVIN------ 438
+ALVGP+A+ + ++G+Y + +P KVIN
Sbjct: 399 -AKIALVGPNASEVRNLLGDYAYLAHIKVLLDSVNQTTFNAPKFNLKNVEKVINESIEKI 457
Query: 439 ---------------YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----LDLSVE 478
+A GC DI+ + A+ A KNAD V+V G +
Sbjct: 458 PSILDSMKAEGVIFTHAIGC-DILNSSTEGFSEALHAVKNADIAVVVVGDRSGLTEDCTS 516
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN-NPKIKSILWVGY 537
E +D +L LPG Q EL+ ++A K P+ LV+++ + KN ++ +I+ +
Sbjct: 517 GESRDSANLKLPGVQEELVLEIAKCGK-PIVLVLVTGRPYSL---KNIVSRVNAIIEMWL 572
Query: 538 PGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTY---KFF 593
PGE GG A+ DV+FGK NPGG+LPI++ A + + + P GR++ +
Sbjct: 573 PGEVGGMALVDVLFGKVNPGGKLPISFPRSAGQIPVYHDVKP------SGGRSHWHKDYV 626
Query: 594 DGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
D V ++ FG+GLSYT+F++ N + K P
Sbjct: 627 DELVEPLFSFGHGLSYTKFEFS---------------------NLVIEPQKIPS------ 659
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAK 710
D + T +++V+N G+++G EVV +Y + T IK++ G++R+ + G+S
Sbjct: 660 -----DGQVTIKVDVKNSGEVEGDEVVQLYLTREHASVTRPIKELKGFKRITLKPGESRT 714
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
F ++ L D ++ G ++G
Sbjct: 715 TVFKIH-TDVLAYYDRGMELVVEPGVFKAMIG 745
>gi|319901343|ref|YP_004161071.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416374|gb|ADV43485.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 781
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 243/823 (29%), Positives = 373/823 (45%), Gaps = 175/823 (21%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQM------------GDLAYGVPRL------GLPL 55
Y A P R KDL+ RMT+ EKV Q+ G V L P+
Sbjct: 26 YKQAGAPIEYRVKDLIGRMTVEEKVAQLCCPLGWEMYTKTGKNTVEVSALYKEKMKDAPV 85
Query: 56 YEWWS-------------------------EALHGVSFIGRRTNSPPGTHFDSEVP---- 86
+W+ AL + R P F E P
Sbjct: 86 GSFWAVLRADPWTQKTLETGLNPELAAKALNALQKYAVEETRLGIP--VLFAEECPHGHM 143
Query: 87 --GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRW 143
GAT FPT + ++++ESL +++G+ ++ EAR N+G + P ++V R+PRW
Sbjct: 144 AIGATVFPTALSAASTWDESLMQQMGEAIALEARLQGANIG------YGPVLDVAREPRW 197
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQ-DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNW 202
R+ ET GEDP + + ++G+Q DV+ H + + KH+AAY +
Sbjct: 198 SRMEETFGEDPVLTSVMGVALMKGMQGDVQNDGKH---------LYSTLKHFAAYGVP-- 246
Query: 203 EGNDRFHFDSRVTE--QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQ 260
+ H SR + + ++ PF+ V G ++M SYN ++G+P ++ LL +
Sbjct: 247 ---ESGHNGSRANSGMRQLFSEYLPPFKKAVEAG-AGTIMTSYNSIDGVPCTSNKFLLTE 302
Query: 261 TIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMG 319
+R W F G++ SD SI+ IV + D KE A A+ L+AGLD+D GD +
Sbjct: 303 VLRNQWGFKGFVYSDLISIEGIV-GMRAAKDNKE-AAAKALRAGLDMDLGGDAFGRNLKQ 360
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGI 379
A ++G I D+D ++ + + ++G F+ +I + +H ELA AR+G+
Sbjct: 361 AYEEGLITMDDLDRAVSNVLRLKFQMGLFENPYVSPEQAGKHIRSREHKELARRVAREGV 420
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGFYA---YS 434
VLLKND G LPL+ ++K +A++GP+A+ +G+Y R + +DG A +
Sbjct: 421 VLLKND-GVLPLDK-HLKRIAVIGPNADMMYNQLGDYTAPQDRKEIVTVLDGVRAAVSKT 478
Query: 435 KVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG--------------------LD 474
+ Y GCA + S IPAA+ AA+ ADA ++V G D
Sbjct: 479 TQVVYVKGCA-VRDTTESDIPAAVAAAQRADAVILVVGGSSARDFKTKYISTGAATVSED 537
Query: 475 LSVE-----AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
+ V EG DR L L G Q +LIN VA K P+ ++ ++ A+++N A + K
Sbjct: 538 IKVLPDMDCGEGFDRSSLRLLGDQEKLINAVAATGK-PLVVIYIAGRAMNMNLAAD--KA 594
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG-- 587
+++L YPGE+GG IAD++FG YNP GRLP++ IP + L PV G
Sbjct: 595 RALLAAWYPGEQGGAGIADILFGDYNPAGRLPVS--------IPRSEGQL-PVFYSQGTQ 645
Query: 588 RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
R Y G +Y FGYGLSYT+F Y K D++ + C
Sbjct: 646 RDYVEEKGTPLYAFGYGLSYTKFVYSALEMRKGTDVETLQTVSC---------------- 689
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHIKQVIGYE 699
V N G DG EVV +Y S+PP + + +
Sbjct: 690 -----------------TVTNTGDRDGEEVVQLYICDEVASVSQPPIL-------LKAFR 725
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
R+F+ G+S KV F + L I D+ N ++ G ++VG
Sbjct: 726 RIFLKKGESRKVTFLLKK-DDLAIYDDEMNYVVEPGDFKVMVG 767
>gi|255693561|ref|ZP_05417236.1| periplasmic beta-glucosidase(Cellobiase) [Bacteroides finegoldii
DSM 17565]
gi|260620626|gb|EEX43497.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 800
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 228/800 (28%), Positives = 359/800 (44%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTDGWSTEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T ++P +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H +++ AA + IV
Sbjct: 389 SEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEVVVHNAAHQDVSMRAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+ LPL+ + +A++GP+A K + Y + G Y + +
Sbjct: 449 LLKNEKEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G K V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKG---KVRVAGVLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ +KP A T
Sbjct: 679 GLSYTTFGYS--------DLKI---------------SKPVIGA---------QENITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ + FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|260062042|ref|YP_003195122.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
gi|88783604|gb|EAR14775.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
Length = 763
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 193/614 (31%), Positives = 313/614 (50%), Gaps = 79/614 (12%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P +++ RDPRWGRV+E GEDPY+ R + VRG Q D S PL I+AC
Sbjct: 163 FAPMVDISRDPRWGRVMEGAGEDPYLGSRVGVARVRGFQG--------DDLSDPLTIAAC 214
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
KH+A Y EG R + + + + PF+ V+ G ++VM S+N +NGIP
Sbjct: 215 LKHFAGYGFA--EGG-RDYNTADFGLSTLYNVVLPPFQAGVDAG-AATVMNSFNVLNGIP 270
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV-ARVLKAGLDLDC 309
AD L ++ W+F G++VSD SI ++ H + D E A+ A V + +D++
Sbjct: 271 ATADAFLQRDILKAAWDFQGFVVSDWGSIGEMI-PHGYARDRNEAALRAAVAGSDMDMES 329
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQH 367
G Y T V+ GK+ E+ +D ++ + + LG F +Y + + + NP
Sbjct: 330 GMYLTELPE-LVRDGKVPESLVDEAVLRILGLKYDLGLFADPYRYADAEREKRILSNPAR 388
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT--PCRYTS 425
+E + AR+ IVLLKN+ G LPL+ N ++AL+GP A+ + +G++ T P S
Sbjct: 389 LETVRDMARKSIVLLKNEGGVLPLSK-NGGSIALIGPLASDKDSPLGSWRLTAEPNSAVS 447
Query: 426 PMDGFYAYS-KVINYAPGC------------ADIVCQNNSMIPAAIDAAKNADATVIVAG 472
++G AYS + Y G I + S IPAA++ A++++ V+V G
Sbjct: 448 VLEGMQAYSGNTLAYERGVPLAEGETAFVFETKINTTDRSGIPAAVELARSSETVVMVLG 507
Query: 473 LDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG+ R L LPG Q EL+ V A + LV+M+ + IN+A + + +I
Sbjct: 508 EHGFQSGEGRSRAALGLPGLQQELLEAV-HAVNPNIVLVLMNGRPLTINWAAEH--VPAI 564
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPL-RPVNNFPGRTY 590
L + G E G AIA+V++G YNP G+LP+T+ ++ + + Y+ + RP +PG
Sbjct: 565 LEAWHLGTESGHAIAEVLYGDYNPSGKLPMTFPKSVGQIPVYYSHLATGRP--EYPGNDL 622
Query: 591 KFFDGPV------VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPP 644
F+ + +YPFG+GLSY+ F+Y D+KL Q +I P
Sbjct: 623 VFWSHYIDQVNEPLYPFGHGLSYSDFRY--------ADLKL----QTTEIR--------P 662
Query: 645 CAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFI 703
++ + + +EN G+E+V +Y + G ++++ G+E+VF+
Sbjct: 663 GGSLEV------------SVRLENASDTPGTEIVQLYVRDHFGSRARPVRELKGFEKVFL 710
Query: 704 AAGQSAKVGFTMNA 717
AG SA+V FT++A
Sbjct: 711 EAGGSAEVSFTLSA 724
>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
Length = 782
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 221/723 (30%), Positives = 343/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG AT FPT I A+++ L K++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSLELVKEV 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 171 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 225
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 226 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 274
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP ++ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 275 AIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLYSIEGIHESH-FV 332
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ +A IDT++ + + +G F
Sbjct: 333 ALTKENAAIQSVTAGVDVDLGGDAYTNLCH-AVQSGQMDKAVIDTAVCRVLRMKFEMGLF 391
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 392 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 450
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI AA+ +
Sbjct: 451 RYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCA-IRDTTVNEIEQAIKAARRS 509
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 510 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 569
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 570 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 626
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 627 IS-VPRSVGQIPVYYNKKAPRNH----DYVEMSSFPLYSFGYGMSYTTFEYS-------- 673
Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + K +C ++++ +V+N GK DG EV +
Sbjct: 674 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 705
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ +ER + G+ KV F + + +V+ ++ SG +
Sbjct: 706 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 764
Query: 740 LVG 742
++G
Sbjct: 765 MIG 767
>gi|313204470|ref|YP_004043127.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443786|gb|ADQ80142.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 746
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 214/677 (31%), Positives = 331/677 (48%), Gaps = 81/677 (11%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
T+FP + TAS++ +L +K + +TEA A TF +P +++ RDPRWGRV+E
Sbjct: 113 TTFPIPLGETASWDLALIEKSARIAATEASAY----GVQWTF-APMVDIARDPRWGRVME 167
Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
GED Y+ A V G Q + I AC KH+AAY G D
Sbjct: 168 GAGEDTYLGSLVAKARVHGFQG--------NGLGNVDAIMACAKHFAAYGA-AIGGRDYN 218
Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
D ++ + + ET++ PF+ V E +V++ M S+N +NGIP A+ + ++G WNF
Sbjct: 219 SVD--MSLRQLNETYLPPFKAAV-EANVATFMNSFNDINGIPATANKYIQRDILKGQWNF 275
Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIA 327
G++VSD SI ++ +H + D+ DA + + AG D+D Y N VQ GK+
Sbjct: 276 KGFVVSDWGSIGEMI-AHGYAKDSY-DAAMKAINAGSDMDMESRCYRNNLKQLVQDGKVD 333
Query: 328 EADIDTSLRFLYIVLMRLGYFDGSPQYKNLG--KNNICNPQHIELAAEAARQGIVLLKND 385
+ ID +++ + + LG FD ++ N K NP++ A E ++ IVLLKN+
Sbjct: 334 ISVIDEAVKRILVKKFELGLFDDPYRFCNAAREKKQTNNPENRAFAREIGKKSIVLLKNE 393
Query: 386 ---NGA--LPLNTGNIKTLALVGPHANATKAMIG----NYEGTPCRYTSPMDGF---YAY 433
NG LPL + KT+AL+GP ATKA G + R S G
Sbjct: 394 PLSNGKTLLPL-SKQTKTVALIGPLFKATKANHGFWSIAFPDDSTRIISQYQGIKNQLDK 452
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQ 493
S I YA GC +I + + AI+AAK+AD ++ G + E K + +L LPG Q
Sbjct: 453 SSSIVYAKGC-NINDNDKTGFAEAINAAKSADVVIMSLGEAADMSGEAKSKSNLQLPGVQ 511
Query: 494 TELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK 553
EL+ ++ K PV L++ + + N+A +N I SIL+ + G E G AIADV+FG
Sbjct: 512 EELLKEIYKTGK-PVVLLLNAGRPLIFNWASDN--IPSILYTWWLGTEAGNAIADVLFGD 568
Query: 554 YNPGGRLPITW-YEANYVKIPY----TSMPLRPVN--NFPGRTYKFFDGPVVYPFGYGLS 606
YNP G+LPI++ + I Y T P + N N+ + P YPFGYGLS
Sbjct: 569 YNPAGKLPISFPRTEGQIPIYYNHFNTGRPAKDENDKNYVSAYIDLQNSP-KYPFGYGLS 627
Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
YT+F ++KL D+ K T +++
Sbjct: 628 YTKFDIS--------NLKLSSDKL------------------------SSGNKLTVTVDI 655
Query: 667 ENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
N G DG EVV +Y + G +K++ G++++ + G++ ++ FT+ + LK +
Sbjct: 656 ANTGNYDGEEVVQLYVRDLVGSVVRPVKELKGFQKLMLKKGETKQLTFTLTP-EDLKFFN 714
Query: 726 NAANSLLASGAHTILVG 742
N + +G + + VG
Sbjct: 715 NEIQYINEAGDYELFVG 731
>gi|299140913|ref|ZP_07034051.1| periplasmic beta-glucosidase [Prevotella oris C735]
gi|298577879|gb|EFI49747.1| periplasmic beta-glucosidase [Prevotella oris C735]
Length = 767
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 205/699 (29%), Positives = 324/699 (46%), Gaps = 98/699 (14%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
ATSFP +++ +L ++I + EA A+ G T ++P ++V RDPRWGRV
Sbjct: 119 ATSFPAQCGQGVTWDRALIRQIANVTAQEASAL------GYTNVYAPILDVSRDPRWGRV 172
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
+E E PY+ G V GLQ+ +I + KH+A Y L ++
Sbjct: 173 VECYSESPYLAGELGKQMVLGLQEN--------------RIVSTPKHFAVYSLPVGGRDE 218
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
D V ++M+ + PF + EG VM SYN +G P P L + +R W
Sbjct: 219 GTRTDPHVAPKEMKTLLLEPFRKAIQEGGALGVMSSYNDYDGEPITGSPYFLTELLRHQW 278
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM-------- 318
FHGY+VSD ++++ + H + +E+ A + AGLD+ TNF+M
Sbjct: 279 GFHGYVVSDSEAVEFLSSKHHVAAN-REEGAAMAINAGLDVR-----TNFSMPETFILPL 332
Query: 319 -GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAA 375
A+ G ++ +D ++ + V LG FD +P N+ + + + + H +L+ AA
Sbjct: 333 RQALTDGLVSMQILDARVKDVLYVKFWLGLFD-NPYRGNVNEVDQVVHSKAHQQLSLRAA 391
Query: 376 RQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY-- 433
+ IVLLKN+N LPL+ ++K +A++GP+A+AT A + Y S + G
Sbjct: 392 LESIVLLKNENNLLPLSK-SLKRIAVIGPNADATTAHVCRYGPANAPIKSVLSGIRESMP 450
Query: 434 SKVINYAPGCA--------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
+ YA GC+ + MI A+ A+ +D V+V G
Sbjct: 451 GAEVRYAKGCSIVDKHFPESELYEVALDTTEQRMIDEAVGVARQSDVAVVVLGGSEETVR 510
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E R DL L G Q +L+ V K PV LV++ A IN+A N + +I+ +PG
Sbjct: 511 EEYSRTDLNLMGRQEQLLRAVYATGK-PVVLVLLDGRAATINWA--NQYVPAIVHGWFPG 567
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
E G A+A V+FG YNPGG+L +T + + +IPY + P +P + G DG +Y
Sbjct: 568 EFTGTAVAKVLFGDYNPGGKLAVT-FPKSVGQIPY-AFPFKPGADSKGPVR--VDG-ALY 622
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT F Y D + +KP +V CK
Sbjct: 623 PFGYGLSYTTFAYS--------DFHI---------------SKPVIGIQGETEVSCK--- 656
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
V N G+ +G E+V +Y + T+ K + G+ER+ + AG+ V F +
Sbjct: 657 ------VRNTGQREGDEIVQLYIRDDISSVTTYQKSLRGFERIHLKAGEETTVRFMLTP- 709
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
+ L + + ++ G TI++G + +L +N
Sbjct: 710 RDLSLWNKHEEFVVEPGTFTIMIGRSSEDICLHGKLTVN 748
>gi|448415866|ref|ZP_21578437.1| beta-glucosidase [Halosarcina pallida JCM 14848]
gi|445680029|gb|ELZ32480.1| beta-glucosidase [Halosarcina pallida JCM 14848]
Length = 765
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 218/792 (27%), Positives = 357/792 (45%), Gaps = 142/792 (17%)
Query: 24 RAKDLVERMTLPEKVQQMGD-----LAYGVPRLGL-PLYEWWSEALHGVSFIGRRTNSPP 77
R ++L++RM L EK Q+G L G L + E S+ + ++ IG + PP
Sbjct: 6 RVEELLDRMALTEKAAQLGSVNADKLLDGDGNLDENAVEEHLSDGIGHLTRIGGEGSLPP 65
Query: 78 G----------THFDSEV------------------PGATSFPTVILTTASFNESLWKKI 109
T+ E P T+FP I ++++ SL ++I
Sbjct: 66 TEAARVTNELQTYLREETRLGIPAIPHEECLSGYMGPEGTTFPQSIGLASTWDPSLVEEI 125
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
T+ T+ A +G A SP ++V RD RWGRV ET GEDPY+V A YV GLQ
Sbjct: 126 TGTIRTQLEA---IGTA--HALSPVLDVARDLRWGRVEETFGEDPYLVASMACGYVDGLQ 180
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
D D ISA KH+A + + G +R + + ++++ET + PFE
Sbjct: 181 G--------DGDG----ISATLKHFAGHSVGEG-GKNRSSVN--LGRRELRETHLFPFEA 225
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V D SVM +Y+ ++GIP +D LL +RG+W F G +VSD S++ + H
Sbjct: 226 AVRTSDAESVMNAYHDIDGIPCASDEWLLTDVLRGEWGFDGTVVSDYYSVEFLRSEHGVA 285
Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D +E+A A L+AG+D++ D Y + + V+ G ++E +D ++R + +R G
Sbjct: 286 AD-EEEAGAMALEAGIDVELPYTDCYGDSLVKGVESGHLSEETVDHAVRRVLRAKVRKGL 344
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
FD + EL AAR+ + LLKN+ LPL ++A++GP A+
Sbjct: 345 FDDPTVDPDAASEPFGTDAADELTTRAARESMTLLKNEGDLLPLAGSETDSVAVIGPKAD 404
Query: 408 ATKAMIGNY--------EGTPCRYTSPMDGFYA----YSKVINYAPGCA----------- 444
+ ++G+Y E T+P+D + + +++ GC
Sbjct: 405 DGQELMGDYAYAAHYPEEEVELDATTPLDAIRSRGDEFGFEVSHEQGCTMTGPGTGGFDA 464
Query: 445 ------------DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGF 492
V +++ + +D + +TV +G EG D VDL LPG
Sbjct: 465 AASAAAEADVAVAFVGARSAVDLSDMDKEQENRSTVPTSG-------EGCDVVDLDLPGV 517
Query: 493 QTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFG 552
Q EL+ +V D P+ +V++S I + +++ PGE GG IA +FG
Sbjct: 518 QQELVERV-DQTGTPLVVVVVSGKPHSIEAISE--AVPAVVQAWLPGERGGEGIAATLFG 574
Query: 553 KYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFK 611
++NPGG LP++ + + Y+ P N + + D +YPFG+GLSYT F+
Sbjct: 575 EHNPGGHLPVSIPRTVGQIPVHYSRKP-----NSANEDHVYVDSDPLYPFGHGLSYTDFE 629
Query: 612 YKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGK 671
Y D+ L D+ PP + T + VEN G+
Sbjct: 630 YG--------DLALSDDE------------IPPAGTI------------TAAVTVENAGE 657
Query: 672 MDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANS 730
G +VV +Y + + +++++G+ERV + AG + +V F ++A + L D +
Sbjct: 658 RAGHDVVQLYVRAENPSQARPVQELVGFERVSLDAGDARRVSFEIDASQ-LAYHDRNFDL 716
Query: 731 LLASGAHTILVG 742
+ G + + VG
Sbjct: 717 TVEEGPYQLRVG 728
>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 791
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 213/726 (29%), Positives = 337/726 (46%), Gaps = 120/726 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG T FPT I A+++ L K++
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------------ITVFPTGIGMAATWSPELVKEV 179
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G V GL
Sbjct: 180 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGLI 234
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ + SR A KH+ AY + N + V +++ E F+ PF+
Sbjct: 235 N--------GNISRKNSTIATLKHFLAYAVPEGGQNGN---QALVGMRELHENFLPPFKK 283
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP A+ LLNQ +R +W F G++VSD SI+ I ESH +
Sbjct: 284 AIDAGALS-VMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLYSIEGIYESH-YT 341
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
+ EDA + + AG+D+D G + YTN AV++ +++EA ID + + + +G F
Sbjct: 342 ASSIEDAAIQAVSAGVDVDLGGEAYTNIYR-AVKEKRLSEAIIDEVVCRVLRLKFEMGLF 400
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + N HI A A+ + LLKN + LPL + NI+ +A++GP+A+
Sbjct: 401 ENPYVDPQIAIERVRNANHIANARRMAQASVTLLKNRHDILPL-SKNIRKVAVIGPNADN 459
Query: 409 TKAMIGNYEGTPCR---YTSPMDGFYAYSKV--INYAPGCADIVCQNNSMIPAAIDAAKN 463
M+G+Y P + + +DG + + + Y GCA I N+ I A++AA
Sbjct: 460 CYNMLGDYTA-PQKDENIKTVLDGIISKLSLSRVEYVRGCA-IRDTTNNEIAKAVEAANR 517
Query: 464 ADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINKV 500
AD + V G + + EG DR L L G Q EL+ +
Sbjct: 518 ADVVIAVVGGSSARDFKTTYKETGAAIADKSQISDMECGEGFDRATLSLLGKQLELLESL 577
Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
K P+ +V + ++ N+A + ++L YPG+EGG AIADV+FG YNP GRL
Sbjct: 578 KSTRK-PLIVVYIEGRPLNKNWAAEHAD--ALLTAYYPGQEGGDAIADVLFGDYNPAGRL 634
Query: 561 PITWYEANYVKIPYT--SMPLRPVNNFPG-RTYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
P++ +P + +P+ P Y +Y FGYGLSY+ F+Y
Sbjct: 635 PVS--------VPRSEGQIPVYYNKKTPKCHDYVEMSASPLYSFGYGLSYSTFEYS---- 682
Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
N V P F +VEN GK DG EV
Sbjct: 683 -----------------NLKVTQQAP--------------LHFEISFDVENTGKYDGEEV 711
Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
+Y + + ++Q+ ++R F+ G+ + FT+ + L I++ ++ G+
Sbjct: 712 AQLYIRDEYASVVRALRQLKHFKRFFLKQGEKKTIVFTL-VEEDLSIINQKMERIVEPGS 770
Query: 737 HTILVG 742
+++G
Sbjct: 771 FQLMIG 776
>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
Length = 958
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 227/808 (28%), Positives = 366/808 (45%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R +DL+ +M L EK QM L YG R+ LP EW W + + +
Sbjct: 65 YEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGMGAI 123
Query: 67 S-----------------------------------FIGRRTNSPPGTHFDSEVPG---- 87
FI P + + G
Sbjct: 124 DEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESY 183
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 184 KATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 237
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I V+G+Q +++A KH+ AY +
Sbjct: 238 YEEVYGESPYLVAELGIEMVKGMQ-------------HNYQVAATGKHFIAYSNNKGARE 284
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + PF+ + E + VM SYN +G+P + L +RG
Sbjct: 285 GMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRGQ 344
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 345 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 403
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA-EAARQGIV 380
Q+G ++E I+ +R + V +G FD Q G + + E+ A +A+R+ IV
Sbjct: 404 QEGGLSEEIINDRVRDILRVKFLVGLFDTPYQTDLKGADEEVEKEENEIVALQASRESIV 463
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
LLKND ALPL+ +I+ +A+ GP+A+ T + +Y T+ + G +
Sbjct: 464 LLKNDKNALPLDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKAEV 523
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC ++V N + I A+ AK AD V+V G E K
Sbjct: 524 LYTKGC-ELVDANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGENK 582
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + +N+A + + +I+ YPG +G
Sbjct: 583 SRSSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAIIEAWYPGSKG 639
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+ADV+FG YNPGG+L +T + + +IP+ + P +P + G G +
Sbjct: 640 GTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPCKPSSQIDGGKNPGPKGNMSRVNG 697
Query: 598 -VYPFGYGLSYTQFKYK-VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFG+GLSYT F+Y ++ SPK + P V V+C
Sbjct: 698 ALYPFGHGLSYTTFEYSDISISPKVIT---------------------PNQKV---QVRC 733
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
K + N GK G EVV +Y + T+ K + G+ER+ + G++ +V FT
Sbjct: 734 K---------ITNTGKRAGDEVVQLYVRDILSSVTTYEKNLEGFERIHLQPGETKEVSFT 784
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ K+L++++ + ++ G +I++G
Sbjct: 785 LDR-KALELLNAKNDWVVEPGDFSIMLG 811
>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 802
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 238/833 (28%), Positives = 362/833 (43%), Gaps = 161/833 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + +P ER + L+ +MTL EKV QM + LG P+YE
Sbjct: 37 YENPSVPVEERVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEI 90
Query: 58 ----------------WWSEALH---GVSFIGRRTNSPPG---THFDSEVP--------- 86
W LH S R +N H +P
Sbjct: 91 SEYHIGALWGFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPH 150
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++TEA A + P +++ RDP
Sbjct: 151 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIATEASA-----QGAHIGYGPVLDLARDP 205
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q D+ + A KH+A+Y
Sbjct: 206 RWSRVEETYGEDPYLNGVMGAALVRGFQG--------DTLRGRKSVIATLKHFASY---G 254
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G +S VM SYN ++G P LL
Sbjct: 255 WTEGGHNGGTAHLGERELEEAIFPPFREAVGAGALS-VMSSYNEIDGNPCTGSRYLLTDI 313
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
++ W F G++VSD +I + E H E AV + + AG+D D G + Y + A
Sbjct: 314 LKDRWQFKGFVVSDLYAIGGLRE-HGVAGSDYEAAV-KAVNAGVDSDLGTNVYAEQLVAA 371
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A +D ++R + + +G FD + +P+HI LA E ARQ IV
Sbjct: 372 VRKGDVAMETVDKAVRRILFLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIV 431
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN++ LPL +I+TLA++GP+A+ M+G+Y +G+ + +
Sbjct: 432 LLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDT 490
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI+AA++AD V+V G D S E
Sbjct: 491 RVLYAKGCA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANR 549
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +V K P+ LV++ + + + +I
Sbjct: 550 VSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQ--EADAI 606
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
L YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 607 LDAWYPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTKRKGNRSRY 660
Query: 593 FD--GPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCR-DINYTVGTNKPPCA 646
+ G YPFGYGLSYT F Y KV S +S CR D++ T
Sbjct: 661 IEEAGTPRYPFGYGLSYTTFSYTGMKVRVSEES--------NHCRVDVSVT--------- 703
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAA 705
V N G +DG EVV +Y + G T +Q+ + RV + A
Sbjct: 704 -------------------VRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKA 744
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
G++ ++ FT++ KSL + + G T++ G ++ + +N
Sbjct: 745 GETREITFTLDK-KSLALYMRDGEWAVEPGRFTVMAGGSSEDIACQQEFEINR 796
>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 950
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 226/762 (29%), Positives = 359/762 (47%), Gaps = 119/762 (15%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEAL 63
K++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY EA+
Sbjct: 160 KVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAV 216
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG S+ G+ GAT FP + A++N L +++ + E A N
Sbjct: 217 HGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NT 259
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
A WSP ++V +D RWGR ET GEDP +V + +++G Q SR
Sbjct: 260 KQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SR 303
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L + KH+ + R D ++E++M+E ++PF + D S+M +Y
Sbjct: 304 GLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMMAY 358
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
+ G+P +LL Q +R +W F+G+IVSDC +I + + K +A + L A
Sbjct: 359 SDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAA 418
Query: 304 GLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
G+ +CGD Y N + A + G+I D+D R + + R F+ +P K L I
Sbjct: 419 GIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKI 477
Query: 363 C----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-- 416
+ H E+A +AAR+ IV+L+N + LPL+ ++T+A++GP A+ + G+Y
Sbjct: 478 YPGWNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--GDYTP 534
Query: 417 EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
+ P + S + G +KV+ Y GC D + + IP A+ AA +D ++V G
Sbjct: 535 KLLPGQLKSVLTGIKGAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLG 592
Query: 473 LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
+ EA E D L+LPG Q EL+ V K PV L++ + DI
Sbjct: 593 DCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI--L 649
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
K + K+IL PG+EGG A+ADV+FG YNP GRLP+T+ +PL
Sbjct: 650 KASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPLYYNF 703
Query: 584 NFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
GR Y++ D +Y FG+GLSYT F+Y ++K+ Q+ + N V
Sbjct: 704 KTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVEV--- 749
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER 700
Q V+N+G G EV +Y + T + ++ + R
Sbjct: 750 ---------------------QATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFAR 788
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ + G+S V F M + ++++ + ++ G I+VG
Sbjct: 789 IHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMVG 829
>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
Length = 954
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 226/762 (29%), Positives = 359/762 (47%), Gaps = 119/762 (15%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEAL 63
K++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY EA+
Sbjct: 164 KVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAV 220
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG S+ G+ GAT FP + A++N L +++ + E A N
Sbjct: 221 HGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NT 263
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
A WSP ++V +D RWGR ET GEDP +V + +++G Q SR
Sbjct: 264 KQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SR 307
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L + KH+ + R D ++E++M+E ++PF + D S+M +Y
Sbjct: 308 GLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMMAY 362
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
+ G+P +LL Q +R +W F+G+IVSDC +I + + K +A + L A
Sbjct: 363 SDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAA 422
Query: 304 GLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
G+ +CGD Y N + A + G+I D+D R + + R F+ +P K L I
Sbjct: 423 GIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKI 481
Query: 363 C----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-- 416
+ H E+A +AAR+ IV+L+N + LPL+ ++T+A++GP A+ + G+Y
Sbjct: 482 YPGWNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--GDYTP 538
Query: 417 EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
+ P + S + G +KV+ Y GC D + + IP A+ AA +D ++V G
Sbjct: 539 KLLPGQLKSVLTGIKGAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLG 596
Query: 473 LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
+ EA E D L+LPG Q EL+ V K PV L++ + DI
Sbjct: 597 DCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI--L 653
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
K + K+IL PG+EGG A+ADV+FG YNP GRLP+T+ +PL
Sbjct: 654 KASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPLYYNF 707
Query: 584 NFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
GR Y++ D +Y FG+GLSYT F+Y ++K+ Q+ + N V
Sbjct: 708 KTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVEV--- 753
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER 700
Q V+N+G G EV +Y + T + ++ + R
Sbjct: 754 ---------------------QATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFAR 792
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ + G+S V F M + ++++ + ++ G I+VG
Sbjct: 793 IHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMVG 833
>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
43184]
gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
Length = 955
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 226/808 (27%), Positives = 362/808 (44%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D +P R +DL+ +M + EK QM L YG R+ LP +W
Sbjct: 61 YEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGAI 119
Query: 59 --------------------WSEALHGVS------FIGRRTNSPPGTHFDSE-VPG---- 87
W + H + F T T F +E + G
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N +L K+G E R + G T ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I +G+Q +D +++A KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGIEMAKGMQ----------TDH---QVAATSKHYIAYSNNKGGRE 280
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + P++ + E + VM SYN +G P + L +RG+
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
+ F GY+VSD D+++ + H D KE + VL AGL++ C D Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGI 379
+G I + ID +R + V +G FD Q K K C + +A +A+++ +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFDHPYQIDLKETDKEVNCAENQL-VALQASKESL 458
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
VLLKN + LPL+ I +A+ GP+A+ + +Y T+ ++G K
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518
Query: 437 INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + GC D+V + S I A++ AK +D TV+V G E
Sbjct: 519 VLFTKGC-DLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGEN 577
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
K R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQ 634
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
GG AIAD +FG YNPGG+L +T + +IP+ + P +P G K DG +
Sbjct: 635 GGTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVN 692
Query: 598 --VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGLSYT F+Y S I + T P V+C
Sbjct: 693 GPLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRC 729
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
K V N GK G EVV +Y + T+ K ++G++R+ + G++ ++ FT
Sbjct: 730 K---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFT 780
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
+ + L+++++ + ++ G ++VG
Sbjct: 781 IEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
Length = 955
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 225/808 (27%), Positives = 364/808 (45%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D +P R +DL+ +M + EK QM L YG R+ LP +W
Sbjct: 61 YEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGAI 119
Query: 59 --------------------WSEALHGVS------FIGRRTNSPPGTHFDSE-VPG---- 87
W + H + F T T F +E + G
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N +L K+G E R + G T ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I +G+Q +D +++A KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGIEMAKGMQ----------TDH---QVAATSKHYIAYSNNKGGRE 280
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + P++ + E + VM SYN +G P + L +RG+
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
+ F GY+VSD D+++ + H D KE + VL AGL++ C D Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGI 379
+G I + ID +R + V +G FD Q K K C ++ ++A +A+++ +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFDHPYQIDLKETDKEVNC-AENQQVALQASKESL 458
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
VLLKN + LPL+ I +A+ GP+A+ + +Y T+ ++G K
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTN 518
Query: 437 INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + GC D+V + S I A++ AK +D TV+V G E
Sbjct: 519 VLFTKGC-DLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGEN 577
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
K R L LPG Q +L+ V K PV L++++ + IN+A + + +IL YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLILINGRPLSINWA--DKYVPAILEAWYPGSQ 634
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
GG AIAD +FG YNPGG+L +T + +IP+ + P +P G K DG +
Sbjct: 635 GGTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVN 692
Query: 598 --VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGLSYT F+Y S I + T P V+C
Sbjct: 693 GPLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRC 729
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
K V N GK G EVV +Y + T+ K ++G++R+ + G++ ++ FT
Sbjct: 730 K---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFT 780
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
+ + L+++++ + ++ G ++VG
Sbjct: 781 IEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 688
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 203/724 (28%), Positives = 340/724 (46%), Gaps = 100/724 (13%)
Query: 33 TLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFP 92
T PE M A RLG+P+ + +A+HG T +P
Sbjct: 43 TNPELRNNMQKKAMEESRLGIPII-FGYDAIHGFR---------------------TVYP 80
Query: 93 TVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGE 152
+ S+N L ++ + EAR + TF SP I+V RDPRWGRV E GE
Sbjct: 81 ISLAQACSWNPDLVEQACAVSAQEAR----MSGVDWTF-SPMIDVARDPRWGRVAEGYGE 135
Query: 153 DPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDS 212
DPY G + VRG Y D+ S +++AC KHY Y G D + +
Sbjct: 136 DPYANGVFGAASVRG--------YQGDNMSAENRVAACLKHYVGYGASE-AGRDYVY--T 184
Query: 213 RVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYI 272
+++Q + +T++LP+EM V G +++M S+N ++G+P A+P + + ++ W G+I
Sbjct: 185 EISQQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFI 243
Query: 273 VSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADI 331
VSD +I+ + ++ L TK++A AGL++D + Y V++GK++ A +
Sbjct: 244 VSDWGAIEQL--KNQGLAATKKEAARYAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQV 301
Query: 332 DTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
D ++R + ++ RLG F+ K P+ +++AA A + +VLLKN+N LPL
Sbjct: 302 DEAVRRVLLLKFRLGLFERPYTPATTEKERFFRPKSMDIAARLAAESMVLLKNENNVLPL 361
Query: 392 NTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPM--DGF---YAYSKVINYAPGCADI 446
+ K +A++GP A ++G++ G M DG +A + YA GC +
Sbjct: 362 T--DKKKIAVIGPMAKNGWDLLGSWRGHGKDTDVAMLYDGLAAEFAGKAELRYALGC-NT 418
Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
N A++AA+ +D V+ G ++ E R + LP Q EL ++ A K
Sbjct: 419 QGDNREGFAEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKAGK- 477
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
PV LV+++ +++N + P +IL + PG G +A ++ G+ NP G+L +T+
Sbjct: 478 PVVLVLVNGRPLELN--RLEPVSDAILEIWQPGVNGALPMAGILSGRINPSGKLAMTF-- 533
Query: 567 ANYVKIPYTS--MPLRPVNNFPGRTYKFFDGPV----VYPFGYGLSYTQFKYKVASSPKS 620
PY++ +P+ GR ++ F + +YPFG+GLSYT+FKY
Sbjct: 534 ------PYSTGQIPIYYNRRKSGRGHQGFYKDITSDPLYPFGHGLSYTEFKY-------- 579
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
GT P V + K + ++ V N+G DG+E V
Sbjct: 580 ------------------GTVTPSATKV------KRGEKLSAEVTVTNIGARDGAETVHW 615
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
+ P + T +K++ +E+ I AG++ F ++ + V+ L +G + I
Sbjct: 616 FISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNI 675
Query: 740 LVGE 743
V E
Sbjct: 676 HVLE 679
>gi|423300893|ref|ZP_17278917.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
CL09T03C10]
gi|408472228|gb|EKJ90756.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
CL09T03C10]
Length = 798
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 231/800 (28%), Positives = 363/800 (45%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 54 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTDGWSTEIWKDGIGNI 112
Query: 64 ----HGVSFIGRR-----TNSPPGTH-----------------FDSE-VPG-----ATSF 91
+G+ G NS H F +E + G AT F
Sbjct: 113 DEQANGLGKFGSEISYPYANSAKNRHTVQRWFVEKTRLGIPVDFTNEGIRGLCHDRATMF 172
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T ++P +++ +DPRWGRV+E+
Sbjct: 173 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 226
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ+ EG I A KH+A Y + +
Sbjct: 227 GEDPYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 272
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 273 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 332
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 333 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 386
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H +++ AA + IV
Sbjct: 387 SEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEVVVHNAAHQDVSMRAALESIV 446
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+ LPL+ + +A++GP+A K + Y + G Y + +
Sbjct: 447 LLKNEKEMLPLSK-SFNKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVR 505
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 506 YAKGC-DIIDKYFPESELYNVPLDTQEKAMINEAVELAKASDVAILVLGGNEKTVREEFS 564
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 565 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 621
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G K V+YPFGY
Sbjct: 622 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKG---KVRVAGVLYPFGY 676
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y D+K+ +KP A T
Sbjct: 677 GLSYTTFGYS--------DLKV---------------SKPVIGA---------QENITLS 704
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ + FT+ + L
Sbjct: 705 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEERTISFTLTP-QDLG 763
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + + G+ +++VG
Sbjct: 764 LWDKNNHFTVEPGSFSVMVG 783
>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1278
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 161/432 (37%), Positives = 237/432 (54%), Gaps = 41/432 (9%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y + + ERA DLV RMTL EK Q+G+ +PRLG+ Y+ W EALHGV +GR
Sbjct: 39 YLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGV--VGRNN 96
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSP 133
NS G ATSFP + ++++ +L K+ V+ EAR + LT+WSP
Sbjct: 97 NS--GMI-------ATSFPNSVAVGSTWDPALIKRETSVVADEARGFNHDLIFTLTYWSP 147
Query: 134 NINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKH 193
I RDPRWGR ET GEDP++V + +V+GL D LK C KH
Sbjct: 148 VIEPARDPRWGRTAETFGEDPFLVSQIGSGFVQGLMG---------DDPTYLKTVPCGKH 198
Query: 194 YAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
Y A +N E N R + + + ++DM+E ++ P+ + + + S+M +Y+ VNG+P A
Sbjct: 199 YFA---NNSEFN-RHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSA 254
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYY 313
L++ + + GY+ DCD++ +V SH++ +K +A A LK G+D DCG Y
Sbjct: 255 SKFLVDTIAKRTYGLDGYVTGDCDAVADVVNSHRYAK-SKAEAAAMGLKTGVDSDCGGIY 313
Query: 314 TNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICNPQHIE 369
+ A++QG I+EAD+D +L +Y + MRLG FD PQ Y + + I +P H +
Sbjct: 314 QTSALEALKQGLISEADMDKALVNIYTIRMRLGEFD--PQNIVPYAGIKPSIINDPSHND 371
Query: 370 LAAEAARQGIVLLKND------NGALPLNTGNIKTLALVGPHANATKAMIGNYEGT--PC 421
LA E A + VLLKN+ ALPLN G IK +A++GP A+ K +G+Y G P
Sbjct: 372 LALEIATKSPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQAD--KVELGDYSGEADPK 429
Query: 422 RYTSPMDGFYAY 433
+P++G Y
Sbjct: 430 YKITPLEGIKNY 441
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 139/284 (48%), Gaps = 43/284 (15%)
Query: 446 IVCQNNSMIPAA----IDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
++ S +PA +D A +AD V+ G D + E DR + LPG Q ELI +A
Sbjct: 593 VLVYRESEVPATDKETLDMAASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIA 652
Query: 502 DAAKGPVTLVIMSA-GAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
A P T+V++ G V++ KNNP + I++ GY G+ G A+A V+FG NPGG+
Sbjct: 653 --AVNPNTIVVIQGMGMVEVEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPGGKT 710
Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
+TWY++ T LR GRTY +F+ V Y FGYGLSYT F Y
Sbjct: 711 SLTWYKSINDLPALTDYTLRGGAGKNGRTYMYFNKDVSYEFGYGLSYTTFAYS------- 763
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
N+ + ++ +D K T ++V+N G +DG EVV +
Sbjct: 764 --------------NFNISK-----TSITPND------KVTVTVDVKNTGTVDGDEVVQI 798
Query: 681 YSKPPGIAGTH---IKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
Y K P + IK++ G++RV I AGQ+ V ++ C L
Sbjct: 799 YVKTPDSPASLERPIKRLKGFKRVAIPAGQTKTVSIEVD-CADL 841
>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
Length = 955
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 225/808 (27%), Positives = 362/808 (44%), Gaps = 140/808 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW------------ 58
Y D +P R +DL+ +M + EK QM L YG R+ LP +W
Sbjct: 61 YEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGAI 119
Query: 59 --------------------WSEALHGVS------FIGRRTNSPPGTHFDSE-VPG---- 87
W + H + F T T F +E + G
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N +L K+G E R + G T ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V + +G+Q +Y +++A KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ----TDY---------QVAATSKHYIAYSNNKGGRE 280
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + P++ + E + VM SYN +G P + L +RG+
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
+ F GY+VSD D+++ + H D KE + VL AGL++ C D Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAEAARQGI 379
+G I + ID +R + V +G FD Q K K C + +A +A+++ +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFDHPYQIDLKETDKEVNCAENQL-VALQASKESL 458
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
VLLKN + LPL+ I +A+ GP+A+ + +Y T+ ++G K
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518
Query: 437 INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + GC D+V + S I A++ AK +D TV+V G E
Sbjct: 519 VLFTKGC-DLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGEN 577
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
K R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQ 634
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV---- 597
GG AIAD +FG YNPGG+L +T + +IP+ + P +P G K DG +
Sbjct: 635 GGTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVN 692
Query: 598 --VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+YPFGYGLSYT F+Y S I + T P V+C
Sbjct: 693 GPLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRC 729
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
K V N GK G EVV +Y + T+ K ++G++R+ + G++ ++ FT
Sbjct: 730 K---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFT 780
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
+ + L+++++ + ++ G ++VG
Sbjct: 781 IEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
Length = 954
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 227/760 (29%), Positives = 357/760 (46%), Gaps = 119/760 (15%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHG 65
+ Y D LP ER + L+ MT PE ++ +G+P G+P LY EA+HG
Sbjct: 166 TSLRYMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHG 222
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S+ G+ GAT FP + A++N+ L + + V E L
Sbjct: 223 FSY---------GS-------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAA 261
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+ WSP ++V +D RWGR ET GEDP +V + +++G Q S+ L
Sbjct: 262 GTMQAWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SKGL 309
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ KH+ + R D ++E++M+E ++PF + D SVM +Y+
Sbjct: 310 FTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSD 364
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P +LL+ +R +W F G+IVSDC +I + + K +A + L AG+
Sbjct: 365 YLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGI 424
Query: 306 DLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC- 363
+CGD Y + + A + G+I ++D R + ++ R F+ +P K L N I
Sbjct: 425 ATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPN-KPLDWNKIYP 483
Query: 364 ---NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EG 418
+ H E+A +AAR+ IV+L+N + LPL +++T+A+VGP A+ + G+Y +
Sbjct: 484 GWNSDSHKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKL 540
Query: 419 TPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
P + S + G +KV+ Y GC D N + IP A+ AA +D V+V G
Sbjct: 541 LPGQLKSVLTGIKQAVGKQTKVV-YEQGC-DFTSSNGTNIPKAVKAASQSDVVVLVLGDC 598
Query: 475 LSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ E+ E D L+LPG Q EL+ V A G ++I+ AG N +K
Sbjct: 599 STSESTTDVYKTSGENHDYATLILPGKQQELLEAV--CATGKPVILILQAGR-PYNLSKA 655
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+ K+IL PG+EGG A ADV+FG YNP GRLP+T+ +V +PL
Sbjct: 656 SELCKAILVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PRHV----GQLPLYYNFKT 709
Query: 586 PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
GR Y++ D +Y FGYGLSYT F+Y K Q+ + N +
Sbjct: 710 SGRRYEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAI----- 753
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
Q V+N+G+ G EVV +Y + T I ++ + RV
Sbjct: 754 -------------------QATVKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVH 794
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G+S V F + + L ++++ + ++ G ILVG
Sbjct: 795 LQPGESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833
>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
Length = 865
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 160/451 (35%), Positives = 242/451 (53%), Gaps = 43/451 (9%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRR 72
PY + L ERA DL++RMTL EK+ QM + + + RLG+P Y WW+EALHGV+ G+
Sbjct: 24 PYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAGK- 82
Query: 73 TNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------LG 124
AT FP I A+F+ + VS EARA Y+ G
Sbjct: 83 ---------------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDG 127
Query: 125 NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRP 184
GLTFW+PNIN+ RDPRWGR +ET GEDPY+ + V+GLQ D +
Sbjct: 128 YKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--------DGTGKY 179
Query: 185 LKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K AC KHYA + W +R FD++ ++++D+ ET++ F+ V EG V VMC+Y
Sbjct: 180 DKTHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAY 236
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI-VESHKFLNDTKEDAVARVLK 302
NR G P C++ +LL + +R DW + +VSDC +I +H + T A A +
Sbjct: 237 NRYEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVV 296
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKN 360
+G DL+CG Y++ AV++G I+E I+ S+ L +LG FD + + + +
Sbjct: 297 SGTDLECGGSYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDNTLVSWSEIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ + +H+ A E AR+ +VLL N N LPL+ +++ +A++GP+AN + + NY G P
Sbjct: 356 VVESKEHVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNANDSVMLWANYNGFP 414
Query: 421 CRYTSPMDGFYAY--SKVINYAPGCADIVCQ 449
+ + ++G + Y GC + Q
Sbjct: 415 TKSVTILEGIRNKLPEGAVYYEKGCDFVNTQ 445
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 149/328 (45%), Gaps = 59/328 (17%)
Query: 432 AYSKVINY--APGCA----DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----- 480
AY V+ Y A G A DI + D A AD + V GL S+E E
Sbjct: 563 AYKVVLEYFQAGGEASLKFDIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVD 622
Query: 481 -----GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
DR ++ LP Q E++ + K PV V+ S + + + N + +I+
Sbjct: 623 LPGFRKGDRTNIDLPQVQEEMLKALKKTGK-PVVFVLCSGSTLALPWEAEN--LDAIIEA 679
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG++GG A+ADV+FG YNP GRLP+T+Y ++ + +P + RTY++F G
Sbjct: 680 WYPGQQGGTAVADVLFGDYNPAGRLPLTFYASS------SDLPDFEDYDMSNRTYRYFKG 733
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
++PFG+GLSYT F Y A + K + G
Sbjct: 734 RPLFPFGHGLSYTTFDYGKAKADKKI--------------LRAGEG-------------- 765
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
T I ++N+GK+ G EVV VY + PG IK + + R+ + AGQ+ V F +
Sbjct: 766 ----LTLTIPLKNIGKLSGDEVVQVYLRNPGDKEGPIKTLRAFRRISLEAGQAEDVLFEL 821
Query: 716 NACKSLKIVDNAANSL-LASGAHTILVG 742
+ + + A N + + G + +L G
Sbjct: 822 -PVSTFEWFNPATNRMEVLPGKYELLYG 848
>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
Length = 946
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 232/807 (28%), Positives = 363/807 (44%), Gaps = 138/807 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D P R +DL+ +MTL EK QM L YG R+ LP EW ++ G+ I
Sbjct: 53 YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111
Query: 70 GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
N PP T F +E + G
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q H +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ-------HNH------QVAATGKHFIAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ PF+ + E + VM SYN +G P + L +RG+
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA-EAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + E A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLKGADEEVEKKENEEVALQASRESIV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN+ LPL+ I+ +A+ GP+A+ + +Y TS + G K +
Sbjct: 452 LLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N I A+ AK AD ++V G E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+AD++FG YNPGG+L +T + +IP+ + P +P + G DG +
Sbjct: 628 GIAVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRANG 685
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y D+ + P A V CK
Sbjct: 686 ALYPFGYGLSYTTFEYS-------------------DLKISPAIITPNQKAY----VTCK 722
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EV+ +Y + T+ K ++G+ERV + G++ ++ F +
Sbjct: 723 ---------VTNTGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFPI 773
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ K+L++++ + ++ G T+++G
Sbjct: 774 DR-KALELLNADMHWVVEPGDFTLMLG 799
>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
Length = 884
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 165/457 (36%), Positives = 241/457 (52%), Gaps = 55/457 (12%)
Query: 7 VKLSDFPYC--DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
+K + PY + LP ER ++L+ +TL EKV M + + V RLG+P Y+WW+EALH
Sbjct: 23 LKSQEIPYKFRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIPAYDWWNEALH 82
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN-- 122
GV+ G+ AT FP I A++NES K+ +S EARA YN
Sbjct: 83 GVARSGK----------------ATVFPQAIGMAATWNESGHKQTFDLISDEARAKYNEA 126
Query: 123 LGNA------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEY 176
+ N GL+FW+PNIN+ RDPRWGR ET GEDPY+ R + VRGLQ
Sbjct: 127 IRNGERGRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQ------- 179
Query: 177 HRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDV 236
D + K AC KH+A + W +R +D+ + +D+ ET++ F+ V E +V
Sbjct: 180 --GDDPKYFKTHACAKHFAVHSGPEW---NRHSYDATASGRDLWETYLPAFKALVKEANV 234
Query: 237 SSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-----ESHKFLND 291
VMC+YN G P C +LL +R W + G +VSDC +I E+HK
Sbjct: 235 QEVMCAYNAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAA 294
Query: 292 TKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGS 351
DAV DL+CG YTN + AV+QG I++ ID SLR + LG D +
Sbjct: 295 AAADAVIH----STDLECGSAYTNL-LEAVRQGLISQQQIDISLRRVLRGWFELGMLDPA 349
Query: 352 PQ--YKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
+ + L + + +H++ A + AR+ + LLKN+ LPL + +IK +A++GP+A +
Sbjct: 350 ERLPWSQLPYQIVASKEHVQQALKVARESMTLLKNNGSILPL-SKSIKKIAVIGPNAADS 408
Query: 410 KAMIGNYEGTPCRYTSPMDGF---YAYSKVINYAPGC 443
+ GNY GTP + + G ++++I Y GC
Sbjct: 409 VMLWGNYNGTPNSTVTILQGIKNKLPHAEII-YDKGC 444
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/302 (29%), Positives = 136/302 (45%), Gaps = 53/302 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
I K DA V GL +E E G D++ + LP Q EL++ + K P
Sbjct: 602 IKRLKEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTGK-P 660
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V+ + ++ + + N ++L Y G+E G A+ADV+FG YNP GRLPIT+Y++
Sbjct: 661 VVFVLCTGSSLALEQDEKN--YNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITFYKS 718
Query: 568 ------NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
+K TS + GRTY++ +Y FG+GLSY++F Y
Sbjct: 719 LSQLDNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNY--------- 769
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
G K V I + I + N+ G EVV VY
Sbjct: 770 -----------------GEAKLTSGTVKIGNT------LNISIPLTNISNNKGEEVVQVY 806
Query: 682 SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTIL 740
K G +K + G++RV IAAG++ + F + A ++ + D + + L +G +TI+
Sbjct: 807 VKRNGDPDAPVKSLKGFKRVAIAAGETKHLDFQLTA-EAFEFYDPSKDELGPKAGNYTIM 865
Query: 741 VG 742
G
Sbjct: 866 YG 867
>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
17393]
gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 935
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 224/760 (29%), Positives = 356/760 (46%), Gaps = 119/760 (15%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHG 65
+ Y D LP ER + L+ MT PE ++ +G+P G+P LY EA+HG
Sbjct: 147 TSLRYMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHG 203
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S+ G+ GAT FP + A++N+ L +++ V E L
Sbjct: 204 FSY---------GS-------GATIFPQALAMGATWNKKLTEEVAMAVGDE-----TLSA 242
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+ WSP ++V +D RWGR ET GEDP +V + +++G Q +
Sbjct: 243 GTMQAWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------M 288
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ KH+ + R D ++E++M+E ++PF + D S+M +Y+
Sbjct: 289 GLYTTPKHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSD 345
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P +LL+ +R +W F G+IVSDC +I + + K +A + L AG+
Sbjct: 346 FLGVPVAKSRELLHNILREEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGI 405
Query: 306 DLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC- 363
+CGD Y + + A + G+I ++D R + ++ R F+ +P K L N I
Sbjct: 406 ATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKAPN-KPLDWNKIYP 464
Query: 364 ---NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EG 418
+ H E+A +AAR+ IVLL+N + LPL+ +++T+A++GP AN + G+Y +
Sbjct: 465 GWNSDSHKEMARQAARESIVLLENKDNILPLSK-DMRTIAVLGPGANDLQP--GDYTPKL 521
Query: 419 TPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
P + S + G +KVI Y GC D + I A+ A +D ++V G
Sbjct: 522 QPGQLKSVLTGIKQAVGKQTKVI-YEQGC-DFTSLGENNIAKAVKVASQSDVVLLVLGDC 579
Query: 475 LSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ EA E D L+LPG Q EL+ V A G ++I+ AG N +K
Sbjct: 580 STSEATTDVYKTSGENHDYATLILPGKQQELLEAV--CATGKPVILILQAGR-PYNLSKA 636
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+ K+IL PG+EGG A ADV+FG YNP GRLP+T+ +V +PL
Sbjct: 637 SELCKAILVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PRHV----GQLPLYYNFKT 690
Query: 586 PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
GR Y++ D +Y FGYGLSYT F+Y K Q+ + N TV
Sbjct: 691 SGRRYEYSDMEYYPLYYFGYGLSYTSFEYSGL-----------KIQEKENGNITV----- 734
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
Q V+N+G+ G EVV +Y + T I ++ + R+
Sbjct: 735 -------------------QATVKNIGQRAGDEVVQLYVTDMYASVKTRITELKDFTRIH 775
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G++ V F + + L ++++ + ++ GA ILVG
Sbjct: 776 LKPGEAKTVSFELTPYE-LSLLNDHMDRVVEKGAFKILVG 814
>gi|167765233|ref|ZP_02437346.1| hypothetical protein BACSTE_03621 [Bacteroides stercoris ATCC
43183]
gi|167696861|gb|EDS13440.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 818
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 224/812 (27%), Positives = 369/812 (45%), Gaps = 151/812 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P +R DL+ +M++ EK Q+ L YG R+ LP+ W W + + +
Sbjct: 59 YEDPSQPVEKRVADLLSQMSVEEKTCQLATL-YGYGRVLKDSLPVAGWKNEIWKDGIANI 117
Query: 67 ----SFIGRRTNSPPG----------------------------THFDSE-VPG-----A 88
+ +G+++ PG F +E + G A
Sbjct: 118 DEMLNGVGKKSAQVPGLLYPFSNHAEAVNTVQRWFVEETRLGIPVDFTNEGIHGLNHTKA 177
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
T P I +++N+ L ++ G EA+A+ G T ++P +++VRDPRWGR L
Sbjct: 178 TPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILDIVRDPRWGRTL 231
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GE+PY++ V G+Q +GV +A KHYA Y + +
Sbjct: 232 ECYGEEPYLIAALGTEMVNGIQS-QGV-------------AATLKHYAVYSVPKGGRDGN 277
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D V +++ E F+ PF+ + VM SYN +G+P A L + +R ++
Sbjct: 278 CRTDPHVAPRELHELFLYPFKKVIQNSHPMGVMSSYNDWDGVPVSASYYFLTELLREEYG 337
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------M 318
F GY+VSD ++++ VES + DT ++AV +VL+AGL++ T+FT
Sbjct: 338 FDGYVVSDSEAVE-FVESKHHVADTYDEAVRQVLEAGLNVR-----THFTPPSDFILPIR 391
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP-QHIELAAEAARQ 377
+++ KI+ A ID + + V RLG FD + + ++++ + +Q
Sbjct: 392 RLLEEKKISMAVIDKRVSEVLRVKFRLGLFDQPYVADTKAADRVGGADRNMDFVKQMQQQ 451
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
+VLLKN+N LPL+ IK + + GP A+ M Y + + G Y K I
Sbjct: 452 ALVLLKNENNILPLDKRQIKKVLVTGPLADEDNFMTSRYGPNGLETVTVLAGLRNYLKGI 511
Query: 438 ---NYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+YA GC A + Q I A+ A +D + V G D E
Sbjct: 512 AEVDYAKGCDIVDAGWPATEILPAPMSEQEKQGIAEAVAKAGESDVIIAVLGEDEYRTGE 571
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
+ R L LPG Q +L+ + K PV LV+++ + +N+A N I +IL +PG
Sbjct: 572 SRSRTSLDLPGRQQQLLEALHATGK-PVILVLINGQPLTVNWA--NAYIPAILESWFPGC 628
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN---------NFPGRTYK 591
+GG IA+ +FG++NPGG+L +T+ ++ V + P +P + N G T
Sbjct: 629 QGGTVIAETLFGEHNPGGKLTVTFPKS--VGQIELNFPFKPGSHGAQPHSGPNGSGATRI 686
Query: 592 FFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLID 651
+ +YPFG+GLSYT F Y D+++ QQ +T G
Sbjct: 687 IGE---LYPFGFGLSYTTFAYS--------DLEVSPLQQ-----HTQG------------ 718
Query: 652 DVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAK 710
++T ++ V N GK G EVV +Y + T+ Q+ G+ERV + G++ +
Sbjct: 719 -------EYTIKVNVTNTGKRAGDEVVQLYVRDKVSSVITYDSQLRGFERVSLQPGETRQ 771
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F++ + L+I+D N + G +++G
Sbjct: 772 VTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 802
>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 736
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 219/723 (30%), Positives = 341/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG AT FPT I A+++ L K++
Sbjct: 83 RLGIPMF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPELVKEV 124
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 125 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 179
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 180 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 228
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++G P ++ LL Q +R +W F G++VSD SI+ I ESH F+
Sbjct: 229 AIDAGALS-VMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FV 286
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ + IDT++ + + +G F
Sbjct: 287 APTKENAAIQSVMAGVDVDLGGDAYTNLCH-AVQSGQMDKTVIDTAVCRVLRMKFEMGLF 345
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 346 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-TINKVAVIGPNADN 404
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI AA+ +
Sbjct: 405 RYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCA-IRDTTVNEIEQAIKAARRS 463
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 464 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 523
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 524 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 580
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 581 IS-VPRSVGQIPVYYNKKAPRNH----DYVEMSSFPLYSFGYGMSYTTFEYS-------- 627
Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + K +C ++++ +V+N GK DG EV +
Sbjct: 628 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 659
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ +ER + G+ KV F + + +V+ ++ SG +
Sbjct: 660 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 718
Query: 740 LVG 742
++G
Sbjct: 719 MIG 721
>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 777
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 209/699 (29%), Positives = 327/699 (46%), Gaps = 117/699 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG T FPT I +++N +L +K+
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------------TTVFPTGIGQASTWNPALLQKM 167
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
TV+ E R + P +++ RDPRW RV E+ GEDP + G A VRGL
Sbjct: 168 SATVAKEVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVRGLG 222
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S P KH+ AY + N + V E++++E F+ PF+
Sbjct: 223 S--------GNLSDPFATIPTLKHFVAYGIPEGGHNGS---AASVGERELREYFLPPFQS 271
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V G SVM +YN V+GIP ++ LL +R +W+F+G+ VSD SI+ I SH+
Sbjct: 272 AVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWSFNGFTVSDLGSIEGIKGSHRVA 330
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
D K+ A+ ++AGLD D G + AV+QG++ E ID ++ + + +G F+
Sbjct: 331 KDHKQAAIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRILALKFEMGLFE 389
Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
K + +I L+ + AR+ IVLL+N N LPL ++K +A+VGP+A+
Sbjct: 390 KPFVDVKTAKKEVKTESNIALSRQVARESIVLLENKNNILPLRK-DVK-IAIVGPNADNV 447
Query: 410 KAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y +G + ++V +Y GCA I NS IPAA+ AA+ +
Sbjct: 448 YNMLGDYTAPQPDGAVTTVRQAISARLPKAQV-SYVKGCA-IRDTTNSDIPAAVTAARQS 505
Query: 465 DATVIVAG----LDLSVE-------------------AEGKDRVDLLLPGFQTELINKVA 501
D V V G D E EG DR L L G Q EL+ +
Sbjct: 506 DIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALK 565
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ ++ + +++N+A + ++L YPG+EGG AIADV+FG YNP G++P
Sbjct: 566 QTGK-PLVVIYIQGRPLNMNWAAT--QADALLCAWYPGQEGGHAIADVLFGDYNPAGKMP 622
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPK 619
++ V +P+ N +++ + +Y FGYG SY+ F+YK
Sbjct: 623 LS------VPRSVGQIPVH-YNRKSSLDHRYVEEAATPLYAFGYGKSYSDFEYK------ 669
Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
D+K+ K+ DY +F + N GK DG EV
Sbjct: 670 --DLKIQKEN--------------------------TDYHVSFTLT--NTGKYDGDEVPQ 699
Query: 680 VYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNA 717
+Y + + + ++Q+ +ER+ + G+S V F + A
Sbjct: 700 LYIRNQYASVSQPVQQLKHFERIHLKTGESKTVSFVLTA 738
>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
Length = 769
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 214/725 (29%), Positives = 326/725 (44%), Gaps = 119/725 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG AT FPT I A+++ L +++
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 158
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ + E R G + P +++ RDPRW RV ET GEDP + G V GL
Sbjct: 159 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGLG 213
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
S P A KH+ AY + N F +++ E F+ PF
Sbjct: 214 G--------GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 262
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++G+P A+ LL + +R +W F G +VSD SI+ I +SH F+
Sbjct: 263 AIDAGALS-VMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FV 320
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E+A L AG+D+D G D Y N M AV G+I++ +D S+ + + +G F
Sbjct: 321 APTMEEAAILALSAGVDVDLGGDAYMNL-MNAVNTGRISKTALDASVARVLRLKFEMGLF 379
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ K + + + + LA A+ I LLKN++ LPLN + +AL+GP+A+
Sbjct: 380 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 437
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
M+G+Y + +DG A S + Y GC+ D V + I A+ AA+
Sbjct: 438 RYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 494
Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
++ + V G + + EG DR L L G Q EL+ K
Sbjct: 495 RSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-K 553
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
A P+ +V + +D N+A N ++L YPG+EGG AIADV+FG +NP GR
Sbjct: 554 ALKATGKPLIVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGR 611
Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
LP + V +PL P Y +YPFGYGLSYT F Y
Sbjct: 612 LPFS------VPRSVGQIPLYYNKKAPQSHDYVEMSASPLYPFGYGLSYTSFDYS----- 660
Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
D+ L + + ++ +F+ V N GK DG EV
Sbjct: 661 ---DLHLSA-------------------------LMPRSFEISFK--VRNTGKYDGEEVA 690
Query: 679 MVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAH 737
+Y + + +KQ+ + R ++ G+ +V F ++ + +VD ++ G
Sbjct: 691 QLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTF 749
Query: 738 TILVG 742
I++G
Sbjct: 750 QIMIG 754
>gi|383110724|ref|ZP_09931543.1| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
gi|382949470|gb|EFS31133.2| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
Length = 783
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 214/724 (29%), Positives = 325/724 (44%), Gaps = 117/724 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG AT FPT I A+++ L +++
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 172
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ + E R G + P +++ RDPRW RV ET GEDP + G V GL
Sbjct: 173 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGLG 227
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
S P A KH+ AY + N F +++ E F+ PF
Sbjct: 228 S--------GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 276
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP A+ LL + +R +W F G +VSD SI+ I +SH F+
Sbjct: 277 AIDAGALS-VMTSYNSMDGIPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSH-FV 334
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E A L AG+D+D G D Y N M AV G+I++ +D S+ + + +G F
Sbjct: 335 APTMEAAAILALSAGVDVDLGGDAYMNL-MNAVNTGRISKTALDASVARVLRLKFEMGLF 393
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ K + + + + LA A+ I LLKN++ LPLN + +AL+GP+A+
Sbjct: 394 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 451
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
M+G+Y + +DG S + Y GC+ D V + I A+ AA+
Sbjct: 452 RYNMLGDYTAPQEEENIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 508
Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
++ + V G + + EG DR L L G Q EL+ K
Sbjct: 509 RSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-K 567
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
A P+ +V + +D +A N ++L YPG+EGG AIADV+FG YNP GR
Sbjct: 568 ALKATGKPLIVVYIEGRPLDKTWASENAD--AVLTAYYPGQEGGNAIADVLFGDYNPAGR 625
Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPK 619
LP+T + +IP P N+ Y +Y FGYGLSYT F+Y
Sbjct: 626 LPLT-VPRSVGQIPIYYNKKAPQNH----DYVELSASPLYAFGYGLSYTTFEYS------ 674
Query: 620 SVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVM 679
D+++ + F +V+N G+ DG EV
Sbjct: 675 --DLRVS---------------------------AISPHSFEVSFKVKNTGRYDGEEVSQ 705
Query: 680 VYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHT 738
+Y + + +KQ+ +ER + G+ +V F ++ I+D +++ SG
Sbjct: 706 LYLRDEYASVVQPLKQLKHFERFCLKRGEVKEVKFVLSES-DFTIIDRNLKTVVESGTFQ 764
Query: 739 ILVG 742
++VG
Sbjct: 765 VMVG 768
>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
Length = 783
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 214/725 (29%), Positives = 326/725 (44%), Gaps = 119/725 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG AT FPT I A+++ L +++
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------------ATVFPTGIGMAATWSPQLIREV 172
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ + E R G + P +++ RDPRW RV ET GEDP + G V GL
Sbjct: 173 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGLG 227
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
S P A KH+ AY + N F +++ E F+ PF
Sbjct: 228 G--------GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQ 276
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++G+P A+ LL + +R +W F G +VSD SI+ I +SH F+
Sbjct: 277 AIDAGALS-VMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FV 334
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T E+A L AG+D+D G D Y N M AV G+I++ +D S+ + + +G F
Sbjct: 335 APTMEEAAILALSAGVDVDLGGDAYMNL-MNAVNTGRISKTALDASVARVLRLKFEMGLF 393
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ K + + + + LA A+ I LLKN++ LPLN + +AL+GP+A+
Sbjct: 394 ENPYVDPEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKN--RKVALIGPNADN 451
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCA--DIVCQNNSMIPAAIDAAK 462
M+G+Y + +DG A S + Y GC+ D V + I A+ AA+
Sbjct: 452 RYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTD---IEQAVAAAQ 508
Query: 463 NADATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINK 499
++ + V G + + EG DR L L G Q EL+ K
Sbjct: 509 RSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-K 567
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
A P+ +V + +D N+A N ++L YPG+EGG AIADV+FG +NP GR
Sbjct: 568 ALKATGKPLIVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGR 625
Query: 560 LPITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSP 618
LP + V +PL P Y +YPFGYGLSYT F Y
Sbjct: 626 LPFS------VPRSVGQIPLYYNKKAPQSHDYVEMSASPLYPFGYGLSYTSFDYS----- 674
Query: 619 KSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVV 678
D+ L + + ++ +F+ V N GK DG EV
Sbjct: 675 ---DLHLSA-------------------------LMPRSFEISFK--VRNTGKYDGEEVA 704
Query: 679 MVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAH 737
+Y + + +KQ+ + R ++ G+ +V F ++ + +VD ++ G
Sbjct: 705 QLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTF 763
Query: 738 TILVG 742
I++G
Sbjct: 764 QIMIG 768
>gi|383115541|ref|ZP_09936297.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
gi|313695054|gb|EFS31889.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
Length = 800
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 227/800 (28%), Positives = 359/800 (44%), Gaps = 141/800 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D P R DL+ +MTL EK QM L YG R+ P W W + +
Sbjct: 56 YEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTDGWSTEIWKDGIGNI 114
Query: 64 ----HGVSFIGRRTNSP-----------------------PGTHFDSEVPG-----ATSF 91
+G+ G + P P + + G AT F
Sbjct: 115 DEQANGLGKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMF 174
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVLETP 150
P A++N+ L ++I + + EA+A+ G T ++P +++ +DPRWGRV+E+
Sbjct: 175 PAQCGQGATWNKKLIREIAKVTADEAKAL------GYTNIYAPILDIAQDPRWGRVVESY 228
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+VG + GLQ EG I A KH+A Y + +
Sbjct: 229 GEDPYLVGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRT 274
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
D V ++M+ ++ PF + E VM SYN +G P L + +R W F G
Sbjct: 275 DPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFT---------MGAV 321
Y+VSD ++++ + H+ + T+E+ A+V+ AGL++ TNFT A+
Sbjct: 335 YVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAI 388
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIV 380
+GK++ +D + + V +G FD P + + N H +++ AA + IV
Sbjct: 389 SEGKVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEVVVHNAAHQDVSMRAALESIV 448
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVIN 438
LLKN+ LPL+ + +A++GP+A K + Y + G Y + +
Sbjct: 449 LLKNEKEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVR 507
Query: 439 YAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKD 483
YA GC DI+ Q +MI A++ AK +D ++V G + E
Sbjct: 508 YAKGC-DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFS 566
Query: 484 RVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGG 543
R +L L G Q +L+ V K PV LV++ A IN+A N + +I+ +PGE G
Sbjct: 567 RTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMG 623
Query: 544 RAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGY 603
AIA V+FG YNPGGRL +T + + +IP+ + P +P ++ G K V+YPFGY
Sbjct: 624 DAIAKVLFGDYNPGGRLAVT-FPKSVGQIPF-AFPFKPGSDSKG---KVRVAGVLYPFGY 678
Query: 604 GLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQ 663
GLSYT F Y ++K+ +KP A T
Sbjct: 679 GLSYTTFNYS--------NLKI---------------SKPVIGA---------QENITLS 706
Query: 664 IEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNACKSLK 722
V+N GK G EVV +Y + + T +V+ G+ER+ + G+ + FT+ + L
Sbjct: 707 CTVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLG 765
Query: 723 IVDNAANSLLASGAHTILVG 742
+ D + G+ +++VG
Sbjct: 766 LWDKNNQFTVEPGSFSVMVG 785
>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 863
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 168/448 (37%), Positives = 239/448 (53%), Gaps = 46/448 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S +PY D KL +RA DL++R+TL EKV M + + +PRLG+ YEWW+EALHGV+
Sbjct: 24 SKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARA 83
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN---- 125
G AT FP I ASFN+ L ++ VS EARA N
Sbjct: 84 GL----------------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQ 127
Query: 126 ----AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PN+N+ RDPRWGR ET GEDPY+ GR + VRGLQ E EY
Sbjct: 128 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD---- 183
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VM
Sbjct: 184 ----KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVM 236
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R DW F G +V+DC +I + K ++T DA
Sbjct: 237 CAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKK--HETHPDAAHAS 294
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + +G DL+CG + + T AV++G I+E I+TS++ L LG + + + N+
Sbjct: 295 ADAVLSGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNI 353
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ I P+H ELA + A + +VLL+N+N L +A++GP+AN + GNY
Sbjct: 354 PFSVIDCPKHKELALKMAHESLVLLQNNNNL--LPLNRQMKVAVIGPNANDSVMQWGNYN 411
Query: 418 GTPCRYTSPMDGFYAY--SKVINYAPGC 443
G P + ++G A I Y P C
Sbjct: 412 GFPSHTVTLLEGIRAKLPDAQIIYEPVC 439
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 138/296 (46%), Gaps = 53/296 (17%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
++ ++AD + G+ +E E G DR ++ LP Q E++ + K
Sbjct: 594 LNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKKT 653
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
V V S A+ I N +IL YPG+ GG A+ADV+FG YNP GRLPIT+Y++
Sbjct: 654 V-FVNFSGSAMAIVPETQN--CDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS 710
Query: 568 NYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
Y ++ GRTY+F +YPFGYGLSYT+F Y A+ +S KL K
Sbjct: 711 MQQLPDYEDYSMK------GRTYRFMTKTPLYPFGYGLSYTRFSYGKATLNQS---KLTK 761
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGI 687
++ A+L I V N+G+ DG EVV VY P
Sbjct: 762 GEK----------------AILT-------------IPVSNVGQRDGEEVVQVYICRPDD 792
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVG 742
K + G++RV IA G++ V + S + D A N++ +G + IL G
Sbjct: 793 KEGPQKTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYG 847
>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 736
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 218/723 (30%), Positives = 342/723 (47%), Gaps = 114/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ EA HG IG T FPT I A+++ L K++
Sbjct: 83 RLGIPMF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPELVKEV 124
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
GQ ++ E R+ G + P +++ RDPRW RV ET GEDP + G + V GL
Sbjct: 125 GQVIAKEIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLG 179
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ S+ A KH+ AY + EG ++ S V +D+ + F+ PF
Sbjct: 180 G--------GNLSQKYATIATLKHFLAYAVP--EGGQNGNYAS-VGIRDLHQNFLPPFRK 228
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
++ G +S VM SYN ++GIP ++ LL + +R +W F G++VSD SI+ I ESH F+
Sbjct: 229 AIDAGALS-VMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLYSIEGIHESH-FV 286
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
TKE+A + + AG+D+D G D YTN AVQ G++ + IDT++ + + +G F
Sbjct: 287 APTKENAAIQSVMAGVDVDLGGDAYTNLCH-AVQSGQMDKTVIDTAVCRVLRMKFEMGLF 345
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + + +HIELA + A+ I LLKN+N LPL+ I +A++GP+A+
Sbjct: 346 EHPYVDPKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSK-MINKVAVIGPNADN 404
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG + Y GCA I + I AI+AA+ +
Sbjct: 405 RYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCA-IRDTTVNEIEQAIEAARRS 463
Query: 465 D----------------------ATVIVAGLDLSVE-AEGKDRVDLLLPGFQTELINKVA 501
+ A V G +E EG DR L L G Q EL+ +
Sbjct: 464 EVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQ 523
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ +V + ++ N+A ++L YPG+EGG AIADV+FG YNP GRLP
Sbjct: 524 KTGK-PLIVVYIEGRPLEKNWASEYA--DALLTAYYPGQEGGNAIADVLFGDYNPSGRLP 580
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
I+ + +IP P N+ Y +Y FGYG+SYT F+Y
Sbjct: 581 IS-VPRSVGQIPVYYNQKAPRNH----DYVEVSSSPLYSFGYGMSYTTFEYS-------- 627
Query: 622 DIK-LDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + K +C ++++ +V+N GK DG EV +
Sbjct: 628 DLQVVQKSARCFEVSF----------------------------KVKNTGKYDGEEVSQL 659
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ +ER + G+ KV F + + +V+ ++ SG +
Sbjct: 660 YMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGNFHL 718
Query: 740 LVG 742
++G
Sbjct: 719 MIG 721
>gi|294146775|ref|YP_003559441.1| beta-glucosidase [Sphingobium japonicum UT26S]
gi|292677192|dbj|BAI98709.1| beta-glucosidase [Sphingobium japonicum UT26S]
Length = 791
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 226/770 (29%), Positives = 353/770 (45%), Gaps = 123/770 (15%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEK--------VQQMGDLAYGVPRLGLPLYEWWSEAL 63
FP+ + P AK +P + V + A RLG+P+ + E L
Sbjct: 91 FPHGMGQFTRPSDAKGAFSPREVPGRNPRQTVALVNALQRWATTQTRLGIPIL-FHEEGL 149
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG + +G ATSFP I +S++ L +++ ++ E R+
Sbjct: 150 HGYAAVG-----------------ATSFPQSIAMASSWDPDLLREVNAVIAREIRSR--- 189
Query: 124 GNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
G++ SP +++ RDPRWGR+ ET GEDPY+VG + V GLQ R
Sbjct: 190 ---GVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG-----KGRSRLL 241
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
P K+ A KH + N + V+E++++E F PFE V + +VM S
Sbjct: 242 PPGKVFATLKHLTGHGQPESGTN---VGPAPVSERELRENFFPPFEQVVKRTGIEAVMAS 298
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN ++G+P+ A+ LL +RG+W F G +VSD ++ ++ H D E A R L
Sbjct: 299 YNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQLMSIHHVAADL-EQAAGRALD 357
Query: 303 AGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
AG+D D D + T+G V++GKI EA +D ++R + + R G F+ +P
Sbjct: 358 AGVDADLPDGLSYATLGRQVREGKIGEALVDRAVRHMLELKFRAGLFE-NPYADAAASEK 416
Query: 362 ICNPQHIELAAEAARQ-GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I N A A Q I+LLKND G LPL ++A++GP +A A +G Y G P
Sbjct: 417 ITNDARARALALKAAQRSIILLKND-GMLPLKPEG--SIAVIGP--SAAVARLGGYYGQP 471
Query: 421 CRYTSPMDGFYA----YSKVINYAPGC---------ADIV-----CQNNSMIPAAIDAAK 462
S ++G A +K++ +A G AD V +N +I A++AA+
Sbjct: 472 PHSVSILEGIRAKVGNRAKIV-FAQGVRITENDDWWADKVTRSDPAENRRLIAQAVEAAR 530
Query: 463 NADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
+ D V+ G EG DR L L G Q EL + + K P+ +V+++
Sbjct: 531 HVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK-PIAVVLINGR 589
Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
+ K + + +IL Y GE+GG A+ADV+FG NPGG+LP+T IP ++
Sbjct: 590 PA--STVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNPGGKLPVT--------IPRSA 639
Query: 577 MPLRPVNNF---PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
L N R Y F +YPFG+GLSYT F S+P+ K+ R
Sbjct: 640 GQLPMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFDL---SAPRLSAAKISVGGMTR- 695
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHI 692
++V N G+ +G EVV +Y + G I
Sbjct: 696 ----------------------------VSVDVRNSGRREGDEVVQLYVRDKVGSVTRPI 727
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
K++ G++RV + G+ V FT+ ++L++ ++ + ++ G I+ G
Sbjct: 728 KELKGFQRVTLKPGEVRTVTFTI-GPEALQMWNDHMDRVVEPGDFEIMTG 776
>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 43/444 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FPY D L ERA DL++R+TL EK M + + +PRL + Y WW+EALHG++ G
Sbjct: 27 FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA----MYNLGN-- 125
AT FP I ASF++SL ++ VS EARA + + GN
Sbjct: 87 ----------------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LT W+PN+N+ RDPRWGR ET GEDPY+ R + V GLQ + Y+
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYN------ 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W +R F++ ++ +D+ ET++ F+ V E V VMC+
Sbjct: 185 --KLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCA 239
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVL 301
YNR G P C +LL Q +R +W F G +VSDC ++ + K + A A +
Sbjct: 240 YNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAV 299
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
G D++CG+ Y + AV+ G I E ID S++ L LG D + + + +
Sbjct: 300 LNGTDVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDENV-WTGISSDV 357
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +P+H +LA + AR+ + LL+N+N LPL+ +AL+GP+AN + GNY G P
Sbjct: 358 VDSPKHRQLALQMARETMTLLQNNNNILPLSKQ--AKIALIGPNANDSVMQWGNYNGLPS 415
Query: 422 RYTSPMDGFYAYSKVIN--YAPGC 443
+ ++G Y N Y P C
Sbjct: 416 HTITLLEGMQRYLPTSNLIYEPVC 439
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 142/300 (47%), Gaps = 53/300 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ ++ K+ D + G+ ++E E G DR ++ LP Q ++ + A
Sbjct: 591 VKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTA 650
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K + V S A+ + N ++IL YPG+ GG+A+A+V+FG YNP G+LP+T
Sbjct: 651 GK-RIVFVNFSGAAIALEPESQN--CEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLT 707
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y N +IP N GRTY++ ++PFG+GLSYT FKY +
Sbjct: 708 FYR-NLAQIPDFE-----DYNMTGRTYRYMKETPLFPFGHGLSYTTFKYG--------KL 753
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
K++ D+ G N I V N G DG EVV VY K
Sbjct: 754 KMNDDK------IAAGQN------------------LNLAIPVTNTGSRDGDEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
+K + ++RV I AG++ +V F+++ + L+ D +N++ + G +T+++G
Sbjct: 790 KMDDTEGPVKTLRAFKRVRIPAGKTVEVKFSLDDTQ-LEWWDEQSNTMRVCPGNYTVMIG 848
>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 43/444 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FPY D L ERA DL++R+TL EK M + + +PRL + Y WW+EALHG++ G
Sbjct: 27 FPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLARTGL 86
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA----MYNLGN-- 125
AT FP I ASF++SL ++ VS EARA + + GN
Sbjct: 87 ----------------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130
Query: 126 --AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LT W+PN+N+ RDPRWGR ET GEDPY+ R + V GLQ + Y+
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYN------ 184
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W +R F++ ++ +D+ ET++ F+ V E V VMC+
Sbjct: 185 --KLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCA 239
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF-LNDTKEDAVARVL 301
YNR G P C +LL Q +R +W F G +VSDC ++ + K + A A +
Sbjct: 240 YNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAV 299
Query: 302 KAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
G D++CG+ Y + AV+ G I E ID S++ L LG D + + + +
Sbjct: 300 LNGTDVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDENV-WTGISSDV 357
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ +P+H +LA + AR+ + LL+N+N LPL+ +AL+GP+AN + GNY G P
Sbjct: 358 VDSPKHRQLALQMARETMTLLQNNNNILPLSKQ--AKIALIGPNANDSVMQWGNYNGLPS 415
Query: 422 RYTSPMDGFYAYSKVIN--YAPGC 443
+ ++G Y N Y P C
Sbjct: 416 HTITLLEGMQRYLPTSNLIYEPVC 439
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 141/300 (47%), Gaps = 53/300 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ ++ K+ D + G+ ++E E G DR ++ LP Q ++ + A
Sbjct: 591 VKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTA 650
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K +V ++ I + ++IL YPG+ GG+A+A+V+FG YNP G+LP+T
Sbjct: 651 GK---RIVFVNFSGAAIALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLT 707
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y N +IP N GRTY++ ++PFG+GLSYT FKY +
Sbjct: 708 FYR-NLAQIPDFE-----DYNMTGRTYRYMKETPLFPFGHGLSYTTFKYG--------KL 753
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
K++ D+ G N I V N G DG EVV VY K
Sbjct: 754 KMNDDK------IAAGQN------------------LNLVIPVTNTGSRDGDEVVQVYLK 789
Query: 684 PPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
+K + ++RV I AG++ +V F+++ + L+ D +N++ + G +T+++G
Sbjct: 790 KMDDTEGPVKTLRAFKRVRIPAGKTVEVKFSLDDTQ-LEWWDEQSNTMRVCPGNYTVMIG 848
>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
Length = 925
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 195/688 (28%), Positives = 332/688 (48%), Gaps = 94/688 (13%)
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
AT+FP+ + ++N L +K G+ V EAR + G T ++P ++V RD RWGR
Sbjct: 180 ATNFPSQLGMGHTWNRELLRKTGRIVGREARLL------GYTNIYAPVLDVGRDQRWGRY 233
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
E GE PY+V + G+Q +Y ++++ KH+AAY +
Sbjct: 234 EEVFGESPYLVAELGVAMASGMQ----TDY---------QVASTAKHFAAYSNNKGAREG 280
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
D ++ ++++ ++PF + + VM SYN +G+P L + +RG+
Sbjct: 281 MSRVDPQMPPREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEM 340
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQ--- 323
F GY+VSD S++ + H + + DAV + ++AGL++ C ++ + ++Q
Sbjct: 341 GFRGYVVSDSGSVEYLHNKHHTAVN-QLDAVRQSIEAGLNVRCNFWHPETYVMPLRQLLR 399
Query: 324 -GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIV 380
G I E +D+ +R + V +G FD P +L + P+H E+A +A+R+ IV
Sbjct: 400 EGLITEELLDSRVRDVLRVKFLVGLFD-RPYQTDLAAADREVDGPEHNEVALQASRESIV 458
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFY----AYSKV 436
LLKN+N LPL+ I+ +A++GP+A+A +G+Y TS +DG A ++
Sbjct: 459 LLKNENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEI 518
Query: 437 INYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ Y GC ++ + + I A +AA +D V+V G E
Sbjct: 519 V-YEKGCELVDAAWPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENC 577
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q EL+ V +A P LV+++ IN+A + + +I+ YPG G
Sbjct: 578 SRSSLDLPGRQEELLRAV-EATGKPTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHG 634
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF-------PGRTYKFFDG 595
G+A+ +V+FG+YNPGG+L +T + + +IP+ + P +P N PG +G
Sbjct: 635 GQAVYEVLFGEYNPGGKLTVT-FPRHVGQIPF-NFPYKPAANTDGGLTPGPGGNQTRING 692
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
+Y FGYGLSYT F+Y D++++ Q R
Sbjct: 693 -ALYDFGYGLSYTTFEY--------ADLRIEP-QTIR----------------------- 719
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
+D F +V N G+ DG EVV +Y T+ K + G++RV + AG++ +V
Sbjct: 720 QDEPFRVSFDVTNTGQRDGDEVVQLYIHDVLSSVTTYEKNLRGFDRVHLKAGETRRVTMQ 779
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
+ + L +++ ++ G +L+G
Sbjct: 780 VRP-QDLSLLNERMERVVEPGDFDVLIG 806
>gi|305663349|ref|YP_003859637.1| glycoside hydrolase family protein [Ignisphaera aggregans DSM
17230]
gi|304377918|gb|ADM27757.1| glycoside hydrolase family 3 domain protein [Ignisphaera aggregans
DSM 17230]
Length = 757
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 224/788 (28%), Positives = 362/788 (45%), Gaps = 139/788 (17%)
Query: 23 ERAKDLVERMTLPEKVQQMGD--------------------LAYGV-------------- 48
ER ++L+ RM++ EK+ Q+ L YGV
Sbjct: 5 ERVRELIGRMSIEEKIAQLISIPLESVLDGKKFSVEKAREVLKYGVGEILRIGGSSARLS 64
Query: 49 PRLGLPLYEWWSEALHGVSFIGRRTNS--PPGTHFDS----EVPGATSFPTVILTTASFN 102
PR + +Y F+ R T P H +S P AT FP + ++++
Sbjct: 65 PREAVEIYNAIQR------FLTRETRLGIPAIVHEESIAGLLAPTATVFPIPLALASTWD 118
Query: 103 ESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAI 162
L ++ + + A+ +P +++ R+PRWGR ET GED Y+ I
Sbjct: 119 PDLVYRVAVAIRRQIMAI-----GSRHTLAPVLDLCREPRWGRCEETYGEDSYLAASMGI 173
Query: 163 NYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQET 222
YV+G+Q D + A KH+ + + EG R V +++ E
Sbjct: 174 AYVKGIQ----------GDDIRYGVIATGKHFVGHGVP--EGG-RNIASIHVGLRELLEI 220
Query: 223 FILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTI 282
++ PFE V E ++ S+M +Y+ ++ +P A+ LL +RG W F G VSD + ++ +
Sbjct: 221 YMYPFEATVKEANLLSIMPAYHDIDNVPCHANKWLLTDILRGSWGFKGIAVSDYEGVKQL 280
Query: 283 VESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYI 340
H+ D E AV + +KAG+D++ G+ + + AV++G I E I+ ++ +
Sbjct: 281 HTIHRVARDCMEAAV-KAIKAGVDIEYPSGECFKQL-VEAVRKGLIDEDTINRAVERVLK 338
Query: 341 VLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLA 400
+ LG F+ + + N ELA E AR+ IVLLKND G LPL +IKT+A
Sbjct: 339 LKFMLGLFENPFIDETKVPTTLDNEADRELAREVARKAIVLLKND-GILPLKR-DIKTIA 396
Query: 401 LVGPHANATKAMIGNY---------EGT------PCRYTSPMDGFYAY---SKVINYAPG 442
++GP+AN AM+G+Y +GT R + ++ + S + YA G
Sbjct: 397 VIGPNANDPWAMLGDYHYDAHIGSFDGTYGKISPSVRIVTVLEAIKSRVSPSTEVLYAKG 456
Query: 443 CADIVCQNNSMIPAAIDAAKNADATVIVAG-------LDLSVEAEGKDRVDLLLPGFQTE 495
C D + + S AI+ AK AD + V G L + EG DR L LPG Q E
Sbjct: 457 C-DTIGDDRSGFGEAIEIAKRADIIIAVMGDRSGLFNLKMFTSGEGVDRASLKLPGVQEE 515
Query: 496 LINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYN 555
L+ ++A K P+ LV+++ + + P + +I+ PGEEGG AIAD++FG Y+
Sbjct: 516 LLKELASLGK-PIILVLINGRP--LALSSILPYVNAIVEAWRPGEEGGNAIADILFGDYS 572
Query: 556 PGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
PGGRLP++ Y+ + I Y+ P N F R Y + ++PFGYGLSYTQF Y+
Sbjct: 573 PGGRLPVSLPYDVGQLPIYYSRKP----NCF--RDYVEYPAKPLFPFGYGLSYTQFAYE- 625
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
N V + +V+ D ++V+N+G M G
Sbjct: 626 --------------------NLVVEST----------EVRDPDTVIRVSVDVKNVGSMAG 655
Query: 675 SEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
EVV +Y S+ + ++ G++R+ + G+ V F + + L D N ++
Sbjct: 656 DEVVQLYISRDYASVTRPVAELKGFKRITLEPGEKKTVVFEI-PLELLAYYDMDMNYVVE 714
Query: 734 SGAHTILV 741
G +T ++
Sbjct: 715 PGEYTFMI 722
>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 799
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 231/820 (28%), Positives = 366/820 (44%), Gaps = 135/820 (16%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
Y D P R KDL+ +MT+ EK Q L YG R+ LP W W
Sbjct: 40 YEDPVAPVANRVKDLLSQMTVEEKTCQTATL-YGFGRVLKDELPTPGWKQEIWKDGIANI 98
Query: 61 -EALHGVS--------------------------FIGRRTNSPPGTHFDSEVPG-----A 88
E L+G++ FI P + + G A
Sbjct: 99 DEELNGLARNKKAQTKYSYPFSNHAEAINKIQKWFIEETRLGIPVDFTNEGIHGLNQDHA 158
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
T+FP I +++N+ L ++GQ + EA+A+ G T ++P ++V RD RWGRV+
Sbjct: 159 TAFPAPIGIGSTWNKELVHQMGQIIGREAKAL------GYTNVYAPILDVARDQRWGRVV 212
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
ET GEDP++V G+Q+ GV ++ KH+A Y + +
Sbjct: 213 ETYGEDPFLVAGLGTALAGGIQE-NGV-------------ASTLKHFAVYSVPKGGRDGN 258
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D V ++MQ+ F+ PF + VM SYN +G+P A L Q +R +
Sbjct: 259 ARTDPHVAPREMQQLFLYPFRKVIQNVHPLGVMSSYNDWDGMPVTASNYFLTQLLRQQFG 318
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTM---GAVQQ 323
F GY+VSD +++ + E H D KE AV V++AGL++ + +NF + +++
Sbjct: 319 FDGYVVSDSRAVEFVYEKHHVAKDYKE-AVKMVMEAGLNVRTEFNAPSNFILPLRQLIKE 377
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE-LAAEAARQGIVLL 382
G ++ ++ + + V RLG FD + I + E +A + R+ +VLL
Sbjct: 378 GGLSMETLNQRVGEVLSVKFRLGLFDAPYVKDPKAADKIVATEASEAVALQMNRESLVLL 437
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---FYAYSKVINY 439
KND LPL+ G + + + GP A+ + I Y + + S ++G F A INY
Sbjct: 438 KNDKNILPLSLGQYRNILVTGPLADEKEHAISRYGPSNKKVISVLEGIRHFAAKKATINY 497
Query: 440 APGC--ADIVCQNNSMIPA------------AIDAAKNADATVIVAGLDLSVEAEGKDRV 485
GC AD + +I A++AAK D + V G + E R
Sbjct: 498 IKGCEAADATWPESEIIDTPPTPQEIAEMNKAVEAAKQNDIIIAVMGENDKQVGESLSRT 557
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
L LPG Q L+ ++ K P+ L++++ + IN+ N + +IL +PG GG A
Sbjct: 558 GLNLPGRQLRLLEELKKTGK-PMVLILINGQPLTINW--ENRYLDAILETWFPGPAGGTA 614
Query: 546 IADVIFGKYNPGGRLPITW------YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
+A+ IFG YNPGG+L T+ E N+ P S +P + G GP +Y
Sbjct: 615 VAEAIFGAYNPGGKLTTTFPKTTGQIEMNFPFKP-ASHAGQPGDGPNGYGKTAVVGP-LY 672
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT F+Y ++K+D ++ + +V
Sbjct: 673 PFGYGLSYTTFEY--------ANLKVDPEKARTQADISVA-------------------- 704
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNAC 718
++V+N GK+ G EVV +Y K + T + ++ G+ERV ++ G++ V F +
Sbjct: 705 ----VDVKNTGKVKGDEVVQLYVKQLVSSVTTYESILRGFERVSLSPGETKTVHFKLTP- 759
Query: 719 KSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
L I+D N ++ GA I+VG + Q+ L
Sbjct: 760 DDLSILDKNMNFVVEPGAFDIMVGSSSVDIRLKKQIILEQ 799
>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
Length = 954
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 226/756 (29%), Positives = 354/756 (46%), Gaps = 119/756 (15%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHGVSFI 69
Y D LP ER + L+ MT PE ++ +G+P G+P LY EA+HG S+
Sbjct: 170 YMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHGFSY- 225
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT 129
G+ GAT FP + A++N+ L ++I V E L +
Sbjct: 226 --------GS-------GATIFPQALAMGATWNKKLTEEIAMAVGDE-----TLAAGTMQ 265
Query: 130 FWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISA 189
WSP ++V +D RWGR ET GEDP +V + +++G Q S+ L +
Sbjct: 266 AWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SKGLFTTP 313
Query: 190 CCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGI 249
KH+ + R D ++E++M+E ++PF + D S+M +Y+ G+
Sbjct: 314 --KHFGGH---GAPLGGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGV 368
Query: 250 PTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC 309
P +LL+ +R +W F G+IVSDC +I + + K +A + L AG+ +C
Sbjct: 369 PVAKSKELLHNILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNC 428
Query: 310 GDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC----N 364
GD Y + + A + G++ ++D R + ++ R F+ +P K L N I +
Sbjct: 429 GDTYNDKEVIQAAKDGRLNMENLDNVCRTMLRMMFRNELFEKAPN-KPLDWNKIYPGWNS 487
Query: 365 PQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCR 422
H E+A +AAR+ IV+L+N LPL+ G I+++A++GP A+ + G+Y + P +
Sbjct: 488 DNHKEMARQAARESIVMLENKENILPLDKG-IRSIAVLGPGADDLQP--GDYTPKLLPGQ 544
Query: 423 YTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
S + G +KVI Y GC D + + IP A+ AA +D V+V G + E
Sbjct: 545 LKSVLTGIKQAVGKQTKVI-YEQGC-DFTNLSETNIPKAVKAASQSDVVVMVLGDCSTSE 602
Query: 479 A---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKI 529
A E D L+LPG Q EL+ V K PV LV+ + N K +
Sbjct: 603 ATTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILVLQAGRP--YNLTKASKLC 659
Query: 530 KSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRT 589
K+I+ PG+EGG A ADV+FG YNP GRLP+T+ + +PL GR
Sbjct: 660 KAIIVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPQH------VGQLPLYYNFKTSGRR 713
Query: 590 YKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
Y++ D +Y FGYGLSYT F+Y K Q+ + N TV
Sbjct: 714 YEYSDLEYYPLYYFGYGLSYTSFEYSGL-----------KVQEKDNGNITV--------- 753
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAG 706
Q V+N+G+ G EVV +Y + T I ++ + R+ + G
Sbjct: 754 ---------------QATVKNVGQRAGDEVVQLYVTDMYASVKTRITELKDFTRINLKPG 798
Query: 707 QSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+S V F + L ++++ + ++ G ILVG
Sbjct: 799 ESKTVSFELTPY-DLSLLNDHMDRVVEKGEFKILVG 833
>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
Length = 946
Score = 261 bits (667), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 231/807 (28%), Positives = 361/807 (44%), Gaps = 138/807 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEWWSEALH-GVSFI 69
Y D P R +DL+ +MTL EK QM L YG R+ LP EW ++ G+ I
Sbjct: 53 YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111
Query: 70 GRRTNS------PPG-------------------------------THFDSE-VPG---- 87
N PP T F +E + G
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q +++A KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQHNH-------------QVAATGKHFIAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ PF+ + E + VM SYN +G P + L +RG+
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE AV + ++AGL++ C D Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA-EAARQGIV 380
++G ++E I+ +R + V +G FD Q G + + E A +A+R+ IV
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFDTPYQTDLKGADEEVEKKENEEVALQASRESIV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN+ LPL+ I+ +A+ GP+A+ + +Y TS + G K +
Sbjct: 452 LLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N I A+ AK AD ++V G E K
Sbjct: 512 LYTKGC-DLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+AD++FG YNPGG+L +T + +IP+ + P +P + G DG +
Sbjct: 628 GIAVADILFGDYNPGGKLTVT-FPKTVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRANG 685
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y D+ + P A V CK
Sbjct: 686 ALYPFGYGLSYTTFEYS-------------------DLKISPAIITPNQKAY----VTCK 722
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EV+ +Y + T+ K + G+ERV + G++ ++ F +
Sbjct: 723 ---------VTNTGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPI 773
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ K+L++++ + ++ G T+++G
Sbjct: 774 DR-KALELLNADMHWVVEPGDFTLMLG 799
>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 950
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 226/762 (29%), Positives = 357/762 (46%), Gaps = 119/762 (15%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEAL 63
K++D Y DA LP ER + L+ MT PE ++ +G+P G+P LY EA+
Sbjct: 160 KVTDRRYMDASLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAV 216
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG S+ G+ GAT FP + A++N L +++ + E A N
Sbjct: 217 HGFSY---------GS-------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NT 259
Query: 124 GNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
A WSP ++V +D RWGR ET GEDP +V + +++G Q SR
Sbjct: 260 KQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SR 303
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
L + KH+ + R D ++E++M+E ++PF + D S+M +Y
Sbjct: 304 GLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMMAY 358
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
+ G+P +LL Q +R +W F+G+IVSDC +I + + K +A + L A
Sbjct: 359 SDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAA 418
Query: 304 GLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI 362
G+ +CGD Y N + A + G+I D+D R + + R F+ +P K L I
Sbjct: 419 GIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKI 477
Query: 363 C----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-- 416
+ H E+A +AAR+ IV+L+N LPL+ + T+A++GP A+ + G+Y
Sbjct: 478 YPGWNSDSHKEMARQAARESIVMLENKENLLPLSK-TLCTIAVLGPGADDLQP--GDYTP 534
Query: 417 EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG 472
+ P + S + G +KV+ Y GC D + + IP A+ AA +D ++V G
Sbjct: 535 KLLPGQLKSVLTGIKGAVGKQTKVL-YEQGC-DFTNPDETNIPKAVKAASQSDVVIMVLG 592
Query: 473 LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
+ EA E D L+LPG Q EL+ V K PV L++ + DI
Sbjct: 593 DCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI--L 649
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVN 583
K + K+IL PG+EGG A+ADV+FG YNP GRLP+T+ +PL
Sbjct: 650 KASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPLYYNF 703
Query: 584 NFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
GR Y++ D +Y FG+GLSYT F+Y ++K+ Q+ + N V
Sbjct: 704 KTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVEV--- 749
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYER 700
Q V+N+G G EV +Y + T + ++ + R
Sbjct: 750 ---------------------QATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFAR 788
Query: 701 VFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ + G+S V F M + ++++ + ++ G I+VG
Sbjct: 789 IHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMVG 829
>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 864
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 169/453 (37%), Positives = 240/453 (52%), Gaps = 47/453 (10%)
Query: 9 LSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSF 68
L+ Y D ERA+DLV+++TL EKV M D + V RLG+ Y WW+EALHGV+
Sbjct: 19 LAQSIYKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVAR 78
Query: 69 IGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA-- 126
G AT FP I ASF+ VS EARA +A
Sbjct: 79 SG----------------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEG 122
Query: 127 ------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
GLT W+P +N+ RDPRWGR +ET GEDPY+ ++ V+GLQ + D
Sbjct: 123 SYKRYQGLTIWTPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQCL-------DE 175
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSV 239
+ + K+ AC KH+A + W +R F++ ++ +D+ ET++ PFE V EG V V
Sbjct: 176 NEKYDKVHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEV 232
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAV 297
MC+YNR G P C +LLN +R +W + G +V+DC +I + HK D +
Sbjct: 233 MCAYNRFEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASS 292
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YK 355
A VL +G DL+CG Y + T G V++G I EADID S++ L LG D Q +
Sbjct: 293 AAVL-SGTDLECGSNYRSLTEG-VKKGFIDEADIDRSVKRLLQARFELGEMDEPDQVRWA 350
Query: 356 NLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN 415
+ + +C+ +H L+ + AR+ + LL N N ALPL G T+A++GP+AN + GN
Sbjct: 351 QIPYSVVCSDKHDSLSLDMARKSMTLLLNKNNALPLERGGT-TIAVMGPNANDSVMQWGN 409
Query: 416 YEGTPCRYTSPMDGFYAY----SKVINYAPGCA 444
Y G P R + +DG + K+I Y GC+
Sbjct: 410 YNGLPKRTITILDGIRSAMGKDDKLI-YEQGCS 441
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 134/297 (45%), Gaps = 53/297 (17%)
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
D+ + + I ++ K+AD + G+ +E E G DR D+ LP Q
Sbjct: 583 DLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQR 642
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
E+I + DA K ++ ++ I ++IL YPG+ GG+A+A+V+FG Y
Sbjct: 643 EMIKALHDAGK---KVIFVNCSGSPIAMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDY 699
Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
NP GRLP T+Y +P N G TY+FF+G ++PFGYGLSYT FKY
Sbjct: 700 NPAGRLPATFYRN------LAQLPDFEDYNMAGHTYRFFNGEPLFPFGYGLSYTTFKYG- 752
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
I+L Q D + V N G +G
Sbjct: 753 -------KIQLKSSAQT-------------------------DETVKITVPVTNTGSRNG 780
Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
EVV VY K G +K + ++RV+I AG++ KV + K L+ D+A N++
Sbjct: 781 EEVVQVYLKKQGETDGPVKTLRAFKRVYIPAGKTVKVELELTP-KQLEWWDSATNTM 836
>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
Length = 864
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 159/449 (35%), Positives = 233/449 (51%), Gaps = 47/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
S PY D L +RA DL++R+T+ EKV M + + G+ RLG+ YEWW+EALHGV+
Sbjct: 26 SQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVARA 85
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA--- 126
G AT FP I ASF+++L ++ +S EARA N
Sbjct: 86 GL----------------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQ 129
Query: 127 -----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
GLT W+PNIN+ RDPRWGR ET GEDPY+ R + V+GLQ + Y+
Sbjct: 130 YKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYN---- 185
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVM 240
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V E DV VM
Sbjct: 186 ----KLHACAKHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVM 238
Query: 241 CSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAV--- 297
C+YNR G P C +LL Q +R +W F G +VSDC +I + K ++T DA
Sbjct: 239 CAYNRFEGDPCCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKK--HNTHPDAAHAS 296
Query: 298 ARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL 357
A + G DL+CG Y T A++ G I+E I+ S++ L LG + + L
Sbjct: 297 AEAVLNGTDLECGSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPW-TL 354
Query: 358 GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE 417
+ + +P+H LA + A + + LL+N LPL+ +A++GP+AN + GNY
Sbjct: 355 PYSIVDSPKHRCLALKMAHETMTLLQNKGKVLPLDKQ--ARIAIIGPNANDSVMQWGNYN 412
Query: 418 GTPCRYTSPMDGFYAYSKV--INYAPGCA 444
GTP ++ + F + + Y P C
Sbjct: 413 GTPSHTSTLLSAFRKRLPISHLIYEPVCG 441
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 135/303 (44%), Gaps = 59/303 (19%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
I ++ K+ D + G+ S+E E G DR D+ P Q +++ + +A
Sbjct: 591 ISNTLEKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEA 650
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKS---ILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
K V LV S A+ + P+ KS IL YPGEEGG AI +V+FG YNP GRL
Sbjct: 651 GK-KVILVNFSGSAMALT-----PETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRL 704
Query: 561 PITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
PIT+Y++ +P + GRTY++ ++PFGYGLSYT F +
Sbjct: 705 PITFYKS------IDQLPDFENYSMKGRTYRYMQEEPLFPFGYGLSYTTFAFG------- 751
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
+ NK +A K T I ++N+G DG EVV +
Sbjct: 752 ----------------KIHINKNSLSA---------GEKVTLHIPIKNIGDRDGVEVVQI 786
Query: 681 YSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTI 739
Y + +K + ++RV I G++ +V + + + D N++ G + I
Sbjct: 787 YIQRQADKEGPVKTLRAFKRVEIPKGKTQEVKIELPYV-AFEWFDPTTNTMRPIQGEYNI 845
Query: 740 LVG 742
L G
Sbjct: 846 LYG 848
>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 765
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 219/703 (31%), Positives = 338/703 (48%), Gaps = 118/703 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ + E LHG + IG TSFP I A+F+ L + +
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------------GTSFPQPIGLGATFDPELVESL 145
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ EARA + LT P ++V R+PRWGRV ET GEDP++V R I VRG Q
Sbjct: 146 FAMTAAEARARGT--HQALT---PVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQ 200
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
G RD ++ A KH+AA+ N V+ + ++ETF+ PF+
Sbjct: 201 ---GDATFRDKT----RVIATLKHFAAHGQPESGTN---CAPVNVSMRVLRETFLFPFKE 250
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV---ESH 286
+++G SVM SYN ++G+P+ A LL +R +W F G++VSD +I + ESH
Sbjct: 251 ALDKGCAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIYELSYRPESH 310
Query: 287 -KFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLM 343
F+ K +A A ++AG++++ D Y + + V +G + E+ +D + +
Sbjct: 311 GHFVAKDKREACALAVQAGVNIELPEPDCYLHL-VDLVHKGVLQESQLDELVEPMLRWKF 369
Query: 344 RLGYFDGSPQYKNLGKNNI--CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLAL 401
++G FD P I C+ H ELA +AAR+ I LLKND +PL+ IKT+A+
Sbjct: 370 QMGLFD-DPYVDPAEAERIAGCD-AHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAV 427
Query: 402 VGPHANATKAMIGNYEGTPCRYTSPMDGFY----AYSKVINYAPGCADIV---------- 447
+GP+AN ++++G Y G P + +DG + +KV+ YA GC +
Sbjct: 428 IGPNAN--RSLLGGYSGVPKHDVTVLDGIRERVGSRAKVV-YAEGCKITIGGSWVQDEVT 484
Query: 448 ----CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELI 497
++ I A+ AK AD V+ G + E DR L L G Q EL+
Sbjct: 485 PSDPAEDRRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELV 544
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
+ K PV + + + IN+ + + +I Y G+E GRA+A+V+FG NPG
Sbjct: 545 RAMVATGK-PVIAFLFNGRPISINYLAQS--VPAIFECWYLGQETGRAVAEVLFGDTNPG 601
Query: 558 GRLPITWYEANYVKIPYTSMPLRPV-NNFPG--RTYKFFDGPVVYPFGYGLSYTQFKYKV 614
G+LPIT IP ++ L N+ P R Y F + +Y FGYGLSYT F ++
Sbjct: 602 GKLPIT--------IPRSAGHLPAFYNHKPSARRGYLFDEVGPLYAFGYGLSYTTFAFQ- 652
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
+++L K + R+ A VL+D V N G +G
Sbjct: 653 -------NLRLAKKKMHRE----------STARVLVD--------------VTNTGAREG 681
Query: 675 SEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
EVV +Y + + T IK++ G+ ++ + GQ+ V F +
Sbjct: 682 REVVQLYIRDLVSSVTRPIKELKGFRKITLQPGQTQTVEFEIT 724
>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 944
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 226/805 (28%), Positives = 360/805 (44%), Gaps = 138/805 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D R +DL+++M+L EK QM L YG R+ LP EW W + + +
Sbjct: 53 YEDPTAAIDARIEDLLKQMSLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGAI 111
Query: 67 S--FIGRRTNSPPGTHFDSEVPG------------------------------------- 87
G R P + ++ P
Sbjct: 112 DEHLNGFRQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESY 171
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L KIG EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRELIHKIGFITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V I VRG+Q Y+ +++A KH+AAY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ------YNH-------QVAATGKHFAAYSNNKGARE 272
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ I PF + E + VM SYN +GIP L +RG+
Sbjct: 273 GMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSHYWLTTRLRGE 332
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
F GY+VSD D+++ + H D KE A+ + ++AGL++ C D + V
Sbjct: 333 IGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AIRQSVEAGLNIRCTFRSPDSFVLPLRELV 391
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN-NICNPQHIELAAEAARQGIV 380
++G ++E I+ +R + V G FD Q G + + ++ +A +A+R+ IV
Sbjct: 392 KEGGLSEEIINDRVRDILRVKFLTGLFDTPYQSDLAGADREVEKEENGSIALQASRESIV 451
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGF---YAYSKVI 437
LLKN+N LPL+ +K +A+ GP+A+ + +Y + + G + +
Sbjct: 452 LLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKAEV 511
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
Y GC D+V N + I A + A+ +D V+V G E K
Sbjct: 512 LYTKGC-DLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGENK 570
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ + K PV LV+++ + +N+A + + +IL YPG +G
Sbjct: 571 SRSSLDLPGRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAKG 627
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G A+ADV+FG YNPGG+L +T + +IP+ + P +P + G +G +
Sbjct: 628 GIALADVLFGDYNPGGKLTVT-FPKTVGQIPF-NFPYKPASQIDGGKNPGPEGNMSRING 685
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y D+ T P A
Sbjct: 686 ALYPFGYGLSYTTFEYS-------------------DLEITPKVITPNEEA--------- 717
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
T +++V N GK G EVV +Y + T+ K + G+ERV + G++ +V FT+
Sbjct: 718 ----TVRLKVTNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGETKEVVFTL 773
Query: 716 NACKSLKIVDNAANSLLASGAHTIL 740
K L+++D ++ G TI+
Sbjct: 774 -GRKHLELLDANMQWVVEPGDFTIM 797
>gi|448410571|ref|ZP_21575276.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
gi|445671607|gb|ELZ24194.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
Length = 760
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 207/707 (29%), Positives = 332/707 (46%), Gaps = 104/707 (14%)
Query: 86 PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGR 145
P T+FP I ++++ L + T+ + A +G A SP ++V RD RWGR
Sbjct: 102 PEGTTFPQGIGMASTWDPDLMAAVTDTIGDQLEA---IGTA--HALSPVLDVARDLRWGR 156
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
V ET GEDPY+V A YV GLQ DS ISA KH+ + + G
Sbjct: 157 VEETYGEDPYLVAEMATAYVDGLQ----------GDSPADGISATLKHFVGHAV-GAGGK 205
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+R D V+ + ++E + PFE + EG+ SVM +Y+ ++G+P D LL +RG+
Sbjct: 206 NRSSVD--VSRRTLREVHMFPFEAAIQEGNAESVMNAYHDIDGVPCAKDEWLLTDVLRGE 263
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL-----DCGDYYTNFTMGA 320
W F G +VSD S+ + E H +E AV+ V +AG+D+ DC +Y A
Sbjct: 264 WGFDGTVVSDYFSVDFLKEEHGVAATQQEAAVSAV-EAGVDVELPNTDCYEYLAE----A 318
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V+ G +AE +D S+R + G F+ + + + + LA EAAR +V
Sbjct: 319 VRDGDLAEESLDESVRRVLRAKFEKGLFEEYTVDVDAATDPYEDEAAVGLAREAARDSLV 378
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV---- 436
+LKN++ LPL+ + ++A+VGP A+ K M+G+Y Y F A + +
Sbjct: 379 VLKNESDLLPLDDAD--SVAVVGPKADDKKGMLGDY-AYAAHYPEEEYEFEADTPLSAIE 435
Query: 437 ------INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVE------------ 478
+NYA GC + I A++AA+NAD + G +V+
Sbjct: 436 NRVGADVNYAQGCT-ATGNSTDKIGRAVEAAENADVALAFVGARSAVDFSDADGVKAEQP 494
Query: 479 -----AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
EG D DL LPG Q EL+ +V + PV +V++S I + + +++
Sbjct: 495 MVPTSGEGCDVTDLGLPGVQNELVAQV-EETDTPVVIVLVSGKPHAI--PEIDAGADAVV 551
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKF 592
PGEE G AI DV+F ++ GG LP++ ++ + + Y+ P N Y +
Sbjct: 552 QAWLPGEEAGNAIVDVVFEGHDSGGHLPVSMPKSVGQLPVHYSRKP-----NTYSEDYVY 606
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
D VYPFG+GLSY +F+Y GT
Sbjct: 607 DDAQPVYPFGHGLSYAEFEYSDLDLSDV-------------DVDPSGT------------ 641
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQSAK 710
F+ + VEN + DGS+VV +Y ++ P +A +++++G+ RV + AG+S +
Sbjct: 642 -------FSASVTVENTAERDGSDVVQLYVSAENPDLA-RPVQELVGFRRVELDAGESTE 693
Query: 711 VGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLN 757
+ F + A + L D AN + +G + + VG ++ +L++
Sbjct: 694 ITFDLAASQ-LAYHDRNANLAVEAGDYELRVGHSSEEIAESARLSVT 739
>gi|390167927|ref|ZP_10219905.1| beta-glucosidase, partial [Sphingobium indicum B90A]
gi|389589522|gb|EIM67539.1| beta-glucosidase, partial [Sphingobium indicum B90A]
Length = 771
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 226/770 (29%), Positives = 353/770 (45%), Gaps = 123/770 (15%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEK--------VQQMGDLAYGVPRLGLPLYEWWSEAL 63
FP+ + P AK +P + V + A RLG+P+ + E L
Sbjct: 71 FPHGMGQFTRPSDAKGAFSPREIPGRNPRQTVALVNALQRWATTQTRLGIPIL-FHEEGL 129
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG + +G ATSFP I +S++ L +++ ++ E R+
Sbjct: 130 HGYAAVG-----------------ATSFPQSIAMASSWDPDLLREVNAVIAREIRSR--- 169
Query: 124 GNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
G++ SP +++ RDPRWGR+ ET GEDPY+VG + V GLQ R
Sbjct: 170 ---GVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG-----KGRSRLL 221
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
P K+ A KH + N + V+E++++E F PFE V + +VM S
Sbjct: 222 PPGKVFATLKHLTGHGQPESGTN---VGPAPVSERELRENFFPPFEQVVKRTGIEAVMAS 278
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN ++G+P+ A+ LL +RG+W F G +VSD ++ ++ H D E A R L
Sbjct: 279 YNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQLMNIHHVAADL-EQAAGRALD 337
Query: 303 AGLDLDCGDYYTNFTMG-AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN 361
AG+D D D + T+G V++GKI EA +D ++R + + R G F+ +P
Sbjct: 338 AGVDADLPDGLSYATLGRQVREGKIGEALVDRAVRHMLELKFRAGLFE-NPYADAAASEK 396
Query: 362 ICNPQHIELAAEAARQ-GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I N A A Q I+LLKND G LPL ++A++GP +A A +G Y G P
Sbjct: 397 ITNDGRARALALKAAQRSIILLKND-GMLPLKPEG--SIAVIGP--SAAVARLGGYYGQP 451
Query: 421 CRYTSPMDGFYA----YSKVINYAPGC---------ADIV-----CQNNSMIPAAIDAAK 462
S ++G A +K++ +A G AD V +N +I A++AA+
Sbjct: 452 PHSVSILEGIRAKVGNRAKIV-FAQGVRITENDDWWADKVTRSDPAENRRLIAQAVEAAR 510
Query: 463 NADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAG 516
+ D V+ G EG DR L L G Q EL + + K P+ +V+++
Sbjct: 511 HVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLMGEQQELFDALKALGK-PIAVVLINGR 569
Query: 517 AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTS 576
+ K + + +IL Y GE+GG A+ADV+FG NPGG+LP+T IP ++
Sbjct: 570 PA--STVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNPGGKLPVT--------IPRSA 619
Query: 577 MPLRPVNNF---PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRD 633
L N R Y F +YPFG+GLSYT F S+P+ K+ R
Sbjct: 620 GQLPMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFDL---SAPRLSAAKIGVGGTTR- 675
Query: 634 INYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHI 692
++V N G+ +G EVV +Y + G I
Sbjct: 676 ----------------------------VSVDVRNSGRREGDEVVQLYVRDKVGSVTRPI 707
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
K++ G++RV + G+ V FT+ ++L++ ++ + ++ G I+ G
Sbjct: 708 KELKGFQRVTLKPGEVRTVTFTV-GPEALQMWNDHMDRVVEPGDFEIMTG 756
>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
Length = 826
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 220/828 (26%), Positives = 362/828 (43%), Gaps = 145/828 (17%)
Query: 11 DFPYCDAKLPYPERAKDLVERMTLPEKVQQMGD--------LAYGVPRLGLPLYEWWSEA 62
+ P+ + LP R DL+ R+T E +QQ+ + A G+ RL + Y+W
Sbjct: 26 EHPFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQW---- 81
Query: 63 LHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN 122
RTN PG D + T FP + A+F+ ++ + E RA +N
Sbjct: 82 ---------RTN--PG---DGRI---TPFPQPVNLGATFDVHTVYRVARATGLEMRARWN 124
Query: 123 LGNA--------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
A G+ ++P +N++R P WGR ET GEDP+++G+ A +VRGL +
Sbjct: 125 RAKAKKTYRDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNA 184
Query: 175 EYH----RDSDSRP--LKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
E ++ S+P L + A CKH+A + R F++ VT+ D+ +T++ F
Sbjct: 185 EPQSLDEQNLSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFR 244
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
C+ G VS VMC+Y+ +NG P C + LL + +R W F G++V+DC ++Q ++ H+
Sbjct: 245 ACLEAGAVS-VMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQI 303
Query: 289 LNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQ----GKIAEADIDTSLRFLYIVLMR 344
N E A+A V +AG++L+ Y + G ++ + R L++ +
Sbjct: 304 FNHYNETAMAAV-RAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLM 362
Query: 345 LGYFDGSPQ--YKNLG-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNT------GN 395
G F+ Y+ L + I N H +A + IVLL+N + LPL G
Sbjct: 363 QGEFNPVEMDPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGP 422
Query: 396 IKTLALVGPHANATKAMIGNYEGTP-CRYTSPMD-GFYAYSKVINYAPGCAD---IVCQN 450
++ +A+VGP A + + G+Y P P+ G S+ ++ + C D N
Sbjct: 423 LRHIAIVGPFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLN 482
Query: 451 NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG---- 506
+ + + + + D V+ G VE E DR ++ LPG Q EL+ + + G
Sbjct: 483 DDALHSTL-GYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNS 541
Query: 507 -------PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGK------ 553
P+ L++ SAG ++I+ A N +K+I W G+PG G A+ ++ G
Sbjct: 542 GLSKRTVPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFG 601
Query: 554 ---------------------------YNPGGRLPITWYEA--NYVKIPYTSMPLRPVNN 584
+ P RLP TWYE+ I M +
Sbjct: 602 PSKPISVGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEMTNQTYRY 661
Query: 585 FPGRTYKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTN 641
P + + + PV+YPFGYGLSY F AS D+
Sbjct: 662 LPTQCHMSSEDCKIPVLYPFGYGLSY-NFNLSGASGFVYSDL------------------ 702
Query: 642 KPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH------IKQV 695
P +AV + + F + V+N G + EVV VY+K + Q+
Sbjct: 703 IAPSSAV------SSNQRIVFYVTVQNEGPIACEEVVQVYTKWLNRTENDNSRNGPLIQL 756
Query: 696 IGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLL-ASGAHTILVG 742
G+ERV + G+ ++ FT+ + L + + N+++ G I VG
Sbjct: 757 AGFERVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIPGRGVLQISVG 804
>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 863
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 241/466 (51%), Gaps = 38/466 (8%)
Query: 16 DAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNS 75
D P R ++++ +MTL EKV Q+ + + +PRL LP Y +W+E LHGV+ G
Sbjct: 51 DLSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE---- 106
Query: 76 PPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNI 135
T FP I ++++ L K+I +STEAR Y GLT+W+P I
Sbjct: 107 ------------VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGKGLTYWAPTI 154
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
N+ RDPRWGR ET GEDPY+ R + +V+GLQ LK A KH+
Sbjct: 155 NMARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---------GDHPNYLKTVATVKHFV 205
Query: 196 AYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADP 255
A N + NDRF S++ + + E + +E CV E +V S+M +YN NGIP
Sbjct: 206 A----NNQENDRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGST 261
Query: 256 KLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTN 315
LL +R +W F G++VSDC +I + H+ +N + E+A A + +G DL+CG Y
Sbjct: 262 WLLEDVLRKEWGFDGFVVSDCGAIGVMNWQHRIVN-SLEEAAALGINSGCDLECGGTYRE 320
Query: 316 FTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNNICNPQHIELAAE 373
+ AVQ+G ++E ID +L + + +LG FD Y + K + Q LA E
Sbjct: 321 NLVAAVQRGLVSEYAIDRALTRVLTMRFKLGEFDPIELVPYNHYDKKLLAGEQFRRLAYE 380
Query: 374 AARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG---F 430
AA + I+LLKN++ LP++ +++++A+VGP A+ +G Y G P S + G
Sbjct: 381 AAVKSIILLKNEDNFLPIDKKDVRSIAIVGPFADNN--YLGGYSGKPVHNISLLQGVKKM 438
Query: 431 YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
I+Y G + + ++S + A+ D N + G DL+
Sbjct: 439 VGEEVEISYIEGTSVVSPVDSSYLLAS-DGVNNGLTADYIDGHDLN 483
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 136/285 (47%), Gaps = 49/285 (17%)
Query: 464 ADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFA 523
AD ++ G D + E +D + LP Q EL+ K + L++ + + +A
Sbjct: 609 ADLVLVALGNDGKLARENRDLPSIYLPMTQ-ELLLKEIYKVNPRIALILQTGNPLTSQWA 667
Query: 524 KNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMP-LRPV 582
+ + SIL YPG+EGG A+A ++FG NP G+LP+T YE+ +P +
Sbjct: 668 AEH--VPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESE------QQLPNILDY 719
Query: 583 NNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+ + GRTY++ +Y FG+GLSY+ F+Y D QC D+ + GT
Sbjct: 720 DIWKGRTYQYLSSKPLYGFGHGLSYSNFEY--------------ADLQCNDVVHVDGT-- 763
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY---SKPPGIAGTHIKQVIGYE 699
++C I+V+N+ + G EV+ VY K P + +K++I +
Sbjct: 764 ----------LQC-------SIKVKNISDVVGEEVIQVYVSREKTP-VYTFPLKKLIAFA 805
Query: 700 RVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
RV + +S V FT+ + L + + +L SG +++ VG G
Sbjct: 806 RVNLKPNESKTVTFTITP-RQLSVWQDGEWKML-SGKYSLFVGGG 848
>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
Length = 955
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 219/807 (27%), Positives = 360/807 (44%), Gaps = 138/807 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R +DL+ +M + EK QM L YG R+ LP +W W + + +
Sbjct: 61 YEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGAI 119
Query: 67 S----------------------------------FIGRRTNSPPGTHFDSE-VPG---- 87
F T T F +E + G
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L K+G E R + G T ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V + +G+Q +Y +++A KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ----TDY---------QVAATSKHYIAYSNNKGGRE 280
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + P++ + E + VM SYN +G P + L +RG+
Sbjct: 281 GMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
+ F GY+VSD D+++ + H D KE + VL AGL++ C D Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK-NLGKNNICNPQHIELAAEAARQGIV 380
+G + + ID +R + V +G FD Q + + ++ ++A +A+++ +V
Sbjct: 400 AEGALPMSTIDDRVRDILRVKFLVGLFDQPYQIDLKQADKEVNSAENQQVALQASKESLV 459
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN + LPL+ I +A+ GP+A+ + +Y T+ ++G K +
Sbjct: 460 LLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEV 519
Query: 438 NYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ GC D+V + S I A++ AK +D V+V G E K
Sbjct: 520 LFTKGC-DLVDANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENK 578
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 579 SRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQG 635
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G AIAD +FG YNPGG+L +T + +IP+ + P +P G K DG +
Sbjct: 636 GTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVNG 693
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y S I + T P V+CK
Sbjct: 694 PLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRCK 730
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EVV +Y + T+ K ++G++R+ + G++ ++ FT+
Sbjct: 731 ---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTI 781
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ L+++++ + ++ G ++VG
Sbjct: 782 EP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|404404031|ref|ZP_10995615.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 740
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 193/625 (30%), Positives = 309/625 (49%), Gaps = 75/625 (12%)
Query: 111 QTVSTEAR-AMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGL 168
+T+ AR A AGL + ++P +++ RDPRWGRV+E GEDPY+ A VRG
Sbjct: 133 ETIEASARMAAVEASAAGLQWTFAPMVDIARDPRWGRVMEGAGEDPYLGSHIARARVRGF 192
Query: 169 QDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFE 228
Q D S P I AC KH+A Y G D D +++Q ++E ++ PF+
Sbjct: 193 QG--------DDLSAPNTILACAKHFAGYGASEG-GRDYNTVD--ISDQRLRELYLPPFK 241
Query: 229 MCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKF 288
+ ++ M S+N ++G+P + L+ Q +R +W + G IVSD S+ ++ H
Sbjct: 242 AAADA-GAATFMNSFNELSGVPATGNRFLVKQILRNEWGWDGVIVSDWGSVAEMI-PHGI 299
Query: 289 LNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D K+ A+ V K D+D G+ Y + V++GK++E +ID S+R + + LG
Sbjct: 300 AEDKKQAALLAV-KNECDIDMEGNCYPSSLEELVKEGKVSEKEIDRSVRRILRLKYELGL 358
Query: 348 FDGSPQY--KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
FD +Y + K + H E A + AR+ IVLL+N LPL G +++A+VGP
Sbjct: 359 FDDPYRYCDEQREKEVTLSAAHREAARDMARKSIVLLENRKSVLPL--GKPRSIAVVGPL 416
Query: 406 ANATKAMIGNY--EGTPCRYTSPMDGFYAYSKV---INYAPGCADIVCQNNSMIPAAIDA 460
A++ M+G + +G P + + G + + +A GC D+ + S A+ A
Sbjct: 417 ADSPVDMLGEWRAKGDPKEVVTILRGIEKTAGAGTRVTHAKGC-DVTGSDRSGFAEAVRA 475
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A++AD + G + EG R +L LPG Q EL+ ++ K P+ L++ + + +
Sbjct: 476 ARSADVVIACLGESADMSGEGYCRSELGLPGVQQELLKELKKTGK-PIVLLLSNGRPLTL 534
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP------Y 574
+ K N I++I+ + G E G A+ADV+FGKYNP G+L ++ + N +IP +
Sbjct: 535 AWEKEN--IETIVETWFLGTEAGNAVADVLFGKYNPSGKLVMS-FPYNVGQIPVYYNHKH 591
Query: 575 TSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCR 632
T P P + + D PV +YPFGYGLSYT+F+Y
Sbjct: 592 TGRPFEPNQRY---VMHYIDAPVDALYPFGYGLSYTRFEY-------------------- 628
Query: 633 DINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH- 691
+P L D T ++V N G DG EVV +Y + T
Sbjct: 629 --------GEP----TLSSDRMAAGDTITATVKVTNAGDYDGEEVVQLYIRDLKAQITRP 676
Query: 692 IKQVIGYERVFIAAGQSAKVGFTMN 716
+K++ G+ ++F+ G+SA V F +
Sbjct: 677 VKELKGFRKIFLKKGESADVTFDIT 701
>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 765
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 195/694 (28%), Positives = 324/694 (46%), Gaps = 98/694 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
G T FP + +++N L++ + + V+ E R+ G +SP ++VVRDPRWGR
Sbjct: 122 GGTVFPVPLSIGSTWNVDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRT 176
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
E GEDPY++ YA+ V GLQ +S P ++A KH+ Y N
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQG--------ESLDSPSSVAATLKHFVGYGSSEGGRNA 228
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
H +R ++ E +LPF+ V G +S+M +YN ++G+P + +LL+ +R +
Sbjct: 229 GPVHMGTR----ELMEVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKE 283
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
W F G +++DC +I + H D DA + ++AG+D++ G+ + AV+
Sbjct: 284 WGFDGMVITDCGAIDMLASGHDTAEDGM-DAAVQAIRAGIDMEMSGEMFGKHLQKAVESN 342
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
K+ + +D ++R + + +LG F+ +N I + QH+ LA + A +GIVLLKN
Sbjct: 343 KLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAENVIGSEQHVGLARQLAAEGIVLLKN 402
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAY----SKVIN 438
+ ALPL+ +A++GP+A+ +G+Y P T+ + G A ++ +
Sbjct: 403 EAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVL 461
Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA-------- 479
YAPGC I + A+ A+ AD V+V G +DL A
Sbjct: 462 YAPGCR-IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDAL 520
Query: 480 ------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
EG DR+ L L G Q EL+ ++ K ++++ I + +IL
Sbjct: 521 SDMDCGEGIDRMTLQLSGVQLELVQEIHKLGK---RMIVVYINGRPIAEPWIDEHADAIL 577
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKF 592
YPG+EGG A+AD++FG NP G+L ++ + + Y R G+ Y
Sbjct: 578 EAWYPGQEGGHAVADILFGDVNPSGKLTMSIPKHVGQLPVYYNGKRSR------GKRYLE 631
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
D YPFGYGLSYT+F Y DI++ + +GT
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYS--------DIQMTPE--------VIGT------------ 663
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
D + V N G +GSEVV +Y T +++ G++++F+ G+ KV
Sbjct: 664 ----DGTAVVSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKV 719
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
FT+ + L+ + ++ G +++G V
Sbjct: 720 EFTIGP-EQLQYIGQDYRQVVEPGLFRVMLGRHV 752
>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 954
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 224/766 (29%), Positives = 361/766 (47%), Gaps = 123/766 (16%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
K +++D Y D LP ER + L+ MT PE ++ +G+P G+P LY E
Sbjct: 162 KGEVTDRRYMDVSLPVEERVESLLAVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVE 218
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
A+HG S+ G+ GAT FP + A++N+ L +++ + E A
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA- 261
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
N A WSP ++V +D RWGR ET GEDP +V + +++G Q
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ------------ 305
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
SR L + KH+ + R D ++E++M+E ++PF + D S+M
Sbjct: 306 SRGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREIHLVPFRHAIRNYDCQSLMM 360
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y+ G+P +LL Q +R +W F+G+IVSDC +I + + K +A + L
Sbjct: 361 AYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQAL 420
Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP----QYKN 356
AG+ +CGD Y N + A + G+I D+D R + + R F+ +P +K
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLSTMFRNELFEKNPCKPLDWKK 480
Query: 357 L--GKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+ G N + H E+A +AAR+ IV+L+N LPL+ ++T+A+VGP A+ + G
Sbjct: 481 IYPGWN---SDSHKEMARQAARESIVMLENKENLLPLSK-TLRTIAVVGPGADDLQP--G 534
Query: 415 NY--EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATV 468
+Y + P + S + G + +KV+ Y GC D + + IP A+ A +D +
Sbjct: 535 DYTPKLLPGQLKSVLTGIKSAVGKQTKVL-YEQGC-DFTNPDATNIPKAVKTASQSDVVI 592
Query: 469 IVAGLDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVD 519
+V G + EA E D L+LPG Q EL+ V K PV L++ + D
Sbjct: 593 MVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYD 651
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
I K + K+IL PG+EGG A+ADV+FG YNP GRLP+T+ +PL
Sbjct: 652 I--LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMTFPRH------VGQLPL 703
Query: 580 RPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYT 637
GR Y++ D +Y FG+GLSYT F+Y ++K+ Q+ + N
Sbjct: 704 YYNFKTSGRRYEYVDMEYYPLYRFGFGLSYTSFEYS--------NLKI---QEKANGNVE 752
Query: 638 VGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVI 696
V Q V+N+G G EV +Y + T + ++
Sbjct: 753 V------------------------QATVKNVGSCAGDEVAQLYVTDMYASVKTRVMELK 788
Query: 697 GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ R+ + G+S V F M + ++++ + ++ G I++G
Sbjct: 789 DFTRIHLQPGESKTVSFEMTPY-DISLLNDRMDRVVEKGEFKIMIG 833
>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
17393]
gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/412 (36%), Positives = 218/412 (52%), Gaps = 34/412 (8%)
Query: 20 PYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGT 79
P R ++L+ +MTL EKV Q+ + +PRL LP Y +W+E LHGV+ G
Sbjct: 55 PISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE-------- 106
Query: 80 HFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVR 139
T FP I ++++ L KK+ +STEAR Y GLT+WSP IN+ R
Sbjct: 107 --------VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMAR 158
Query: 140 DPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDL 199
DPRWGR ET GEDPY+ R + +V+GLQ H D LK A KH+ A
Sbjct: 159 DPRWGRNEETYGEDPYLTSRLGVAFVKGLQGD-----HPDY----LKTVATIKHFVA--- 206
Query: 200 DNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLN 259
N + NDRF S++ + + E + +E CV E D SVM +YN NG+ LL
Sbjct: 207 -NNQENDRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLG 265
Query: 260 QTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMG 319
+R +W F G++VSDC +I + H+ +N + E+A A + +G DL+CG Y +
Sbjct: 266 DVLRKEWGFDGFVVSDCGAIGVMNWQHRVVN-SLEEAAALGINSGCDLECGGTYREKLVA 324
Query: 320 AVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNNICNPQHIELAAEAARQ 377
AV+ G ++E ID +L + +LG FD Y + K + + +LA EAA +
Sbjct: 325 AVKMGLVSEQAIDKALTRVLTARFKLGEFDPIELVPYNHYDKKLLAGEKFGKLAYEAAVK 384
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDG 429
IVLLKNDN LP++ I+++A+VGP A+ +G Y G P S + G
Sbjct: 385 SIVLLKNDNDFLPVDKKKIRSVAIVGPFADNN--YLGGYSGKPVHNVSLLQG 434
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 140/311 (45%), Gaps = 49/311 (15%)
Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVT 509
N+ I + AD ++ G D + E +D + LP Q L+ ++ P T
Sbjct: 595 NSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKV--NPRT 652
Query: 510 LVIMSAG-AVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEAN 568
+I+ G + +A N + +IL YPG+EGG+A+A ++FG NP G+LP+T YE+
Sbjct: 653 ALILHTGNPLTSKWAAEN--VPAILQAWYPGQEGGKALAGILFGSENPSGKLPMTIYESE 710
Query: 569 YVKIPYTSMP-LRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDK 627
+P + + + GRTY++ +Y FG+GLSY+ F+Y S
Sbjct: 711 ------EQLPDILDYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYTHLQS---------- 754
Query: 628 DQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPP 685
DDV D IE++N+ + G EVV VY +
Sbjct: 755 -----------------------DDVVRPDGTLQCSIEIKNISDVAGEEVVQVYISRENT 791
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
+ +K+++ + RV + G+S V FT+ A + L I +L G +++ VG G
Sbjct: 792 PVYTFPLKKLVAFARVDLKPGESKTVTFTI-APRQLSIWQEGIWKMLP-GKYSLFVGSGQ 849
Query: 746 GGVSFPLQLNL 756
G+S + N
Sbjct: 850 EGLSKGINRNF 860
>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
Length = 955
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 219/807 (27%), Positives = 360/807 (44%), Gaps = 138/807 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEALHGV 66
Y D P R +DL+ +M + EK QM L YG R+ LP +W W + + +
Sbjct: 61 YEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGAI 119
Query: 67 S----------------------------------FIGRRTNSPPGTHFDSE-VPG---- 87
F T T F +E + G
Sbjct: 120 DEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 179
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L K+G E R + G T ++P ++V RD RWGR
Sbjct: 180 IATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWGR 233
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE PY+V + +G+Q +Y +++A KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ----TDY---------QVAATSKHYIAYSNNKGGRE 280
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D +++ ++++ + P++ + E + VM SYN +G P + L +RG+
Sbjct: 281 GMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
+ F GY+VSD D+++ + H D KE + VL AGL++ C D Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKESVLQSVL-AGLNIRCTFRSPDSYVLPLRELI 399
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYK-NLGKNNICNPQHIELAAEAARQGIV 380
+G + + ID +R + V +G FD Q + + ++ ++A +A+++ +V
Sbjct: 400 AEGALPMSTIDDRVRDILRVKFLVGLFDQPYQIDLKQADKEVNSAENQQVALQASKESLV 459
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK---VI 437
LLKN + LPL+ I +A+ GP+A+ + +Y T+ ++G K +
Sbjct: 460 LLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEV 519
Query: 438 NYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
+ GC D+V + S I A++ AK +D V+V G E K
Sbjct: 520 LFTKGC-DLVDANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENK 578
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R L LPG Q +L+ V K PV LV+++ + IN+A + + +IL YPG +G
Sbjct: 579 SRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQG 635
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV----- 597
G AIAD +FG YNPGG+L +T + +IP+ + P +P G K DG +
Sbjct: 636 GTAIADALFGDYNPGGKLTVT-FPKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRVNG 693
Query: 598 -VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
+YPFGYGLSYT F+Y S I + T P V+CK
Sbjct: 694 PLYPFGYGLSYTTFEYSDIS-----------------IQPAIVTQVQPVT------VRCK 730
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
V N GK G EVV +Y + T+ K ++G++R+ + G++ ++ FT+
Sbjct: 731 ---------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTI 781
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
+ L+++++ + ++ G ++VG
Sbjct: 782 EP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 954
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 226/760 (29%), Positives = 356/760 (46%), Gaps = 119/760 (15%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSEALHG 65
+ Y D LP ER + L+ MT PE ++ +G+P G+P LY EA+HG
Sbjct: 166 TSLRYMDPTLPVEERVESLLSVMT-PEDKMELIREGWGIP--GIPHLYVPPITKVEAVHG 222
Query: 66 VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGN 125
S+ G+ GAT FP + A++N+ L + + V E L
Sbjct: 223 FSY---------GS-------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAA 261
Query: 126 AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
+ WSP ++V +D RWGR ET GEDP +V + +++G Q S+ L
Sbjct: 262 GTMQAWSPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SKGL 309
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
+ KH+ + R D ++E++M+E ++PF + D SVM +Y+
Sbjct: 310 FTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSD 364
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
G+P +LL+ +R +W F G+IVSDC +I + + K +A + L AG+
Sbjct: 365 YLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGI 424
Query: 306 DLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC- 363
+CGD Y + + A + G+I ++D R + ++ R F+ +P K L N I
Sbjct: 425 ATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPN-KPLDWNKIYP 483
Query: 364 ---NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EG 418
+ H E+A +AAR+ IV+L+N + LPL +++T+A+VGP A+ + G+Y +
Sbjct: 484 GWNSDSHKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKL 540
Query: 419 TPCRYTSPMDGFY----AYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
P + S + G +KV+ Y GC D N + IP A+ AA +D V+V G
Sbjct: 541 LPGQLKSVLTGIKQAVGKQTKVV-YEQGC-DFTSSNGTDIPKAVKAASQSDVVVLVLGDC 598
Query: 475 LSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKN 525
+ E+ E D L+LPG Q EL+ V A G ++I+ AG N +K
Sbjct: 599 STSESTTDVYKTSGENHDYATLILPGKQQELLEAV--CATGKPVILILQAGR-PYNLSKA 655
Query: 526 NPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF 585
+ K+IL PG+EGG A ADV+FG YNP GRLP+T+ +V +PL
Sbjct: 656 SELCKAILVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PRHV----GQLPLYYNFKT 709
Query: 586 PGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
GR Y++ D +Y FGYGLSYT F+Y K Q+ + N +
Sbjct: 710 SGRRYEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAI----- 753
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVF 702
Q V+N+G+ G EVV +Y + T I ++ + RV
Sbjct: 754 -------------------QATVKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVH 794
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ +S V F + + L ++++ + ++ G ILVG
Sbjct: 795 LQPDESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833
>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
Length = 733
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 217/760 (28%), Positives = 356/760 (46%), Gaps = 92/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
Y DA P R KDL++RMTL EKV Q+ +G +P +G +Y
Sbjct: 25 YKDAGQPVETRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPAEIGSLIYLH 84
Query: 59 WSEALHGV----SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
L + R P FD T +P + SFN L + Q
Sbjct: 85 TDPKLRNQIQRKAMEESRLGIPILFGFDVIHGLRTVYPISLAQACSFNPDL---VTQACG 141
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
A+ L TF SP I+V RDPRWGR+ E GEDPY +N V G+ V+G
Sbjct: 142 MAAKESV-LSGIDWTF-SPMIDVARDPRWGRISECYGEDPY------LNTVFGVASVQGY 193
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+ + SD P I+AC KHY Y EG + + + ++ Q + ET++ P+E CV G
Sbjct: 194 QGEKLSD--PYSIAACLKHYVGYGAS--EGGRDYRY-TDISPQALWETYLPPYEACVKAG 248
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P ++ +L + ++ W G++VSD ++I+ ++ ++ + ++
Sbjct: 249 -AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIEQLI--YQGVAKDRK 305
Query: 295 DAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
+A + AG+++D D Y + V + KI + ID ++ + V RLG FD P
Sbjct: 306 EAAYKAFHAGVEMDMRDNIYYEYLEQLVAEKKIQMSQIDDAVARILRVKFRLGLFD-EPY 364
Query: 354 YKNLG-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
K L + + I LAA A + +VLLKN+N LPL++ +K +AL+GP A + +
Sbjct: 365 TKELTEQERYLQKEDIALAARLAEESMVLLKNENNLLPLSS-TVKRVALIGPMAKDSANL 423
Query: 413 IGNY------EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
+G + E Y M + ++Y GCA + + S AA+ A+ +D
Sbjct: 424 LGAWAFKGHAEDVETIYEG-MQKEFGDKVQLDYEQGCA-LDGNDESGFSAALKTAEASDV 481
Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V+ G E R + LP Q +L+ + A K P+ LV+ S +++ +
Sbjct: 482 VVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLE 538
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSM--PLRPVN 583
P++++I+ + PG GG +A ++ G+ NP G+L +T + + +IP Y +M RP +
Sbjct: 539 PQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVT-FPLSTGQIPVYYNMRQSARPFD 597
Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
Y+ +YPFG+GLSYT F Y S K +K+ K+Q
Sbjct: 598 AMG--DYQDIPTKPLYPFGHGLSYTTFVY---SDAKLSSLKIRKNQ-------------- 638
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF 702
K T ++ V N GKM+G E V+ Y P + + +K++ +E+
Sbjct: 639 ---------------KITAEVTVTNAGKMEGKETVLWYVSDPFCSISRPMKELKFFEKHS 683
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ AG+S F ++ + L D L +G + VG
Sbjct: 684 LNAGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFIVSVG 723
>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 793
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 225/801 (28%), Positives = 360/801 (44%), Gaps = 137/801 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WSEAL--- 63
Y D K R DL+ +MTL EK QM L YG R+ LP W W + +
Sbjct: 43 YEDPKQSVNARTADLLSKMTLDEKTCQMATL-YGWHRVLKDSLPTDSWKNAIWKDGIANI 101
Query: 64 --HGVSFIGRRTNSPPGTHFDSE--------------------VPG-------------- 87
H F G +P D E +P
Sbjct: 102 DEHLNGFAGWGKTAPIDLVKDMEKHVWAMNETQRFFIEQTRLGIPADFTNEGIRGVEAYE 161
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRV 146
AT FPT + ++N+ L + G EARA+ G T ++P ++V RD RWGR+
Sbjct: 162 ATGFPTELNMGMTWNKELVHQEGIITGREARAL------GYTNVYAPIMDVARDQRWGRL 215
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
E+ GEDPY+V I +G+Q +D K+++ KH+A Y +
Sbjct: 216 EESYGEDPYLVASMGIALAKGIQ--------QDG-----KVASTAKHFAVYSANKGAREG 262
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
+ D +V ++++ + PF+ + E + VM SYN +GIP L Q +R +
Sbjct: 263 QARTDPQVAPREVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEM 322
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDL----DCGDYYTNFTMGAVQ 322
F GY+VSD D+++ + H + KE AV + AG+++ D + V+
Sbjct: 323 GFTGYVVSDSDALEYLATKHHVAANLKE-AVFQAFMAGMNVRTTFKAPDSIIIYLRQLVK 381
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIV 380
+G+I I+ + + V RLG FD P ++ + + + ++A +A+R+ +V
Sbjct: 382 EGRIPMDTINHRVADVLRVKFRLGLFD-HPYVESAAETRKVVNSDASQQIALQASRESVV 440
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---SKVI 437
LLKN+N LPL ++ +A+VGP+A +Y + + G A KV+
Sbjct: 441 LLKNNNNILPL-VKSLDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVL 499
Query: 438 NYAPGCADIVCQN---------------NSMIPAAIDAAKNADATVIVAGLDLSVEAEGK 482
YA G D+V +N +M+ +A++ K A ++V G + E K
Sbjct: 500 -YAKGV-DLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESK 557
Query: 483 DRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEG 542
R DL LPG Q EL+ + K PV +V++ + IN+ + I I++ GYPG +G
Sbjct: 558 SRTDLDLPGHQLELVKAIKATGK-PVVVVLLGTQPMTINWI--DKYIDGIVYAGYPGVKG 614
Query: 543 GRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFG 602
G A+ADV+FG YNPGG+L +TW ++ +IP + P +P + G ++YPFG
Sbjct: 615 GIAVADVLFGDYNPGGKLTLTWPKS-VGQIPL-NFPSKPGAQSDEGEHAKIKG-LLYPFG 671
Query: 603 YGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTF 662
+GLSYT F Y N + T K V +
Sbjct: 672 FGLSYTSFGY---------------------TNLKISTGKTAADPVAV------------ 698
Query: 663 QIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSL 721
++V N GK+ G EVV Y + T+ K + G+ERV + AG++ + FT+ + L
Sbjct: 699 TVDVTNTGKLAGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREEL 757
Query: 722 KIVDNAANSLLASGAHTILVG 742
K+ + +L G ++++G
Sbjct: 758 KLYNREMKFVLEPGEFSVMIG 778
>gi|423223874|ref|ZP_17210343.1| hypothetical protein HMPREF1062_02529 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392637823|gb|EIY31686.1| hypothetical protein HMPREF1062_02529 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 759
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 229/797 (28%), Positives = 369/797 (46%), Gaps = 139/797 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGL-------------------- 53
Y D+ P +R +DL++RMTL EKV QM GV +
Sbjct: 18 YKDSTAPVKDRVEDLLKRMTLEEKVGQMNQFV-GVEHIKANSAVMTEEELKNNTANAFYP 76
Query: 54 -----PLYEWWSEALHGVSFI----------------GRRTNSP-----PGTHFDSEVPG 87
+ +W E L G SF+ R P H ++ P
Sbjct: 77 GFTEKDIEKWTEEGLIG-SFLHVLTIEEANYLQSLAMKSRLQIPIIFGIDAIHGNANAPD 135
Query: 88 ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVL 147
T +PT I SF+ + KI + + E RAM N TF +PN+ V RD RWGRV
Sbjct: 136 NTVYPTNINLACSFDTLMAYKIARQTAKEMRAM----NMHWTF-NPNVEVARDARWGRVG 190
Query: 148 ETPGEDPYVVGRYAINYVRGLQ-DVEGVEYHRDSDSRPLKISACCKHY--AAYDLDNWEG 204
ET GEDPY+V + V+G Q D+ G E + AC KH+ + ++ G
Sbjct: 191 ETYGEDPYLVTLLGVQSVKGYQGDLNGNE----------DVLACIKHFVGGSEPINGTNG 240
Query: 205 NDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRG 264
+ + ++E+ ++E F PFE V G +S +M ++N +NGIP ++ L+ +RG
Sbjct: 241 SP-----TDLSERTLREVFFPPFEAGVKAGAMS-LMTAHNELNGIPCHSNEWLMQDILRG 294
Query: 265 DWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQ 323
+WNF G++VSD I+ I + H + KE A + + G+D+ G ++ + V++
Sbjct: 295 EWNFPGFVVSDWMDIEHIHDLHATAENLKE-AFYQSIMGGMDMHMHGIHWNEMVVELVRE 353
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGS-PQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
G+I E+ ID S+R + + RLG F+ K +C+ +H A E+AR GIVLL
Sbjct: 354 GRIPESRIDESVRRILDIKFRLGLFEQPYADEAETMKVRLCD-EHRATALESARNGIVLL 412
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGF--YAYSKVIN 438
KND G LPL+ K + + G +A+ + ++G++ T+ ++G A +
Sbjct: 413 KND-GVLPLDASRYKKILVTGINAD-DQNILGDWSAPEKDENVTTILEGLKMIAPDTQFD 470
Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL-------SVEAEGKDRVDLLLPG 491
+ D + + A AK+AD ++VAG + + E DR DL L G
Sbjct: 471 FVDQGWDPRNMDPKKVAEAAVRAKSADLNIVVAGEYMMRFRWNDRTDGEDTDRSDLDLVG 530
Query: 492 FQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIF 551
Q ELI KVA + K P L++++ + + +A N + +I+ PG GG+A+A++++
Sbjct: 531 LQNELIEKVAASGK-PTILILVNGRPLGVQWAAEN--LPAIVEAWAPGMYGGQAVAEILY 587
Query: 552 GKYNPGGRLPITWYEANYVKIPYTSMPLRPV-NNFPGRTYKFFDG----PVVYPFGYGLS 606
GK NP +L IT IP++ L+ + N+ P + + + +YPFG+GLS
Sbjct: 588 GKVNPSAKLAIT--------IPHSVGQLQMIYNHKPSQYFHPYAAGKPSTPLYPFGHGLS 639
Query: 607 YTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEV 666
YT +KY D+KL + + +D V ++V
Sbjct: 640 YTTYKYD--------DLKLAQKEITKDGTVDVS------------------------VKV 667
Query: 667 ENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD 725
N G DG E+V +Y + + T +K++ + RV + AG+S V F + K L D
Sbjct: 668 TNTGDRDGVEIVQLYIRDKFSSVTRPVKELKDFARVSLKAGESQVVNFKITPDK-LAFYD 726
Query: 726 NAANSLLASGAHTILVG 742
++ G ++VG
Sbjct: 727 KKMKKIVEPGEFIVMVG 743
>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
thermohalophila DSM 12881]
Length = 749
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 215/726 (29%), Positives = 336/726 (46%), Gaps = 100/726 (13%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+P+ + +L R DL+ RMTL EKV + VPRLG+ E HGV+ G
Sbjct: 52 YPFQNPELDSEARIDDLLSRMTLDEKVSALSTDP-SVPRLGVKGAPH-IEGYHGVAMGGP 109
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN---LGNAGL 128
+P G D VP T+FP A++N L + G+ S EAR ++ + GL
Sbjct: 110 ANWAPKG---DEAVP-TTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGGL 165
Query: 129 TFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKIS 188
+PN ++ RDPRWGR E GEDP++VG A + +GLQ D + + +
Sbjct: 166 VVRAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQ---------GDDDQYWRTA 216
Query: 189 ACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNG 248
+ KH+ A +N + FD ++ E + F EG ++ M +YN +NG
Sbjct: 217 SLLKHFLANSNENGRESSSSDFDMQL----YHEYYGASFRRAFIEGGSNAYMAAYNAING 272
Query: 249 IPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD 308
+P + + W G +D Q +V HK+ +D A V+KAGL+
Sbjct: 273 VPAHVH-DMHKEITERMWGVDGIKCTDGGGYQLLVYGHKYYDDLYL-AAEGVIKAGLN-Q 329
Query: 309 CGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ----YKNLGKNNICN 364
D Y GA+ G I EADID LR +Y V+++LG D PQ Y +G++
Sbjct: 330 FLDNYREGVYGALAHGYITEADIDEVLRGVYRVMIKLGQLD--PQEKVPYSAIGRDGKPA 387
Query: 365 P----QHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
P +H + A AR+ IVLLKN+N LPLN + +A++G A+ ++ Y G P
Sbjct: 388 PWTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAVIGYLADTV--LLDWYSGLP 445
Query: 421 CRYTSPMDGFYAY----SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLS 476
+P++G SKV+ YAP ++ AA++AA AD +++ G +
Sbjct: 446 PYRITPLEGIREKLGNDSKVL-YAP---------DNDYNAAVEAASEADVAIVILGNYPT 495
Query: 477 VEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
+E G++ +D E + K+ A V+ S+ IN+++ N
Sbjct: 496 CNSEIWADCPDPGMGREAIDRKTLRLTDEYLVKLVMEANPNTIFVLQSSFPYAINWSQQN 555
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFP 586
+ +IL + + G+E G A+ADV+FG YNPGG+L TW ++ +R
Sbjct: 556 --VPAILHLTHNGQETGSALADVLFGDYNPGGKLTQTWPKSEDQLPDMMEYDIR-----K 608
Query: 587 GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
G TY +F+ +YPFG+GLSYT F ++ + NKP +
Sbjct: 609 GHTYMYFEDKPLYPFGHGLSYTTFAWE-----------------------DISINKPVVS 645
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAA 705
A D + ++++N G + G EVV +Y S P K + G++RV +
Sbjct: 646 A--------DDEEVIITVKLKNTGDVKGDEVVQLYASFPESTVRRPAKALKGFKRVTLEP 697
Query: 706 GQSAKV 711
G+ K+
Sbjct: 698 GEKKKI 703
>gi|410097652|ref|ZP_11292633.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223742|gb|EKN16677.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
CL02T12C30]
Length = 780
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 234/814 (28%), Positives = 365/814 (44%), Gaps = 157/814 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQM------------------GDLAYGVPRLGLPL 55
Y A P +R KDL+ RMT+ EKV Q+ DL Y +P+
Sbjct: 25 YKQATAPVEDRVKDLIGRMTVEEKVGQLCCPLGWEMYTKTTNGVVASDL-YKERMKTMPI 83
Query: 56 YEWWS-------------------------EALHGVSFIGRRTNSPPGTHFDSEVP---- 86
+W+ AL + R P F E P
Sbjct: 84 GSFWAVLRADPWTQKTLETGLNPELSAKALNALQKYAVEETRLGIP--VLFAEECPHGHM 141
Query: 87 --GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRW 143
G T FPT + +++N L ++G+ ++ EAR+ N+G + P +++ R+PRW
Sbjct: 142 AIGTTVFPTSLSQASTWNAELMHRMGEAIALEARSQGANIG------YGPVLDIAREPRW 195
Query: 144 GRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWE 203
R+ ET GEDP + + +++G+Q +D + L + KH+AAY +
Sbjct: 196 SRMEETFGEDPVLTTHLGVAFMKGMQG------KSQNDGKHL--YSTLKHFAAYGIPEAG 247
Query: 204 GNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIR 263
N + V + + ++ PF+ V EG V+++M SYN ++G+P ++ LL +R
Sbjct: 248 HNGA---RANVGMRQLFSDYLPPFKKAVEEG-VATIMTSYNTIDGVPCTSNKYLLTDVLR 303
Query: 264 GDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQ 322
W F G++ SD SI+ IV + + D KE AV LKAGLD+D G + Y A++
Sbjct: 304 DQWGFKGFVYSDLTSIEGIVGA-RVAKDNKEAAVL-ALKAGLDMDLGGNAYGKNLQKALE 361
Query: 323 QGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLL 382
+G I D++ ++ + + R+G F+ K + + H ELA E AR+GIVLL
Sbjct: 362 EGAITMDDLNRAVANVLRLKFRMGLFENPYVSPEQAKQVVRSKAHKELAREVAREGIVLL 421
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGF---YAYSKVI 437
KN+ G LPL NI +A++GP+A+ +G+Y R + +DG + S +
Sbjct: 422 KNE-GVLPLKK-NIGNIAVIGPNADMMYNQLGDYTAPQEREEIVTVLDGIRKAVSPSTKV 479
Query: 438 NYAPGCA--DIVCQNNSMIPAAI------------DAAKNADATVIVAGL-DLSVEA--- 479
NY GCA DI N + A +A++ I G D+S +
Sbjct: 480 NYVKGCAIRDITTSNITAAVEAARAADAVVLVVGGSSARDFKTKYIGTGAADVSNDGNQL 539
Query: 480 -------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q +L+ VA K P+ ++ + +++N A + K +++
Sbjct: 540 LSDMDCGEGYDRSTLRLLGDQEKLLKAVAATGK-PLVVIYIQGRTLNMNLA--SEKAQAL 596
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG--RTY 590
L YPGE+GG AIADV+FG YNP GRLP++ +P + L P+ G R Y
Sbjct: 597 LTAWYPGEQGGTAIADVLFGDYNPAGRLPVS--------VPRSEGQL-PLFYSQGKQRAY 647
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+G +Y FGYGLSYT+F Y K
Sbjct: 648 VEEEGTPLYAFGYGLSYTKFDYSQLEMQKG------------------------------ 677
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--SKPPGIAGTHIKQVIGYERVFIAAGQS 708
KD T V N G DG EVV +Y K ++ + I + +ER+ + G+S
Sbjct: 678 ---NGKDVLQTVSCTVTNTGDCDGEEVVQLYICDKVASVSQSPI-LLKAFERISLKKGES 733
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
KV FT+ + L + + ++ G ++VG
Sbjct: 734 KKVTFTLGE-EELSLYNMEMKQVVEPGDFKVMVG 766
>gi|329957143|ref|ZP_08297710.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523411|gb|EGF50510.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 803
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 231/813 (28%), Positives = 366/813 (45%), Gaps = 153/813 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
Y D P +R DL+ +M++ EK Q+ L YG R+ LP+ W W
Sbjct: 44 YEDPSQPVEKRVADLLSQMSVEEKTCQLATL-YGYGRVLKDSLPVAGWKNEIWKDGIANI 102
Query: 61 -EALHGVS--------------------------FIGRRTNSPPGTHFDSEVPG-----A 88
E L+GV F+ P + + G A
Sbjct: 103 DEMLNGVGKKSALVPDLLYPFSNHAEAVNTVQRWFVEETRLGIPVDFTNEGIHGLNHTKA 162
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
T P I +++N+ L ++ G EA+A+ G T ++P ++VVRDPRWGR L
Sbjct: 163 TPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILDVVRDPRWGRTL 216
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GE+P+++ V G+Q +GV +A KHYA Y + +
Sbjct: 217 ECYGEEPFLIAALGTEMVNGIQS-QGV-------------AATLKHYAVYSVPKGGRDGH 262
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D V +++ E F+ PF+ + VM SYN +G+P A L + +R ++
Sbjct: 263 CRTDPHVAPRELHELFLYPFKKVIQNSHPMGVMSSYNDWDGVPVSASYYFLTELLREEYG 322
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA------- 320
F GY+VSD +++ VES + DT ++AV +VL+AGL++ T+FT +
Sbjct: 323 FDGYVVSDSQAVE-FVESKHHVADTYDEAVRQVLEAGLNV-----RTHFTPPSDFILPIR 376
Query: 321 --VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG-KNNICNP-QHIELAAEAAR 376
+++ KI+ A ID + + V RLG FD P + G +N+ ++++ E +
Sbjct: 377 RLLEEKKISMATIDKRVSEVLRVKFRLGLFD-RPYVTDTGAADNVGGADRNMDFVKEMQQ 435
Query: 377 QGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK- 435
Q +VLLKN+N LPL+ IK + + GP A+ M Y + + G AY +
Sbjct: 436 QALVLLKNENNILPLDKQRIKKVLVTGPLADEDNFMTSRYGPNGLETVTVLAGLRAYLQG 495
Query: 436 --VINYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVE 478
++YA GC DIV + I A+ A +D + V G D
Sbjct: 496 VAEVDYAKGC-DIVDAGWPATEILPVPMNEREKRGIAEAVAKAGESDVVIAVLGEDEYRT 554
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
E + R L LPG Q +L+ + K PV LV+++ + +N+A N I +IL +P
Sbjct: 555 GESRSRTSLDLPGRQQQLLEALHATGK-PVILVLINGQPLTVNWA--NAYIPAILESWFP 611
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITW------YEANYVKIP--YTSMPLRPVNNFPGRTY 590
G +GG IA+ +FG++NPGG+L +T+ E N+ P + S P + N G T
Sbjct: 612 GCQGGTVIAETLFGEHNPGGKLTVTFPKSVGQIELNFPFKPGSHGSQP-KSGPNGSGATR 670
Query: 591 KFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
+ +YPFG+GLSYT F Y D+++ +Q YTV N
Sbjct: 671 VIGE---LYPFGFGLSYTTFAYS--------DLEVSPLRQRTQGEYTVKVN--------- 710
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSA 709
V N GK G EVV +Y + T+ Q+ G+ERV + G++
Sbjct: 711 ---------------VTNTGKRAGDEVVQLYVRDKVSSVITYDSQLRGFERVSLKPGETR 755
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+V F++ + L+I+D N + G +++G
Sbjct: 756 QVTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 787
>gi|399578325|ref|ZP_10772073.1| glycoside hydrolase family 3 domain protein [Halogranum salarium
B-1]
gi|399236488|gb|EJN57424.1| glycoside hydrolase family 3 domain protein [Halogranum salarium
B-1]
Length = 778
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 226/803 (28%), Positives = 360/803 (44%), Gaps = 134/803 (16%)
Query: 24 RAKDLVERMTLPEKVQQMGD---------------------LAYGVPRL-------GLPL 55
R DL+ERMTL EK Q+G L+ G+ L LP
Sbjct: 19 RVADLLERMTLAEKAAQLGSVNAEKLLTDDGTLDEDAVDEHLSAGIGHLTRIGGEGSLPP 78
Query: 56 YEWWSEALHGVSFIGRRTN-SPPGTHFDSEV-----PGATSFPTVILTTASFNESLWKKI 109
E +++ T P T + + P AT+FP +I ++++ L + +
Sbjct: 79 REAAERTNELQTYLREETRLGIPATPHEECLSGYMGPEATTFPQMIGMASTWSPELLETV 138
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
T+ + A +G A SP ++V RD RWGRV ET GEDPY+V A YV GLQ
Sbjct: 139 TGTIREQLEA---IGTA--HALSPVLDVARDLRWGRVEETFGEDPYLVAAMACGYVGGLQ 193
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
D D ISA KH+ + G +R + + ++++ET + PFE
Sbjct: 194 G--------DGDG----ISATLKHFVGHSAGEG-GKNRSSVN--IGRRELRETHMFPFEA 238
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
+ D SVM +Y+ V+GIP +D LL +RG+W F G +VSD S++ + H
Sbjct: 239 TIRTADAESVMNAYHDVDGIPCASDEWLLTDVLRGEWGFDGTVVSDYYSVEFLRSEHGVA 298
Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D +E VA V +AG+D++ D Y + AV+ G ++EA +D S+R + + G
Sbjct: 299 ADEQEAGVAAV-EAGIDVELPYTDCYGEHLVDAVEAGVLSEATLDESVRRVLRMKAEKGL 357
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
D + + +L AAR+ + LLKN++ LPL + ++A+VGP A+
Sbjct: 358 LDDATVDPETAAEPFGTEEADDLTTRAARESMTLLKNEDDLLPLVGDDTDSVAVVGPKAD 417
Query: 408 ATKAMIGNYEGTPCRY---------TSPMDGFYA----YSKVINYAPGCADIVCQNNSMI 454
+ ++G+Y P Y T+P+D A Y + + GC + +
Sbjct: 418 DAQELMGDY-AYPAHYPEEEVEFDATTPLDALRARGEEYGFDVLHEQGCTTTGPETDGFD 476
Query: 455 PAAIDAAKNADATVIV---AGLDLS-------------VEAEGKDRVDLLLPGFQTELIN 498
AA A+ A V + +D S EG D VDL LPG Q EL+
Sbjct: 477 AAAHAASDADVALAFVGARSAVDFSDSDRERVNMPSVATSGEGCDVVDLGLPGVQAELVG 536
Query: 499 KVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
++ + P+ +V++S I + A++ P + W+ PGE GG +A V+FG++NPG
Sbjct: 537 RLGE-TDTPLVVVVVSGKPHSIESIAESVPAVVQA-WL--PGERGGEGVASVLFGEHNPG 592
Query: 558 GRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVAS 616
G LP++ + + Y P N Y + + +YPFG+GLSYT+F+Y
Sbjct: 593 GHLPVSIPRSVGQLPVHYNRKP-----NTANEEYVYTESDPLYPFGHGLSYTEFEYG--- 644
Query: 617 SPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSE 676
D+ L ++ PP V T + VEN G G +
Sbjct: 645 -----DLTLSTEE------------LPPAGTV------------TATVTVENTGDRAGHD 675
Query: 677 VVMVYSKP--PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
VV +Y++ P A +++++G+ERV + AG++ +V F + A L D + +
Sbjct: 676 VVQLYARAVNPDQA-RPVQELVGFERVRLEAGETVQVEFEV-AADQLAYHDRDMDLAVEE 733
Query: 735 GAHTILVGEGVGGVSFPLQLNLN 757
G + VG ++ L +
Sbjct: 734 GPYEFRVGHSAADITSTASLAVT 756
>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 804
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 225/814 (27%), Positives = 349/814 (42%), Gaps = 154/814 (18%)
Query: 13 PYCDAKLPYPERAKDLVERMTLPEKVQQMGDL-AYG-VPRLGLPLYEW----WSEALHGV 66
PY D ER +DL+ +MTL EK Q+ L YG V R LP W W + + +
Sbjct: 46 PYEDPARSLDERVEDLLGQMTLEEKSCQLATLYGYGRVLRDSLPTERWKNEVWKDGIANI 105
Query: 67 ----SFIGRRTNSPPGTHFDSEVPG----------------------------------- 87
+ +G+ + P H S+ G
Sbjct: 106 DEMLNGVGKCLRTTP--HLVSDYTGHVEAKNTIQRWFVEQTRLGIPVEFTNEGIHGLNHS 163
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
AT P I +++N +L + G+ EAR LG + ++P ++V RDPRWGRV
Sbjct: 164 RATPLPAPIAIGSTWNRALVHRAGEIAGHEARV---LGYKNV--YAPILDVARDPRWGRV 218
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
+E GEDP+++ + VRG+Q +GV ++ KHYAAY + +
Sbjct: 219 VECYGEDPFLIAELGVEMVRGIQS-QGV-------------ASTLKHYAAYSVPKGGRDG 264
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
D + +++ + ++ PF + E VM SYN +G+P A L +R ++
Sbjct: 265 NCRTDPHIAPRELHQMYLYPFRRVIRESGPMGVMSSYNDWDGVPVTASRYFLTDLLRHEY 324
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA------ 320
F GY+VSD ++++ + H + +T EDAV +VL+AGL++ TNF+ A
Sbjct: 325 GFDGYVVSDSEAVEYVHTKHA-VAETYEDAVRQVLEAGLNV-----RTNFSPPARFILPV 378
Query: 321 ---VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQ 377
V++G+++ +D +R + V RLG FD +H + + RQ
Sbjct: 379 RKLVREGRLSMEVVDQRVREVLRVKFRLGLFDNPYNDPREAVAEAGADKHRDFVLDIQRQ 438
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY---S 434
+VLLKN++ LPL+ + + GP A+ MI Y + +DG Y
Sbjct: 439 SLVLLKNEDKTLPLDKKKTARVLVAGPLADEDNFMISRYGPNDLPTVTVLDGIRNYLGDG 498
Query: 435 KVINYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+ YA GC + + I A+ A D V V G D E
Sbjct: 499 AEVRYAKGCDVVDAGFPDSELTATPLTAAERAGINEAVKQAAGCDVIVAVLGEDDERVGE 558
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
R L LPG Q +L+ + A PV LV+++ + +N+A N + +IL +P
Sbjct: 559 SHSRTSLELPGRQQQLLEAL-HATGVPVVLVLINGQPLTVNWAAQN--VPAILEGWFPSV 615
Query: 541 EGGRAIADVIFGKYNPGGRLPITWYEAN--------YVKIPYTSMPLRPVNNFPGRTYKF 592
EGG AIA+ +FG YNPGG+L IT+ + Y K + + P + N G +
Sbjct: 616 EGGTAIAETLFGDYNPGGKLTITFPRSTGQIELNFPYKKGSHGAQPRKGPNG--GGVTRV 673
Query: 593 FDGPVVYPFGYGLSYTQFKYK---VASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVL 649
+YPFGYGLSYT F YK +A P + T G+ + C
Sbjct: 674 LGS--IYPFGYGLSYTTFAYKNLRIAPEP----------------SRTQGSFRVSC---- 711
Query: 650 IDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQS 708
EV N G G EVV +Y S T+ + G+ERV + G++
Sbjct: 712 ---------------EVTNTGDRRGDEVVQLYISDKFSSVVTYESVLRGFERVTLEPGET 756
Query: 709 AKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
V F + L+++D+ N + G I +G
Sbjct: 757 KTVSFEVTPSH-LELLDSNMNWTVEPGEFEIRIG 789
>gi|409198206|ref|ZP_11226869.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 775
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 195/658 (29%), Positives = 329/658 (50%), Gaps = 82/658 (12%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T+FP + S++ L +K + + EA A +G+ + ++P I++ RDPRWGRV+
Sbjct: 129 TTFPIPLAEACSWDLELMEKSARIAAEEATA------SGVAWNFAPMIDIGRDPRWGRVM 182
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GED Y+ + A V G Q G+E + D S+ + A KH+ Y G D
Sbjct: 183 EGAGEDVYLATQVARARVIGFQ---GIEDYTDL-SQSNTMMATSKHFVGYGA-ALAGRDY 237
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D ++E+++ ETF+ PF+ V+EG V+S M ++N +NG+P + L + +R W
Sbjct: 238 QSVD--MSERELHETFLPPFKATVDEG-VASFMTAFNDLNGVPCTGNQYLFKEILRDRWG 294
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
F G +V+D +I +V +H F D K A + AG+D+D + + V++G +
Sbjct: 295 FGGMVVTDYTAIMEMV-AHGFAKDLKH-AAELAIDAGIDMDMISEAFVTHLKELVEEGDV 352
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLLKN 384
+E ID ++ + + LG FD +Y + + + NP+H++ A EAA++ IVLLKN
Sbjct: 353 SEEQIDVAVSRILEMKFLLGLFDDPFRYFDAERQQEVVMNPEHLKTAREAAQRSIVLLKN 412
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---YAYSKV-IN 438
+ LPL+ K +AL+GP +++ G + +G + + ++G Y S+V
Sbjct: 413 EGNVLPLDKNTSKRVALIGPFVKERESLNGEWAIKGDRNKSVTLLEGLEEKYDGSRVEFT 472
Query: 439 YAPGCA----DIVCQNNSM--------IPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
YA G D Q S+ A++ A+N+D ++ G + E R D
Sbjct: 473 YAQGTTLPLIDRSTQKVSVTEVPDRRGFAEAVNVARNSDVIMVAMGENYHWSGEAASRTD 532
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
+ LPG Q EL+ ++ K P+ LV+ + +D+++ + N + +I+ YPG G A+
Sbjct: 533 ITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDLSWEEEN--VDAIVEAWYPGMMSGHAV 589
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RPVNNFPGRTYK--FFDGP--VVY 599
AD++ G YNP +L +T + N +IP + +M RP + Y+ + D P ++
Sbjct: 590 ADILSGDYNPSAKLVMT-FPRNVGQIPIFYNMKNTGRPFDAEHPADYRSSYIDSPNTPLF 648
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT F+Y A I DK Q +
Sbjct: 649 PFGYGLSYTTFEYANAK------ISSDKFQSGSSL------------------------- 677
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMN 716
T +EV N G +DG EVV +Y + G +K++ G+E++ + AG++ V F+++
Sbjct: 678 -TASVEVTNTGDLDGEEVVQLYLRDRVGSVVRPVKELKGFEKIHLKAGETKTVEFSID 734
>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
Length = 312
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 138/295 (46%), Positives = 172/295 (58%), Gaps = 21/295 (7%)
Query: 15 CDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTN 74
CD ERA L+ TL EK+ G+ A GVPRLGLP Y+WW EALHGV+
Sbjct: 34 CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVA------- 86
Query: 75 SPPGTHF--DSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
PG F E ATSFP IL A+F+++L + VSTEARA N +G+ FW+
Sbjct: 87 ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWT 146
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
PNIN +DPRWGR ETPGEDP+ + Y N + GLQ EY R I A CK
Sbjct: 147 PNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQGGLDPEYKR--------IVATCK 198
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+AAYDL+NWEGN R+ FD+ V+ QD+ E + F C + +V S MCSYN VNG+P+C
Sbjct: 199 HFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSC 258
Query: 253 ADPKLLNQTIRGDW---NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAG 304
A+ LL +R W N YI SDCD+IQ I E H + T+ + VA L AG
Sbjct: 259 ANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPH-YYTATRAETVADALNAG 312
>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
Length = 873
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 158/422 (37%), Positives = 224/422 (53%), Gaps = 48/422 (11%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP+ + +L R DLV RMTL EK+ Q+ A + RL +P Y WW+E+LHGV+ G
Sbjct: 24 FPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWWNESLHGVARAGY 83
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG------- 124
AT FP I AS++ L +++ +S EARA ++
Sbjct: 84 ----------------ATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYLRRDQHD 127
Query: 125 -NAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT WSPNIN+ RDPRWGR ET GEDP++ G YV+GLQ D
Sbjct: 128 IYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQ---------GDDPE 178
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK+ A KH+A + R +FD+ +E+D+ ET++ F M V + V SVM +Y
Sbjct: 179 YLKVVATAKHFAVHSGPE---ESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAY 235
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
NR G ++ KLL +R W F GY+VSDC +I I E HK + A A L+
Sbjct: 236 NRFRGEAASSN-KLLFDILRNKWGFDGYVVSDCGAINDIWEDHK-ITADAASASALALET 293
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNI- 362
G DL+CG Y + A+ G I E I+ ++ L+ ++LG FD +NL I
Sbjct: 294 GTDLNCGATYKSLKE-AIANGLITEEKINIAIERLFRARLKLGMFDTE---ENLSYATIP 349
Query: 363 ----CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
N H LA +AA++ IVLLKN+ LPL + ++K +A++GP+A+ +++ GNY G
Sbjct: 350 FSVNTNASHTALARKAAQESIVLLKNEAHMLPL-SKDLKQIAVIGPNAHNVQSLWGNYNG 408
Query: 419 TP 420
TP
Sbjct: 409 TP 410
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 152/302 (50%), Gaps = 54/302 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAEGK----------DRVDLLLPGFQTELINKVADA 503
+ A++ A+++D T++V GL+ +E E DR L LP Q EL+ +
Sbjct: 589 LERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELMRALVAT 648
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K P+ LV+++ A+ IN+A+ + + +IL GYPG+EGG AIADV+FG YNP GRLP+T
Sbjct: 649 GK-PIVLVLLNGSALAINYAQEH--VPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVT 705
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDI 623
+Y++ +P + GRTY++F+G +YPFGYGLSYTQF Y + +
Sbjct: 706 YYKS------VDDLPDFEDYSMKGRTYRYFEGEALYPFGYGLSYTQFSYDAIKTSGRL-- 757
Query: 624 KLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSK 683
D Q+ V N G DG EVV +Y K
Sbjct: 758 -------------------------------AADKVLNVQVTVTNSGDRDGDEVVQLYLK 786
Query: 684 PPGIAGTHIK-QVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ T + Q++G++R+ + G++ V F ++A + ++++ ++ G T+ G
Sbjct: 787 DEVASTTRPQVQLVGFKRIHLQKGETQTVEFRLDA-RQFSMINDQEQLVVEPGWFTLYAG 845
Query: 743 EG 744
G
Sbjct: 846 GG 847
>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 862
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 156/429 (36%), Positives = 228/429 (53%), Gaps = 38/429 (8%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + L RAKDLV R+TL EKV M D++ VPRLG+ + WWSEALHG +
Sbjct: 22 LPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYA---- 77
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
N P T FP + ASF++ + VS EARA N
Sbjct: 78 --NQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQ 125
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
L+ W+PN+N+ RDPRWGR ET GEDPY+ R ++ V+GLQ +D++
Sbjct: 126 RFHDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQG--------PADAK 177
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
K+ AC KHYA + W ++ D VT +D+ ET++ F+ V + DV VMC+Y
Sbjct: 178 YRKLLACAKHYAVHSGPEWSRHEMNVTD--VTPRDLWETYLPAFKSLVQDADVREVMCAY 235
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
R++ P C + +LL Q +R DW F +VSDC +I SH +D A A+ + +
Sbjct: 236 QRLDDEPCCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATH-ASAKAVLS 294
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGKNN 361
G D++C Y + AV +G I E DI+TS+ L LG D + + +
Sbjct: 295 GTDVECVGYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKDELVPWTKIPLSV 354
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ + H +LA + AR+ + LL+N+N LPL + +I LA++GP+AN ++ + GNY GTP
Sbjct: 355 VNSEDHQKLALDMARETMTLLQNNNNILPL-SKSIGKLAVIGPNANDSQMLSGNYNGTPL 413
Query: 422 RYTSPMDGF 430
R + ++G
Sbjct: 414 RTINILEGI 422
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 134/298 (44%), Gaps = 55/298 (18%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
I+ K+AD V V G+ +E E G DR D+ LP Q I + A K
Sbjct: 593 IEKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKAGK-- 650
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
+V ++ I +IL Y GE GG+A+ADV+FG YNP G LP+T+Y
Sbjct: 651 -KIVFVNCSGSAIAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVTFYR- 708
Query: 568 NYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
N ++P ++ ++ GRTY++ ++PFG+GLSYT F A KL
Sbjct: 709 NVQQLPDFSDYSMK------GRTYRYLKSAPLFPFGFGLSYTTFNIGEA--------KLT 754
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
K+ N T G ++ V N GK DG+E++ VY +
Sbjct: 755 KN------NITKGE------------------AIQLRVPVANAGKTDGTELLQVYIRKVD 790
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVD-NAANSLLASGAHTILVGE 743
K + G++R+ ++AG++ V + K+ + D A ++ G + +L GE
Sbjct: 791 DPDGASKTLRGFKRIPVSAGKTEMVTLDL-PPKTFEFFDPTDAVVRVSPGNYQLLYGE 847
>gi|389696043|ref|ZP_10183685.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
gi|388584849|gb|EIM25144.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
Length = 751
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 210/746 (28%), Positives = 349/746 (46%), Gaps = 94/746 (12%)
Query: 24 RAKDLVERMTLPEKVQQMGDLAYGVP---------RLGLPLYEWWSEALHGVSFIGRRTN 74
R +L+ RMTL EKV Q+ +++G P + G L +E + + R ++
Sbjct: 39 RVNELLGRMTLEEKVGQLNLVSHGPPLRWEDISEGKAGAVLNFNSAEDVARAQALVRESH 98
Query: 75 SPPGTHFDSEVPGA--TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWS 132
F +V T FP + A+F+ + + + + EA + TF +
Sbjct: 99 LKIPLLFGLDVLHGFRTQFPLPLGEAAAFSPRVSRLASEWAAREASYV----GVNWTF-A 153
Query: 133 PNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCK 192
P ++ RD RWGR++E GEDP + V G R ++A K
Sbjct: 154 PMADLSRDSRWGRIVEGFGEDPTLGAALTAARVEGF--------------RKGGLAAAAK 199
Query: 193 HYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTC 252
H+A Y R + + + +M +T++ PF V G +S M ++N +NG P+
Sbjct: 200 HFAGYGAPQ---GGRDYDTTYIPRAEMYDTYLPPFRAAVEAG-TASFMAAFNALNGEPST 255
Query: 253 ADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GD 311
A+P LL +R W F G++ SD I +V +H D E A +L AG+D+D G
Sbjct: 256 ANPWLLTDVLRTQWGFDGFVTSDWVGIGELV-NHGIAADGAEAARKAIL-AGVDMDMMGQ 313
Query: 312 YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELA 371
Y N V+ G++ E+ ID S+R + RLG FD + + +P+ + A
Sbjct: 314 LYINHLPDEVRAGRVPESVIDESVRRVLRTKFRLGLFDRPDVDSSHLDSEFPSPESRQAA 373
Query: 372 AEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDG 429
E AR+ VLL+N + LP+ + ++++A+VGP A+A + +G + G + ++G
Sbjct: 374 REVARETFVLLQNRDDVLPIPS-KVRSIAVVGPLADAPQDQMGPHAARGHKEDSVTILEG 432
Query: 430 FYAYSK----VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
++ + +APGC D+ C+N +P A++AA+ +D + V G + E R
Sbjct: 433 IRRRAQSAGIAVRHAPGC-DLFCRNTDALPGALEAARQSDFVIAVFGEPQELSGEAASRA 491
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
++ L G Q E++ ++A K PV LVIM G +I SIL YPG E G A
Sbjct: 492 NMELNGKQIEVLEELAKTGK-PVALVIM--GGRPQVLGPVADRIPSILMAWYPGTEAGPA 548
Query: 546 IADVIFGKYNPGGRLPITWYEAN------YVKIPYTSMPLRPVNNFPGRTYKFFDGPV-- 597
+ADV+FG +P G+LP+TW A Y ++P T P N F T + D +
Sbjct: 549 VADVLFGDVSPSGKLPLTWPRATGQLPLYYNRLP-TGRPTLANNRF---TLHYIDESIAP 604
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
+YPFG+GLSYT F Y S + +LD+ Q
Sbjct: 605 LYPFGWGLSYTHFAY---SDARIASRQLDEGQ---------------------------- 633
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
++V+N G DG EVV +Y++ P + + ++++ +E++ + +G++ +V +
Sbjct: 634 -VLEVSLDVKNTGARDGQEVVQLYTRDPVASRSRPLRELKAFEKIALKSGETKRVTLRV- 691
Query: 717 ACKSLKIVDNAANSLLASGAHTILVG 742
+SL + L+ +GA + VG
Sbjct: 692 PVESLGFHLDDGTYLVEAGAIQVFVG 717
>gi|387789382|ref|YP_006254447.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
gi|379652215|gb|AFD05271.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
Length = 771
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 211/725 (29%), Positives = 339/725 (46%), Gaps = 131/725 (18%)
Query: 35 PEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTV 94
PEK+++ +LA RL +P+ + S+ +HG T+FP
Sbjct: 89 PEKIRKAQELAVNKSRLKIPMI-FGSDVIHG---------------------HKTTFPIP 126
Query: 95 ILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGED 153
+ AS+N L +K Q + EA A GL + +SP ++V RDPRWGR+ E GED
Sbjct: 127 LGLAASWNIELIEKSAQIAAKEATA------DGLNWVFSPMVDVARDPRWGRIAEGSGED 180
Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR 213
PY+ A V+G Q ++ S + AC KH+A Y G D D
Sbjct: 181 PYLGSLIAKAMVKGYQG-------DNTYSSATNLMACVKHFALYGAAE-AGRDYNSVD-- 230
Query: 214 VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIV 273
++ Q M E ++ P++ V G V SVM S+N V G+P + LL +R W F+G +V
Sbjct: 231 MSRQKMYEFYLPPYKAAVEAG-VGSVMSSFNEVEGVPATGNQWLLTDLLRKQWGFNGMVV 289
Query: 274 SDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKIAEADID 332
SD S+ ++E H N + A+A +KAGLD+D G+ Y + ++Q+GK++E DI+
Sbjct: 290 SDYTSVNEMME-HGMGNLQEVSALA--IKAGLDMDMVGEGYLSTLQKSLQEGKVSETDIN 346
Query: 333 TSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIVLLKNDNGALP 390
+ R + +LG F ++ N + I Q + + EAA + VLLKN+ LP
Sbjct: 347 LACRRILEAKYKLGLFSDPYKFINEKRAATEILTTQSLSFSREAATRSFVLLKNEKQVLP 406
Query: 391 L-NTGNIKTLALVGPHANATKAMI------GNYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
L TG T+AL+GP A++ + M+ GN++ + M+ ++KV+ YA G
Sbjct: 407 LKKTG---TIALIGPLADSKRNMLGTWAVSGNWKTSVSVKEGLMNAVGTHAKVL-YAKGA 462
Query: 444 ------------------ADIVCQNN-SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
DI +++ ++ A+ A+ +D ++ G + E R
Sbjct: 463 NISDDSAFARRVNTFGVEIDIDKRSSKELLDEALSIAQQSDVIIVAVGEAADMSGEAASR 522
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
D+ +P Q EL+ + K PV +V+ + + +++ N + +IL V PG + G
Sbjct: 523 TDINIPESQKELLKALVQTGK-PVVMVLFNGRPLTLSW--ENEHLNAILDVWAPGHQAGN 579
Query: 545 AIADVIFGKYNPGGRLPITWYEANYVKIPY------TSMPLRPVNNFPGRTYKFFDGPVV 598
AIADV+FG YNP G++ +T + N ++P T P N F + D +
Sbjct: 580 AIADVLFGDYNPSGKITVT-FPKNVGQVPMYYNHKNTGRPYDDRNRFTSKYLDMPDNAPM 638
Query: 599 YPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDY 658
YPFGYGLSYT F+Y D+ +D+D T KP
Sbjct: 639 YPFGYGLSYTTFQYG--------DVTIDQD-----------TIKP-------------GE 666
Query: 659 KFTFQIEVENMGKMDGSEVVMVYSK-------PPGIAGTHIKQVIGYERVFIAAGQSAKV 711
T ++ + N G DG E V +Y + PP +K + G++++ + G+S V
Sbjct: 667 TITAKVTITNTGNYDGVETVQLYIQDVIASVAPP------VKTLKGFKQISLKKGESKVV 720
Query: 712 GFTMN 716
F ++
Sbjct: 721 EFVIS 725
>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
Length = 858
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 40/444 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+PY + L RA+DL+ R+TL EK M D + +PRLG+ + WWSEALHG + +G
Sbjct: 31 YPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 89
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNAG-- 127
T FP I ASFN++L K+ S E RA Y+ + N G
Sbjct: 90 ---------------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGED 134
Query: 128 -----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
L+ W+PN+N+ RDPRWGR ET GEDPY+ VRGLQ E DS
Sbjct: 135 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPE--------DS 186
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ K+ AC KHYA + + + + V+ +D+ ET++ F+ V E V VMC+
Sbjct: 187 KYRKLWACAKHYAVHSGPEYTRHTANL--NNVSPRDLWETYLPAFKTLVEEAKVREVMCA 244
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y ++ P C + +LL Q +R +W F +VSDC ++ I ++HK +D A A+
Sbjct: 245 YQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVH-ATAKAAL 303
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
AG D++CG YT + AVQ+G I+E ++D + L LG D + +
Sbjct: 304 AGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMDDPALVPWSKIPY 363
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + H +L+ + ARQ IVLL+N LPL N + +A++GP+A+ M GNY GT
Sbjct: 364 SVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAVIGPNADNVPMMWGNYNGT 422
Query: 420 PCRYTSPMDGFYAYSKVINYAPGC 443
P R + +DG A K + Y GC
Sbjct: 423 PNRTVTILDGIRAKHKNVKYIKGC 446
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 132/298 (44%), Gaps = 60/298 (20%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
I A K + V V G+ ++E E G DR D+ LP Q + I + A K
Sbjct: 602 IRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGK-- 659
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
LV ++ I +I+ Y G+EGG A+ADV+FG YNP G+LP+T+Y+
Sbjct: 660 -QLVYVNCSGSAIALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYK- 717
Query: 568 NYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
N ++P Y + ++ GRTY++F P ++ FG+GLSYT F A K D
Sbjct: 718 NSNQLPDYENYSMK------GRTYRYFSDP-LFAFGHGLSYTTFNMGTAEIIKKAD---- 766
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+I VEN+G DG+E V++Y K
Sbjct: 767 --------------------------------SIVVRIPVENVGSKDGTETVLLYIKNHQ 794
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGE 743
IK + G+ RVF+ AG A V + KS + D N++ G + +L G+
Sbjct: 795 DPNGPIKSLRGFSRVFVKAGHKA-VAELLLTRKSFEFFDENTNTVHFKEGNYDLLYGD 851
>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
Length = 864
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 159/422 (37%), Positives = 221/422 (52%), Gaps = 42/422 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + KL ERA DLV R+TL EK M + + +PRLG+ Y+WW+EALHGV G
Sbjct: 25 LPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGI 84
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
AT FP I ASF++ L ++ VS EARA Y
Sbjct: 85 ----------------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLK 128
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLTFW+PN+N+ RDPRWGR ET GEDPY+ + + VRGLQ E Y
Sbjct: 129 RYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYD------ 182
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + W +R F++ + +D+ ET++ F+ V + V VMC+
Sbjct: 183 --KLHACAKHFAVHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCA 237
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVES--HKFLNDTKEDAVARV 300
YNR+ G P C + +LL +R +W + G +VSDC +I H+ D K A A
Sbjct: 238 YNRLEGEPCCGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPD-KAHASAGA 296
Query: 301 LKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
+ +G DL+CG Y + AV+ G IAE+ +D S++ L LG D + + +
Sbjct: 297 VLSGTDLECGSNYKSLPE-AVKAGLIAESQLDISVKRLLKARFELGEMDKDVCWDTIPYS 355
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ H +LA AR+ IVLL+N N LPL ++K +ALVGP+AN + GNY G P
Sbjct: 356 VVDCQAHKDLALRMARESIVLLQNRNNILPLRK-DMK-IALVGPNANDSIMHWGNYNGFP 413
Query: 421 CR 422
Sbjct: 414 SH 415
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 132/300 (44%), Gaps = 53/300 (17%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ A D K+AD + G+ ++E E G DR + LP Q +L+ ++
Sbjct: 591 LQATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKL 650
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K P+ + S A+ + A + ++ YPG+ GG AIADV+FG YNP G+LP+T
Sbjct: 651 GK-PIVFINYSGSAMGL--APESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPVT 707
Query: 564 WYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
+Y N ++P + ++ GRTY++ ++ FG+GLSYT F Y A
Sbjct: 708 FYR-NTEQLPDFEDYAMK------GRTYRYMTETPLFRFGHGLSYTTFDYGKAR------ 754
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
L + K T I V N G DG E V VY
Sbjct: 755 --------------------------LSQNTFSKGETLTLTIPVSNTGTRDGEETVQVYL 788
Query: 683 KPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ PG A + ++RV++ G + ++ FT++ L + N L SG + +L G
Sbjct: 789 RRPGDADAPSHTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLYG 848
>gi|336399403|ref|ZP_08580203.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069139|gb|EGN57773.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 757
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 209/753 (27%), Positives = 346/753 (45%), Gaps = 98/753 (13%)
Query: 26 KDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR-------------- 71
+DL+++MTL EK+ Q+ G G P S++L +G
Sbjct: 47 RDLIKKMTLTEKIGQLSQYVGGSLLTG-PQSGALSDSLFVRGMVGSILNVGGVESLRKLQ 105
Query: 72 -------RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
R P FD T FPT + + S++ +G T A
Sbjct: 106 EKNMQSSRLKIPVLFAFDVIHGYKTIFPTPLAESCSWD------LGLMFETAKAAAIEAS 159
Query: 125 NAGLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
+G+ + ++P +++ RDPRWGR++E GED Y+ + A VRG Q G +
Sbjct: 160 ASGIHWTFAPMVDIARDPRWGRIVEGAGEDTYLACKIAETRVRGFQWNLG---------K 210
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
P + AC KH+ AY G D D ++ + E ++ PF+ CV+ G V + M ++
Sbjct: 211 PNSVYACAKHFVAYGAPQ-AGRDYAPVD--LSLSTLAEVYLPPFKACVDAG-VHTFMSAF 266
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N +NG+P + L+ +R W FHG++VSD +++Q + ++H + +T DA A
Sbjct: 267 NSLNGVPATGNRWLMTDILRNQWKFHGFVVSDWNAVQEL-KAHG-VAETDTDAALMAFDA 324
Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG--KN 360
G+D+D D Y AV +GK+ IDTS+ + LG FD ++ ++ +
Sbjct: 325 GVDMDMTDGLYNRCLEKAVCEGKLDMQAIDTSVERILRAKYALGLFDDPYRFLDVKRERR 384
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYE--G 418
I + +LA +AA +VLLKND+ LPL+ + K +AL+GP A+ ++G+++ G
Sbjct: 385 EIRSEAVTKLARKAAASSMVLLKNDHATLPLSK-HTKRIALIGPLADNRSEVMGSWKARG 443
Query: 419 TPCRYTSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDL 475
+ +DG + Y GC D + + PAA +AAK +D + V G
Sbjct: 444 EESDVVTVLDGIKKKLGSDVAVTYVQGC-DFLEPSTREFPAAFEAAKQSDVVIAVVGEKA 502
Query: 476 SVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWV 535
+ E + R L LPG Q L++ + A + P+ +V+M+ + + K + + ++L
Sbjct: 503 LMSGESRSRAVLRLPGQQEALLDTLQKAGR-PLVVVLMNGRPLCLQ--KVDRQADALLEA 559
Query: 536 GYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYT---SMPLRPVNNFPGRTYKF 592
+PG + G A+AD++FG P +L T + +IP RP + T +
Sbjct: 560 WFPGTQCGNAVADILFGDAVPSAKL-TTSFPLTEGQIPNNYNYKRSGRPGDMSHSSTVRH 618
Query: 593 FDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLI 650
D P +YPFGYGLSYT F Y PK +
Sbjct: 619 IDVPNRNLYPFGYGLSYTTFSYGEMQCPKQFN---------------------------- 650
Query: 651 DDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSA 709
D ++V N G DG E+V +Y + +K++ G+++VFI GQ+
Sbjct: 651 -----ADGTLQVSVDVTNTGGYDGEEIVQLYVADKVASMVRPVKELKGFQKVFIPKGQTK 705
Query: 710 KVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
++ FT+NA + L +N+ ++ G I+VG
Sbjct: 706 RIDFTLNA-RDLGFWNNSMQYIVEPGTFEIMVG 737
>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
[Tannerella forsythia ATCC 43037]
gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
forsythia ATCC 43037]
Length = 775
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 213/729 (29%), Positives = 347/729 (47%), Gaps = 124/729 (17%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P++ + E +HG IG T FPT I +++N +L +K+
Sbjct: 121 RLGIPIF-FAEECMHGHMAIG-----------------TTVFPTSIGQASTWNRTLIEKM 162
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G ++ E R+ + P +++ R+PRW RV ET GEDP + G +VRGLQ
Sbjct: 163 GAAIAHETRS-----QGAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILGSAFVRGLQ 217
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
G ++ +D R + KH AAY + N R +++ +++ +LPFEM
Sbjct: 218 ---GKDF---ADGR--HTYSTLKHLAAYGIPVGGHNGR---QAQIGARELIAEHLLPFEM 266
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V G SVM SYN V+G+P ++ +L + +RG+W+F+G++VSD SI+ I +H+
Sbjct: 267 AVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEGIATTHRVA 325
Query: 290 NDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
D K A A L AG+++D G YT A I+ ++ID ++ + + +G F
Sbjct: 326 PDIKH-AAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISMSEIDDAVSRILRLKFEMGLF 384
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ + I + +H LA + A + IVLLKN+ LPL+ NI ++A++GP+A+
Sbjct: 385 ESPYVQPSRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPLSK-NIGSIAVIGPNADN 443
Query: 409 TKAMIGNYEG-TPCRY-TSPMDGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKN 463
+G+Y P + + ++G + + VI Y GCA + S I A+ AA
Sbjct: 444 LYNQLGDYTAPQPEEHIVTILEGIRNAVSPTTVIRYVKGCA-VRDTTQSNIDEAVRAANA 502
Query: 464 ADATVIVAG----LDLSVE----------------------AEGKDRVDLLLPGFQTELI 497
++A V+V G D + EG DR L L G Q +LI
Sbjct: 503 SNAVVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRKSLTLLGHQEKLI 562
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPG 557
+A K P+ +V + +++N A + K ++L YPGEEGG A+A+VIFG NP
Sbjct: 563 ESIAATGK-PLIMVYIQGRPLNMNLA--DKKASALLTAWYPGEEGGNAVANVIFGDVNPS 619
Query: 558 GRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVA 615
GRLPI+ +P ++ L PV G++ + +G +Y FGYGLSYT F+Y
Sbjct: 620 GRLPIS--------VPRSTGQL-PVYYSLGKSNDYVEGTSTPLYAFGYGLSYTAFEYG-- 668
Query: 616 SSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGS 675
++ + ++ G N T V N G DG
Sbjct: 669 ------NLTISRE----------GGN------------------ITVSCTVTNTGNTDGD 694
Query: 676 EVVMVYSKPPGIAGTHIKQVI--GYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA 733
EVV +Y + +A + V+ + ++ + G+SA+V F + + L + ++
Sbjct: 695 EVVQLYLRDH-VASVSVPPVLLKDFAKISLKKGESARVNFVLTP-EQLAFFNTDLKRVVE 752
Query: 734 SGAHTILVG 742
G T+++G
Sbjct: 753 PGEFTVMIG 761
>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
Length = 858
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 159/444 (35%), Positives = 232/444 (52%), Gaps = 40/444 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+PY + L RA+DL+ R+TL EK M D + +PRLG+ + WWSEALHG + +G
Sbjct: 31 YPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG- 89
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNAG-- 127
T FP I ASFN++L K+ S E RA Y+ + N G
Sbjct: 90 ---------------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGED 134
Query: 128 -----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
L+ W+PN+N+ RDPRWGR ET GEDPY+ VRGLQ E DS
Sbjct: 135 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPE--------DS 186
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ K+ AC KHYA + + + + V+ +D+ ET++ F+ V E V VMC+
Sbjct: 187 KYRKLWACAKHYAVHSGPEYTRHTANL--NNVSPRDLWETYLPAFKTLVEEAKVREVMCA 244
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y ++ P C + +LL Q +R +W F +VSDC ++ I ++HK +D A A+
Sbjct: 245 YQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVH-ATAKAAL 303
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
AG D++CG YT + AVQ+G I+E ++D + L LG D + +
Sbjct: 304 AGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMDDPALVPWSKIPY 363
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + H +L+ + ARQ IVLL+N LPL N + +A++GP+A+ M GNY GT
Sbjct: 364 SVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAVIGPNADNVPMMWGNYNGT 422
Query: 420 PCRYTSPMDGFYAYSKVINYAPGC 443
P R + +DG A K + Y GC
Sbjct: 423 PNRTVTILDGIRAKHKNVKYIKGC 446
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 131/298 (43%), Gaps = 60/298 (20%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGP 507
I A K + V V G+ ++E E G DR D+ LP Q + I + A K
Sbjct: 602 IRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGK-- 659
Query: 508 VTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA 567
LV ++ I +I+ Y G+EGG A+ADV+FG YNP G+LP+T+Y+
Sbjct: 660 -QLVYVNCSGSAIALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVTFYK- 717
Query: 568 NYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
N ++P Y + ++ GRTY++F P ++ FG+GLSYT F A K D
Sbjct: 718 NSNQLPDYENYSMK------GRTYRYFSDP-LFAFGHGLSYTTFNMGTAEIIKKAD---- 766
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+I VEN+G DG+E V++Y K
Sbjct: 767 --------------------------------SIVVRIPVENVGSKDGTETVLLYIKNHQ 794
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVGE 743
IK + G+ RVF+ AG A + KS + D N++ G + +L G+
Sbjct: 795 DPNGPIKSLRGFSRVFVKAGHQAVAELVLTR-KSFEFFDENTNTVHFKEGNYDLLYGD 851
>gi|295135338|ref|YP_003586014.1| glycoside hydrolase [Zunongwangia profunda SM-A87]
gi|294983353|gb|ADF53818.1| glycoside hydrolase family protein [Zunongwangia profunda SM-A87]
Length = 764
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 222/725 (30%), Positives = 333/725 (45%), Gaps = 130/725 (17%)
Query: 35 PEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTV 94
PEK++ D A R+G+PL S+ +HG T+FP
Sbjct: 89 PEKIRVAQDYAVNDTRMGIPLL-IGSDVIHGYK---------------------TTFPIP 126
Query: 95 ILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGED 153
+ T AS++ + KK + + EA A G+ + +SP +++ RDPRWGR+ E GED
Sbjct: 127 LGTAASWDMEMIKKTAEIAAQEATA------DGINWNFSPMVDIARDPRWGRIAEGAGED 180
Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFD-S 212
PY+ + A V G Y D ++ + A KH+A Y G D D S
Sbjct: 181 PYLGSQIAKAMVEG--------YQGDDLAKENTMIATVKHFALYGASE-AGRDYNTTDMS 231
Query: 213 RVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYI 272
RV M ++ P++ ++ G SVM S+N V+G+P + LL +R W F G++
Sbjct: 232 RVK---MFNEYLPPYKAAIDAG-AESVMSSFNDVDGVPATGNKWLLTDLLRDRWGFEGFV 287
Query: 273 VSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKIAEADI 331
SD S+ ++ +H + A+A LKAGLD+D G+ Y ++ +GK+ EA+I
Sbjct: 288 TSDYTSLNEMI-AHGMGDLQAVSALA--LKAGLDMDMVGEGYLKTLKKSLDEGKVTEAEI 344
Query: 332 DTSLRFLYIVLMRLGYFDGSPQYKNLGK--NNICNPQHIELAAEAARQGIVLLKNDNGAL 389
T+ R + +LG FD +Y + + +I + ++ + + A VLLK D G
Sbjct: 345 TTAARRILEAKYKLGLFDDPYKYLDESRPEKDILSEENRTFSRKVAAHSFVLLKKDAGVF 404
Query: 390 PLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF--YAYSKVINYAPGC-- 443
PL N K +AL+GP AN M+G + G P + G A + YA G
Sbjct: 405 PLKK-NAK-IALIGPLANNKNNMLGTWAPTGNPQLSVPVLQGVKNVAPKAKVTYAQGANI 462
Query: 444 ----------------ADIVCQN-NSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVD 486
A+I + M+ A+ AK +D V V G + E R +
Sbjct: 463 TDDAQLAENINVFGPRAEISETSPEKMLEEALKVAKKSDVIVAVVGEATEMSGEAASRTN 522
Query: 487 LLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAI 546
LL+P Q +LI ++A K P+ LV+MS +N ++ + IL V +PG E G AI
Sbjct: 523 LLIPESQKKLIRELAKTGK-PMALVLMSGRP--LNISEESEMNIDILQVWHPGVEAGNAI 579
Query: 547 ADVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RP--VNNFPGRTYKFFDGP--VVY 599
ADVIFG YNP G++ +W N ++P Y +M RP V F +F D P +Y
Sbjct: 580 ADVIFGDYNPSGKITASW-PRNVGQVPVYYAMKRTGRPGEVEGFQKFKSEFLDTPNSPLY 638
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT+F+Y D+K D+ D GT
Sbjct: 639 PFGYGLSYTEFEYS--------DVKASADELKMD-----GT------------------- 666
Query: 660 FTFQIEVENMGKMDGSEVVMVYSK-------PPGIAGTHIKQVIGYERVFIAAGQSAKVG 712
T + N G DG EVV +Y PP +KQ+IG+E++ + G+S V
Sbjct: 667 LTLSAIITNTGDYDGEEVVQLYIHDKVRSITPP------MKQLIGFEKIMLKKGESKTVT 720
Query: 713 FTMNA 717
F ++A
Sbjct: 721 FEISA 725
>gi|451821117|ref|YP_007457318.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787096|gb|AGF58064.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 750
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 218/721 (30%), Positives = 335/721 (46%), Gaps = 96/721 (13%)
Query: 36 EKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVI 95
EK ++ +A RLG+P+ + + +HG T FP +
Sbjct: 95 EKSNELQKIAVEESRLGIPIL-FGLDVIHGYR---------------------TIFPIPL 132
Query: 96 LTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGEDP 154
SF+ K+ + + EA A AGL + ++P +++ RDPRWGRV E GEDP
Sbjct: 133 AEACSFDIEKIKESARIAAKEASA------AGLHWTFAPMVDISRDPRWGRVAEGAGEDP 186
Query: 155 YVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRV 214
Y+ A V G Q +S P I AC KH+A Y + G D D +
Sbjct: 187 YLGSVIAKARVEGFQG--------ESLDNPESILACAKHFAGYGAPDG-GRDYNTVD--M 235
Query: 215 TEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVS 274
+ Q + + ++ PF+ G V + M ++N +NGIP + LL +R + F+G++VS
Sbjct: 236 SLQTLHDVYLPPFKAAAEAG-VGTFMSAFNDLNGIPCTVNKYLLTDVLREKFGFNGFVVS 294
Query: 275 DCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDT 333
D +SI +V H + D K A + L AGLD+D Y N V++G I E +D
Sbjct: 295 DANSIPEVV-VHGYAEDNKA-ASKKALNAGLDMDMSQGTYRNELPELVKEGDILEEVLDE 352
Query: 334 SLRFLYIVLMRLGYFDG--SPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPL 391
++R + V LG FD K K +C +H+E A + +R+ IVLLKN+N ALPL
Sbjct: 353 AVRRVLRVKFLLGLFDNPYRTDAKKEEKTLLCK-EHLEAARDISRRSIVLLKNENNALPL 411
Query: 392 NTGNIKTLALVGPHANATKAMIGNYE--GTPCRYTSPMDGFYAYSKV---INYAPGCADI 446
++K +A+VGP A M+G + G P + + G A I YA GC I
Sbjct: 412 KK-DLKKIAVVGPLAENAAEMLGTWSHTGNPSDVVTIISGIKAAVSTETEILYAEGC-KI 469
Query: 447 VCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKG 506
+ A+ AK +D + V G + + E R+D+ LPG Q EL+ ++ K
Sbjct: 470 TGEECIDFEGAVRVAKESDVIIAVVGENSDMSGEAASRIDINLPGKQEELLKELRKIGK- 528
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW-Y 565
P+ +V+++ + I + N + +++ G + G AIADV+FG YNP G+L T+ Y
Sbjct: 529 PLIVVLINGRPLTIPWEAEN--VDALVEAWQLGTQSGNAIADVLFGDYNPSGKLVATFPY 586
Query: 566 EANYVKIPYTS-MPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVD 622
V I Y + M RP T K+ DGP +YPFG+GLSYT FKY+
Sbjct: 587 SVGQVPIYYNNPMTGRPAGKIK-FTSKYIDGPAEPLYPFGFGLSYTTFKYE--------- 636
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
+++ NK + D V K Y V N G++ G EVV +Y
Sbjct: 637 ----------NLSILSAENK------IGDTVAVKVY-------VTNTGEVSGEEVVQLYV 673
Query: 683 KPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+ +K++ +E+V + + + F +N K L D N ++ G + V
Sbjct: 674 SDVVASRVRPVKELKSFEKVLLQPKECKTIIFKLN-TKDLGFHDENMNYVVEPGLFKVYV 732
Query: 742 G 742
G
Sbjct: 733 G 733
>gi|103486503|ref|YP_616064.1| glycoside hydrolase [Sphingopyxis alaskensis RB2256]
gi|98976580|gb|ABF52731.1| glycoside hydrolase, family 3-like protein [Sphingopyxis alaskensis
RB2256]
Length = 772
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 217/731 (29%), Positives = 341/731 (46%), Gaps = 98/731 (13%)
Query: 27 DLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE--------------------ALHGV 66
DL+ +MTL EK Q+ L G + + + E L +
Sbjct: 59 DLMVKMTLDEKTGQLTLLTSNWESTGPTMRDSYKEDIRAGRVGAIFNAYTAKYTRELQAL 118
Query: 67 SFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA 126
+ G R P +D T FP + AS++ +K + + EA A
Sbjct: 119 AVEGTRLKIPLLFGYDVIHGHRTIFPISLGEAASWDLQAIEKAARISAIEASA------E 172
Query: 127 GLTF-WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
G+ + +SP +++ RDPRWGR+ E GED Y+ A VRG Y SRP
Sbjct: 173 GIHWTFSPMVDIARDPRWGRISEGAGEDVYLGSLIAKARVRG--------YQGGDLSRPD 224
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNR 245
I A KH+AAY G D D ++E+ M++ ++ PF+ + ++ M ++N
Sbjct: 225 TILATAKHFAAYGAAQ-AGRDYHTVD--ISERTMRDVYLPPFKAAADA-GAATFMTAFNE 280
Query: 246 VNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGL 305
+G+P LL +R W F G++V+D SI +V H + D K+ A + ++AG+
Sbjct: 281 YDGVPASGSHYLLTDVLRKKWGFKGFVVTDYTSINEMV-PHGYAKDLKQ-AGEQAMRAGV 338
Query: 306 DLDC-GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLG--KNNI 362
D+D G + +V +GK+ A ID +++ + + RLG FD +Y + K I
Sbjct: 339 DMDMQGAVFMENLAKSVAEGKVDTARIDAAVKAILEMKYRLGLFDDPYRYADAAREKATI 398
Query: 363 CNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR 422
P +E A + AR+ IVLLKN + LPL + K++A++GP N+ + MIG++ R
Sbjct: 399 YKPAFLEAARDVARKSIVLLKNKDNVLPL-AASAKSIAVIGPLGNSKEDMIGSWSAAGDR 457
Query: 423 YTSP---MDGFYAYS---KVINYAPGCA---DIVCQNNSMIPAAIDAAKNADATVIVAGL 473
T P ++G A + I YA G + D V + + A+ A+ +D + G
Sbjct: 458 RTRPVTLLEGLQAGAPKGTTIAYAKGASYHFDDVGKTDG-FAEALALAEKSDVIIAAMGE 516
Query: 474 DLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
++ E R L LPG Q L+ + K PV LV+MS I +A N + +IL
Sbjct: 517 HWNMTGEAASRTSLDLPGNQQALLEALEKTGK-PVILVLMSGRPNSIEWADAN--VDAIL 573
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPVN-NFPGRTY 590
YPG GG AIAD+++G+YNP G+LP+T+ V I Y RP+ PG Y
Sbjct: 574 EAWYPGTMGGHAIADILYGRYNPSGKLPVTFPRTVGQVPIHYDMKNTGRPIELGAPGAKY 633
Query: 591 --KFFDGP--VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCA 646
++ + P +YPFGYGLSYT F Y SP + LD+ + + +P
Sbjct: 634 VSRYLNTPNTPLYPFGYGLSYTSFTY----SP----VTLDRSK--------IRPGEP--- 674
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAA 705
T + V N G DG EVV +Y + G +K++ G++++ +
Sbjct: 675 -------------LTASVTVTNSGPRDGEEVVQLYVRDLVGSVTRPVKELKGFQKIGLKK 721
Query: 706 GQSAKVGFTMN 716
G++ V FT+
Sbjct: 722 GETRTVRFTLT 732
>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
Length = 855
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 162/439 (36%), Positives = 237/439 (53%), Gaps = 46/439 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ D P ER DL+ R+T+ EKV + + A +PRL + Y +EALHGV
Sbjct: 29 FRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGVV------ 82
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
PG T FP I A++N +L ++ +S EAR + + G
Sbjct: 83 --RPGEF--------TVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAG 132
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDP++ GR +V+GLQ + R
Sbjct: 133 GSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQG---------DNPR 183
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK + KH+AA N E ++R ++R++E+D++E ++ FE C+ +G S+M +Y
Sbjct: 184 YLKTVSTPKHFAA----NNEEHNRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N VN +P + L+ + +RGDWNF+GYIVSDC + + +V HK++ + E A LKA
Sbjct: 240 NAVNDVPCTVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNL-EAAATLALKA 298
Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CGD YT + A + ++EA+ID++ + M LG FD Q Y + +
Sbjct: 299 GLDLECGDRVYTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFDDPSQNPYNKIEPS 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
I +H ELA E ARQ +VLLKN LPLN I+++A+VG +A G+Y G P
Sbjct: 359 VIGCKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVG--ISAAHCEFGDYSGNP 416
Query: 421 CRY-TSPMDGFYAYSKVIN 438
S +DG Y++ N
Sbjct: 417 KNTPVSVLDGIKKYAENAN 435
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 101/294 (34%), Positives = 147/294 (50%), Gaps = 52/294 (17%)
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VD 519
AK D TV V G++ S+E EG+DR L LP Q E I ++ P T+V++ AG+ +
Sbjct: 600 AKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKV--NPNTVVVLVAGSSMA 657
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
IN+ N + +IL YPGE+GG A+A+V+FG YNPGGRLP+T+Y + +P
Sbjct: 658 INWMDEN--VPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------LDELPA 709
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ RTY++F+G +Y FGYGLSYT FKYK S +S D DI + +
Sbjct: 710 FDDYSVKNRTYQYFEGKPLYEFGYGLSYTNFKYKKKSIMQSND--------TVDITFNLS 761
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIG 697
N+GK DG EV VY + P GT+ +KQ+ G
Sbjct: 762 ----------------------------NVGKYDGDEVAQVYVRYPE-TGTYMPLKQLKG 792
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSF 750
+ RV + G+SA + ++ K L+ D + +G + VG +S
Sbjct: 793 FSRVHLKKGKSADITISIPK-KELRYWDEKTRQFVTPTGEYVFQVGGSSENISI 845
>gi|423301451|ref|ZP_17279475.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
CL09T03C10]
gi|408472052|gb|EKJ90581.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
CL09T03C10]
Length = 781
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 211/723 (29%), Positives = 325/723 (44%), Gaps = 115/723 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG T FPT I A+++ L ++
Sbjct: 129 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGIGMAATWSPQLINEV 170
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ + E R G + P +++ RDPRW RV ET GEDP + G V GL
Sbjct: 171 GKAIGKEIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVAGLG 225
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
SRP A KH+ AY + N F +++ E F+ PF
Sbjct: 226 S--------GDLSRPYSTLATLKHFLAYGISESGQNGNPSFAGM---RELHENFLPPFGQ 274
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
+N G +S VM SYN ++G P A+ LL + +R DW F G +VSD SI+ I +SH F+
Sbjct: 275 AINAGALS-VMTSYNSMDGTPCTANHYLLTELLRDDWKFKGVVVSDLYSIEGIHQSH-FV 332
Query: 290 NDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYF 348
T ++A L AG+D+D G D Y N M AV + +I++ +D ++ + + +G F
Sbjct: 333 ASTMKEAAVMALSAGVDIDLGGDAYMNL-MDAVNRKEISKEILDAAVSRVLRLKFEMGLF 391
Query: 349 DGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANA 408
+ K + + +++ LA + A+ I LLKN++ LPL+ +AL+GP+A+
Sbjct: 392 ENPYVDPGKAKKEVRSKEYVALARQVAQASITLLKNEHSLLPLDRS--MKVALIGPNADN 449
Query: 409 TKAMIGNYEG--TPCRYTSPMDGFYA--YSKVINYAPGCADIVCQNNSMIPAAIDAAKNA 464
M+G+Y + +DG A S + Y GC+ I S I A+ AA+ +
Sbjct: 450 RYNMLGDYTAPQEEENVKTVLDGIRAKLSSSQVEYVKGCS-IRDTVTSDIEQAVAAARRS 508
Query: 465 DATVIVAGLDLSVE-----------------------AEGKDRVDLLLPGFQTELINKVA 501
+ + V G + + EG DR L L G Q EL+ K
Sbjct: 509 EVVIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KAL 567
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
A P+ +V + +D N+A N ++L YPG+EGG AIADV+FG++NP GRLP
Sbjct: 568 KATGKPLIVVYIEGRPLDKNWASENAD--ALLTAYYPGQEGGNAIADVLFGEFNPAGRLP 625
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFP-GRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
+ V +P+ P Y +Y FGYGLSYT F+Y
Sbjct: 626 FS------VPRSVGQVPVYYNKKAPQSHDYVEVSASPLYSFGYGLSYTTFEYS------- 672
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
D++ + T + F ++ N GK DG EVV +
Sbjct: 673 ------------DLHLSALT----------------PHSFEVSCKIRNTGKYDGEEVVQL 704
Query: 681 YSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTI 739
Y + + +KQ+ + R+F+ G+ KV F ++ + +VD ++ G +
Sbjct: 705 YLRDEYASVVQPLKQLKHFARLFLKCGEEQKVKFILSE-EDFALVDRNLKRVVEPGTFQV 763
Query: 740 LVG 742
++G
Sbjct: 764 MIG 766
>gi|294673871|ref|YP_003574487.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294474367|gb|ADE83756.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 782
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 220/726 (30%), Positives = 331/726 (45%), Gaps = 121/726 (16%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+PL+ EA HG IG T FPT A++N +L +K
Sbjct: 130 RLGIPLF-LAEEAPHGHMAIG-----------------TTVFPTGFGMAATWNPALIEKT 171
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
G+ + E R G + P +++ R+PRW RV ET GEDP + G V+GL
Sbjct: 172 GEVIGQEIRL-----QGGHISYGPVLDLAREPRWSRVEETMGEDPVLAGELGAAMVKGLG 226
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
G+ S+P A KH+ Y N +++QE+F+ PF+
Sbjct: 227 G--GIL------SKPYSTIATLKHFIGYGTTEAGQNGGITI---AGARELQESFLPPFKK 275
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
+N G +S VM SYN ++GIP+ LL +R W F+G++VSD SI I +H+ +
Sbjct: 276 AINAGALS-VMTSYNSLDGIPSTCSKALLTDLLRTQWGFNGFVVSDLYSIDGIHGTHR-V 333
Query: 290 NDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD 349
+TK+ A LKAG+D D G AVQ+G + EA+ID +++ + + +G F+
Sbjct: 334 AETKQQAGVMALKAGVDADLGALAFGRLEDAVQKGMVTEAEIDVAVKRILKMKFEMGLFE 393
Query: 350 GSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANAT 409
K + + + +A + AR+ I LLKN N LPL+ + + + GP+A+
Sbjct: 394 HPYVDAAQAKQLVRSDNNKAVALQVAREIITLLKNQNHVLPLS--KTQKVLVCGPNADNV 451
Query: 410 KAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCA--DIVCQNNSMI-------- 454
M+G+Y EG + + S+V Y GCA D N +
Sbjct: 452 YNMLGDYTAPQEEGNVKTILAGIRSKLPASQV-TYVKGCAVRDTTASNIAEAVAAAKQAD 510
Query: 455 --------PAAID---AAKNADATVI----VAGLDLSVEAEGKDRVDLLLPGFQTELINK 499
+A D + K A V ++ +D EG DR L G Q +L+ K
Sbjct: 511 VVVVAVGGSSARDFKTSYKETGAAVTDSKTISDMDC---GEGFDRATLTPLGHQMQLL-K 566
Query: 500 VADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGR 559
A P+ +V + +D ++A + ++L YPG+EGG AIADV+FG YNP GR
Sbjct: 567 ALKAIGKPLVVVYIEGRPMDKSWAAQHAD--ALLTAYYPGQEGGTAIADVLFGDYNPAGR 624
Query: 560 LPITWYEANYVKIP--YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASS 617
LP++ AN +IP Y P P Y +Y FGYGLSYT FKY
Sbjct: 625 LPVS-VPANVGQIPVYYNKKPPMP------HDYVEMSARPLYAFGYGLSYTTFKYD---- 673
Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
D+N I++ +K TF V N G MDG EV
Sbjct: 674 ---------------DLN--------------IEETGDTQFKVTFN--VTNTGDMDGDEV 702
Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGA 736
V +Y + + Q+ + R+FI G++ +V FT+ A + L+IVD N ++ +G
Sbjct: 703 VQLYLHDEFASTAQPMMQLKKFSRIFIPKGETKQVSFTLEA-EDLEIVDQEMNHVVETGD 761
Query: 737 HTILVG 742
T+++G
Sbjct: 762 FTVMIG 767
>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 954
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 225/768 (29%), Positives = 354/768 (46%), Gaps = 127/768 (16%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLP-LYE---WWSE 61
K K++D PY DA LP ER + L+ MT +K++ + + +G+P G+P LY E
Sbjct: 162 KGKVTDRPYMDASLPVDERVESLLAAMTPADKMELIRE-GWGIP--GIPHLYVPPITKVE 218
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMY 121
A+HG S+ G+ GAT FP + A++N L +++ + E +
Sbjct: 219 AVHGFSY---------GS-------GATIFPQALAMGATWNRQLTEEVAMAIGDET-VIA 261
Query: 122 NLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
N A WSP ++V +D RWGR ET GEDP +V + +++G Q
Sbjct: 262 NTKQA----WSPVLDVAQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ------------ 305
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
S+ L + KH+ + R D ++E++M+E ++PF + D S+M
Sbjct: 306 SKGLFTTP--KHFGGHGAPL---GGRDSHDIGLSEREMREVHLVPFRHVIRNYDCQSLMM 360
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y+ GIP +LL + +R +W F+G+IVSDC +I + + K +A + L
Sbjct: 361 AYSDYMGIPIAKSTELLQRILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQAL 420
Query: 302 KAGLDLDCGDYYTNF-TMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKN 360
AG+ +CGD Y N + A + G+I ++D R + + R F+ +P K L N
Sbjct: 421 AAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLATMFRNELFEKNP-CKPLDWN 479
Query: 361 NIC----NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY 416
I + H +A AA + IV+L+N + LPL+ ++T+A++GP A+ + G+Y
Sbjct: 480 KIYPGWNSDSHKAMAHRAACESIVMLENKDNLLPLSK-ELRTIAVLGPGADDLQP--GDY 536
Query: 417 --EGTPCRYTSPMDGFYA----YSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIV 470
+ P + S + G A +KV+ Y GC D + IP A+ A AD V+V
Sbjct: 537 TPKLQPGQLKSVLTGIKAAVSKQTKVL-YEKGC-DFTETGMTDIPKAVKTASQADVVVMV 594
Query: 471 AGLDLSVE----------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
G D S+ E D L+LPG Q EL+ V K PV L++ + D+
Sbjct: 595 LG-DCSISEATKDVRKTCGENNDLATLVLPGKQQELLEAVCATGK-PVILILQAGRPYDL 652
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
K + K+IL PG+EGG A ADV+FG YNPGGRLP+T+ +PL
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPATADVLFGDYNPGGRLPMTFPRH------VGQLPLY 704
Query: 581 PVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCRDIN 635
GR Y++ D +Y FGYGLSYT F+Y KV P
Sbjct: 705 YNFKTSGRRYEYVDMEYYPLYRFGYGLSYTSFEYSGLKVQEKPNG--------------- 749
Query: 636 YTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQ 694
T + V+N+G G EV +Y + T + +
Sbjct: 750 -----------------------NVTVEATVKNVGGRAGDEVAQLYVTDMYASVKTRVME 786
Query: 695 VIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ + R+ + G+S V F + L ++++ + ++ G I VG
Sbjct: 787 LKDFARIHLNPGESKTVSFELTPY-DLSLLNDHMDRVVEKGEFKICVG 833
>gi|319953334|ref|YP_004164601.1| beta-glucosidase [Cellulophaga algicola DSM 14237]
gi|319421994|gb|ADV49103.1| Beta-glucosidase [Cellulophaga algicola DSM 14237]
Length = 756
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 216/716 (30%), Positives = 338/716 (47%), Gaps = 116/716 (16%)
Query: 35 PEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTV 94
PEK++ D A RLG+PL+ + S+ +HG T+FP
Sbjct: 82 PEKIKTAQDFAVKKTRLGIPLF-FGSDIIHGYK---------------------TTFPIP 119
Query: 95 ILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETPGED 153
+ ++S++ L K+ Q + EA A G+ + +SP +++ RDPRWGR+ E GED
Sbjct: 120 LGLSSSWDMELLKRTAQVAALEATA------DGINWNFSPMVDISRDPRWGRISEGAGED 173
Query: 154 PYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR 213
PY+ + A V G Q + + + + A KH+A Y G D D
Sbjct: 174 PYLGSQIAKAMVTGYQGEDLMAKNT--------MLATVKHFALYGAAE-AGRDYNSVD-- 222
Query: 214 VTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIV 273
++ M ++ P++ ++ G V SVM S+N ++GIP + LL +R DW F+G++V
Sbjct: 223 MSRLKMYNEYLPPYKAAIDAG-VGSVMSSFNDIDGIPASGNKWLLTDLLRDDWKFNGFVV 281
Query: 274 SDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKIAEADID 332
SD S+ ++ +H L D + A LKAGLD+D G+ + ++ +GK+ +I
Sbjct: 282 SDYTSVNEMI-AHG-LGDLQA-VSALSLKAGLDMDMVGEGFLTTLKKSLDEGKVTAEEIT 338
Query: 333 TSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALP 390
T+ R + +LG FD +Y K +I ++ LA EAA++ VLLKND LP
Sbjct: 339 TACRRILEAKFKLGLFDDPYKYIDKKRPAKDILKDENRALAREAAKKSFVLLKNDTKNLP 398
Query: 391 LNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF--YAYSKVINYAPGC--- 443
+N + +AL+G AN+ M+G + G P S + GF A + I +A G
Sbjct: 399 INKSS--KIALIGDLANSKDNMLGTWAPTGDPQLSVSILQGFKNVAPNAQITHAKGANIT 456
Query: 444 --ADIVCQNN--------------SMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDL 487
A + + N M+ A++ AK +D V V G E R D+
Sbjct: 457 DDAALAKKINVFGERVTIDKRSAEEMLNEAVELAKKSDIIVAVVGEATEFTGESSSRTDI 516
Query: 488 LLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIA 547
+P Q +LI +A K P+ LV+MS + + + SIL V +PG E G AIA
Sbjct: 517 SIPQSQKKLIRALAATGK-PLVLVLMSGRPLVLE--EELALSASILQVWFPGVEAGNAIA 573
Query: 548 DVIFGKYNPGGRLPITWYEANYVKIP-YTSMPL--RP--VNNFPGRTYKFFDGP--VVYP 600
DV+FG YNP G+L TW N +IP Y S+ RP + F T + D P + P
Sbjct: 574 DVVFGDYNPSGKLTATWPR-NVGQIPIYHSIKNTGRPQLTSEFEKFTSNYLDAPNTPLLP 632
Query: 601 FGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKF 660
FGYGLSYT+F+Y ++ ++ Q + N+P
Sbjct: 633 FGYGLSYTEFEYS--------NLNVNASQ--------INQNEP----------------L 660
Query: 661 TFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
+ V N G DG EVV +Y + + T +KQ+ G+++V + G++ +V T+
Sbjct: 661 IVTVSVTNTGNFDGEEVVQLYLRDVVRSITQPVKQLKGFKKVMLKKGETKQVTLTL 716
>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 887
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 170/502 (33%), Positives = 245/502 (48%), Gaps = 63/502 (12%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
FP D L + R KDLV R+TL EKV QM + A +PRLG+P Y+WW+E LHGV+
Sbjct: 40 FPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVA---- 95
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
T F T +P I A ++ + + + E RA++N A
Sbjct: 96 ------RTPFH-----VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNN 144
Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
GLT+W+PNIN+ RDPRWGR ET GEDP++ +VRGLQ D
Sbjct: 145 ERYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQ---------GDDP 195
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
+ LK +AC KH+A + R + + D+ +T++ F+ V + V VMC+
Sbjct: 196 KYLKAAACAKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCA 252
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
YN +G P C L+N +R W F GY+ SDC +I + HK D +V VL
Sbjct: 253 YNAFHGQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLH 312
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFD--GSPQYKNLGKN 360
G D++CG + V++G IAEA +D SL L+ RLG FD +Y ++
Sbjct: 313 -GTDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFDPVSMVKYAQTPES 371
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ +H + + A+Q IVLLKN+ LPL+ NIK +A++GP+A+ ++GNY G P
Sbjct: 372 ILETAEHKAHSLKMAQQSIVLLKNEGNTLPLSK-NIKKIAVLGPNADNRIVVLGNYNGQP 430
Query: 421 CRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMI-PAAIDAAKNADATVIVAGLDLSVEA 479
S++I G + + Q +I AI+ D + A +
Sbjct: 431 -------------SEIITALQGIKNKLGQEVELIYEKAINFTN--DTLLAYANVTNQYSW 475
Query: 480 EGKDRVDLLLPGFQTELINKVA 501
EGK PGF+ E N VA
Sbjct: 476 EGK-------PGFKAEYYNNVA 490
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 133/276 (48%), Gaps = 53/276 (19%)
Query: 452 SMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVA 501
S + A ++ K+ADA V V G+ +E E G DR +LLP QTEL+ +
Sbjct: 607 SNLSAIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELLKMLK 666
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P+ V+M+ A+ + + N I +I+ Y G+ G AIADV+FG YNP GRLP
Sbjct: 667 GTGK-PLVFVVMTGSAIALPYEDQN--IPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLP 723
Query: 562 ITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSV 621
+T+Y+A+ + +P + RTY++F G +YPFG+GLSYT F+Y +P +
Sbjct: 724 VTFYKAD------SDLPDFKSYDMNNRTYRYFKGDALYPFGHGLSYTSFQYSKLKTPGKI 777
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
F + N GK DG EVV +Y
Sbjct: 778 K---------------------------------SGASFKVSATLTNTGKKDGDEVVQLY 804
Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
P +AG I+ + G+ R+ + AG+S V FT++
Sbjct: 805 LAYPEVAGKAPIRALKGFNRIRLKAGESKTVSFTLS 840
>gi|330996730|ref|ZP_08320605.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella
xylaniphila YIT 11841]
gi|329572575|gb|EGG54218.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 725
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 161/449 (35%), Positives = 229/449 (51%), Gaps = 46/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
D PY + L ERA DL++R+TL EK+ M + + GV RLG+ Y WWSEALHGV+
Sbjct: 21 QDEPYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARN 80
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---------M 120
G AT +P + + F++ L + I TVS E RA
Sbjct: 81 GL----------------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGR 124
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
Y GN GLTFW+PN+N+ RDPRWGR ET GEDPY+ R + V+G+Q +
Sbjct: 125 YGRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVQGMQG--------PA 176
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE-QDMQETFILPFEMCVNEGDVSSV 239
D++ K AC KHYA + R FD E +D+ ET++ F+ V E DV V
Sbjct: 177 DAKYDKTHACAKHYAVHSGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEV 233
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MC+Y R G P C +LL Q +R +W + +VSDC +I ++ ++T DA
Sbjct: 234 MCAYQRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF--YQGRHETHPDAATS 291
Query: 300 VLKA---GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A G DL+CG Y + AV++G I E IDTSLR L LG D +
Sbjct: 292 SASAVINGTDLECGVEYAHLDE-AVERGLITEHRIDTSLRRLLEARFALGEMDDDALVPW 350
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+ + + H ++A + R+ +VLL N NG LPL+ G++ +A++GP+A + G
Sbjct: 351 SRISIDTVDCDMHRQMALDVTRKSMVLLHN-NGILPLDKGDVGKIAVMGPNAVDSVMQWG 409
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
NY+G P + ++G + Y GC
Sbjct: 410 NYKGVPAHTYTILEGIRMEVGNVPYEKGC 438
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 70/130 (53%), Gaps = 21/130 (16%)
Query: 458 IDAAKNADATVIVAGLDLSVEAE-----------GKDRVDLLLPGFQTELINKVADAAKG 506
++ K+A+ + V G+ ++E E G DR + LP Q +++ K AA
Sbjct: 594 VERVKDAETIIFVGGISPNLEGEDKYFVYCPGFAGGDRTSIELPQVQRDIL-KALKAAGK 652
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKS---ILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
V V S AV + P+++S IL YPG+ GG A+ADV+FG +NP G+LP+T
Sbjct: 653 KVVFVNCSGSAVALV-----PELESCDAILQAWYPGQAGGLAVADVLFGDFNPSGKLPVT 707
Query: 564 WYEANYVKIP 573
+Y+ N ++P
Sbjct: 708 FYK-NTEQLP 716
>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
Length = 785
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 245/830 (29%), Positives = 377/830 (45%), Gaps = 173/830 (20%)
Query: 6 KVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQM------------GDLAYGVPRL-- 51
+V + Y A +P R KDL+ RMT+ EKV Q+ G V L
Sbjct: 22 RVMAQQWLYKQAAVPIEYRVKDLLGRMTIEEKVGQLCCPLGWEMYTKTGKNEVTVSELYK 81
Query: 52 ----GLPLYEWWS-------------------------EALHGVSFIGRRTNSPPGTHFD 82
P+ +W+ AL + R P F
Sbjct: 82 KKMAEAPVGSFWAVLRADPWTQKTLETGLSPELSAKALNALQKYAVEETRLGIP--VLFA 139
Query: 83 SEVP------GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNI 135
E P G T FPT + +++NE L K+G+ ++ EAR N+G + P +
Sbjct: 140 EECPHGHMAIGTTVFPTALSAASTWNEGLMLKMGEAIALEARLQGANIG------YGPVL 193
Query: 136 NVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYA 195
+V R+PRW R+ ET GEDP + + G+ ++G++ +D + L A KH+A
Sbjct: 194 DVAREPRWSRMEETFGEDP------VLTTIMGVAMMKGMQGKVQNDGKHL--YATLKHFA 245
Query: 196 AYDLDNWEGNDRFHFDSRVT--EQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCA 253
AY + + H SR + + ++ PF V EG ++M SYN ++G+P A
Sbjct: 246 AYGVP-----ESGHNGSRANCGMRQLLSEYLPPFRKAVKEG-AGTLMTSYNAIDGVPCTA 299
Query: 254 DPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDY 312
+ +LL +R W F G++ SD SI+ IV + D KE AV + LKAGLD+D G+
Sbjct: 300 NKELLTDVLRNQWGFKGFVYSDLISIEGIV-GMRAAKDNKEAAV-KALKAGLDMDLGGNA 357
Query: 313 YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAA 372
+ A ++G I AD+D ++ + + ++G F+ L K + + +H ELA
Sbjct: 358 FGKNLKKAYEEGLITMADLDRAVGNVLRLKFQMGLFENPYVSPELAKKLVHSKEHKELAR 417
Query: 373 EAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCR--YTSPMDGF 430
+ AR+G+VLLKN+ G LPL+ +I LA++GP+A+ +G+Y R + +DG
Sbjct: 418 QVAREGVVLLKNE-GVLPLSK-HIGHLAVIGPNADEMYNQLGDYTAPQVREEVATVLDGI 475
Query: 431 YAY---SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG--------------- 472
A S + Y GCA + + IPAA+ AA+ ADA V+V G
Sbjct: 476 RAAVSESTRVTYVKGCA-VRDTTATDIPAAVAAAQKADAVVLVVGGSSARDFKTKYISTG 534
Query: 473 -LDLSVEA---------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINF 522
+S +A EG DR L L G Q +LI+ VA K P+ +V + +++N
Sbjct: 535 AATVSEDAKTLPDMDCGEGFDRSSLRLLGDQEKLISAVASTGK-PLVVVYIQGRTMNMNL 593
Query: 523 AKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV 582
A K +++L YPGE+GG IAD++FG Y+P GRLP++ +P + L PV
Sbjct: 594 AAE--KAQALLTAWYPGEQGGMGIADILFGDYSPAGRLPVS--------VPRSEGQL-PV 642
Query: 583 NNFPG--RTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGT 640
G R Y G +Y FGYGLSYT+F Y ++L K + +
Sbjct: 643 FYSQGTQRDYVESKGTPLYAFGYGLSYTRFTYS--------GLELQKGTEMETLQ----- 689
Query: 641 NKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY--------SKPPGIAGTHI 692
T V N G DG EVV +Y S+PP +
Sbjct: 690 --------------------TVACTVTNTGNRDGEEVVQLYIGDKVASVSQPPLL----- 724
Query: 693 KQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ ++R+F+ G+S +V F + L I D+ N ++ G ++VG
Sbjct: 725 --LKAFQRIFLKKGESRQVIFHLKK-DDLGIYDSEMNYVVEPGEFKVMVG 771
>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 799
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 203/698 (29%), Positives = 336/698 (48%), Gaps = 111/698 (15%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
G T FPT I +++N +L +++ ++ EAR G + P +++ R+PRW RV
Sbjct: 156 GTTVFPTSIGQASTWNPALIQEMAAAIALEARL-----QGGHIGYGPVLDLAREPRWSRV 210
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGND 206
ET GEDPY+ + V G Q +S + + + KH+ AY + N
Sbjct: 211 EETYGEDPYINSQMGRAMVSGFQG--------ESIASGKNVISTLKHFTAYGVPEGGHNG 262
Query: 207 RFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDW 266
V ++++ E+++ PF+ V EG +S VM +YN ++G+P ++ LLN +R DW
Sbjct: 263 T---SVSVGQRELHESYLPPFKAAVAEGALS-VMTAYNSIDGVPCTSNGHLLNDVLRDDW 318
Query: 267 NFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDY-YTNFTMGAVQQGK 325
F+G++VSD SI + SH + +T E A + AG+D D G Y + + AVQ G
Sbjct: 319 GFNGFVVSDLGSISGLRGSHH-VTETAEGAAQLAINAGVDSDLGGYGFGKNLLAAVQAGG 377
Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
+++ +D ++R + V +G F+ + ++ + + +HI LA + AR+ +VLLKN+
Sbjct: 378 VSQEVLDEAVRRVLKVKFDMGLFENPYVDPSKAESLVRSAKHIALARKVARESVVLLKNE 437
Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC--RYTSPMDGFYAYSKV-----IN 438
N LPL + ++A++GP+A+ T +G+Y + ++G +KV +N
Sbjct: 438 NDLLPLRK-KVNSIAVIGPNADNTYNQLGDYTAPQPNENVVTVLEGI--KNKVGKDVRVN 494
Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE---------------- 478
Y GCA I S I A A +D V+V G D E
Sbjct: 495 YVKGCA-IRDTTQSEIGKAASLAARSDVAVVVLGGSSARDFDTEYEETAAAKVSEAEEGQ 553
Query: 479 -------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKS 531
EG DR+ L L G Q +L+ V A PV +V++ +++N+ + + +
Sbjct: 554 VISDMESGEGFDRMTLDLLGDQLKLVQAV-QATGTPVVVVLIKGRPLNLNWIDEH--VPA 610
Query: 532 ILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNF--PGRT 589
I+ YPG+EGG AIADV+FG YNP GRL I+ +P + L N+ P R
Sbjct: 611 IVDAWYPGQEGGNAIADVLFGDYNPSGRLTIS--------VPRSVGQLPVFYNYRNPKR- 661
Query: 590 YKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAA 647
+ + +G +Y FG+GLSY F+Y S G P
Sbjct: 662 HDYVEGSAEPLYAFGHGLSYADFEYDNLEVTAS------------------GMAGSPTVR 703
Query: 648 VLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIG---YERVFIA 704
V +V N+ +DG EVV +Y + AG+ ++ ++ +E+V +
Sbjct: 704 V--------------HFQVSNISNVDGEEVVQLYVRDE--AGSTVRPLLELKRFEKVMVP 747
Query: 705 AGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
AG+S+K+ F + A + L+++ N L+ G+ +LVG
Sbjct: 748 AGESSKITFMLTA-EDLQVLGQDMNWLVEPGSFQVLVG 784
>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 865
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 153/439 (34%), Positives = 233/439 (53%), Gaps = 40/439 (9%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
S+ VK P+ + L ERAKDL+ R+T+ EK + + D + +PRLG+ + WWSEAL
Sbjct: 14 SMTVKGQVLPFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HG + + DS T FP I ASFNE L +I +S EARA Y+
Sbjct: 74 HGYA------------NNDS----VTVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQ 117
Query: 124 GNA---------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
L+ W+PN+N+ RDPRWGR ET GEDPY+ R + V+GLQ E
Sbjct: 118 AQRRGEENRRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDA 177
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+Y K+ AC KHY + W ++ D V+ ++ ET++ F+ V +
Sbjct: 178 KYR--------KLLACAKHYTVHSGPEWSRHELNIND--VSPREFYETYMPAFKALVQKA 227
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
DV VMC+Y+R++ P C++ ++L + +R +W + +V+DC +I +H ++ T
Sbjct: 228 DVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHG-ISSTPV 286
Query: 295 DAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
A A L AG DL+C +Y+ A+++ I E DID SL + LG D +
Sbjct: 287 HAAATGLLAGTDLECIWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEMDDNS 346
Query: 353 --QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATK 410
+ + + + +H +LA + A+Q IVLL+N N LPL+ +I +A+VGP+A+
Sbjct: 347 LVPWAQIPPSVLNCEKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPNADDEV 406
Query: 411 AMIGNYEGTPCRYTSPMDG 429
+ GNY GTP R + +DG
Sbjct: 407 VLWGNYNGTPIRTITVLDG 425
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 84/326 (25%), Positives = 138/326 (42%), Gaps = 59/326 (18%)
Query: 407 NATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPG-------CADIVCQNNSMIPAAID 459
N T A N+ P R+ ++ Y I YA D + + A I
Sbjct: 539 NDTLASYTNWRTIPARFPLYVEAGKTYEIEIRYAQRENWEANIQFDFGREEDIDFTALIK 598
Query: 460 AAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKGPVT 509
+ + + V GL +E E G DR ++ LP Q + + +A K T
Sbjct: 599 KLEGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQRNCLKALKEAGK---T 655
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
++ ++ I +I+ Y GE GG+AIADV+FG YNP G+LP+T+Y +
Sbjct: 656 VIFVNCSGSAIALEPETESCDAIIQAWYGGESGGQAIADVLFGDYNPSGKLPVTFYRNSD 715
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
+ + GRTY++ + ++PFG+GLSYT F+ A KS IK D+
Sbjct: 716 NLGDFEDYSME------GRTYRYTNNH-LFPFGFGLSYTNFEIGKARLSKST-IKADE-- 765
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAG 689
+ +I V+N GK DG+E+V VY +
Sbjct: 766 -----------------------------TISIKIPVKNTGKRDGTEIVQVYVRKVNDID 796
Query: 690 THIKQVIGYERVFIAAGQSAKVGFTM 715
+K + G++R+ + AG++ + ++
Sbjct: 797 GPLKTLKGFQRIAVPAGKTRQANISL 822
>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
oral taxon 329 str. F0087]
gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
Length = 675
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 162/449 (36%), Positives = 227/449 (50%), Gaps = 46/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
D PY + L ERA DL++R+TL EK+ M + + GV RLG+ Y WWSEALHGV+
Sbjct: 21 QDEPYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARN 80
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---------M 120
G AT +P + + F++ L + I TVS E RA
Sbjct: 81 GL----------------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGR 124
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
Y GN GLTFW+PN+N+ RDPRWGR ET GEDPY+ R + VRG+Q +
Sbjct: 125 YGRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQG--------PA 176
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE-QDMQETFILPFEMCVNEGDVSSV 239
D++ K AC KHYA + R FD E +D+ ET++ F+ V E DV V
Sbjct: 177 DAKYDKTHACAKHYAVHSGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEV 233
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MC+Y R G P C +LL Q +R +W + +VSDC +I ++ ++T DA
Sbjct: 234 MCAYQRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF--YQGRHETHPDAATS 291
Query: 300 VLKA---GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A G DL+CG Y + AV++G I E IDTSLR L LG D +
Sbjct: 292 SASAVINGTDLECGVEYAHLDE-AVERGLITEHRIDTSLRRLLEARFALGEMDDDALVPW 350
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+ + + H +A + R+ +VLL N NG LPL+ G+ +A++GP+A + G
Sbjct: 351 SRISIDTVDCGTHRRMALDVTRKSMVLLHN-NGILPLDKGDAGKIAVMGPNAVDSVMQWG 409
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
NY+G P + ++G + Y GC
Sbjct: 410 NYKGVPAHTYTILEGIRGAIGNVPYEKGC 438
>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 858
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 164/454 (36%), Positives = 231/454 (50%), Gaps = 47/454 (10%)
Query: 4 SIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEAL 63
++ + PY + KL RA DL+ R+TL EK M + + +PRLG+ YEWW+EAL
Sbjct: 14 TVSLVAQQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEAL 73
Query: 64 HGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNL 123
HGV G AT FP I ASFN L VS EARA N
Sbjct: 74 HGVGRSGV----------------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNK 117
Query: 124 GN--------AGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
+ GLT+W+PN+N+ RDPRWGR ET GEDPY+ + V+GLQ + E
Sbjct: 118 FSEQGGLKRYQGLTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAE 177
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEG 234
Y K+ AC KH+A + W +R F++ + +D+ ET++ F+ V +
Sbjct: 178 YD--------KLHACAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQKA 226
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVE--SHKFLNDT 292
DV VMC+YNR P C +LL Q +R DW F G +VSDC +I + +H D
Sbjct: 227 DVKEVMCAYNRFEDEPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDA 286
Query: 293 KEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP 352
A VL G DL+CG + N AV+ G I E ID SL+ L LG + S
Sbjct: 287 THAAANAVLN-GTDLECGSDFRNLPE-AVKAGLIEEKRIDVSLKRLLKARFELGEMN-SD 343
Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
Q + + + + +H LA A + IVLL+N+N LPL + +K +A++GP+AN +
Sbjct: 344 QVWPISYSVVNSEKHQNLALRMAEESIVLLQNNNNILPL-SKKLK-IAVMGPNANDSVMQ 401
Query: 413 IGNYEGTPCRYTSPMDGF---YAYSKVINYAPGC 443
GNY G P + ++ + +++I Y PGC
Sbjct: 402 WGNYNGFPAHTVTLLEAMRKSFPGAQLI-YEPGC 434
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 128/276 (46%), Gaps = 53/276 (19%)
Query: 454 IPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADA 503
+ A+I K+AD V G+ S+E E G DR D+ LP Q L+ + DA
Sbjct: 585 LSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDA 644
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K V V S A+ + ++IL YPG+ GG A+A+V+ G YNP GRLP+T
Sbjct: 645 GK-KVVFVNFSGSAMGL--VPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVT 701
Query: 564 WYEANYVKIP-YTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
+Y+ N ++P + ++ GRTY++ ++ FGYGLSYT+F
Sbjct: 702 FYK-NVAQLPDFEDYSMK------GRTYRYMTEKPLFSFGYGLSYTKF------------ 742
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
+GT K +++ ++ + V N GK+ G+EV+ VY
Sbjct: 743 --------------VLGTAKLNKSSIKANET------LKITVPVTNAGKVAGTEVLQVYV 782
Query: 683 KPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNAC 718
+ K + G+++V I G+++++ + +
Sbjct: 783 RKVKDVDGPAKTLRGFKKVNIEPGKTSQISIDLTSS 818
>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
YIT 11840]
Length = 676
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 162/449 (36%), Positives = 227/449 (50%), Gaps = 46/449 (10%)
Query: 10 SDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFI 69
D PY + L ERA DL++R+TL EK+ M + + GV RLG+ Y WWSEALHGV+
Sbjct: 21 QDEPYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALHGVARN 80
Query: 70 GRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA---------M 120
G AT +P + + F++ L + I TVS E RA
Sbjct: 81 GL----------------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGR 124
Query: 121 YNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDS 180
Y GN GLTFW+PN+N+ RDPRWGR ET GEDPY+ R + VRG+Q +
Sbjct: 125 YGRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQG--------PA 176
Query: 181 DSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTE-QDMQETFILPFEMCVNEGDVSSV 239
D++ K AC KHYA + R FD E +D+ ET++ F+ V E DV V
Sbjct: 177 DAKYDKTHACAKHYAVHSGPE---AKRHSFDVENLEPRDLWETYLPAFKALVQEADVKEV 233
Query: 240 MCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVAR 299
MC+Y R G P C +LL Q +R +W + +VSDC +I ++ ++T DA
Sbjct: 234 MCAYQRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF--YQGRHETHPDAATS 291
Query: 300 VLKA---GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QY 354
A G DL+CG Y + AV++G I E IDTSLR L LG D +
Sbjct: 292 SASAVINGTDLECGVEYAHLDE-AVERGLITEHRIDTSLRRLLEARFALGEMDDDALVPW 350
Query: 355 KNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIG 414
+ + + H +A + R+ +VLL N NG LPL+ G+ +A++GP+A + G
Sbjct: 351 SRISIDTVDCGTHRRMALDVTRKSMVLLHN-NGILPLDKGDAGKIAVMGPNAVDSVMQWG 409
Query: 415 NYEGTPCRYTSPMDGFYAYSKVINYAPGC 443
NY+G P + ++G + Y GC
Sbjct: 410 NYKGVPAHTYTILEGIRGAIGNVPYEKGC 438
>gi|336255157|ref|YP_004598264.1| beta-glucosidase [Halopiger xanaduensis SH-6]
gi|335339146|gb|AEH38385.1| Beta-glucosidase [Halopiger xanaduensis SH-6]
Length = 774
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 223/821 (27%), Positives = 367/821 (44%), Gaps = 160/821 (19%)
Query: 8 KLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL--------GLPLYEWW 59
+LS Y D R +DL+ERMT+ EK Q+G + RL + EW
Sbjct: 4 ELSTAAYQDESESVENRVEDLLERMTVEEKAAQLGSV--NADRLLDEDGEIDWDAVDEWL 61
Query: 60 SEALHGVSFIGR-------------RTNSPPGTHFDSEV------------------PGA 88
+ HG+ R R + T+ E P A
Sbjct: 62 A---HGIGHFTRLGGEGSLAPSEAARVTNELQTYLREETRLGIPAIPHEECLSGYMGPEA 118
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
T+FP ++ +S+N L + + +T+ E G + SP ++V RD RWGRV E
Sbjct: 119 TTFPQMLGMASSWNPELLQTVTETIRGELE-----GIGTVHALSPVLDVARDLRWGRVEE 173
Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
T GEDPY+V A YV GLQ D R ISA KH+ + + G +R
Sbjct: 174 TFGEDPYMVAEMARAYVSGLQ----------GDGRADGISATLKHFVGHGATDG-GKNRS 222
Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
+ V ++++ET + P+E ++E + SVM +Y+ ++G+P LL + +RG++ F
Sbjct: 223 SLN--VGPRELRETHLFPYEAVISEANAESVMNAYHDLDGVPCANSEWLLTEVLRGEFGF 280
Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKI 326
G +VSD S++ +V H+ + TK +A + L+AG+D++ +YY + AV++G +
Sbjct: 281 DGTVVSDYYSVRHLVTEHETAS-TKPEAAVQALEAGIDVELPYTEYYGEHLVEAVEEGDL 339
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
AE ++ S+R + R G FD + + + E+ EAARQ + LLKN++
Sbjct: 340 AEETLNESVRRILREKFRKGVFDDPAVDVDAAADAFHTDEAREVTREAARQSMTLLKNED 399
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--------EGTPCRYTSPMDGFYAYSKV-I 437
++ +A+VGP A+ K ++G+Y E +P++ A + +
Sbjct: 400 DL---LPLDVDDVAVVGPKADNPKELMGDYAYAAHYPEEEYEADAVTPLEALEARDGLDV 456
Query: 438 NYAPGCA-----------------------DIVCQNNSMIPAAIDAAKNADATVIVAGLD 474
Y GC V +++ + ++A K +V +G
Sbjct: 457 TYEQGCTISGPSTDGFDAAADAAADADVALAFVGARSAVDFSDVEAEKEEKPSVPTSG-- 514
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSIL 533
EG D L LPG Q EL+ ++ + PV +V++S I + A P +IL
Sbjct: 515 -----EGCDVTHLGLPGVQEELVAELLE-TDTPVVVVLVSGKPHAIEDIAAEAP---AIL 565
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPV-----NNFPGR 588
+ PG+EGG AIA+ +FG+ NP G+LP++ +P + L PV N +
Sbjct: 566 YAWLPGDEGGTAIAETLFGENNPAGKLPVS--------LPKSVGQL-PVYYNRKENTANK 616
Query: 589 TYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAV 648
Y + D VYPFG+G SYT+F+Y D+ L D P +
Sbjct: 617 DYVYTDSEPVYPFGHGESYTEFEYG--------DVSLSTDSVT------------PLGS- 655
Query: 649 LIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQ 707
FT + V N+G G E+V Y + + +++++G+ERV + G+
Sbjct: 656 -----------FTASVTVANVGDRAGDEIVQCYGRATNASQARPVQELLGFERVSLEPGE 704
Query: 708 SAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
S +V F ++A + L D + N + G + I +G G+
Sbjct: 705 SKRVAFDLSATQ-LAFHDLSMNLAVEEGPYEIRIGRSADGI 744
>gi|285808617|gb|ADC36136.1| glycoside hydrolase family 3 protein [uncultured bacterium 253]
Length = 752
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 191/641 (29%), Positives = 319/641 (49%), Gaps = 69/641 (10%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T FP + +S++ + ++ + EARA AG+ + ++P +++ RDPRWGR+
Sbjct: 117 TIFPIPLAEASSWDPTSAERSTSIAAREARA------AGVRWTFAPMLDIARDPRWGRIT 170
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GED ++ +A VRG Q G +Y S P K+ AC KH+ AY EG R
Sbjct: 171 EGAGEDQFLGAAFARARVRGFQ---GTDY-----SAPDKMLACAKHWVAYGAT--EGG-R 219
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
+ + ++E ++E + PF+ V+ G V +VM +N +NG+P A+ L + +RG+W
Sbjct: 220 DYNTTDMSENTLREIYFPPFKAAVDAG-VGTVMSGFNDLNGVPVSANHFTLTEVLRGEWK 278
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
F G++VSD S++ ++ D +DA L AG+D++ + +++GK+
Sbjct: 279 FDGFVVSDYTSVKELINHGLAFGD--QDAARLALNAGVDMEMVSRLFNQQGPQLLKEGKV 336
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
+ A ID ++R + + RLG F + ++ ++ A A + +VLLKN+
Sbjct: 337 SPATIDEAVRRILRIKFRLGLFANPYADEARETTSLLTSENRAAARALADRSMVLLKNEG 396
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGFYAY---SKVINYAP 441
G LPL+ G I+++A++GP A+ +A +G + +G P +P+ G A + +NYA
Sbjct: 397 GTLPLSKG-IRSIAVIGPLADDHRAPLGWWSGDGKPEDTVTPLMGIRAKVSPATKVNYAK 455
Query: 442 GCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVA 501
GC D+ + I A+ A+ ++ ++ G + E + L L G Q +L+ V
Sbjct: 456 GC-DVQGDSTGDIAEAVAVARESELAIVFVGESAEMVGEAASKSSLDLTGCQMDLVKAVQ 514
Query: 502 DAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLP 561
K P +V+++ + + + +N W+G G E G AIADV+FG NPGG+LP
Sbjct: 515 ATGK-PTIVVLINGRPLTVGWIFDNTPAVLEAWMG--GTEAGNAIADVLFGDANPGGKLP 571
Query: 562 ITW-YEANYVKIPYTSMPL-RPVNNFPGRTYKFFDGPVV--YPFGYGLSYTQFKYKVASS 617
+TW V I Y M RP T K+ D P + FGYGLSYTQFK
Sbjct: 572 VTWPRTVGQVPIYYNHMNTGRPPEANNRYTSKYLDVPWTPQFCFGYGLSYTQFKI----- 626
Query: 618 PKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEV 677
+++L + P +A K T +EVEN+GK G EV
Sbjct: 627 ---TNLQL---------------SAPRISAT---------GKLTASVEVENVGKRAGDEV 659
Query: 678 VMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNA 717
V +Y + T +K++ G++R+ + G+ +V F + +
Sbjct: 660 VQLYIHDVAASMTRPVKELKGFQRITLQPGEKKRVEFVLTS 700
>gi|298374091|ref|ZP_06984049.1| thermostable beta-glucosidase B [Bacteroides sp. 3_1_19]
gi|298268459|gb|EFI10114.1| thermostable beta-glucosidase B [Bacteroides sp. 3_1_19]
Length = 732
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 221/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTIDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + V S ++AGLDL+ G
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GK+ + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+VIM AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVIMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG A+ DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ +K++ G+++VF+ G+S ++ + SL A + + IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714
>gi|225871719|ref|YP_002753173.1| glycosyl hydrolase family, 3 [Acidobacterium capsulatum ATCC 51196]
gi|225793416|gb|ACO33506.1| glycosyl hydrolase family, 3 [Acidobacterium capsulatum ATCC 51196]
Length = 776
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 194/661 (29%), Positives = 318/661 (48%), Gaps = 93/661 (14%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T FP + AS++ + + + EAR++ G+ + ++P +++ RDPRWGR++
Sbjct: 133 TIFPVPLAQAASWDPVMVSRDQSIAAMEARSV------GIDWAFAPMVDIARDPRWGRMV 186
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E G DPY+ A VRG Q P I AC KH+A Y EG R
Sbjct: 187 EGAGSDPYLGAAMAAAQVRGFQGA--------YPGAPNHILACAKHFAGYGAA--EGG-R 235
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
+ S +++ + ++ PF V G V+++M +Y +N +P + LL +R DW
Sbjct: 236 DYDASYISDSQLWNVYLPPFHAAVKAG-VATLMSAYMDLNDVPATGNQWLLQDVLRRDWK 294
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG---DYYTNFTMGAVQQG 324
F GY+VSD ++++ + ++H F D +EDA R KAG++++ Y + A+QQG
Sbjct: 295 FDGYVVSDANAVRNL-QTHGFAQD-QEDAAVRAFKAGVNMEMAIGQTAYDSELSKALQQG 352
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLL 382
I +D ++R + + MRLG F+ Y ++ ++ + +P H A AA + VLL
Sbjct: 353 VITGQQLDDAVRPILEMKMRLGLFEHP--YVDVARSQRILDDPAHRTAARIAAERSAVLL 410
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIG--NYEGTPCRYTSPMDGF---YAYSKVI 437
+N+ G LPLN +A++GP A++ + +G ++ + + G + S I
Sbjct: 411 RNEGGLLPLNKTRYHNIAVIGPLADSQRDTLGPWTFDENLSETVTVLQGLRNAFGASAKI 470
Query: 438 NYAPGCADIVCQNNSMIPA---------------------AIDAAKNADATVIVAGLDLS 476
YAPG A + + SM A AID A+ +D V+V G +
Sbjct: 471 TYAPG-AQMHRKFPSMFDALDRGKKPPVWTPAQARQQMQQAIDLARKSDLVVMVLGEHQN 529
Query: 477 VEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVG 536
+ E L LPG Q +L+ VA K P+ LV+M+ ++I +A + + +IL V
Sbjct: 530 MSGEAASSDSLKLPGDQEQLLQSVAATGK-PLVLVLMNGRPLNIKWAALH--VPAILDVW 586
Query: 537 YPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKFFDG 595
YPG +GG A+A+++ GK PGG+LP W + V IPY N R +
Sbjct: 587 YPGSQGGNAVANLLLGKSVPGGKLPFDWPRDVGQVPIPYAHNLTHEPQNQARRYWDEAST 646
Query: 596 PVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKC 655
P +YPFGYGLSYT F + +++DK + +DV
Sbjct: 647 P-LYPFGYGLSYTAFAFS--------HLQIDKSSVSKK-----------------EDVHV 680
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
++V N GK+ G EV +Y + G A ++++ G+ER+ + GQ+ + FT
Sbjct: 681 S-------VDVTNTGKLAGDEVAQLYIHQEYGNASRPVRELKGFERITLQPGQTKTLQFT 733
Query: 715 M 715
+
Sbjct: 734 L 734
>gi|262383006|ref|ZP_06076143.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262295884|gb|EEY83815.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 732
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 216/747 (28%), Positives = 354/747 (47%), Gaps = 141/747 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + V S ++AGLDL+ G
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GKI + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+V+M AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG A+ DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKV 711
+ +K++ G+++VF+ G+S ++
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRI 686
>gi|301307693|ref|ZP_07213650.1| thermostable beta-glucosidase B [Bacteroides sp. 20_3]
gi|423337298|ref|ZP_17315042.1| hypothetical protein HMPREF1059_00967 [Parabacteroides distasonis
CL09T03C24]
gi|300834367|gb|EFK64980.1| thermostable beta-glucosidase B [Bacteroides sp. 20_3]
gi|409237758|gb|EKN30554.1| hypothetical protein HMPREF1059_00967 [Parabacteroides distasonis
CL09T03C24]
Length = 732
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 221/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + V S ++AGLDL+ G
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GKI + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+V+M AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG A+ DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ +K++ G+++VF+ G+S ++ + SL A + + IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714
>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
Length = 238
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 126/239 (52%), Positives = 171/239 (71%), Gaps = 4/239 (1%)
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP---QYKNLGKNNICNPQHIELAAEAARQ 377
++ G++ E DI+ +L V MRLG FDG P +Y NLG ++C P ELA EAARQ
Sbjct: 1 MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKVI 437
GIVLL+N +LPL+T +T+A++GP+++ T+ MIGNY G C YT+P+ G Y++ I
Sbjct: 61 GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTRTI 120
Query: 438 NYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELI 497
+ A GC D+ C N +I AA AA+ ADATV+V GLD S+EAE +DR +LLLPG Q EL+
Sbjct: 121 HQA-GCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELV 179
Query: 498 NKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
++VA A++GP LVIMS G +D+ FAKN+P+I +I+WVGYPG+ GG AIADV+FG NP
Sbjct: 180 SRVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238
>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
Length = 819
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 223/804 (27%), Positives = 360/804 (44%), Gaps = 135/804 (16%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
+ + K P +R +DL+ +M L EK Q+ L YG R+ LP EW W
Sbjct: 53 FENPKQPIEKRVQDLLSQMNLDEKTCQLATL-YGYKRVMSDSLPTPEWKNKIWKDGIANI 111
Query: 61 -EALHGVS--------------------------FIGRRTNSPPGTHFDSEVPG-----A 88
E L+GV FI P + + G A
Sbjct: 112 DEQLNGVGRGAKIAQDLIYPFSKHAEAINKTQKWFIEETRLGIPVDFSNETIHGLNHTKA 171
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
T P I +++N L K G EA+A+ G T ++P +++ RDPRWGRVL
Sbjct: 172 TPLPAPIGIGSTWNAPLVYKAGSIAGKEAKAL------GYTNIYAPILDLARDPRWGRVL 225
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDP++V V+G+Q+ +GV +A KH+A Y + +
Sbjct: 226 ECYGEDPFLVATLGTQMVKGIQE-QGV-------------AATLKHFAVYSVPKGGRDGS 271
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D V ++M + + PF+ + + VM SYN +G+P A L Q +R ++
Sbjct: 272 VRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFG 331
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAVQQ 323
F GY+VSD D+++ + H + +T E+AV VL+AGL++ D + V++
Sbjct: 332 FDGYVVSDSDAVEYVYNKH-HVAETYEEAVRMVLEAGLNVRTTFAAPDIFILPARKLVKE 390
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP-QHIELAAEAARQGIVLL 382
G+++ ID + + V RLG FD + I ++ + + RQ +VLL
Sbjct: 391 GRLSMKVIDERVADVLRVKFRLGLFDQPFVADPKAADKIVGADKNKDFVLDIQRQSLVLL 450
Query: 383 KNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAY--SKV-INY 439
KN+N LPL+ + + + GP A M+ Y + +G Y +KV ++Y
Sbjct: 451 KNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGNKVAVDY 510
Query: 440 APGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRV 485
A GC + + + I A++ AK +D + V G D E K R
Sbjct: 511 ALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTGESKSRS 570
Query: 486 DLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRA 545
L LPG Q +L+ + K PV LV+++ + IN+A + I +IL +PG+ GG A
Sbjct: 571 GLDLPGRQQQLLEALYATGK-PVVLVLINGQPLTINWA--DRYIPAILEAWFPGQMGGTA 627
Query: 546 IADVIFGKYNPGGRLPITW------YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
IA+ +FG YNPGG+LP+T+ E N+ P S +P G +G +Y
Sbjct: 628 IAETLFGDYNPGGKLPVTFPKTLGQIELNFPFKP-ASQSKQPEAGPNGYGKTRVNG-ALY 685
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GLSYT F+Y ++K+ ++Q D +
Sbjct: 686 PFGFGLSYTTFEYS--------NLKVSPERQGPK----------------------GDIQ 715
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVI-GYERVFIAAGQSAKVGFTMNAC 718
+F ++ N GK G E+V +Y K + + ++ G+ERV + G++ + FT++
Sbjct: 716 VSF--DITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNIQFTLHP- 772
Query: 719 KSLKIVDNAANSLLASGAHTILVG 742
+ L+I+D N + G + +G
Sbjct: 773 EDLEILDINMNWNVEPGEFEVRIG 796
>gi|333377833|ref|ZP_08469566.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
22836]
gi|332883853|gb|EGK04133.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
22836]
Length = 780
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 203/702 (28%), Positives = 329/702 (46%), Gaps = 97/702 (13%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAM-YNLGNAGLTFWSPNINVVRDPRWGR 145
G T FPT I A++N +L +++ +S EAR+ ++G + P +++ R+ RW R
Sbjct: 144 GTTVFPTAIGQAATWNPNLIQQMSAVISKEARSQGSHIG------YGPVLDLAREARWSR 197
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
V ET GEDP ++ + +V G + S+P + + KH+ AY + + N
Sbjct: 198 VEETYGEDPVLISKMGEAFVTG--------FGSGDLSKPYSLISTLKHFVAYGIPDGGHN 249
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ V +D++E ++ PFE V G +S VM +YN V+GIP ++ LL + D
Sbjct: 250 GN---SNSVGMRDLKENYLPPFEKAVKAGALS-VMTAYNSVDGIPCTSNEYLLKDVLCKD 305
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGK 325
W F G+ VSD SI+ + SH ++ +E A+ L +GLD D G AV++G
Sbjct: 306 WGFKGFTVSDLGSIEGLKGSHYVVSTIQEAAILS-LTSGLDCDLGGNAFFTLSDAVKKGM 364
Query: 326 IAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKND 385
+ E ID+++ + + +G F+ +N + + ++I LA + AR+ IVLL+N
Sbjct: 365 VGETQIDSAVYKILKLKFDMGLFENPYVDENNARQVVRTQENIVLARQVARESIVLLENK 424
Query: 386 NGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAYSK--VINYAP 441
N LPLN IK +A++GP+A+ +G+Y + +DG + K I Y
Sbjct: 425 NNVLPLNKSKIKKIAVIGPNADNVYNQLGDYTAPQDDSNVKTVLDGIRSKLKQSQIEYVK 484
Query: 442 GCADIVCQNNSMIPAAIDAAKNADATV---------------IVAGLDLSVE-------- 478
GCA I N+ I A+ AA +D V I G ++ E
Sbjct: 485 GCA-IRDTLNTDIDKAVQAALRSDVAVVVVGGSSARDFKTKYIETGAAVADEHSISDMES 543
Query: 479 AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYP 538
EG DRV L L G Q EL+ + K PV +V + +++N+A N ++L YP
Sbjct: 544 GEGFDRVSLDLMGKQLELLKAIKATGK-PVVVVYIQGRPLNMNWASENA--DALLSAWYP 600
Query: 539 GEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPG-RTYKFFDGPV 597
G+EGG AIADV+FG+YNP GRLP++ V +P+ + P Y
Sbjct: 601 GQEGGNAIADVLFGEYNPAGRLPMS------VAKSVGQLPVYYNHRNPASHDYVEMTSKP 654
Query: 598 VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKD 657
+Y FGYGLS+T F+Y ++K++K ++
Sbjct: 655 LYSFGYGLSFTSFEYS--------NLKINKSNSGVEVT---------------------- 684
Query: 658 YKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
+E+ N G DG EVV +Y + + I Q+ +ERV + G++ + +
Sbjct: 685 ------VELRNSGNFDGDEVVQLYLRNNRASVVQPIMQLKAFERVNLKKGETKTIKLLLT 738
Query: 717 ACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSFPLQLNLN 757
I+D N ++ +G T +VG + ++ LN
Sbjct: 739 K-DDFSIIDKKMNRVVEPNGDFTFMVGSASDNIKLREKMMLN 779
>gi|409197254|ref|ZP_11225917.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
Length = 734
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 209/734 (28%), Positives = 344/734 (46%), Gaps = 102/734 (13%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPR--------LGLPLYEWWSEALHGVSFIG---R 71
ER + L+ MTL EK+ QM ++ G +G L E E ++ + I
Sbjct: 22 ERVEQLLGEMTLDEKIGQMCQVSGGQGNEESIRQGMIGSILNEVDPENINRLQKIAVEES 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF- 130
R P D T FP + A++N L +K + ++EA + G+ +
Sbjct: 82 RLGIPIIVARDVIHGFKTVFPIPLGQAATWNPELVQKGSRIAASEA------ASTGVRWT 135
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P I++ RD RWGR+ E+ GEDPY+ V G Q DS + I+AC
Sbjct: 136 FAPMIDISRDARWGRIAESLGEDPYLTSVLGAAMVTGFQG--------DSLNGETSIAAC 187
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
KH+A Y EG ++ S + +++++ ++ PF+ V+ G V + M +N V+G+P
Sbjct: 188 AKHFAGYGAA--EGGRDYNTTS-IPPRELRDIYLPPFKAAVDAG-VRTFMSGFNEVDGVP 243
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG 310
A+ LL +R +W F G++VSD S ++ +H F D KE A R +K G+D++
Sbjct: 244 ATANKYLLTDVLRNEWQFDGFVVSDWASTWEMI-NHGFAADEKE-AAHRAIKVGVDMEMA 301
Query: 311 DY-YTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIE 369
Y + +++G + DI+ ++R + V LG FD +P +N P+++E
Sbjct: 302 TTTYRDNIAALLKEGALNIEDINQAVRNILRVKFELGLFD-NPYIAEEKQNQFARPEYLE 360
Query: 370 LAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPM 427
A AA Q +VLLKN+ LP+N+ + +AL+GP A+ +G ++G +P+
Sbjct: 361 AANLAATQSMVLLKNEQKTLPINSSS--KIALIGPMADQPYEQLGTWIFDGDTTLTVTPL 418
Query: 428 DGF---YAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDR 484
F + V+ +A G ++ AI+ AKN+D V G + + E R
Sbjct: 419 QAFNKTFGQENVL-FAEGMPISRTRHQKGFRKAIEQAKNSDVIVFCGGEESILSGEAHSR 477
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
++ LPG Q ELI ++ K P+ LV+M+ + I + + ++++ +PG GG
Sbjct: 478 ANIDLPGVQNELIKELKKTGK-PLVLVVMAGRPLTI--GEISEHADAVVYAWHPGTMGGA 534
Query: 545 AIADVIFGKYNPGGRLPITWYE-ANYVKIPYTSMPL-RPVNNFPGRTYKFFDGPV----- 597
A+AD++ GK NP G+LP+T+ + + I Y RP N P + +D PV
Sbjct: 535 ALADIVSGKANPSGKLPVTFPKVVGQIPIYYNHKNTGRPAN--PDSWTQMYDIPVKAPQT 592
Query: 598 ---------------VYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNK 642
+YPFGYGLSYT F+Y D+ LDK+ RD
Sbjct: 593 SLGNESHYIDAGFIPLYPFGYGLSYTSFEYS--------DLSLDKEVYARD--------- 635
Query: 643 PPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERV 701
+ + +FT N G+ G EV VY + G +K++ +ER+
Sbjct: 636 -----------ETIEVRFTLS----NTGEFAGEEVAQVYVRDLVGNVTRPVKELKAFERI 680
Query: 702 FIAAGQSAKVGFTM 715
+ G+S V T+
Sbjct: 681 DLQKGESKTVTLTI 694
>gi|224537403|ref|ZP_03677942.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520981|gb|EEF90086.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
DSM 14838]
Length = 750
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 205/742 (27%), Positives = 347/742 (46%), Gaps = 109/742 (14%)
Query: 29 VERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGA 88
V +T P ++ +A RLG+PL + +HG I
Sbjct: 75 VMSITDPNIFNEVQRIAVEDSRLGIPLINA-RDVIHGFKTI------------------- 114
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
FP + ASFN + + + +TEA A AG+ + ++P I++ DPRWGR+
Sbjct: 115 --FPIPLGQAASFNPEIAETGARIAATEASA------AGIRWTFAPMIDITHDPRWGRIA 166
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDP +V + + ++G Q S + P I+AC KH+A Y EG R
Sbjct: 167 EGFGEDPLLVSQMGVAAIKGFQG--------SSLNHPTSIAACAKHFAGYGAS--EGG-R 215
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
+ + +TE+ + ++ PFE VN G +++M ++N +GIP+ A+P LL +R +WN
Sbjct: 216 DYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGIPSSANPFLLKDVLRNEWN 274
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
+ G +VSD S+ ++ H F D KE A+ + AG D++ + Y + +++GK+
Sbjct: 275 YRGTVVSDWASVSEMIR-HGFCEDEKEAAL-KATNAGTDIEMVSETYIKYLPQLIKEGKV 332
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
+ ID ++R + + RLG F+ P + K P +E A AA Q VLLKN+
Sbjct: 333 SMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFLEAAQTAAEQSAVLLKNER 391
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPMDGFYAYS----KVINYA 440
G LP+ + NIKT+ + GP A+A +G ++G +P+ S KV+ YA
Sbjct: 392 GTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTPLQALRRTSGDSIKVL-YA 449
Query: 441 PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKV 500
PG S ++ A+ AD + G + + E +L L G Q+ L++++
Sbjct: 450 PGLNYSRDTATSQFNKVVELAREADLILAFVGEEAILSGEAHCLANLNLQGAQSRLLHRL 509
Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
++ K P+ V+M+ + I N ++L+ +PG GG A+A+++FGK P G+L
Sbjct: 510 SETGK-PLVTVVMAGRPLTIGREVNIS--DALLYAFHPGTMGGPALANLLFGKVVPSGKL 566
Query: 561 PITW-YEANYVKIPYT----------------SMPLRPVNNFPGRTYKFFDGPV--VYPF 601
P+T+ E + I Y ++P+ G T + D ++PF
Sbjct: 567 PVTFPKETGQIPIYYNHTSTGRPASGSEKNIFTIPVGAEQTSLGNTSFYLDAGKDPLFPF 626
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GYGLSYT F Y +++L Q R+ V+I T
Sbjct: 627 GYGLSYTTFAYS--------NLQLSSTQYTRN-------------EVII---------IT 656
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKS 720
F ++ N GK DG+E+ +Y + + T +K++ +ER+ + AG++ + + K
Sbjct: 657 F--DLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAAFERIHLKAGETRHIRMEL-PVKQ 713
Query: 721 LKIVDNAANSLLASGAHTILVG 742
L + A + + G + +G
Sbjct: 714 LSFWNYAMDYCVEPGKFDLWIG 735
>gi|393784338|ref|ZP_10372503.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
CL02T12C01]
gi|392666114|gb|EIY59631.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
CL02T12C01]
Length = 857
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 218/807 (27%), Positives = 360/807 (44%), Gaps = 162/807 (20%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQM-----------------------GDLAYGV 48
Y + LP ER DL+ RMTL EK+ Q+ G +++G
Sbjct: 26 LSYRQSSLPISERVDDLLGRMTLEEKIAQIRHIHSWNVFNGQDLDMEKLGKFTGGVSWGF 85
Query: 49 -------------------------PRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDS 83
RLG+P++ +E+LHG S
Sbjct: 86 VEGFPLTGVNCKKNMQLIQKFMVENTRLGIPVFTV-AESLHG-----------------S 127
Query: 84 EVPGATSFPTVILTTASFNESLWKKIGQTVSTE--ARAMYNLGNAGLTFWSPNINVVRDP 141
G+T +P I ++F L + ++ + A+ M+ + +P I+VVRD
Sbjct: 128 VHEGSTIYPQNIAMGSTFRPELAYRKAAMITKDLHAQGMHQV-------LAPCIDVVRDL 180
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RWGRV E+ GEDP + G + I V+G D IS KHY +
Sbjct: 181 RWGRVEESFGEDPVLCGLFGIAEVKGYMDN--------------GISPMLKHYGPH---- 222
Query: 202 WEGNDRFHFDSRVTE---QDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLL 258
GN + E +D+ E ++ PFEM + V +VM +YN N +P A LL
Sbjct: 223 --GNPLSGLNLASVECGLRDLHEVYLKPFEMVIRNTPVLAVMSTYNSWNHVPNSASHYLL 280
Query: 259 NQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTM 318
+ +RG + F GY+ SD +I+ + H+ +++ E+A + AGLD++
Sbjct: 281 TEVLRGQFGFKGYVYSDWGAIEMLKTLHRVAHNS-EEAAMQAFTAGLDVEASSNCYPLLA 339
Query: 319 GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQG 378
G +Q+GK+ E ++ S+R + ++G F+ P + + + + I L+ E A +
Sbjct: 340 GLIQKGKLDEEVLNESVRRVLYAKFKMGLFE-DPYGEQYSHSEMHGAESIRLSKEIADES 398
Query: 379 IVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRY--TSPMDG---FYAY 433
+VLLKN+NG LPLN +K++A++GP NA + G+Y + +P++G
Sbjct: 399 VVLLKNENGLLPLNADKLKSVAVIGP--NADQVQFGDYTWSRNNKDGVTPLEGIRRLLGG 456
Query: 434 SKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG---------LDLSVEAEGKDR 484
+ YA GC D+V N I A++AA+ ++ ++ G S EG D
Sbjct: 457 KATVRYAKGC-DLVSLNAGGIKEAVEAARKSEVAILFCGSASAALARDYKSSTCGEGFDL 515
Query: 485 VDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGR 544
DL L G Q +LI +V + PV LV+++ I++ K + I +IL Y GE+ G
Sbjct: 516 NDLNLTGVQGQLIKEVYETGT-PVVLVLVTGKPFAISWEKKH--IPAILTQWYAGEQAGN 572
Query: 545 AIADVIFGKYNPGGRLPITWYEAN-YVKIPYTSMPLRP-------VNNFPGRTYKFFDGP 596
+IAD++FG +P GRL ++ + ++ + Y +P PGR Y F
Sbjct: 573 SIADILFGSISPSGRLTFSYPQTTGHLPVYYNYLPSDKGFYKNPGSYESPGRDYVFSSPD 632
Query: 597 VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCK 656
++ FG+GL+YT F YK L D++ +N T + ID
Sbjct: 633 ALWAFGHGLTYTSFVYK----------NLRTDKEHYGLNDT----------IYID----- 667
Query: 657 DYKFTFQIEVENMGKMDGSEVVMVY-SKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTM 715
++++N GK +G EVV +Y + T +KQ+ +++V + AG++ V +
Sbjct: 668 -------VDIKNTGKREGKEVVQLYVNDKVSTVVTPVKQLRDFKKVDVEAGKTETVKLKV 720
Query: 716 NACKSLKIVDNAANSLLASGAHTILVG 742
A L IV+ ++ G + VG
Sbjct: 721 -AVNDLYIVNAGNKRVVEPGEFELQVG 746
>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
Length = 868
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 164/460 (35%), Positives = 238/460 (51%), Gaps = 51/460 (11%)
Query: 2 FESIKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSE 61
F + + + PY + +L ERA DL+ RMTL EK QM + G+ RLG+ Y+WW+E
Sbjct: 14 FSAFSFRAENPPYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNE 73
Query: 62 ALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARA-- 119
ALHG++ G+ AT FP I A+F+++ ++ VS E RA
Sbjct: 74 ALHGIARAGK----------------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKY 117
Query: 120 -------MYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVE 172
MYN G GLTFW+PNIN+ RDPRWGR +ET GEDP++ + + V+GLQ
Sbjct: 118 HDFQRKGMYN-GYKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQG-- 174
Query: 173 GVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVN 232
D + K AC KHYA + W N + ++ +D++ET++ F+ V
Sbjct: 175 ------DGTQKYDKAHACAKHYAVHSGPEW--NRHSYNAENISIRDLRETYLPAFKALVT 226
Query: 233 EGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV-----ESHK 287
EG V VMC+YNR G P C++ LL ++ +W F IVSDC +I E+H
Sbjct: 227 EGKVKEVMCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHA 286
Query: 288 FLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D DAV +G DL+CG Y A+++G I E I+ S+ L LG
Sbjct: 287 SAADASADAVI----SGTDLECGGSYWALDE-ALEKGLITETKINESVFRLLRARFELGM 341
Query: 348 FDGSP--QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPH 405
FD + ++ + +C +H A E AR+ +VLL N N LPL+ +IK +A++GP+
Sbjct: 342 FDDDSLVSWSSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPN 400
Query: 406 ANATKAMIGNYEGTPCRYTSPMDGFYAY--SKVINYAPGC 443
AN + + NY GTP R + ++G A + Y GC
Sbjct: 401 ANDSVMLWANYNGTPDRSVTILEGIKAKLPEGSVIYEKGC 440
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/303 (31%), Positives = 141/303 (46%), Gaps = 60/303 (19%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
A + K+ADA + V G+ S+E E DR ++ LP Q ++ + + K
Sbjct: 594 AVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQKNMMKALKETGK 653
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
PV V+ S + +++ N + +IL YPG+EGG A+ADV+FG YNP GRLP+T+Y
Sbjct: 654 -PVIFVLCSGSTMALSWEDKN--MDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPLTFY 710
Query: 566 EANYVKIPYTSMPLRPVNNF-----PGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKS 620
+S L N+ GRTY++F G +YPFG+GLSYT F Y A
Sbjct: 711 A--------SSDDLPDFENYNMSEGQGRTYRYFKGKPLYPFGHGLSYTGFSYSKA----- 757
Query: 621 VDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMV 680
KL+K + +N +V + ++N G DG EVV V
Sbjct: 758 ---KLNK--KSMSVNDSV----------------------FLSLNLKNTGLRDGDEVVQV 790
Query: 681 YSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTI 739
Y + K + GY+RV + AGQ+ V + A S + + + + G + I
Sbjct: 791 YIRNLQDPEGPSKSLRGYKRVSVKAGQTVPVKIDLPAS-SFEFFNPVTEKMEVRPGKYEI 849
Query: 740 LVG 742
L G
Sbjct: 850 LYG 852
>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
610]
Length = 812
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 237/833 (28%), Positives = 361/833 (43%), Gaps = 161/833 (19%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE---------------- 57
Y + P ER + L+ +MTL EKV QM + LG P+YE
Sbjct: 47 YENPSAPVEERVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEI 100
Query: 58 ----------------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------- 86
W LH G+ S R +N H +P
Sbjct: 101 SEYHIGALWGFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPH 160
Query: 87 -----GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDP 141
G T FPT I +++N L +++G+ ++ EA A + P +++ RDP
Sbjct: 161 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDP 215
Query: 142 RWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDN 201
RW RV ET GEDPY+ G VRG Q D+ + A KH+A+Y
Sbjct: 216 RWSRVEETYGEDPYLNGVMGAALVRGFQG--------DTLRGRKSVIATLKHFASY---G 264
Query: 202 WEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQT 261
W + + E++++E PF V G +S VM SYN ++G P LL
Sbjct: 265 WTEGGHNGGTAHLGERELEEAIFPPFREAVGAGALS-VMSSYNEIDGNPCTGSRYLLTDI 323
Query: 262 IRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGA 320
+ W F G++VSD +I + E H E AV + + AG+D D G + Y + A
Sbjct: 324 LEDRWLFKGFVVSDLYAIGGLRE-HGVAGSDYEAAV-KAVNAGVDSDLGTNVYAEQLVAA 381
Query: 321 VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIV 380
V++G +A +D ++R + + +G FD + +P+HI LA E ARQ IV
Sbjct: 382 VRKGDVAMETVDKAVRRILSLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIV 441
Query: 381 LLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSK 435
LLKN++ LPL +I+TLA++GP+A+ M+G+Y +G+ + +
Sbjct: 442 LLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDT 500
Query: 436 VINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE------------- 478
+ YA GCA + + + AI++A++AD V+V G D S E
Sbjct: 501 RVFYAKGCA-VRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAKVSANR 559
Query: 479 ------AEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSI 532
EG DR L L G Q EL+ +V K P+ LV++ + + + +I
Sbjct: 560 VSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQ--EADAI 616
Query: 533 LWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKF 592
L YPG +GG A+ADV+FG YNP GRL ++ V +P+ G ++
Sbjct: 617 LDAWYPGMQGGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTKRKGNRSRY 670
Query: 593 FD--GPVVYPFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCR-DINYTVGTNKPPCA 646
+ G YPFGYGLSYT F Y KV S +S CR D++ T
Sbjct: 671 IEEAGTPRYPFGYGLSYTTFSYTGMKVRVSEES--------NHCRVDVSVT--------- 713
Query: 647 AVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAA 705
V N G +DG EVV +Y + G T +Q+ + RV + A
Sbjct: 714 -------------------VRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKA 754
Query: 706 GQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGVSFPLQLNLNH 758
G++ ++ FT++ KSL + + G T++ G ++ + +N
Sbjct: 755 GETWEITFTLDK-KSLALYMRDGEWAVEPGRFTVMAGGSSEDIACQQEFEINR 806
>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
Length = 861
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 157/445 (35%), Positives = 225/445 (50%), Gaps = 41/445 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + L ERA DL R+TL EK M D + +PRLG+ + WWSEALHG + +G
Sbjct: 22 LPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAANMGN 81
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------- 122
TN FP + ASFN L K+ STE RA YN
Sbjct: 82 VTN----------------FPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGE 125
Query: 123 -LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSD 181
+ L+ W+PN+N+ RDPRWGR ET GEDPY+ + V+GLQ E D
Sbjct: 126 DMKMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGPE--------D 177
Query: 182 SRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMC 241
+R K+ AC KHYA + + + D V+ +D ET++ F+ V + V VMC
Sbjct: 178 ARYRKLWACAKHYAVHSGPEYTRHTANLTD--VSARDFWETYMPAFKTLVKDAKVREVMC 235
Query: 242 SYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVL 301
+Y R++ P C +LL Q +R +W F +VSDC ++ E+HK +D VL
Sbjct: 236 AYQRLDDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENHKSSSDAVHGTSKAVL 295
Query: 302 KAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLG 358
AG D++CG Y ++ AV++G ++E ++D + L LG D ++ +
Sbjct: 296 -AGTDVECGFNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMDDPSLVEWSKIP 354
Query: 359 KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
+ + +A + ARQ IVLL+N N LPL N + +A++GP+A+ M GNY G
Sbjct: 355 YSAMSTKASANVALDMARQTIVLLQNKNNILPLKK-NAEKIAIIGPNAHNEPMMWGNYNG 413
Query: 419 TPCRYTSPMDGFYAYSKVINYAPGC 443
TP + +DG A K + Y PGC
Sbjct: 414 TPNHTVTILDGVKAKQKKLVYIPGC 438
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 132/309 (42%), Gaps = 56/309 (18%)
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
D+ + N I K + + G+ S+E E G DR + LP Q
Sbjct: 583 DVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKVQR 642
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
E + + A K ++ ++ I +I+ YPG+EGG A+ADV+FG Y
Sbjct: 643 EFLKALKAAGK---QVIYVNCSGSAIALQPETESCDAIVQAWYPGQEGGTAVADVLFGDY 699
Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
NPGG+L +T+Y+ + Y ++ GRTY++FD ++PFGYGLSYT F+
Sbjct: 700 NPGGKLSVTFYKNDQQLPDYEDYSMK------GRTYRYFDD-ALFPFGYGLSYTTFEVGE 752
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
A + D L + QI V N G +G
Sbjct: 753 AKVEAATDGAL----------------------------------YNVQIPVTNTGTKNG 778
Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLAS 734
SE + +Y + +K + G+ER+ I AG++A + +SL+ D N++
Sbjct: 779 SETIQLYIRNLQDPDGPLKSLRGFERLDIKAGKTATANLKLTK-ESLEFWDAETNTMRTK 837
Query: 735 -GAHTILVG 742
G + IL G
Sbjct: 838 PGKYEILYG 846
>gi|150009689|ref|YP_001304432.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|149938113|gb|ABR44810.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 732
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 220/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + V S ++AGLDL+ G
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GK+ + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+V+M AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG A+ DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFDYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKETYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ +K++ G+++VF+ G+S ++ + SL A + + IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714
>gi|347536214|ref|YP_004843639.1| glycoside hydrolase family protein [Flavobacterium branchiophilum
FL-15]
gi|345529372|emb|CCB69402.1| Glycoside hydrolase precursor, family 3 [Flavobacterium
branchiophilum FL-15]
Length = 740
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 200/673 (29%), Positives = 327/673 (48%), Gaps = 78/673 (11%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T+FP + AS++ +K + +TEA ++G+ + ++P +++ RDPRWGRV+
Sbjct: 111 TTFPIPLAEAASWDVEAIEKSARVAATEA------ASSGIHWTFAPMVDISRDPRWGRVM 164
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GED Y+ + A V+G Q G + H + AC KH+AAY G D
Sbjct: 165 EGAGEDTYLGSKIAFARVKGFQANLG-DVH--------SVMACVKHFAAYGA-AVGGRDY 214
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D ++E+ + ET++ PF+ ++ G ++ M ++N +NGIP A+ + ++G W
Sbjct: 215 NSVD--ISERMLWETYLPPFKAALDAG-AATFMNAFNDINGIPATANKHIQRDILKGKWQ 271
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKI 326
F G++VSD SI +V +H + D K+ A + L AG D+D Y V++ K+
Sbjct: 272 FQGFVVSDWGSIGEMV-AHGYAKDYKQ-AAEKALLAGSDMDMESSAYIGHLATLVKENKV 329
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNN--ICNPQHIELAAEAARQGIVLLKN 384
A ID ++R + M LG F+ ++ N + N + NP+H ++A E A + IVLLKN
Sbjct: 330 PIALIDDAVRRILRKKMELGLFEDPFKFCNPERQNKALNNPEHTKIAREVAAKSIVLLKN 389
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIG----NYEGTPCRY-TSPMDGF---YAYSKV 436
D LPL+ ++KT+A +GP + + G + + Y S +G +
Sbjct: 390 DKQVLPLSK-DLKTIAFIGPMVQSKRDNHGFWAVDLKDVDSTYIVSQWEGLQRKVGKNTK 448
Query: 437 INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTEL 496
+ YA GC D++ N S AI A AD V+ G ++ E K R L LPG Q +L
Sbjct: 449 LLYAKGC-DVLSTNKSGFEEAIAVAHQADVVVVSVGEKHNMSGEAKSRSSLQLPGVQEDL 507
Query: 497 INKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNP 556
I ++ K P+ ++I + + N+ +N + +IL+ + G E G AIADV+FG YNP
Sbjct: 508 IMELQKTGK-PIVVLINAGRPLIFNWTADN--MPTILYTWWLGSEAGNAIADVLFGDYNP 564
Query: 557 GGRLPITWYEAN-YVKIPYTSMPL-RPVNNFPGRTYKF----FDGPVVYPFGYGLSYTQF 610
+LPIT+ + V I Y RP + + YK +PFGYGLSYT F
Sbjct: 565 SAKLPITFPRSEGQVPIYYNHFSTGRPAKSDDDKIYKSAYIDLQNSPKFPFGYGLSYTTF 624
Query: 611 KYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMG 670
+Y D+KL + + TN + Q ++N G
Sbjct: 625 EYS--------DLKLSTQK--------ITTND----------------RIMVQATIKNTG 652
Query: 671 KMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAAN 729
K G+E+V +Y K G + ++ ++++ + AG S + F ++ K L +
Sbjct: 653 KYAGTEIVQLYIKDQFGSVVRPVLELKDFQKITLEAGASKTISFVIDKEK-LSFYNADLQ 711
Query: 730 SLLASGAHTILVG 742
+ G I++G
Sbjct: 712 YVAEPGTFEIMIG 724
>gi|256838635|ref|ZP_05544145.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739554|gb|EEU52878.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 732
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 220/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + V S ++AGLDL+ G
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GK+ + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+V+M AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG A+ DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ +K++ G+++VF+ G+S ++ + SL A + + IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714
>gi|298374050|ref|ZP_06984008.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298268418|gb|EFI10073.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 758
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 185/603 (30%), Positives = 299/603 (49%), Gaps = 63/603 (10%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P +++ RD RWGRV+E GEDPY+ A V G Q G ++ +D + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+AAY G D +++ Q+ + +P + E V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
+ + L+ +R DW F+G++V+D I +V +H + + KE A AG+D+D
Sbjct: 274 STGNKWLMTDLLRKDWGFNGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
G Y+ + + +V++GK++E +ID ++ + + LG FD +Y KN I P+
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENIDRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
++ A E + + IVLLKNDN P++ T+AL+GP + N A G E +
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKNITVALIGPMVKDKINQNGEWAGRGEREESIS 451
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + + + YA GC D++ ++S AI A+ AD + G D + E
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
R DL LPG Q L+ ++ K P+ L++++ +D+++ + + IL Y G
Sbjct: 511 ACRTDLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTM 567
Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
G +ADVI G YNP RL +++ + + Y P RPV P YK + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627
Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGLSYT F +KLD++ ++T G
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
K T EVEN GK+DG VV +Y + G +K++ G+E+V + AG+ +V F
Sbjct: 660 ----KITVMAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSF 715
Query: 714 TMN 716
T++
Sbjct: 716 TID 718
>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
12058]
Length = 864
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 151/428 (35%), Positives = 222/428 (51%), Gaps = 38/428 (8%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
+ + + LP ER DLV +TL EK+ QM + A + RLG+P Y WW+E LHGV+
Sbjct: 24 YKFQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVA---- 79
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA----- 126
R+ P TSFP I A+++ ++ + S E RA+Y+
Sbjct: 80 RSPYP-----------VTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPG 128
Query: 127 ---GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
GLT+WSPNIN+ RDPRWGR ET GEDPY+ + +V+GLQ D
Sbjct: 129 IFRGLTYWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQ---------GDDPV 179
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK SAC KHYA + W +R +++ V+ D+ +T++ F V + V+ VMC+Y
Sbjct: 180 YLKSSACAKHYAVHSGPEW---NRHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAY 236
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N P C + L+ +R W F GY+ SDC +I+ +H D E + VL
Sbjct: 237 NSFFEQPCCGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLH- 295
Query: 304 GLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKNN 361
G D +CG+ A+ +G I E +D SL+ L+ + RLG FD + Y ++ +
Sbjct: 296 GTDCECGNGAYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDDRVPYSDIPISV 355
Query: 362 ICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPC 421
+ H A + ARQ IVLLKN+ LPL+ IK +A+VGP+A+ ++ NY G P
Sbjct: 356 LECDAHKAHALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPS 415
Query: 422 RYTSPMDG 429
T+ ++G
Sbjct: 416 CVTTVLEG 423
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 138/297 (46%), Gaps = 58/297 (19%)
Query: 460 AAKNADATVIVAGLDLSVEAEGK----------DRVDLLLPGFQTELINKVADAAKGPVT 509
+ K+AD V V GL VE E DR + +P Q L+ ++ K PV
Sbjct: 595 SVKDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELYATGK-PVI 653
Query: 510 LVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANY 569
++M+ AV + + + + +IL Y G+ GG+AIADV+FG YNP GRLP+T+Y+
Sbjct: 654 FILMTGSAVGLEWESEH--LPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKN-- 709
Query: 570 VKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQ 629
+P + RTY++F G VYPFGYGLSYT F+Y S+D
Sbjct: 710 ----VNDLPDFEDYSMKNRTYRYFTGIPVYPFGYGLSYTDFQYNTIKVQPSLD------- 758
Query: 630 QCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI--EVENMGKMDGSEVVMVYSKPPGI 687
K + ++ EV N+GK +G EVV +Y P
Sbjct: 759 -----------------------------KLSVKVTAEVSNVGKYEGEEVVQLYVSNPRD 789
Query: 688 AGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVGEG 744
T I+ + G+ R+ + G+S V F + + K L +VD A N + G I +G G
Sbjct: 790 FVTPIRALKGFRRINLKPGESQMVEFVLTS-KELSVVDVAGNFVPMKGEVQISLGGG 845
>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 765
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 196/694 (28%), Positives = 322/694 (46%), Gaps = 98/694 (14%)
Query: 87 GATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRV 146
G T FP + +++N L++ + + V+ E R+ G +SP ++VVRDPRWGR
Sbjct: 122 GGTVFPVPLSIGSTWNLDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRT 176
Query: 147 LETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN- 205
E GEDPY++ YA+ V GLQ +S P ++A KH+ Y N
Sbjct: 177 EECFGEDPYLISEYAVASVEGLQG--------ESLDSPSSVAATLKHFVGYGSSEGGRNA 228
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
H +R ++ E +LPF+ V G +S+M +YN ++G+P + +LL+ +R +
Sbjct: 229 GPVHMGTR----ELMEVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKE 283
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQG 324
W F G +++DC +I + H D DA + ++AG+DL+ G+ + AV+
Sbjct: 284 WGFDGMVITDCGAIDMLASGHDTAEDGM-DAAVQAIRAGIDLEMSGEMFGKHLQKAVESN 342
Query: 325 KIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKN 384
K+ + +D ++R + + +LG F+ +N I + QHI LA + A +GIVLLKN
Sbjct: 343 KLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAENVIGSGQHIGLARQLAAEGIVLLKN 402
Query: 385 DNGALPLNTGNIKTLALVGPHANATKAMIGNYEG--TPCRYTSPMDGFYAY----SKVIN 438
+ ALPL+ +A++GP+A+ +G+Y P T+ + G A ++ +
Sbjct: 403 EAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVL 461
Query: 439 YAPGCADIVCQNNSMIPAAIDAAKNADATVIVAG-----------LDLSVEA-------- 479
YAPGC I + A+ A+ AD V+V G +DL A
Sbjct: 462 YAPGCR-IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDAL 520
Query: 480 ------EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSIL 533
EG DR+ L L G Q +L ++ K ++++ I + +IL
Sbjct: 521 SDMDCGEGIDRMTLQLSGVQLDLAQEIHKLGK---RMIVVYINGRPIAEPWIDEHADAIL 577
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPLRPVNNFPGRTYKF 592
YPG+EGG AIAD++FG NP G+L ++ + + Y R G+ Y
Sbjct: 578 EAWYPGQEGGHAIADILFGDVNPSGKLTMSIPKHVGQLPVYYNGKRSR------GKRYLE 631
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
D YPFGYGLSYT+F Y DI++ + +GT
Sbjct: 632 EDSQPRYPFGYGLSYTEFSYS--------DIQMTPE--------VIGT------------ 663
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
D + V N G +GSEVV +Y T +++ G++++ + G+ KV
Sbjct: 664 ----DGTAVVSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKV 719
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGV 745
FT+ + L+ + ++ G +++G V
Sbjct: 720 EFTIGP-EQLQYIGQDYRQVVEPGLFRVMLGRHV 752
>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 862
Score = 256 bits (653), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 166/463 (35%), Positives = 243/463 (52%), Gaps = 47/463 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + +L ERAKDLV+R+TL EK M D + +PRLG+ + WWSEALHGV+ G
Sbjct: 21 LPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG- 79
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------- 122
T FP + ASFN+ L +I VS E RA +N
Sbjct: 80 ---------------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLED 124
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+ L+ W+PN+N+ RDPRWGR ET GEDPY+ + I V+GLQ E +Y
Sbjct: 125 VRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYR----- 179
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W + + V+ +D+ ET++ F+ V + DV VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEWSRHTANL--NNVSPRDLWETYLPAFKALVQKADVREVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y R++ P C + +LL Q +R +W F +VSDC +I SHK +D AV +
Sbjct: 235 YQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTM- 293
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK-- 359
AG D++CG Y + AV +G I E ++D + L LG D P N K
Sbjct: 294 AGTDVECGYGYAYQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIP 352
Query: 360 NNICNPQ-HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
++ N + H +L+ +RQ + LL+N N LPL + +I+ +A++GP+A+ + GNY G
Sbjct: 353 MSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPL-SKSIRKIAVIGPNADDKPMLWGNYNG 411
Query: 419 TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAID 459
TP + + +DGF + K I Y GC D+V N+ + + +D
Sbjct: 412 TPNQTITILDGFKSKLKKNQIVYMKGC-DLV--NDQTLESYLD 451
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 119/264 (45%), Gaps = 44/264 (16%)
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
+G DR D+ LP Q + + DA K +V ++ + +IL Y G
Sbjct: 626 KGGDRTDIELPAVQRNFLKALKDAGK---QVVFVNCSGSSMALLPETESCDAILQAWYGG 682
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
E GG A+ADV+FG YNP G+LP+T+Y++ Y ++ GRTY++ P ++
Sbjct: 683 ELGGYAVADVLFGDYNPSGKLPVTFYKSTKQLPDYEDYSMK------GRTYRYMSDP-LF 735
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFG+GLSYT F AS K+ Q R D
Sbjct: 736 PFGFGLSYTDFAVGTASCNKT---------QLR-----------------------TDES 763
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACK 719
T + V N GK G+EVV VY + A +K + Y RV +AAG V + + +
Sbjct: 764 LTLTVPVSNTGKRSGTEVVQVYIRKTDDADGPLKSLKAYARVELAAGAKQDVKIELPS-E 822
Query: 720 SLKIVDNAANSL-LASGAHTILVG 742
S + D + N++ +A G + + G
Sbjct: 823 SFECFDPSTNTMRVAPGKYELFYG 846
>gi|224535195|ref|ZP_03675734.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523186|gb|EEF92291.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
DSM 14838]
Length = 733
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 214/760 (28%), Positives = 352/760 (46%), Gaps = 92/760 (12%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYG--------------VP-RLGLPLYEW 58
Y DA P R KDL+ RMTL EKV Q+ +G +P +G +Y
Sbjct: 25 YKDAGQPVETRVKDLLNRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPAEIGSLIYLH 84
Query: 59 WSEALHG----VSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVS 114
L + R P FD T +P + SFN L + Q
Sbjct: 85 TDPKLRNRIQRKAMEESRLGIPILFGFDVIHGLRTVYPISLAQACSFNPDL---VTQACG 141
Query: 115 TEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGV 174
A+ L TF SP I+V RDPRWGR+ E GEDPY +N V G+ V+G
Sbjct: 142 MAAKESV-LSGIDWTF-SPMIDVARDPRWGRISECYGEDPY------LNTVFGVASVKGY 193
Query: 175 EYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEG 234
+ + SD P I+AC KHY Y + EG + + + ++ Q + ET++ P+E CV G
Sbjct: 194 QGEKLSD--PYSIAACLKHYVGYGVS--EGGRDYRY-TDISPQALWETYLPPYEACVKAG 248
Query: 235 DVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKE 294
+++M S+N ++G+P ++ +L + ++ W G++VSD ++I+ ++ ++ + ++
Sbjct: 249 -AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIEQLI--YQGVAKNRK 305
Query: 295 DAVARVLKAGLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ 353
+A + AG+++D D Y + V + KI + ID ++ + V RLG FD P
Sbjct: 306 EAAYKAFHAGVEMDMRDNVYYEYLEQLVAEKKIEISQIDDAVARILRVKFRLGLFD-EPY 364
Query: 354 YKNLG-KNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
K L + + I LAA A + +VLLKN+ LPL++ +K +AL+GP +
Sbjct: 365 TKELTEQERYLQKEDIALAARLAEESMVLLKNEKNLLPLSS-TVKRVALIGPMVKDRSDL 423
Query: 413 IGNY------EGTPCRYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADA 466
+G + E Y M + ++Y GCA + + S AA+ A+ +D
Sbjct: 424 LGAWAFKGQAEDVETIYEG-MQKEFGDKVRLDYEQGCA-LDGNDESGFSAALKTAEASDV 481
Query: 467 TVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNN 526
V+ G E R + LP Q +L+ + A K P+ LV+ S +++ +
Sbjct: 482 VVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLE 538
Query: 527 PKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIP-YTSM--PLRPVN 583
P++++I+ + PG GG +A ++ G+ NP G+L +T + + +IP Y +M RP +
Sbjct: 539 PQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVT-FPLSTGQIPVYYNMRQSARPFD 597
Query: 584 NFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKP 643
Y+ +YPFGYGLSYT F Y S K +K+ K+Q
Sbjct: 598 AMG--DYQDIPTEPLYPFGYGLSYTTFTY---SDAKLSSLKIKKNQ-------------- 638
Query: 644 PCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVF 702
K T ++ V N GK++G E V+ Y P + + +K++ +E+
Sbjct: 639 ---------------KITAEVTVTNAGKVEGKETVLWYVSDPFCSISRPMKELKFFEKQS 683
Query: 703 IAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILVG 742
+ G+S F ++ + L D L +G + VG
Sbjct: 684 LKVGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFIVSVG 723
>gi|423226659|ref|ZP_17213124.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628186|gb|EIY22220.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 750
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 205/742 (27%), Positives = 346/742 (46%), Gaps = 109/742 (14%)
Query: 29 VERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGA 88
V +T P ++ +A RLG+PL + +HG I
Sbjct: 75 VMSITDPNIFNEVQRIAVEDSRLGIPLINA-RDVIHGFKTI------------------- 114
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
FP + ASFN + + + +TEA A AG+ + ++P I++ DPRWGR+
Sbjct: 115 --FPIPLGQAASFNPEIAETGARIAATEASA------AGIRWTFAPMIDITHDPRWGRIA 166
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDP +V + + ++G Q S + P I+AC KH+A Y EG R
Sbjct: 167 EGFGEDPLLVSQMGVAAIKGFQG--------SSLNHPTSIAACAKHFAGYGAS--EGG-R 215
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
+ + +TE+ + ++ PFE VN G +++M ++N +GIP+ A+P LL +R +WN
Sbjct: 216 DYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGIPSSANPFLLKDVLRNEWN 274
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
+ G +VSD S+ ++ H F D KE A+ + AG D++ + Y +++GK+
Sbjct: 275 YRGTVVSDWASVSEMIR-HGFCEDEKEAAL-KATNAGTDIEMVSETYIKHLPQLIKEGKV 332
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
+ ID ++R + + RLG F+ P + K P +E A AA Q VLLKN+
Sbjct: 333 SMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFLEAAQTAAEQSAVLLKNER 391
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPMDGFYAYS----KVINYA 440
G LP+ + NIKT+ + GP A+A +G ++G +P+ S KV+ YA
Sbjct: 392 GTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTPLQALRRISGDSIKVL-YA 449
Query: 441 PGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKV 500
PG S ++ A+ AD + G + + E +L L G Q+ L++++
Sbjct: 450 PGLNYSRDTATSQFNKVVELAREADLILAFVGEEAILSGEAHCLANLNLQGAQSRLLHRL 509
Query: 501 ADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRL 560
++ K P+ V+M+ + I N ++L+ +PG GG A+A+++FGK P G+L
Sbjct: 510 SETGK-PLVTVVMAGRPLTIGREVNIS--DALLYAFHPGTMGGPALANLLFGKVVPSGKL 566
Query: 561 PITW-YEANYVKIPYT----------------SMPLRPVNNFPGRTYKFFDGPV--VYPF 601
P+T+ E + I Y ++P+ G T + D ++PF
Sbjct: 567 PVTFPKETGQIPIYYNHTSTGRPASGSEKNIFTIPVGAEQTSLGNTSFYLDAGKDPLFPF 626
Query: 602 GYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFT 661
GYGLSYT F Y +++L Q R+ V+I T
Sbjct: 627 GYGLSYTTFAYS--------NLQLSSTQYTRN-------------EVII---------IT 656
Query: 662 FQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKS 720
F ++ N GK DG+E+ +Y + + T +K++ +ER+ + AG++ + + K
Sbjct: 657 F--DLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAAFERIHLKAGETRHIRMEL-PVKQ 713
Query: 721 LKIVDNAANSLLASGAHTILVG 742
L + A + + G + +G
Sbjct: 714 LSFWNYAMDYCVEPGKFDLWIG 735
>gi|256838673|ref|ZP_05544183.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
gi|256739592|gb|EEU52916.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
Length = 758
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 185/603 (30%), Positives = 298/603 (49%), Gaps = 63/603 (10%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P +++ RD RWGRV+E GEDPY+ A V G Q G ++ +D + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+AAY G D +++ Q+ + +P + E V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
+ + L+ +R DW F G++V+D I +V +H + + KE A AG+D+D
Sbjct: 274 STGNKWLMTDLLRKDWGFKGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
G Y+ + + +V++GK++E +ID ++ + + LG FD +Y KN I P+
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENIDRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
++ A E + + IVLLKNDN P++ T+AL+GP + N A G E +
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESIS 451
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + + + YA GC D++ ++S AI A+ AD + G D + E
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
R DL LPG Q L+ ++ K P+ L++++ +D+++ + + IL Y G
Sbjct: 511 ACRTDLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTM 567
Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
G +ADVI G YNP RL +++ + + Y P RPV P YK + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627
Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGLSYT F +KLD++ ++T G
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
K T EVEN GK+DG VV +Y + G +K++ G+E+V + AG+ +V F
Sbjct: 660 ----KITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVALKAGEKKQVSF 715
Query: 714 TMN 716
T++
Sbjct: 716 TID 718
>gi|423333878|ref|ZP_17311659.1| hypothetical protein HMPREF1075_03310 [Parabacteroides distasonis
CL03T12C09]
gi|409226713|gb|EKN19619.1| hypothetical protein HMPREF1075_03310 [Parabacteroides distasonis
CL03T12C09]
Length = 732
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 220/776 (28%), Positives = 362/776 (46%), Gaps = 142/776 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + + S ++AGLDL+ G
Sbjct: 239 AENNYLVCKILRNEWGFDGVYVTDWGAAHSTIPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GKI + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIEAVKSGKIPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLEALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+V+M AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG A+ DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNALVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFNYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKKTYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ +K++ G+++VF+ G+S ++ + SL A + + IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714
>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 158/430 (36%), Positives = 224/430 (52%), Gaps = 45/430 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y DA L RA+ LV+ +TL EK M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASFN + ++ VS EARA +
Sbjct: 81 --------------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERY 126
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ R + V+GLQ +D +
Sbjct: 127 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYD 178
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ AC KH+A + W +R F++ + +D+ ET++ PFE V EG V VMC+YN
Sbjct: 179 KLHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYN 235
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R G P C +LL Q +RG+W F G +VSDC +I H D + + A V+
Sbjct: 236 RFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVI- 294
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--- 359
+G DL+CG Y + +V++G I+E +DTS++ L LG D P+ + K
Sbjct: 295 SGTDLECGSSYKAL-IESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPF 352
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + H LA AR+ + LL N + LPL G + T+A++GP+AN + GNY G
Sbjct: 353 SVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGM 411
Query: 420 PCRYTSPMDG 429
P + +DG
Sbjct: 412 PAHTVTILDG 421
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/309 (30%), Positives = 152/309 (49%), Gaps = 53/309 (17%)
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
D+ + + I +++ K+AD + +G+ S+E E DR D+ LP Q
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
ELI+ + A K +++++ I K ++IL YPG++GG+A+A+V+FG Y
Sbjct: 641 ELIDALHRAGK---KIILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697
Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
NP G+LP+T+Y + +P N GRTY++ ++PFGYGLSYT F Y
Sbjct: 698 NPAGKLPVTFYRN------VSQLPDFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFGYG- 750
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
K+V LDK++ T G + + V N GK +G
Sbjct: 751 ----KTV---LDKNE------LTAGQS------------------LKLTVPVTNTGKRNG 779
Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LA 733
EVV VY + G A IK + ++RV I AG++ V F + K L+ D+ +N++ +
Sbjct: 780 EEVVQVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTVRVC 838
Query: 734 SGAHTILVG 742
G + I+VG
Sbjct: 839 PGNYDIMVG 847
>gi|410096731|ref|ZP_11291716.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225348|gb|EKN18267.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
CL02T12C30]
Length = 746
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 202/712 (28%), Positives = 330/712 (46%), Gaps = 109/712 (15%)
Query: 32 MTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSF 91
+T PE V + +A RLG+PL + +HG I F
Sbjct: 76 LTDPELVNKAQRIAVEESRLGIPLL-MSRDVIHGYKTI---------------------F 113
Query: 92 PTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVLETP 150
P + A+FN L + + + EA A G+ + ++P I++ RDPRWGR+ E+
Sbjct: 114 PIPLGQAATFNPQLVEDGARVAAVEASA------DGIRWTFAPMIDISRDPRWGRIAESC 167
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ + V+G Q DS + P ++AC KH+ Y EG R +
Sbjct: 168 GEDPYLSSVMGVAMVKGFQG--------DSLNNPTAVAACAKHFVGYGAS--EGG-RDYN 216
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
+ + E+ ++ + PFE G ++ M S+N +GIP+ + +L +RG+WN+ G
Sbjct: 217 STFIPERQLRNVYFPPFEAAAKAG-CATFMTSFNDNDGIPSTGNSFILKDVLRGEWNYDG 275
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQGKIAE 328
+V+D S ++ SH F D KE A+ V AG++++ G + N V++ K++E
Sbjct: 276 LVVTDWASSAEMI-SHGFCKDEKEAAMKSV-NAGINMEMVSGTFIRNLEE-LVKEKKVSE 332
Query: 329 ADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGA 388
A ID ++R + + RLG FD Y + + P H+ A EAA Q ++LLKND
Sbjct: 333 AAIDEAVRNILRLKFRLGLFDNP--YTDTDQQVKYAPTHLAKAKEAAEQSVILLKNDRET 390
Query: 389 LPLNTGNIKTLALVGPHANATKAMIGN--YEGTPCRYTSPMDGF---YAYSKVINYAPGC 443
LP T I+TLA++GP A+A +G ++G + + Y I Y PG
Sbjct: 391 LPF-TDKIRTLAVIGPLADAAHDQMGTWVFDGEKAHTQTVLTALKEMYGDKVRIIYEPGL 449
Query: 444 ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADA 503
++ + I A++AA +ADA ++ AG + + E DL L G Q+ELI +A
Sbjct: 450 GYSRDKHTAGIAKAVNAAMHADAVLVCAGEESILSGEAHSLADLHLQGAQSELIAALAKT 509
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K P+ V+M+ + I + + ++L+ +PG GG A+AD++FGK P G+ P+T
Sbjct: 510 GK-PLVTVVMAGRPLTI--GQEVEQSDAVLYAFHPGTMGGPALADLLFGKAVPSGKTPVT 566
Query: 564 ----------WYEANYVKIPYT-------SMPLRPVNNFPGRTYKFFDGPV--VYPFGYG 604
+Y N P + +P G T + D ++PFGYG
Sbjct: 567 FPKMVGQIPVYYAHNNTGRPASRQETLIDDIPQEAGQTSLGCTSFYMDAGFDPLFPFGYG 626
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
LSYT F Y N + TN+ D
Sbjct: 627 LSYTTFGYD---------------------NLQLATNQLAV-----------DGTLEISF 654
Query: 665 EVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTM 715
++ N GK +G+E+V +Y + + T +K++ G+ R+ + G++ V F++
Sbjct: 655 DLTNTGKYEGTEIVQLYIQDKAGSITRPVKELKGFRRIPLKQGETKTVSFSL 706
>gi|363583088|ref|ZP_09315898.1| b-glucosidase [Flavobacteriaceae bacterium HQM9]
Length = 779
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 191/679 (28%), Positives = 317/679 (46%), Gaps = 84/679 (12%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTF-WSPNINVVRDPRWGRVL 147
T FP + AS++ K + + EA + G+ + ++P +++ +D RWGR+
Sbjct: 146 TIFPIPLGLAASWDAETAKAAARVSAIEASSY------GIRWTFAPMLDITQDSRWGRIA 199
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E+PGEDPY+ A YV G QD + S+ ++AC KH+ Y R
Sbjct: 200 ESPGEDPYLASVLAKAYVEGFQD--------NDLSKSTSLAACAKHFIGYGA---AIGGR 248
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
+ + + E ++ T++ PFE ++ G ++VM S+N +NG+P + LLN+ +R +
Sbjct: 249 DYNTAIIHEPLLRNTYLKPFEAAIDAG-AATVMTSFNELNGVPASGNKWLLNEVLRKELG 307
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-CGDYYTNFTMGAVQQGKI 326
FHG++VSD +SI ++ +H + + K A A + AGLD++ Y N+ +++ KI
Sbjct: 308 FHGFVVSDWNSITEMI-AHSYAENEKH-AAALGINAGLDMEMTSKSYENYIKQLLKEKKI 365
Query: 327 AEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDN 386
E +D + + V RL F+ + K N + +H++LA AA + VLLKN+
Sbjct: 366 TETQLDFLVSNILRVKFRLNLFEKPYRLKK-HTGNFYSQEHMDLAKNAAIRSSVLLKNNQ 424
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIG--NYEGTPCRYTSPMDGFYAYSKVINYAPGCA 444
G LPLN + +A++GP ANA +G ++G +P+ F N+A
Sbjct: 425 GLLPLN--KLTKVAVIGPLANAPHEQLGTWTFDGDQAYSVTPLQAFKNNKVNFNFAETLT 482
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAA 504
Q+ A+ A+++D + G + + E R + LPG Q LI +A
Sbjct: 483 YSRDQSTKAFDKALRTAQSSDVILFFGGEEAILSGEAHSRAHINLPGQQEALIKALAKTG 542
Query: 505 KGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITW 564
K P+ VIM+ I K ++ +IL +PG GG AI ++++GK PGGRLPITW
Sbjct: 543 K-PIVFVIMAGRP--ITLTKVIDQVDAILMTWHPGTMGGEAIYEMLWGKNEPGGRLPITW 599
Query: 565 YEAN----------------YVK--IPYTSMPLRPVNNFPGRTYKFFDGPVV--YPFGYG 604
+ + +K + S+P+ + G T + D +PFGYG
Sbjct: 600 PKTSGQLPLFYNHKNTGRPPSIKSFVQMDSIPVGAWQSSLGNTSHYLDVGFTPQFPFGYG 659
Query: 605 LSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQI 664
L YT FKY D+K+ ++ + V +
Sbjct: 660 LGYTTFKYS--------DVKISTTSITKNESLEVS------------------------V 687
Query: 665 EVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKI 723
+ N G G+E+V +Y + G +K++ G++ + + G S V FT+NA L
Sbjct: 688 TLTNTGDRAGAELVQLYVQDVVGSLTRPVKELKGFKHIHLDKGASTIVKFTLNA-NDLMF 746
Query: 724 VDNAANSLLASGAHTILVG 742
V+N +L G I VG
Sbjct: 747 VNNTLKPVLEKGEFNIFVG 765
>gi|255013061|ref|ZP_05285187.1| beta-glucosidase [Bacteroides sp. 2_1_7]
gi|410102523|ref|ZP_11297449.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
gi|409238595|gb|EKN31386.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
Length = 758
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 184/603 (30%), Positives = 299/603 (49%), Gaps = 63/603 (10%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P +++ RD RWGRV+E GEDPY+ A V G Q G ++ +D + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+AAY G D +++ Q+ + +P + E V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
+ + L+ +R DW F+G++V+D I +V +H + + KE A AG+D+D
Sbjct: 274 STGNKWLMTDLLREDWGFNGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
G Y+ + + +V++GK++E +ID ++ + + LG FD +Y KN I P+
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENIDRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
++ A E + + IVLLKNDN P++ T+AL+GP + N A G E +
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESIS 451
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + + + YA GC D++ ++S AI A+ AD + G D + E
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
R DL LPG Q L+ ++ K P+ L++++ +D+++ + + IL Y G
Sbjct: 511 ACRTDLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTM 567
Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
G +ADVI G YNP RL +++ + + Y P RPV P YK + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627
Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGLSYT F +KLD++ ++T G
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
K T EVEN GK+DG V+ +Y + G +K++ G+E+V + AG+ +V F
Sbjct: 660 ----KITVTAEVENTGKVDGETVIQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSF 715
Query: 714 TMN 716
T++
Sbjct: 716 TID 718
>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 158/430 (36%), Positives = 224/430 (52%), Gaps = 45/430 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
Y DA L RA+ LV+ +TL EK M D + V RLG+ Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNA------- 126
AT FP I ASFN + ++ VS EARA +
Sbjct: 81 --------------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERY 126
Query: 127 -GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPL 185
GLT W+P +N+ RDPRWGR +ET GEDPY+ R + V+GLQ +D +
Sbjct: 127 QGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYD 178
Query: 186 KISACCKHYAAYDLDNWEGNDRFHFDSR-VTEQDMQETFILPFEMCVNEGDVSSVMCSYN 244
K+ AC KH+A + W +R F++ + +D+ ET++ PFE V EG V VMC+YN
Sbjct: 179 KLHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYN 235
Query: 245 RVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIV--ESHKFLNDTKEDAVARVLK 302
R G P C +LL Q +RG+W F G +VSDC +I H D + + A V+
Sbjct: 236 RFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVI- 294
Query: 303 AGLDLDCGDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK--- 359
+G DL+CG Y + +V++G I+E +DTS++ L LG D P+ + K
Sbjct: 295 SGTDLECGSSYKAL-IESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPF 352
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + H LA AR+ + LL N + LPL G + T+A++GP+AN + GNY G
Sbjct: 353 SVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGM 411
Query: 420 PCRYTSPMDG 429
P + +DG
Sbjct: 412 PAHTVTILDG 421
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/309 (30%), Positives = 152/309 (49%), Gaps = 53/309 (17%)
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQT 494
D+ + + I +++ K+AD + +G+ S+E E DR D+ LP Q
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640
Query: 495 ELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKY 554
ELI+ + A K +++++ I K ++IL YPG++GG+A+A+V+FG Y
Sbjct: 641 ELIDALHRAGK---KIILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697
Query: 555 NPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKV 614
NP G+LP+T+Y + +P N GRTY++ ++PFGYGLSYT F Y
Sbjct: 698 NPAGKLPVTFYRN------VSQLPDFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFGYG- 750
Query: 615 ASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDG 674
K+V LDK++ T G + + V N GK +G
Sbjct: 751 ----KTV---LDKNE------LTAGQS------------------LKLTVPVTNTGKRNG 779
Query: 675 SEVVMVYSKPPGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LA 733
EVV VY + G A IK + ++RV I AG++ V F + K L+ D+ +N++ +
Sbjct: 780 EEVVQVYLRKQGDAEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTVRVC 838
Query: 734 SGAHTILVG 742
G + I+VG
Sbjct: 839 PGNYDIMVG 847
>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
12058]
Length = 862
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 163/462 (35%), Positives = 239/462 (51%), Gaps = 45/462 (9%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + +L ERAKDLV R+TL EK M D + +PRLG+ + WWSEALHGV+ G
Sbjct: 21 LPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG- 79
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNA--- 126
T FP + ASFNE L +I S E RA +N + N
Sbjct: 80 ---------------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLED 124
Query: 127 ----GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
L+ W+PN+N+ RDPRWGR ET GEDPY+ + I V+GLQ E +Y
Sbjct: 125 TRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPEDEKYR----- 179
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W + + ++ +D+ ET++ F+ V + DV VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEWSRHSANL--NNISPRDLWETYLPAFKALVQKADVREVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y R++ P C +LL Q +R +W F +VSDC +I SHK +D AV +
Sbjct: 235 YQRLDDDPCCGSTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTM- 293
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
AG D++CG Y + AV +G I E +ID + L LG D ++ +
Sbjct: 294 AGTDVECGYGYAYQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMDDPSLVKWSQIPM 353
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + + H +L+ +RQ + LL+N N LPL + +I+ +A++GP+A+ + GNY GT
Sbjct: 354 SVVNSKAHKDLSLNMSRQTMTLLQNKNNVLPL-SKSIRKIAVIGPNADDKPMLWGNYNGT 412
Query: 420 PCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAID 459
P + + +DGF K I Y GC D+V N+ + + +D
Sbjct: 413 PNQTITILDGFKTKLKKNQIIYMKGC-DLV--NDKTLESYLD 451
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 133/297 (44%), Gaps = 54/297 (18%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
+I KN D V V G+ +E E DR D+ LP Q + + +A+K
Sbjct: 593 SISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEASK- 651
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ + +IL Y GE GG+A+ADV+FG YNP G+LP+T+Y+
Sbjct: 652 --QVVFVNCSGSSMALLPETESCDAILQAWYGGELGGQAVADVLFGDYNPSGKLPVTFYK 709
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ Y ++ GRTY++ P+ +PFG+GLSYT F
Sbjct: 710 STKQLPDYEDYSMK------GRTYRYMSDPL-FPFGFGLSYTDF---------------- 746
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
TVGT + C+ + + T + + N GK G+EV+ VY +
Sbjct: 747 ----------TVGTAQ--CSKTQLR----TEEALTLTVPISNTGKRSGTEVIQVYIRKTD 790
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
G +K + Y R +AAG + + + A +S + D + N++ +A G + + G
Sbjct: 791 DTGGPLKSLKAYARAELAAGATQDIEIQLPA-ESFECFDPSTNTMRVAPGEYELFYG 846
>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
DSM 14838]
Length = 885
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 166/463 (35%), Positives = 242/463 (52%), Gaps = 47/463 (10%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + +L ERAKDLV+R+TL EK M D + +PRLG+ + WWSEALHGV+ G
Sbjct: 21 LPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQGN 80
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--------- 122
T FP + ASFN+ L I VS E RA +N
Sbjct: 81 ----------------VTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLED 124
Query: 123 LGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
+ L+ W+PN+N+ RDPRWGR ET GEDPY+ + I V+GLQ E +Y
Sbjct: 125 VRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEKYR----- 179
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KHYA + W + + V+ +D+ ET++ F+ V + DV VMC+
Sbjct: 180 ---KLLACAKHYAVHSGPEWSRHTANL--NNVSPRDLWETYLPAFKALVQKADVREVMCA 234
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y R++ P C + +LL Q +R +W F +VSDC +I SHK +D AV +
Sbjct: 235 YQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKSSSDAVHAAVKGTM- 293
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK-- 359
AG D++CG Y + AV +G I E ++D + L LG D P N K
Sbjct: 294 AGTDVECGYGYAYQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIP 352
Query: 360 NNICNPQ-HIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEG 418
++ N + H +L+ +RQ + LL+N N LPL + +I+ +A++GP+A+ + GNY G
Sbjct: 353 MSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPL-SKSIRKIAVIGPNADDKPMLWGNYNG 411
Query: 419 TPCRYTSPMDGFYAYSK--VINYAPGCADIVCQNNSMIPAAID 459
TP + + +DGF + K I Y GC D+V N+ + + +D
Sbjct: 412 TPNQTITILDGFKSKLKKNQIVYMKGC-DLV--NDQTLESYLD 451
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 130/297 (43%), Gaps = 54/297 (18%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
+I K D V V G+ +E E G DR D+ LP Q + + DA K
Sbjct: 593 SISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDAGK- 651
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
+V ++ + +IL Y GE GG A+ADV+FG YNP G+LP+T+Y+
Sbjct: 652 --QVVFVNCSGSSMALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVTFYK 709
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ Y ++ GRTY++ P ++PFG+GLSYT F AS K+ +L
Sbjct: 710 STKQLPDYEDYSMK------GRTYRYMSDP-LFPFGFGLSYTDFAVGTASCNKT---QLH 759
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
D+ T + V N GK G+EVV VY +
Sbjct: 760 TDES-----------------------------LTLTVPVSNTGKRSGTEVVQVYIRKTD 790
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
A +K + Y RV +AAG V + + +S + D + N++ +A G + + G
Sbjct: 791 DADGPLKSLKAYARVELAAGAKQDVKIELPS-ESFECFDPSTNTMRVAPGEYELFYG 846
>gi|374596264|ref|ZP_09669268.1| glycoside hydrolase family 3 domain protein [Gillisia limnaea DSM
15749]
gi|373870903|gb|EHQ02901.1| glycoside hydrolase family 3 domain protein [Gillisia limnaea DSM
15749]
Length = 758
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 196/658 (29%), Positives = 318/658 (48%), Gaps = 91/658 (13%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
T FP + TAS++ ++ + + E+ A TF SP I++ RD RWGR++E
Sbjct: 122 TIFPVPLGETASWDLEAMEESARIAALESAAH----GVNWTF-SPMIDISRDARWGRIME 176
Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
GEDPY+ + A+ ++G Y + + I+A KH+A Y R
Sbjct: 177 GSGEDPYLTSKVAVAKIKG--------YQGNDLADANTIAATAKHFAGYGFGE---AGRD 225
Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
+ + E ++ T + PF+ G V++ M ++N ++G P L ++GDWN+
Sbjct: 226 YNTVHIGENELHNTILPPFKAAAEAG-VATFMNAFNDIDGTPATGHKILQRDILKGDWNW 284
Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIA 327
+G+IVSD SI ++ H F D K+ A +KAG D+D G Y N V+ G+I
Sbjct: 285 NGFIVSDWASIPEMI-YHGFARD-KKHAAEIAVKAGSDMDMEGGAYENHLEDLVKSGEID 342
Query: 328 EADIDTSLRFLYIVLMRLGYFDGSPQYKNLGK-NNICNPQHIELAAEAARQGIVLLKNDN 386
E +D S+R + V +LG FD +Y N NI +H++ A + A + IVLLKN+
Sbjct: 343 EELLDDSVRRILRVKFKLGLFDDPYKYSNPEMLKNISFEEHLKTARDIASKSIVLLKNEG 402
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGF---------YAYSK 435
LPL ++K +A++GP A+ + IGN+ +G S ++G Y+K
Sbjct: 403 ELLPLKP-SVKNIAVIGPLADDKNSPIGNWRAQGEENSAVSVLEGIKNAVGNNVRVTYAK 461
Query: 436 VINYAPGCADIVC------QNNSMIPAAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLL 489
++ G + + + S AI+ AKNA+ ++V G D EG+ +V++ L
Sbjct: 462 GADHGTGVKNFLLPLEINETDKSGFAEAIEVAKNAEVVLMVLGEDAFQTGEGRSQVEIGL 521
Query: 490 PGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADV 549
G Q EL+ +V K + LV+++ ++I++A N I +I+ + G E G AIADV
Sbjct: 522 MGVQQELLEEVYKVNKN-IVLVLINGRPLEISWAAEN--IPAIVEAWHLGSESGNAIADV 578
Query: 550 IFGKYNPGGRLPIT----------WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVY 599
+FGKYNP G+LP++ +Y PY++ + + G Y + +Y
Sbjct: 579 LFGKYNPSGKLPVSFPRNVGQEPLYYNQKNTGRPYSAEHV----TYSG--YTDVEKDALY 632
Query: 600 PFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYK 659
PFGYGLSYT FKY V P+ KL ++
Sbjct: 633 PFGYGLSYTTFKYGV---PQLTSKKL-----------------------------TQEGS 660
Query: 660 FTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMN 716
T + V N GK+ G EVV +Y + + T +K++ +E V +A G++ V F ++
Sbjct: 661 ITVTVPVTNTGKLKGKEVVQLYIRDLVASTTRPVKELKAFEMVELAPGETRDVQFEID 718
>gi|167645796|ref|YP_001683459.1| glycoside hydrolase family 3 [Caulobacter sp. K31]
gi|167348226|gb|ABZ70961.1| glycoside hydrolase family 3 domain protein [Caulobacter sp. K31]
Length = 808
Score = 254 bits (650), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 209/721 (28%), Positives = 326/721 (45%), Gaps = 107/721 (14%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ EALHG ++ R ATSFP I ++F+ + +K+
Sbjct: 152 RLGVPML-MHDEALHG--YVAR---------------DATSFPQAIALASTFDTEMTEKV 193
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ E RA + N L +P ++V RDPRWGR+ ET GEDP++ + +RG Q
Sbjct: 194 FAVAAREMRARGS--NIAL---APVVDVARDPRWGRIEETYGEDPHLCAEIGLAAIRGFQ 248
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
+ P K+ KH + N +++ E+ ++E F PFE
Sbjct: 249 G-------KTLPLAPDKVFVTLKHMTGHGQPE---NGTNVGPAQIAERTLRENFFPPFER 298
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V E V SVM SYN ++G+P+ A+ LL +R +W + G + SD +I+ ++ HK
Sbjct: 299 AVKELPVRSVMPSYNEIDGVPSHANRWLLTDILRKEWGYKGSVQSDYFAIKELMGRHKLT 358
Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
+D E AV + AG+D++ G+ Y V+ G+I +A +D ++ + + G
Sbjct: 359 DDLGETAVM-AMNAGVDVELPDGEAYALLPQ-LVKVGRIPQAAVDQAVERVLTMKFEGGL 416
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
F+ + P I LA EAAR+ +VLLKND G LPLN K LAL+G HA
Sbjct: 417 FENPYADEKTADAKTATPDAIALAREAARKAVVLLKNDKGVLPLNPSKFKRLALLGTHAK 476
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSK----VINYAPGC----ADIVCQ---------- 449
T IG Y TP S +G A +K ++YA A I Q
Sbjct: 477 DTP--IGGYSDTPRHVVSIYEGLQAEAKKSGFTLDYAEAVRITEARIWAQDEVKLVDPAV 534
Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADA 503
N +I A++ AK AD V+V G + E DR L L G Q +L + D
Sbjct: 535 NAKLIAEAVEVAKQADVIVMVLGDNEQTSREAWADNHLGDRDSLDLIGQQNDLARAIFDL 594
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K P + +++ + IN + +++ Y G+E G A AD++FG+ NPGG+LP++
Sbjct: 595 GK-PTVVFLLNGRPLSINLLAQ--RADAVIEGWYLGQETGNAAADILFGRANPGGKLPVS 651
Query: 564 -WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVD 622
+ + I Y P R Y D +YPFG+GLSYT F S+P+
Sbjct: 652 IARDVGQLPIYYNRKPT------ARRGYLLGDTSPLYPFGFGLSYTTFDI---SAPRPAK 702
Query: 623 IKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYS 682
++ ++ + +I+V N GK+ G EVV +Y
Sbjct: 703 AEIGANESVK-----------------------------VEIDVINTGKVAGDEVVQLYI 733
Query: 683 KPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTILV 741
+ T + ++ ++RV +A G V F ++ L + + ++ G T+L
Sbjct: 734 HDEAASVTRPVLELKHFKRVTLAPGAKQTVTFEVSPL-DLSLWNLEMKRVVEPGKFTLLS 792
Query: 742 G 742
G
Sbjct: 793 G 793
>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
616]
Length = 814
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 235/808 (29%), Positives = 354/808 (43%), Gaps = 161/808 (19%)
Query: 23 ERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYE------------------------- 57
ER + L+ +MTL EKV QM + LG P+YE
Sbjct: 58 ERVEYLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEISEYHIGALW 111
Query: 58 -------WWSEALH-GV--SFIGRRTNSPPG---THFDSEVP--------------GATS 90
W LH G+ S R +N H +P G T
Sbjct: 112 GFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPHGHMAIGTTV 171
Query: 91 FPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETP 150
FPT I +++N L +++G+ ++ EA A + P +++ RDPRW RV ET
Sbjct: 172 FPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETY 226
Query: 151 GEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHF 210
GEDPY+ G VRG Q D+ + A KH+A+Y W
Sbjct: 227 GEDPYLNGVMGAALVRGFQG--------DTLRGRKSVIATLKHFASY---GWTEGGHNGG 275
Query: 211 DSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHG 270
+ + E++++E PF V G +S VM SYN ++G P LL ++ W F G
Sbjct: 276 TAHLGERELEEAIFPPFREAVGAGALS-VMSSYNEIDGNPCTGSRYLLTDILKDRWLFKG 334
Query: 271 YIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG-DYYTNFTMGAVQQGKIAEA 329
++VSD +I + E H E AV + + AG+D D G + Y + AV++G +A
Sbjct: 335 FVVSDLYAIGGLRE-HGVAGSDYEAAV-KAVNAGVDSDLGTNVYAEQLVAAVRKGDVAME 392
Query: 330 DIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGAL 389
+D ++R + + +G FD + +P+HI LA E ARQ IVLLKN++ L
Sbjct: 393 TVDKAVRRILSLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIVLLKNEDKLL 452
Query: 390 PLNTGNIKTLALVGPHANATKAMIGNY-----EGTPCRYTSPMDGFYAYSKVINYAPGCA 444
PL +I+TLA++GP+A+ M+G+Y +G+ + + + YA GCA
Sbjct: 453 PLKK-DIRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCA 511
Query: 445 DIVCQNNSMIPAAIDAAKNADATVIVAG----LDLSVE-------------------AEG 481
+ + + AI+AA++AD V+V G D S E EG
Sbjct: 512 -VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEG 570
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
DR L L G Q EL+ +V K P+ LV++ + + + +IL YPG +
Sbjct: 571 YDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQ--EADAILDAWYPGMQ 627
Query: 542 GGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFD--GPVVY 599
GG A+ADV+FG YNP GRL ++ V +P+ G ++ + G Y
Sbjct: 628 GGNAVADVLFGDYNPAGRLTLS------VPRSVGQLPVYYNTKRKGNRSRYIEEAGTPRY 681
Query: 600 PFGYGLSYTQFKY---KVASSPKSVDIKLDKDQQCR-DINYTVGTNKPPCAAVLIDDVKC 655
PFGYGLSYT F Y KV S +S CR D++ T
Sbjct: 682 PFGYGLSYTMFSYTGMKVRVSEES--------NHCRVDVSVT------------------ 715
Query: 656 KDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVGFT 714
V N G +DG EVV +Y + G T +Q+ + RV + AG++ ++ FT
Sbjct: 716 ----------VRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFT 765
Query: 715 MNACKSLKIVDNAANSLLASGAHTILVG 742
++ KSL + + G T++ G
Sbjct: 766 LDK-KSLALYMRDGEWAVEPGRFTVMAG 792
>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
Length = 885
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 156/444 (35%), Positives = 226/444 (50%), Gaps = 39/444 (8%)
Query: 12 FPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGR 71
PY + L ERA DL R+TL EK M D + +PRLG+ + WWSEALHG + +G
Sbjct: 48 LPYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALHGAANMGN 107
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYN--LGNAG-- 127
TN FP I +SFN +L K + S E RA Y+ + N G
Sbjct: 108 VTN----------------FPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGED 151
Query: 128 -----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDS 182
L+ W+PN+N+ RDPRWGR ET GEDPY+ V GLQ E +Y
Sbjct: 152 EKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGPESSKYR----- 206
Query: 183 RPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCS 242
K+ AC KH+A + E + ++ +D+ ET++ F+ V +G V VMC+
Sbjct: 207 ---KLWACAKHFAVHS--GPESTRHTANLNNISPRDLYETYLPAFQSTVQDGHVREVMCA 261
Query: 243 YNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLK 302
Y R++ P C++ +LL Q +R +W F +VSDC ++ I +SHK +D + L
Sbjct: 262 YQRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSSDAVHASRQATL- 320
Query: 303 AGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP--QYKNLGK 359
AG D++CG YT + AV++G + E +ID + L LG D S ++ +
Sbjct: 321 AGTDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMDDSKLVEWSKIPY 380
Query: 360 NNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGT 419
+ + H +LA + ARQ IVLL+N LPL + +A++GP+A+ M GNY GT
Sbjct: 381 SIMSCKAHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNADNKPMMWGNYNGT 440
Query: 420 PCRYTSPMDGFYAYSKVINYAPGC 443
P S ++G K + Y P C
Sbjct: 441 PNHTVSILEGIRKQYKNVVYLPAC 464
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 82/298 (27%), Positives = 135/298 (45%), Gaps = 57/298 (19%)
Query: 456 AAIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAK 505
A I K D + V G+ S+E E G DR D+ +P Q + I +A+A K
Sbjct: 618 ANIAQLKGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEAGK 677
Query: 506 GPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWY 565
+++++ I + ++I+ YPG+EGG A+AD++ GK NP G+LP+T+Y
Sbjct: 678 ---QIILVNCSGSAIALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVTFY 734
Query: 566 EANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
++ +P + RTY++F+ +YPFGYGLSYT F+
Sbjct: 735 KST------QQLPDFEDYSMKNRTYRYFED-ALYPFGYGLSYTSFE-------------- 773
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+GT K L ++ T QI V N GK +G+E+V VY +
Sbjct: 774 ------------IGTAK---LQTLTNN------SITLQIPVTNTGKREGTELVQVYLRRD 812
Query: 686 GIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL-LASGAHTILVG 742
K + + + + AG++ K +N + D + N++ + G +TI G
Sbjct: 813 DDVEGPSKTLRSFAHITLKAGETKKAILKLNR-NQFECWDASTNTMRVIPGKYTIFYG 869
>gi|393786524|ref|ZP_10374660.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
CL02T12C05]
gi|392660153|gb|EIY53770.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
CL02T12C05]
Length = 841
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 227/811 (27%), Positives = 351/811 (43%), Gaps = 149/811 (18%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---GLPLYEW----WS------ 60
+ D P +R KDL+ +MT+ EK Q+ L YG R+ LP W W
Sbjct: 82 FEDPSQPVEKRVKDLLSQMTIEEKSCQLATL-YGFGRVLKDSLPTPAWKEAIWKDGIANI 140
Query: 61 -EALHGVSFIGRRTNS---PPGTH----------------------FDSE-VPG-----A 88
E L+GV +R P H F +E + G A
Sbjct: 141 DEQLNGVGRGAKRVPHLIVPFSNHVKAINETQRWFIEETRLGIPVDFSNEGIHGLNHTKA 200
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGRVL 147
T P I +++N L ++ G+ V EAR + G T ++P ++VVRDPRWGR L
Sbjct: 201 TPLPAPIAIGSTWNTELVREAGEIVGKEARVL------GYTNVYAPILDVVRDPRWGRTL 254
Query: 148 ETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDR 207
E GEDPY++G + V G+Q +GV +A KH+A Y +
Sbjct: 255 ECYGEDPYLIGELGVQMVDGIQS-QGV-------------AATLKHFAVYSSPKGGRDGN 300
Query: 208 FHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWN 267
D VT +++ E ++ PF+ + + VM SYN NG P + L + +R ++
Sbjct: 301 CRTDPHVTPRELHEIYLYPFKHVIQQSHPMGVMSSYNDWNGEPVTSSYYFLTKLLREEYG 360
Query: 268 FHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGDYYTNFTMGA------- 320
F GY+VSD +++ + H+ D E AV +VL+AGL++ T+FT A
Sbjct: 361 FDGYVVSDSQAVEFVHTKHQVAEDYDE-AVRQVLEAGLNVR-----THFTPPADFILPIR 414
Query: 321 --VQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNP-QHIELAAEAARQ 377
+ + KI+ A ID + + V RLG FD + + + +H E E RQ
Sbjct: 415 RLLAENKISMATIDKRVSEVLAVKFRLGLFDAPYRDNPKEADEVAGADKHSEFVKEMQRQ 474
Query: 378 GIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSK-- 435
+VLLKND LPLN IK + + GP A+ MI Y + + G Y K
Sbjct: 475 SLVLLKNDGQLLPLNKKEIKKVLVTGPLADEDNFMISRYGPNGLPTITVLQGIKDYLKGD 534
Query: 436 -VINYAPGC--------------ADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAE 480
+ Y+ GC A + + + + A+ A++AD + V G D E
Sbjct: 535 VEVVYSKGCNIIDKEWPASEVLPAVLTAEEVADMDKAVSEAQSADVIIAVMGEDEYRVGE 594
Query: 481 GKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGE 540
+ R L LPG Q EL+ + K PV LV+++ + IN+ N + +IL +P
Sbjct: 595 SRSRTSLELPGRQRELLQALHATGK-PVVLVLINGQPLTINWEDQN--LPAILEAWFPSF 651
Query: 541 EGGRAIADVIFGKYNPGGRLPITW------YEANYVKIPYT--SMPLRPVNNFPGRTYKF 592
+GG+ IA+ +FG YNPGG+L +T+ E N+ P+ S +P + G
Sbjct: 652 QGGKIIAETLFGDYNPGGKLTVTFPKSVGQIELNF---PFKKGSHGTQPSSGPNGSGSTR 708
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
G +YPFGYGLSYT F Y + P
Sbjct: 709 VLG-ALYPFGYGLSYTTFAYS-----------------------NLEVTAP--------- 735
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKV 711
K + ++ N GK G EV +Y + T+ ++ G++RV + ++ ++
Sbjct: 736 AKGTQGEVQISFDITNTGKYAGEEVAQLYVRDLVSSVVTYDSRLRGFQRVLLQPNETKRM 795
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT+ L+++D + SG + VG
Sbjct: 796 HFTLKPA-DLELLDRNMEWTVESGTFEVRVG 825
>gi|255013016|ref|ZP_05285142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410102476|ref|ZP_11297402.1| hypothetical protein HMPREF0999_01174 [Parabacteroides sp. D25]
gi|409238548|gb|EKN31339.1| hypothetical protein HMPREF0999_01174 [Parabacteroides sp. D25]
Length = 732
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 219/776 (28%), Positives = 361/776 (46%), Gaps = 142/776 (18%)
Query: 18 KLPYPERAKDLVERMTLPEKVQQM-GDLAY---GVPRLGLPLYEW-WSEALHGV-SFIGR 71
K+ +R + L+++MTL EKV + G+ + GV RLG+P EW S+ HGV + I R
Sbjct: 28 KVQMEKRIEKLIKKMTLEEKVGLLHGNSKFYVAGVERLGIP--EWSLSDGPHGVRAEINR 85
Query: 72 RTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFW 131
+ G DS A+ FPT A++N L + G+ + EAR
Sbjct: 86 HDWAYAGWTNDS----ASYFPTGTAFAAAWNPELAYRRGEVLGEEAR-----WRKKDVLL 136
Query: 132 SPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACC 191
P +N++R P GR E EDPY+ A+ Y++GLQ RD ++
Sbjct: 137 GPGVNIIRSPLCGRNFEYMSEDPYMNSVLAVAYIKGLQS-------RD-------VACSV 182
Query: 192 KHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPT 251
KH+A ++N E N R D +E+ ++E ++ F+ V EG +VM +YN+ G
Sbjct: 183 KHFA---VNNQETN-RTTVDVECSERALREIYLPAFKAAVQEGGALTVMAAYNKFRGEFC 238
Query: 252 CADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCGD 311
+ L+ + +R +W F G V+D + + V S ++AGLDL+ G
Sbjct: 239 AENNYLVRKILRNEWGFDGVYVTDWGAAHSTVPS---------------MEAGLDLEMGT 283
Query: 312 --------YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNIC 363
YY N + AV+ GK+ + +D + + V+++ D P+ K G ++
Sbjct: 284 LIDKYEDWYYANPLIDAVKSGKVPMSLVDEKVGDVLRVMIKTNVLD--PK-KRFGPGSMN 340
Query: 364 NPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVG-----PHANA--TKAMIGNY 416
+H + +AA + IVLLKN N LPL+ +IK+LA++G H+N + + Y
Sbjct: 341 TKEHQQATYDAAAEAIVLLKNQNNLLPLDFSSIKSLAVIGDNATRKHSNGGLSSEIKAVY 400
Query: 417 EGTP-----CRYTSPMDGFYA--YSKVINYAPGC---------ADIVCQNNSMIPAAIDA 460
E TP ++ +D +A Y K+ + G + ++++++ A++
Sbjct: 401 EVTPLGALRAKWGDKVDIRFAQGYEKLSTFVEGSNNGQSSGTFSSKTQESDALLKEAVEV 460
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI 520
A+ +D ++V GL+ + E DR+++ +P Q ELI +V A P T+V+M AG+ +
Sbjct: 461 ARTSDVALLVCGLNHDYDTESFDRLNMDIPYGQVELIQEVVKA--NPRTIVVMIAGS-PL 517
Query: 521 NFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLR 580
N A + +I+W + G EGG + DV+ GK NP G++P T + P
Sbjct: 518 NMAAVDICSPAIVWAWFNGMEGGNVLVDVLSGKVNPSGKMPFT------TPVSLDQSPAH 571
Query: 581 PVNNFPGRT------------YKFFDG---PVVYPFGYGLSYTQFKYKVASSPKSVDIKL 625
+ NFPGR Y++FD PVVYPFGYGLSYT F Y L
Sbjct: 572 ALGNFPGRDLKVNYEEDILVGYRWFDTKGLPVVYPFGYGLSYTTFDYS----------NL 621
Query: 626 DKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP 685
+ D++ D T+ + TF + N G +G+EV +Y P
Sbjct: 622 NTDKETYDQADTI--------------------QATFTL--TNTGDREGAEVAQLYVSDP 659
Query: 686 GIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
+ +K++ G+++VF+ G+S ++ + SL A + + IL
Sbjct: 660 VCSVMRPVKELKGFKKVFLKPGESRRITLDI-PVSSLAFYSEAQSQFVVEPGEFIL 714
>gi|120437787|ref|YP_863473.1| beta-glucosidase [Gramella forsetii KT0803]
gi|117579937|emb|CAL68406.1| beta-glucosidase [Gramella forsetii KT0803]
Length = 757
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 202/689 (29%), Positives = 327/689 (47%), Gaps = 84/689 (12%)
Query: 89 TSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLE 148
T FP + TAS++ ++ + + E+ A TF SP I++ RD RWGR++E
Sbjct: 122 TIFPVPLAETASWDMEAAEESARIAALESVAE----GVNWTF-SPMIDISRDARWGRIME 176
Query: 149 TPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRF 208
GEDPY+ + A+ V+G Y + S P I+A KH+A Y EG +
Sbjct: 177 GSGEDPYLTSKVAVAKVKG--------YQGEDLSNPKTIAATAKHFAGYGFA--EGGKDY 226
Query: 209 HFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNF 268
+ + E ++ + PF+ + G V++ M S+N ++GIP L + ++GDW++
Sbjct: 227 N-TVNIGENELHNVILPPFKAAADAG-VATFMNSFNTIDGIPATGSESLQREILKGDWDW 284
Query: 269 HGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDC-GDYYTNFTMGAVQQGKIA 327
G++VSD SI ++ H F D K A +KAG D+D G Y V GK+
Sbjct: 285 TGFMVSDWGSIAEMI-PHGFAKD-KIHAAEIAVKAGSDMDMEGGAYEAGLEKLVAAGKVE 342
Query: 328 EADIDTSLRFLYIVLMRLGYFDGSPQYKNL-GKNNICNPQHIELAAEAARQGIVLLKNDN 386
EA ID +++ + V ++G FD +Y N K N+ +H+ A + A++ IVLLKN+N
Sbjct: 343 EALIDDAVKRILRVKFKMGLFDDPYRYINSETKKNVPYKEHMSTARDIAKKSIVLLKNEN 402
Query: 387 GALPLNTGNIKTLALVGPHANATKAMIGNY--EGTPCRYTSPMDGFY--AYSKVINYAPG 442
LP+ T ++K +A++GP A+ IGN+ +G S ++G I YA G
Sbjct: 403 DLLPIKT-SVKKIAVIGPLADDKDTPIGNWRAQGEENSAVSVLEGLKNANLDAQITYAQG 461
Query: 443 CADIVCQNNSMIP------------AAIDAAKNADATVIVAGLDLSVEAEGKDRVDLLLP 490
+ + + ++P A+ AKNA+ V+V G D EG+ + + L
Sbjct: 462 IKLGMGERSFLMPLKINKTDTTGMGEAVRNAKNAELVVMVLGEDAFQSGEGRSQAKIGLA 521
Query: 491 GFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVI 550
G Q EL+ V K + LV+++ ++++++ N I +I+ G E G AIADV+
Sbjct: 522 GLQMELLKAVHKVNKN-IVLVLINGRPLELSWSSEN--IPTIVEAWQLGSESGNAIADVL 578
Query: 551 FGKYNPGGRLPITW-----YEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGL 605
GKYNP G+LP+++ E Y T P + +GP +YPFGYGL
Sbjct: 579 LGKYNPSGKLPVSFPRAVGQEPLYYNHKNTGRPFSAEHVTYAHYTDIENGP-LYPFGYGL 637
Query: 606 SYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIE 665
SYTQF Y +P + ++ +K ++ K T +
Sbjct: 638 SYTQFDYA----------------------------RPELS---VESIKSRE-KATLSVA 665
Query: 666 VENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIV 724
V N G G EVV +Y + +K++ G+E + + G++ V F +N + LK
Sbjct: 666 VTNSGDRKGKEVVQLYLRDLVATTARPVKELKGFEMIELEPGETKTVEFIINE-EMLKFY 724
Query: 725 DNAANSLLASGAHTILVG---EGVGGVSF 750
+ + G ++VG E V V F
Sbjct: 725 NASEKWEAEEGEFQLMVGGNSEDVQSVKF 753
>gi|295690896|ref|YP_003594589.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
gi|295432799|gb|ADG11971.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
ATCC 21756]
Length = 806
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 212/722 (29%), Positives = 324/722 (44%), Gaps = 109/722 (15%)
Query: 50 RLGLPLYEWWSEALHGVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKI 109
RLG+P+ EALHG ++ R ATSFP I ++F+ L +KI
Sbjct: 151 RLGIPML-MHDEALHG--YVAR---------------DATSFPQAIALASTFDTELTEKI 192
Query: 110 GQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQ 169
+ E RA + N L +P ++V RDPRWGR+ ET GEDP+V + +RG Q
Sbjct: 193 FAVAAREMRARGS--NLAL---APVVDVARDPRWGRIEETYGEDPHVCAEIGLAAIRGFQ 247
Query: 170 DVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEM 229
G D K+ KH + N ++++E+ ++E F PFE
Sbjct: 248 ---GTTLPLAKD----KVFVTLKHMTGHGQPE---NGTNVGPAQISERVLRENFFPPFER 297
Query: 230 CVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFL 289
V E V +VM SYN ++G+P+ LL + +R +W + G + SD +I+ ++ HK
Sbjct: 298 AVTELPVRAVMPSYNEIDGVPSHGSRWLLTKILREEWGYKGSVQSDYFAIKEMISRHKLT 357
Query: 290 NDTKEDAVARVLKAGLDLDC--GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGY 347
D E AV R + AG+D++ G+ Y V+ G+I + +ID ++ + + G
Sbjct: 358 TDLGETAV-RAMHAGVDVELPDGEAYA-LIPELVKAGRIPQFEIDAAVARVLTMKFEGGL 415
Query: 348 FDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHAN 407
F+ + P + LA EAAR+ +VLLKND G LPL+ IK LAL+G HA
Sbjct: 416 FENPYCDEKTADAKTATPDAVALAREAARKAVVLLKNDKGVLPLDGKKIKRLALLGTHAK 475
Query: 408 ATKAMIGNYEGTPCRYTSPMDGFYAYSKVINYAPGCADIV------------------CQ 449
T IG Y P S +G A +K +A A+ V
Sbjct: 476 DTP--IGGYSDVPRHVVSIYEGLTAEAKAQGFALDYAEAVRITEQRIWAQDQVNFTDPAV 533
Query: 450 NNSMIPAAIDAAKNADATVIVAGLDLSVEAEG------KDRVDLLLPGFQTELINKVADA 503
N +I A++ AK AD V+V G + E DR L L G Q +L + D
Sbjct: 534 NAKLIAEAVEVAKKADVVVMVLGDNEQTSREAWADNHLGDRESLDLIGQQNDLAKAIFDL 593
Query: 504 AKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPIT 563
K P + +++ + IN + +I+ Y G+E G A ADV+FG+ NPGG+LP++
Sbjct: 594 GK-PTVVFLLNGRPLSINLLAE--RADAIIEGWYLGQETGNAAADVLFGRANPGGKLPVS 650
Query: 564 WYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGPV--VYPFGYGLSYTQFKYKVASSPKSV 621
N ++P N P + G V +YPFG+GLSYT F S+P+
Sbjct: 651 -IARNVGQLPIY------YNRKPTARRGYLGGDVTPLYPFGFGLSYTSFDI---SAPRLA 700
Query: 622 DIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVY 681
K+ + + + +++V N GK+ G EVV +Y
Sbjct: 701 KAKIGQGETVK-----------------------------VEVDVANTGKVAGDEVVQLY 731
Query: 682 SKPPGIAGTH-IKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLASGAHTIL 740
T + ++ ++RV +A G V F + L + D ++ G +IL
Sbjct: 732 IHDETATVTRPVLELKHFKRVTLAPGAKTTVTFEIKPS-DLWMWDLDMKRVVEPGDFSIL 790
Query: 741 VG 742
VG
Sbjct: 791 VG 792
>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
Length = 948
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 221/810 (27%), Positives = 365/810 (45%), Gaps = 144/810 (17%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRL---------------------- 51
Y DA P +R DL+E+MT+ EK QM L YG R+
Sbjct: 61 YEDASAPLNDRINDLLEQMTIEEKTNQMVTL-YGYKRVLEDDLPNAGWKQKLWKDGIGAI 119
Query: 52 ----------GLPLYE--W-WSEALHGVS-------FIGRRTNSPPGTHFDSEVPG---- 87
GLP + W W + H + F+ P + + G
Sbjct: 120 DEHLNGFVQWGLPPSDNPWVWPASKHAWAINEVQRFFVEETRLGIPVDFTNEGIRGIESY 179
Query: 88 -ATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLT-FWSPNINVVRDPRWGR 145
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWGR
Sbjct: 180 KATNFPTQLGLGTTWNRQLIRQVGYITGREARLL------GYTNVYAPILDVGRDQRWGR 233
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
E GE P++V I RGLQ ++++ KH+AAY +
Sbjct: 234 YEEIYGESPFLVAELGIQMTRGLQT-------------DFQVASTAKHFAAYSNNKGGRE 280
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
D ++ ++++ + P+E V E + M SYN +GIP L + +R
Sbjct: 281 GMSRVDPQMPPREVENIHLYPWERVVQEAGLLGAMSSYNDYDGIPIQGSYHWLTEVLRHR 340
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLDCG----DYYTNFTMGAV 321
+ F GYIVSD D+++ + H D KE AV + + AGL++ C D + +
Sbjct: 341 FGFRGYIVSDSDALEYLFSKHHTAADMKE-AVYQAVMAGLNVRCTFRSPDSFVLPLRELI 399
Query: 322 QQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNL--GKNNICNPQHIELAAEAARQGI 379
++G+I + ID + + V G FD +P NL + + ++ +A +A+RQ I
Sbjct: 400 REGRIPMSVIDRLVGDILRVKFITGIFD-NPYQMNLKAADQEVNSERNQAVALQASRQSI 458
Query: 380 VLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTPCRYTSPMDGFYAYSKV--- 436
VLLKN + LPL+ ++ + + GP+A+ + +Y T+ ++G KV
Sbjct: 459 VLLKNQDRLLPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGI--RDKVENN 516
Query: 437 --INYAPGCADIV---------------CQNNSMIPAAIDAAKNADATVIVAGLDLSVEA 479
++YA GC D+V Q I A+ AK +D ++V G +
Sbjct: 517 IEVSYAKGC-DVVDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCG 575
Query: 480 EGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPG 539
E K R L LPG Q +L+ V K PV LV+++ + +N+A + I +I+ YPG
Sbjct: 576 ENKSRSSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWA--DRFIPAIVEAWYPG 632
Query: 540 EEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPLRPVNNFPGRTYKFFDGP--- 596
+GG A+ADV+FG YNPGG+L +T + + +IP+ + P +P + G G
Sbjct: 633 SQGGTAVADVLFGDYNPGGKLTVT-FPKSVGQIPF-NFPSKPASQVDGGNKLGLQGNASR 690
Query: 597 ---VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDV 653
+Y FG+GLSYT FKY +++L K+ +N ++ ++
Sbjct: 691 INGALYSFGHGLSYTTFKYS--------NLRLSKETMT--LNDSI-------------NI 727
Query: 654 KCKDYKFTFQIEVENMGKMDGSEVVMVYSKPP-GIAGTHIKQVIGYERVFIAAGQSAKVG 712
C +V N G +G EVV +Y + T+ K + G++R+ + G++ +
Sbjct: 728 SC---------DVSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPGETKTLT 778
Query: 713 FTMNACKSLKIVDNAANSLLASGAHTILVG 742
FT+ + LK+V+ ++ G I++G
Sbjct: 779 FTIKP-EHLKLVNKDFEKVVEPGEFKIMIG 807
>gi|257051950|ref|YP_003129783.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
DSM 12940]
gi|256690713|gb|ACV11050.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
DSM 12940]
Length = 783
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 199/697 (28%), Positives = 326/697 (46%), Gaps = 99/697 (14%)
Query: 86 PGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAGLTFWSPNINVVRDPRWGR 145
PG T FP I ++++ +L + I ++ A+ + SP ++V RD RWGR
Sbjct: 121 PGGTIFPQSIGLASTWSPALVESITDSIRKRLAAV-----GAVQALSPVLDVSRDMRWGR 175
Query: 146 VLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISACCKHYAAYDLDNWEGN 205
V ET GEDP +VG YV GLQ+ D D I A KH+AA+ + EG
Sbjct: 176 VEETYGEDPQLVGALGAAYVSGLQN--------DGDG----IDATLKHFAAHG--SGEGG 221
Query: 206 DRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGD 265
+ ++ E++++E + PFE+ + E D +VM +Y+ ++G+P + LL +RG+
Sbjct: 222 -KNRSSVQIGERELREVHLYPFEVAIREADARAVMNAYHDIDGVPCASSEWLLTDVLRGE 280
Query: 266 WNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD--CGDYYTNFTMGAVQQ 323
W F G++V+D S+ + H + DT+ +A L+AGLD++ D Y + AV+
Sbjct: 281 WGFDGHVVADYFSVDLLKTEHG-IADTQREAGVAALEAGLDIELPATDCYGENLLKAVED 339
Query: 324 GKIAEADIDTSLRFLYIVLMRLGYFDGSPQYKNLGKNNICNPQHIELAAEAARQGIVLLK 383
G+++EA +DT++R + + G FD + ELAA AAR+ + LL+
Sbjct: 340 GELSEATVDTAVRRVLRAKIESGVFDDPYVDPEAASEPFDTDEQTELAARAARESMTLLE 399
Query: 384 NDNGALPLNTGNIKTLALVGPHANATKAMIGNY---------EGTPCRYTSPMDGFYAYS 434
ND+ LPL ++ ++ALVGP A+ +A +G+Y E +P D A
Sbjct: 400 NDD-LLPLAGEDLDSVALVGPQADDGRAQVGDYTHAARFDTEEDGDFECVTPRDALEAKG 458
Query: 435 KV----INYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGL----------------D 474
+ + Y G A + + AA + +AD V G D
Sbjct: 459 ETAGFDVEYVEG-ATMTGPSTEEFDAAEETVADADVAVACVGARSDIDFADRENPSELPD 517
Query: 475 LSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDI-NFAKNNPKIKSIL 533
+ E D DL LPG Q ELI+++A+ P+ +V +S I A+ P ++L
Sbjct: 518 VPTSGENCDVTDLELPGVQAELIDRLAE-TDTPLVVVQVSGKPHAIPEIAETVP---ALL 573
Query: 534 WVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEA-NYVKIPYTSMPLRPVNNFPGRTYKF 592
PG+ GG AIADV+FG+YNP G LP++ ++ + Y+ P N + +
Sbjct: 574 HAWLPGQAGGTAIADVLFGEYNPSGHLPVSIPKSVGQQPVYYSRKP-----NSANEEHVY 628
Query: 593 FDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDD 652
DG +Y FG+GLSYT F+Y + T +P +
Sbjct: 629 MDGEPLYSFGHGLSYTDFEYGELELEEG-------------------TVEPMGS------ 663
Query: 653 VKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH-IKQVIGYERVFIAAGQSAKV 711
+ + V N G+ G +VV +Y + +++++G+ERV + G+S +V
Sbjct: 664 -------LSASVTVTNAGERAGDDVVQLYQHAENPSQARPVQELLGFERVHLEPGESKRV 716
Query: 712 GFTMNACKSLKIVDNAANSLLASGAHTILVGEGVGGV 748
FT +A + L D + + G + + VGE +
Sbjct: 717 TFTFDATQ-LAYYDLNMHLAVEEGPYELRVGESAAEI 752
>gi|262383061|ref|ZP_06076198.1| glycoside hydrolase family 3 [Bacteroides sp. 2_1_33B]
gi|262295939|gb|EEY83870.1| glycoside hydrolase family 3 [Bacteroides sp. 2_1_33B]
Length = 758
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 184/603 (30%), Positives = 299/603 (49%), Gaps = 63/603 (10%)
Query: 131 WSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSRPLKISAC 190
++P +++ RD RWGRV+E GEDPY+ A V G Q G ++ +D + AC
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQG--GNDWRSLADVN--TVLAC 217
Query: 191 CKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSYNRVNGIP 250
CKH+AAY G D +++ Q+ + +P + E V++ M S+N +NG+P
Sbjct: 218 CKHFAAYGAAE-AGRD---YNTSELSQNTLMNYYMPPYLAAKEAGVATFMASFNEINGVP 273
Query: 251 TCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKAGLDLD-C 309
+ + L+ +R DW F+G++V+D I +V +H + + KE A AG+D+D
Sbjct: 274 STGNKWLMTDLLRKDWGFNGFVVTDYTGINEMV-AHSIVRNDKE-AGELAANAGIDMDMT 331
Query: 310 GDYYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQY--KNLGKNNICNPQH 367
G Y+ + + +V++GK++E +I+ ++ + + LG FD +Y KN I P+
Sbjct: 332 GGIYSQYLVQSVKEGKVSEENINRAVASILEMKFLLGLFDDPYRYLDNEREKNTIMKPEF 391
Query: 368 IELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGP------HANATKAMIGNYEGTPC 421
++ A E + + IVLLKNDN P++ T+AL+GP + N A G E +
Sbjct: 392 LQEARETSARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESIS 451
Query: 422 RYTSPMDGFYAYSKVINYAPGCADIVCQNNSMIPAAIDAAKNADATVIVAGLDLSVEAEG 481
+ + + + YA GC D++ ++S AI A+ AD + G D + E
Sbjct: 452 LFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEA 510
Query: 482 KDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEE 541
R +L LPG Q L+ ++ K P+ L++++ +D+++ N + IL Y G
Sbjct: 511 ACRTNLKLPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--ENQHVDGILEAWYLGTM 567
Query: 542 GGRAIADVIFGKYNPGGRLPITW-YEANYVKIPYTSMPL-RPV-NNFPGRTYK--FFDGP 596
G +ADVI G YNP RL +++ + + Y P RPV P YK + D P
Sbjct: 568 AGHGMADVISGDYNPSARLTMSFPRTVGQLPLYYNQKPTGRPVPPEAPDTDYKSRYMDVP 627
Query: 597 --VVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVGTNKPPCAAVLIDDVK 654
+YPFGYGLSYT F +KLD++ ++T G
Sbjct: 628 NTPLYPFGYGLSYTTFAVN--------SMKLDQN------SFTKGG-------------- 659
Query: 655 CKDYKFTFQIEVENMGKMDGSEVVMVYSKP-PGIAGTHIKQVIGYERVFIAAGQSAKVGF 713
K T EVEN GK+DG VV +Y + G +K++ G+E+V + AG+ +V F
Sbjct: 660 ----KITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSF 715
Query: 714 TMN 716
T++
Sbjct: 716 TID 718
>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 862
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 158/459 (34%), Positives = 232/459 (50%), Gaps = 43/459 (9%)
Query: 5 IKVKLSDFPYCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALH 64
+ V PY D L + RAKDL R+TL EK M D++ +PRLG+ + WWSEALH
Sbjct: 16 VGVNAQQSPYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALH 75
Query: 65 GVSFIGRRTNSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLG 124
G + G T FP I ASFN ++ ++ S EAR YN
Sbjct: 76 GYANNG----------------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQS 119
Query: 125 NA---------GLTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVE 175
A L+ W+PN+N+ RDPRWGR ET GEDPY+ + V+GLQ E +
Sbjct: 120 MAEGKEDTRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPESTK 179
Query: 176 YHRDSDSRPLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGD 235
Y K+ AC KH+A + + + D ++ +D+ ET++ F+ V +
Sbjct: 180 YR--------KLYACAKHFAVHSGPEYTRHTANLAD--ISPRDLWETYLPAFKATVQQAG 229
Query: 236 VSSVMCSYNRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKED 295
V VMC+Y R++ P C + +LL Q +R +W F +VSDC +I +H +D
Sbjct: 230 VREVMCAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVH- 288
Query: 296 AVARVLKAGLDLDCGDYYTNFTM-GAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSP-- 352
A A+ AG D++CG Y + AV++G ++EA++D + L LG D
Sbjct: 289 AAAKGTLAGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMDDPKLV 348
Query: 353 QYKNLGKNNICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAM 412
+ + + + H +LA ARQ + LL+N N LPL G + +A+VGP+A +
Sbjct: 349 SWTKISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPLAKG--EKIAVVGPNAADGPML 406
Query: 413 IGNYEGTPCRYTSPMDGFYAYS-KVINYAPGCADIVCQN 450
GNY GTP R T+ ++G A + K I Y GC D+V +N
Sbjct: 407 WGNYNGTPSRTTTILEGIRAKAGKDIPYLQGC-DLVNKN 444
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 129/285 (45%), Gaps = 53/285 (18%)
Query: 457 AIDAAKNADATVIVAGLDLSVEAE----------GKDRVDLLLPGFQTELINKVADAAKG 506
AI + V V G+ +E E G DR + LP Q + + + A K
Sbjct: 593 AIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKALKAAGK- 651
Query: 507 PVTLVIMSAGAVDINFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYE 566
T+V ++ I +IL Y GEEGGRA+ADV++G YNPGG+LP+T+Y
Sbjct: 652 --TVVFVNCSGSAIALTPEVESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVTFYR 709
Query: 567 ANYVKIPYTSMPLRPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLD 626
+ T +P + GRTY++F ++PFGYGLSYT+F
Sbjct: 710 ST------TQLPAFDDYSMKGRTYRYFSD-ALFPFGYGLSYTRF---------------- 746
Query: 627 KDQQCRDINYTVGTNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPG 686
+G A+ D K T + V N+GK G EVV VY +
Sbjct: 747 ----------AIGKGSLSAPAMKADG------KVTLTVPVSNVGKRTGDEVVQVYVRDVN 790
Query: 687 IAGTHIKQVIGYERVFIAAGQSAKVGFTMNACKSLKIVDNAANSL 731
A +K + + RV + AG+S KV + A ++ + D+A+N++
Sbjct: 791 DADGPLKSLKAFRRVSLKAGESRKVTIPLTA-ETFSLFDSASNTV 834
>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 852
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 161/456 (35%), Positives = 238/456 (52%), Gaps = 46/456 (10%)
Query: 14 YCDAKLPYPERAKDLVERMTLPEKVQQMGDLAYGVPRLGLPLYEWWSEALHGVSFIGRRT 73
+ D K P ER DL+ R+T+ EK+ + + A + RLG+ Y +EALHGV
Sbjct: 29 FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGVV------ 82
Query: 74 NSPPGTHFDSEVPGATSFPTVILTTASFNESLWKKIGQTVSTEARAMYNLGNAG------ 127
PG T FP I A +N L +I +S EAR + G
Sbjct: 83 --RPGDF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAG 132
Query: 128 ----LTFWSPNINVVRDPRWGRVLETPGEDPYVVGRYAINYVRGLQDVEGVEYHRDSDSR 183
LTFWSP +N+ RDPRWGR ET GEDPY+ G + +V+GLQ + R
Sbjct: 133 ASDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQG---------NHPR 183
Query: 184 PLKISACCKHYAAYDLDNWEGNDRFHFDSRVTEQDMQETFILPFEMCVNEGDVSSVMCSY 243
LK + KH+A N E ++R +++V+E+D++E ++ FE C+ EG S+M +Y
Sbjct: 184 YLKTVSTPKHFAV----NNEEHNRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAY 239
Query: 244 NRVNGIPTCADPKLLNQTIRGDWNFHGYIVSDCDSIQTIVESHKFLNDTKEDAVARVLKA 303
N VN +P + L+ +RGDW F+GYIVSDC + + ++ H ++ T+E A +KA
Sbjct: 240 NAVNDVPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVK-TREAAATLAVKA 298
Query: 304 GLDLDCGD-YYTNFTMGAVQQGKIAEADIDTSLRFLYIVLMRLGYFDGSPQ--YKNLGKN 360
GLDL+CG+ Y + A +Q ++EADID++ + M LG FD Q Y + +
Sbjct: 299 GLDLECGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDDPSQNPYNQIEPS 358
Query: 361 NICNPQHIELAAEAARQGIVLLKNDNGALPLNTGNIKTLALVGPHANATKAMIGNYEGTP 420
+ H +LA EAARQ +VLLKN + LPLN +K++A+VG +A G+Y GTP
Sbjct: 359 VVGCKAHQDLALEAARQSMVLLKNKDNFLPLNPQKVKSIAVVG--ISAGHCEFGDYSGTP 416
Query: 421 CRY-TSPMDGFYAYSKVINYAPGCADIVCQNNSMIP 455
+ +DG Y++ + A V + P
Sbjct: 417 KNEPVTILDGIKQYAEEYGFKVAYAPWVSASEDFEP 452
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/294 (36%), Positives = 152/294 (51%), Gaps = 52/294 (17%)
Query: 461 AKNADATVIVAGLDLSVEAEGKDRVDLLLPGFQTELINKVADAAKGPVTLVIMSAGA-VD 519
A D TV V G++ S+E EG+DR L LP Q E I ++ P T+V++ AG+ +
Sbjct: 601 AAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKV--NPNTVVVLVAGSSLA 658
Query: 520 INFAKNNPKIKSILWVGYPGEEGGRAIADVIFGKYNPGGRLPITWYEANYVKIPYTSMPL 579
+N+ N + +IL YPGE+GG A+A+V+FG YNPGGRLP+T+Y + +P
Sbjct: 659 VNWMDEN--VPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS------LDEIPA 710
Query: 580 RPVNNFPGRTYKFFDGPVVYPFGYGLSYTQFKYKVASSPKSVDIKLDKDQQCRDINYTVG 639
+ GRTY++F+G +Y FGYGLSYT+F+YK K V + D TV
Sbjct: 711 FDNYSVKGRTYQYFEGQPLYEFGYGLSYTKFRYK----SKGVSVARD----------TV- 755
Query: 640 TNKPPCAAVLIDDVKCKDYKFTFQIEVENMGKMDGSEVVMVYSKPPGIAGTH--IKQVIG 697
K +F EV N GK DG EV VY K P GT+ +KQ+ G
Sbjct: 756 -------------------KVSF--EVSNTGKYDGDEVAQVYVKYPE-TGTYMPLKQLHG 793
Query: 698 YERVFIAAGQSAKVGFTMNACKSLKIVDNAANSLLA-SGAHTILVGEGVGGVSF 750
++RV I G+++KV + K L+ D + G +T +VG + F
Sbjct: 794 FKRVHIKKGKTSKVTVGVPK-KDLRYWDEQERKFVTPKGEYTFMVGASSEDIKF 846
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,620,819,410
Number of Sequences: 23463169
Number of extensions: 566271712
Number of successful extensions: 1242051
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6145
Number of HSP's successfully gapped in prelim test: 1531
Number of HSP's that attempted gapping in prelim test: 1188429
Number of HSP's gapped (non-prelim): 17569
length of query: 758
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 607
effective length of database: 8,816,256,848
effective search space: 5351467906736
effective search space used: 5351467906736
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)