BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038581
(792 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
Length = 802
Score = 1169 bits (3024), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/772 (71%), Positives = 644/772 (83%), Gaps = 4/772 (0%)
Query: 21 STNAVDAN-GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQ 79
+ N DAN SS +VCD R+ LGL M++F FCDSSL Y +R KDLV++MTL EKVQ
Sbjct: 34 TLNHDDANPRGSSFTYVCDSSRYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKVQ 93
Query: 80 QLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESL 139
QLGD A+GVPRLG+P+YEWWSEALHGVS+VGPGT FDD++PGATSFPT ILTTASFNESL
Sbjct: 94 QLGDLAYGVPRLGIPKYEWWSEALHGVSDVGPGTFFDDLVPGATSFPTTILTTASFNESL 153
Query: 140 WKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
WK IGQA S +ARAMYNLGRAGLTYWSPN+NV RDPRWGR ETPGEDP+VVGRYAVNYV
Sbjct: 154 WKNIGQA-SAKARAMYNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNYV 212
Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
RGLQDVEG EN TDLN+RPLKVSSCCKHYAAYDV+ W+GV+R FDARVTEQDM ETFLR
Sbjct: 213 RGLQDVEGTENYTDLNTRPLKVSSCCKHYAAYDVEKWQGVERLTFDARVTEQDMVETFLR 272
Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
PFEMCVKEGD SSVMCS+NRVNGIP+CADPKLLNQT+RG+WDLHGYIV+DCDSI+VMVDN
Sbjct: 273 PFEMCVKEGDVSSVMCSFNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSIEVMVDN 332
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
HKFL D+ EDAVAQ LKAGLDLDCG YYTNFT +V+QGK +E ID+SLKYLY VLMRL
Sbjct: 333 HKFLGDTNEDAVAQVLKAGLDLDCGGYYTNFTETSVKQGKAREEYIDRSLKYLYVVLMRL 392
Query: 380 GFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
GFFDG+PQY LGK+DIC+ EN+ELA +AAREGIVLLKN+ +TLPL+ KVK +AVVGPH
Sbjct: 393 GFFDGTPQYQKLGKKDICTKENVELAKQAAREGIVLLKNN-DTLPLSMDKVKNLAVVGPH 451
Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
ANAT MIGNYAG+PCRY+SPI GFS Y+NVTY+ GC DV CK+ + +F A AAK ADA
Sbjct: 452 ANATRVMIGNYAGVPCRYVSPIDGFSIYSNVTYEIGC-DVPCKNESLVFPAVHAAKNADA 510
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
TII+AGLDL++EAE LDR DL LPGYQTQLINQVA A GPVILVIM+AGGVDI+FA N
Sbjct: 511 TIIVAGLDLTIEAEGLDRNDLLLPGYQTQLINQVAGAANGPVILVIMAAGGVDISFARDN 570
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
IKAILW GYPG+EGG AIADVVFGK+NPGGRLPITWY D+V+ +P+T M LRP + L
Sbjct: 571 EKIKAILWVGYPGQEGGHAIADVVFGKYNPGGRLPITWYEADFVEQVPMTYMQLRPDEEL 630
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
GYPG+TYKFY+G T+YPFGYGLSYT F YN+ S ++ + LNK QHCR+L Y ++ K
Sbjct: 631 GYPGKTYKFYDGSTVYPFGYGLSYTTFSYNITSAKRSKHIALNKFQHCRDLRYGNETFKP 690
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
CP VL + L C+D FE +V+ +N GS DGS+VV+VYSK P I +YIKQVIGF+RVFV
Sbjct: 691 SCPAVLTDHLPCNDDFELEVEVENTGSRDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFV 750
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+AG +++ F FN CKS I+DY A ++LP+G HTI VG+ VS P+++N++
Sbjct: 751 QAGSVEKVNFRFNVCKSFRIIDYNAYSILPSGGHTIMVGDDIVSIPLYINYS 802
>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 812
Score = 1136 bits (2939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 538/800 (67%), Positives = 635/800 (79%), Gaps = 10/800 (1%)
Query: 2 AKVVSS-LLCFSLSIALLVFSTNAV---------DANGSSSPVFVCDPGRFSKLGLQMSS 51
AK+ SS ++ S+ +F+ NA D ++ FVCDP R+ KLGL SS
Sbjct: 13 AKMASSPIMMISVLSLFFIFTANARVFPRRSLLDDPPAVNNFTFVCDPSRYDKLGLDFSS 72
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F FCDSSL + R KDL+ RMTL EK QLG A GV RLGLP Y WWSEALHGVSNVGP
Sbjct: 73 FGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLPPYNWWSEALHGVSNVGP 132
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GT FD V+PGATSFP VI T +SFNE LWK IGQAVSTEARAMYNLGRAGLTYWSP INV
Sbjct: 133 GTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAMYNLGRAGLTYWSPTINV 192
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
RDPRWGR ETPGEDPFVVG+YA NYVRGLQDVEG EN TDLNSRPLKVSSCCKHYAAY
Sbjct: 193 IRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTDLNSRPLKVSSCCKHYAAY 252
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
DVDNW GV+RY FDARVTEQDM ETF +PFEMCVKEGD SSVMCSYNRVNGIP+CADP L
Sbjct: 253 DVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSVMCSYNRVNGIPTCADPVL 312
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
L T+RG W LHGYIV+DCDS++VMV++ +L D+ EDAVAQTLKAGLDLDCGQ Y N+T
Sbjct: 313 LKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQTLKAGLDLDCGQIYPNYT 372
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAARE 411
+ V+QGKV +ID +L LY VLMRLG+FDG+ + SLGK DICSDE+IELA EAAR+
Sbjct: 373 ESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTGFESLGKPDICSDEHIELATEAARQ 432
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
G VLLKND +TLP + + KT+AVVGPHANAT AM+GNYAG+PCR SP+ G S YA V
Sbjct: 433 GTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNYAGVPCRMNSPMDGLSEYAKVK 492
Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
Y+ GCD VACK++ IF A EAA+T+DAT+I G+DLS+EAESLDR DL LPGYQTQL+
Sbjct: 493 YQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSIEAESLDRVDLLLPGYQTQLVQ 552
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
QVA V+KGPV+LVI+SAGG+D++FA+ N+NIKAI+WAGYPGEEGGRAIADV+FGKFNPGG
Sbjct: 553 QVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGYPGEEGGRAIADVIFGKFNPGG 612
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
RLP+TWY DYV LP+TSMPLRPV SLGYPGRTYKFY+GP +YPFG+GLSYT F +NL
Sbjct: 613 RLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFYDGPVVYPFGHGLSYTFFLHNLT 672
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
S ++I ++L+ CR++ YT+ K CP VLV+DL C + EF+++ +N G DGS
Sbjct: 673 SAKRSIAIDLSNRTQCRDIAYTNGTFKPECPAVLVDDLTCTEEIEFQMEVENTGERDGSQ 732
Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
V++VYS PP I++T+IKQV+GFQRVF++AG ++ + F NACKSL +VD+ LLPAG
Sbjct: 733 VLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSETVTFKLNACKSLGLVDFTGYNLLPAG 792
Query: 772 EHTIFVGNGGVSFPIHLNFN 791
HTI VG+G VSFP+ L+FN
Sbjct: 793 GHTIVVGDGEVSFPVELSFN 812
>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 809
Score = 1083 bits (2802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 523/810 (64%), Positives = 611/810 (75%), Gaps = 23/810 (2%)
Query: 1 MAKVVSSLLCFSLSIALLVF--STNAVDANGSSSP---------------VFVCDPGRFS 43
M K++ SL FSLSI + F +A+ + P +VCD RF+
Sbjct: 1 MGKLLRSLF-FSLSIVWIAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFA 59
Query: 44 KLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEAL 103
LGL M F +CDSS PY +R KDLV RMTL EKV Q GD A GV R+GLP+Y WWSEAL
Sbjct: 60 ALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEAL 119
Query: 104 HGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
HGVSN G FD+V+PGATSFPTVIL+ ASFN+SLWK +GQAVSTEARAMYN G AGLT
Sbjct: 120 HGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLT 179
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
+WSPNINV RDPRWGRI ETPGEDP +VG YAVNYVRGLQDV G EN TDLNSRPLKVSS
Sbjct: 180 FWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDLNSRPLKVSS 239
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
CCKHYAAYD+DNWKG DR HFDARV+ QDM ETF+ PFEMCVKEGD SSVMCSYN++NGI
Sbjct: 240 CCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGI 299
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
PSCAD +LL QT+RGEWDLHGYIV+DCDS++VM + K+L S D+ AQ L AG++LDC
Sbjct: 300 PSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDC 359
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
G + AV QGK + D+D SL+YLY +LMR+GFFDG P + SLGK DICS E+IE
Sbjct: 360 GTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKDDICSAEHIE 419
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LA EAAR+GIVLLKND TLPL S VK +A+VGPHANAT AMIGNYAGIPC Y+SP+
Sbjct: 420 LAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIPCYYVSPLDA 477
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
FS V Y+ GC DV C + IF A EAAK ADATII AG DLS+EAE+LDR DL LP
Sbjct: 478 FSSMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLP 537
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
GYQTQLINQVA+++ GPV+LVIMS GGVDI+FA N I AILWAGYPGE+GG AIADV+
Sbjct: 538 GYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVI 597
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK+NPGGRLPITWY DYV MLP+TSM LRPVDSLGYPGRTYKF+NG T+YPFGYG+SY
Sbjct: 598 LGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSY 657
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F Y+L + + +NL KLQ CR++ Y +D CP VLV+DL C + EF+V +N
Sbjct: 658 TNFSYSLSTSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDDLSCKESIEFEVAVKN 717
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
VG DGS+VV+VYS PP IA T+IK+V+GF+RVFV+ G +++KF N CKSL IVD
Sbjct: 718 VGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVCKSLGIVDST 777
Query: 764 ANTLLPAGEHTIFVG---NGGVSFPIHLNF 790
LLP+G HTI VG V+FP H+N+
Sbjct: 778 GYALLPSGSHTIKVGGDNTTSVAFPFHVNY 807
>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
Length = 694
Score = 1075 bits (2779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/726 (70%), Positives = 598/726 (82%), Gaps = 36/726 (4%)
Query: 67 DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
DLV++MTL+EKV QLG+ A+GVPRLGL +Y+WWSEALHGVSNVGPGT FDD+IPG+TSFP
Sbjct: 2 DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
TVI T A+FNESLWK IGQAVSTEARAMYNLGRAGLTYWSPNINV RDPRWGR ETPGE
Sbjct: 62 TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGE 121
Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
DP++VGRYAVNYVRGLQDVEG EN TD NSRPLKVSSCCKHYAAYDVDNWKGV+RY FDA
Sbjct: 122 DPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFDA 181
Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
RV+EQDM ETFLRPFEMCVK+GD SSVMCSYNRVNGIP+CADPKLLNQT+RG+WDLHGYI
Sbjct: 182 RVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGYI 241
Query: 307 VADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDID 366
V+DCDS+QVMV+NHK+L GLDLDCG YYT AV+QGKV+E DID
Sbjct: 242 VSDCDSLQVMVENHKWL--------------GLDLDCGAYYTENVEAAVRQGKVREADID 287
Query: 367 KSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
KSL +LY VLMRLGFFDG PQY S GK D+CS ENIELA EAAREG VLLKN+ ++LPL+
Sbjct: 288 KSLNFLYVVLMRLGFFDGIPQYNSFGKNDVCSKENIELATEAAREGAVLLKNENDSLPLS 347
Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
KVKT+AV+GPH+NAT AMIGNYAGIPC+ ++PI G S YA V Y+ GC D+ACK +
Sbjct: 348 IEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKVDYQMGCSDIACKDESF 407
Query: 487 IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
IF A E+AK ADATIILAG+DLS+EAESLDR+DL LPGYQTQLINQVA V+ GPV+LV+M
Sbjct: 408 IFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVVLVLM 467
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
SAGGVDI+FA++N +IK+ILW GYPGEEGG AIADV+FGK+NPGGRLP+TW+ DYV ML
Sbjct: 468 SAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHEADYVDML 527
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P+TSMPLRP+DSLGYPGRTYKF+NG T+YPFG+GLSYTQF Y L S +++ + L+K Q+
Sbjct: 528 PMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKLTSTIRSLDIKLDKYQY 587
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIA 724
C +L Y +D+ FK F+ N G+ DGS+VVIVY+KPP I
Sbjct: 588 CHDLGYKNDS--------------------FKPSFEVLNAGAKDGSEVVIVYAKPPEGID 627
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
ATYIKQVIGF+RVFV AG ++++KF FNA KSL +VD+ A ++LP+G HTI +G+ +SF
Sbjct: 628 ATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLGDDIISF 687
Query: 785 PIHLNF 790
+ + F
Sbjct: 688 SVQIRF 693
>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 805
Score = 1062 bits (2747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/809 (62%), Positives = 625/809 (77%), Gaps = 23/809 (2%)
Query: 1 MAKVVSSLLCFSLSI---ALLVFST-------------NAVDANGSSSPVFVCDPGRFSK 44
MAK + L FSLSI A L ST A D G+ + +VCD RF+
Sbjct: 1 MAKSFTRLF-FSLSILAIAFLAVSTARYTPRPNSRFLSQAFDVPGNYT--YVCDASRFAA 57
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
LGL M F++CDSSLPY +RVKDLV R+TL+EK + + D A GVPR+GLP Y+WWSEALH
Sbjct: 58 LGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALH 117
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
GV+NVG T FD+V+PGATSFP VIL+ ASFN+SLWK +GQ VSTEARAMYNLG AGLT+
Sbjct: 118 GVANVGSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTF 177
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNINVARDPRWGRI ETPGEDP VG Y VNYVRGLQD+EG EN TDLNSRPLK++S
Sbjct: 178 WSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASS 237
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH+AAYD+D W VDR HFDA+V+EQDM ETFLRPFEMCVKEGD SSVMCS+N +NGIP
Sbjct: 238 CKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIP 297
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
CADP+ L +R +W+LHGYIV+DC +I +V + KFL + E+ VA ++KAGLDL+CG
Sbjct: 298 PCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECG 357
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIEL 404
YY + AV++G+V E D+DKSL YLY VLMR+GFFDG P SLGK+DIC+DE+IEL
Sbjct: 358 HYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKDICNDEHIEL 417
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
A EAAR+GIVLLKND TLPL VK +A+VGPHANATVAMIGNYAGIPC Y+SP+ F
Sbjct: 418 AREAARQGIVLLKNDNATLPLK--PVKKLALVGPHANATVAMIGNYAGIPCHYVSPLDAF 475
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
S +VTY+ GC DV C ++ ++ A+EAAK ADATIIL G DLS+EAE DREDL LPG
Sbjct: 476 SELGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPG 535
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
YQT+++NQV +++ GPVILV+M G +DI+FA+ N I AILWAG+PGE+GG AIAD+VF
Sbjct: 536 YQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVF 595
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK+NPGGR PITWY YV MLP+TSM LRP++SLGYPGRTYKF+NG T+YPFGYGLSYT
Sbjct: 596 GKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYT 655
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F Y+L + T+++ ++L +LQ CR++ Y+SD+ + C VLV+DL CD+ FEF+V +NV
Sbjct: 656 NFSYSLTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVDDLSCDESFEFQVAVKNV 715
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
GS DGS+VV+VYS PP+ I T+IKQVIGF+RVFV+ G +++KF N CKSL +VD +
Sbjct: 716 GSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSG 775
Query: 765 NTLLPAGEHTIFVGNG--GVSFPIHLNFN 791
LLP+G HTI G+ VSFP +N++
Sbjct: 776 YILLPSGSHTIMAGDNSTSVSFPFQVNYH 804
>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 805
Score = 1040 bits (2690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/760 (63%), Positives = 589/760 (77%), Gaps = 4/760 (0%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
+VCD R++ LGL M SF FCD SL Y R KDLVSRMTL EKV Q A GV RLGLP
Sbjct: 47 YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
+Y WWSEALHG+SN+GPG FD+ IPGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YNLG AGLT+WSPNINV RD RWGR ET GEDPF+VG +AVNYVRGLQDVEG EN TDL
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDL 226
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
NSRPLKVSSCCKHYAAYD+D+W VDR+ FDARV+EQDM+ETF+ PFE CV+EGD SSVM
Sbjct: 227 NSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 286
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CS+N++NGIP C+DP+LL +R EWDLHGYIV+DC ++V+VDN +L DSK DAVA+T
Sbjct: 287 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 346
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ 394
L+AGLDL+CG YYT+ +V GKV + ++D++LK +Y +LMR+G+FDG P Y SLG +
Sbjct: 347 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 406
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
DIC+ ++IELA EAAR+GIVLLKND LPL K +A+VGPHANAT MIGNYAG+P
Sbjct: 407 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 464
Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
C+Y+SP+ FS NVTY TGC D +C ++ A EAAK+A+ TII G DLS+EAE
Sbjct: 465 CKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAEF 524
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
+DR D LPG QT+LI QVAEV+ GPVILV++S +DI FA+ N I AILW G+PGE+
Sbjct: 525 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 584
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG AIADVVFGK+NPGGRLP+TWY DYV MLP++SM LRPVD LGYPGRTYKF++G T+
Sbjct: 585 GGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGSTV 644
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFGYG+SYT+F Y+L + +I ++LNK Q CR + YT D CP VL++D+ CDD
Sbjct: 645 YPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVLLDDMSCDDT 704
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
EF+V NVG DGS+V++VYS PP+ I T+IKQVIGFQ+VFV AG +R+KF NAC
Sbjct: 705 IEFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNAC 764
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN--GGVSFPIHLNFNY 792
KSL IVD +LLP+G HTI VG+ S+ + +N++Y
Sbjct: 765 KSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 804
>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 1011 bits (2613), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 493/809 (60%), Positives = 602/809 (74%), Gaps = 54/809 (6%)
Query: 1 MAKVVSSLLCFSLSI---ALLVFST-------------NAVDANGSSSPVFVCDPGRFSK 44
MAK + L FSLSI A L ST A D G+ + +VCD RF+
Sbjct: 1 MAKSFTRLF-FSLSILAIAFLAVSTARYTPRPNSRFLSQAFDVPGNYT--YVCDASRFAA 57
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
LGL M F++CDSSLPY +RVKDLV R+TL+EK + + D A GVPR+GLP Y+WWSEALH
Sbjct: 58 LGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALH 117
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
GV+NVG T FD+V+PGATSFP VIL+ ASFN+SLWK +GQ VSTEARAMYNLG AGLT+
Sbjct: 118 GVANVGSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTF 177
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNINVARDPRWGRI ETPGEDP VG Y VNYVRGLQD+EG EN TDLNSRPLK++S
Sbjct: 178 WSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASS 237
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH+AAYD+D W VDR HFDA+V+EQDM ETFLRPFEMCVKEGD SSVMCS+N +NGIP
Sbjct: 238 CKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIP 297
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
CADP+ L +R +W+LHGYIV+DC +I +V + KFL + E+ VA ++KAGLDL+CG
Sbjct: 298 PCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECG 357
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIEL 404
YY + AV++G+V E D+DKSL YLY VLMR+GFFDG P SLGK+DIC+DE+IEL
Sbjct: 358 HYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKDICNDEHIEL 417
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
A EAAR+GIVLLKND TLPL VK +A+VGPHANATVAMIGNYAGIPC Y+SP+ F
Sbjct: 418 AREAARQGIVLLKNDNATLPLK--PVKKLALVGPHANATVAMIGNYAGIPCHYVSPLDAF 475
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
S +VTY+ GC DV C ++ ++ A+EAAK ADATIIL G DLS+EAE DREDL LPG
Sbjct: 476 SELGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPG 535
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
YQT+++NQV +++ GPVILV+M G +DI+FA+ N I AILWAG+PGE+GG AIAD+VF
Sbjct: 536 YQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVF 595
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK+NPGGR PITWY YV MLP+TSM LRP++SLGYPGRTYKF+NG T+YPFGYGLSYT
Sbjct: 596 GKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYT 655
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F Y+L + T+++ ++L FEF+V +NV
Sbjct: 656 NFSYSLTAPTRSVHISLTS-------------------------------FEFQVAVKNV 684
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
GS DGS+VV+VYS PP+ I T+IKQVIGF+RVFV+ G +++KF N CKSL +VD +
Sbjct: 685 GSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSG 744
Query: 765 NTLLPAGEHTIFVGNG--GVSFPIHLNFN 791
LLP+G HTI G+ VSFP +N++
Sbjct: 745 YILLPSGSHTIMAGDNSTSVSFPFQVNYH 773
>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
Length = 746
Score = 967 bits (2499), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/810 (59%), Positives = 565/810 (69%), Gaps = 86/810 (10%)
Query: 1 MAKVVSSLLCFSLSIALLVFST--NAVDANGSSSP---------------VFVCDPGRFS 43
M K++ SL FSLSI + F +A+ + P +VCD RF+
Sbjct: 1 MGKLLRSLF-FSLSIVWIAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFA 59
Query: 44 KLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEAL 103
LGL M F +CDSS PY +R KDLV RMTL EKV Q GD A GV R+GLP+Y WWSEAL
Sbjct: 60 ALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEAL 119
Query: 104 HGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
HGVSN G FD+V+PGATSFPTVIL+ ASFN+SLWK +GQAVSTEARAMYN G AGLT
Sbjct: 120 HGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLT 179
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
+WSPNINV RDPRWGRI ETPGEDP +VG YAVNY
Sbjct: 180 FWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY------------------------- 214
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
HYAAYD+DNWKG DR HFDARV+ QDM ETF+ PFEMCVKEGD SSVMCSYN++NGI
Sbjct: 215 ---HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGI 271
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
PSCAD +LL QT+RGEWDLHGYIV+DCDS++VM + K+L S D+ AQ L AG++LDC
Sbjct: 272 PSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDC 331
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
G + AV QGK + D+D SL+YLY +LMR+GFFDG P + SLGK DICS E+IE
Sbjct: 332 GTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKDDICSAEHIE 391
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LA EAAR+GIVLLKND TLPL S VK +A+VGPHANAT AMIGNYAGIPC Y+SP+
Sbjct: 392 LAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIPCYYVSPLDA 449
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
FS V Y+ GC DV C + IF A EAAK ADATII AG DLS+EAE+LDR DL LP
Sbjct: 450 FSSMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLP 509
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
GYQTQLINQVA+++ GPV+LVIMS GGVDI+FA N I AILWAGYPGE+GG AIADV+
Sbjct: 510 GYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVI 569
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK+NPGGRLPITWY DYV MLP+TSM LRPVDSLGYPGRTYKF+NG T+YPFGYG+SY
Sbjct: 570 LGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSY 629
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F Y+L T Q C + EF+V +N
Sbjct: 630 TNFSYSL----STSQ-------------------------------SCKESIEFEVAVKN 654
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
VG DGS+VV+VYS PP IA T+IK+V+GF+RVFV+ G +++KF N CKSL IVD
Sbjct: 655 VGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVCKSLGIVDST 714
Query: 764 ANTLLPAGEHTIFVG---NGGVSFPIHLNF 790
LLP+G HTI VG V+FP H+N+
Sbjct: 715 GYALLPSGSHTIKVGGDNTTSVAFPFHVNY 744
>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
vinifera]
Length = 781
Score = 940 bits (2430), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/784 (60%), Positives = 580/784 (73%), Gaps = 16/784 (2%)
Query: 19 VFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP-YSIRVKDLVSRMTLDEK 77
+F + D G+ S VCDP RF+ LG M F++C+SSLP Y +RVKDLV RMTL+EK
Sbjct: 1 MFLSEGFDVPGNYS--HVCDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEK 58
Query: 78 VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV---GPGTHFDDVIPGATSFPTVILTTAS 134
+ A GV R+GLP Y+WWSEALHGVS+V GP T FD+ +PGATSFP VIL+ AS
Sbjct: 59 ATNVIYKAAGVERIGLPPYQWWSEALHGVSSVSINGP-TFFDETVPGATSFPNVILSAAS 117
Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRY 194
FN+SLWK I Q VS EARA YNLG AGLT+W PN+NVARDPRWGR ET GEDPF V Y
Sbjct: 118 FNQSLWKTIRQVVSKEARATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVY 177
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
AV+YVRGLQDVEG EN TDLNSRPLKVSS KH+AAYD+DNW VDR HF+ARV+EQDM
Sbjct: 178 AVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMA 237
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
ETFLRPFE CV+EGD S VMCS+N +NGIP CADP+L T+R EW+LHGYIV+DC SI+
Sbjct: 238 ETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIE 297
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+V++ KFL + E+AVA LKAGLDL+CG YY + +AV G+V + D+D+SL LY
Sbjct: 298 TIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYV 357
Query: 375 VLMRLGFFDGSPQYVSLGKQDIC-SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
VLMRLGFFDG P SLGK DIC S E+IELA EAAR+GIVLLKND TLPL S VK +
Sbjct: 358 VLMRLGFFDGIPALASLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNL 415
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEA 493
A+VGP+A+A AM+GNYAG PCR +SP FS NVTY+ GC DV C ++ ++ A EA
Sbjct: 416 ALVGPNADAYGAMMGNYAGPPCRSVSPRDAFSAIGNVTYEMGCGDVLCHNDTYVYKAVEA 475
Query: 494 AKTADATIILAGL-DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS--AGG 550
AK AD TII+ G+ D+S+ E DR DL LPGYQT L+NQ+A+ P+ILV+ G
Sbjct: 476 AKHADTTIIVVGITDVSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGP 535
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+DI+FA N I+ ILWAG+PGEEGG AIADVV+GK+NPGGRLP+TWY YV MLP+TS
Sbjct: 536 IDISFARDNPGIEPILWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTS 595
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
M LR V+SLGYPGR YKF++G T+YPFG GLSYT F Y+L + T++I +L KLQ CR++
Sbjct: 596 MALRSVESLGYPGRKYKFFSGSTVYPFGCGLSYTNFSYSLTAPTRSIHTHLKKLQPCRSM 655
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
Y+ + +CP VLV+DL C++ FEF+V + VGS DGS+VVIVYS PP+ I T+IKQ
Sbjct: 656 AYSICSVIPQCPAVLVDDLSCNETFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQ 715
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIH 787
VIGF+RVFV+ G +++KF N CKSL IV + +TLLP+G I G VSFP
Sbjct: 716 VIGFERVFVKVGXVEKVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGGDNTISVSFPFQ 775
Query: 788 LNFN 791
++
Sbjct: 776 AAYH 779
>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
Length = 745
Score = 939 bits (2427), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/760 (59%), Positives = 550/760 (72%), Gaps = 64/760 (8%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
+VCD R++ LGL M SF FCD SL Y R KDLVSRMTL EKV Q A GV RLGLP
Sbjct: 47 YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
+Y WWSEALHG+SN+GPG FD+ IPGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YNLG AGLT+WSPNINV RD RWGR ET GEDPF+VG +AVNYVRGLQDVEG EN
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTEN---- 222
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
VSSCCKHYAAYD+D+W VDR+ FDARV+EQDM+ETF+ PFE CV+EGD SSVM
Sbjct: 223 ------VSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 276
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CS+N++NGIP C+DP+LL +R EWDLHGYIV+DC ++V+VDN +L DSK DAVA+T
Sbjct: 277 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 336
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ 394
L+AGLDL+CG YYT+ +V GKV + ++D++LK +Y +LMR+G+FDG P Y SLG +
Sbjct: 337 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 396
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
DIC+ ++IELA EAAR+GIVLLKND LPL K +A+VGPHANAT MIGNYAG+P
Sbjct: 397 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 454
Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
C+Y+SP+ FS NVTY TG TII G DLS+EAE
Sbjct: 455 CKYVSPLEAFSAIGNVTYATGF-----------------------TIIFVGTDLSIEAEF 491
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
+DR D LPG QT+LI QVAEV+ GPVILV++S +DI FA+ N I AILW G+PGE+
Sbjct: 492 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 551
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG AIADVVFGK+NPGGRLP+TWY DYV MLP++SM LRPVD LGYPGRTYKF++G T+
Sbjct: 552 GGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGSTV 611
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFGYG+SYT+F Y+L + +I ++LNK Q CR
Sbjct: 612 YPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRT------------------------- 646
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
F+V NVG DGS+V++VYS PP+ I T+IKQVIGFQ+VFV AG +R+KF NAC
Sbjct: 647 --FEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNAC 704
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN--GGVSFPIHLNFNY 792
KSL IVD +LLP+G HTI VG+ S+ + +N++Y
Sbjct: 705 KSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 744
>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
gi|219887469|gb|ACL54109.1| unknown [Zea mays]
gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 835
Score = 902 bits (2331), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/772 (56%), Positives = 552/772 (71%), Gaps = 17/772 (2%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCDP RF LGL MS F +CD+SLPY+ RV+DLV R+ L+EKV+ LGD A G PR+GLP
Sbjct: 62 VCDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPP 121
Query: 96 YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y+WW EALHGVS+VGPG T F DV+PGATSFP VI + A+FNESLW+ IG VSTE RAM
Sbjct: 122 YKWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 181
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG--HENAT 212
YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAVN+VRG+QDV+ + A
Sbjct: 182 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAA 241
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
D SRP+KVSSCCKH+AAYDVD W DR FDA+V E+DM ETF RPFEMC+++GDAS
Sbjct: 242 DPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASC 301
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMCSYNR+NGIP+CAD +LL++TVR +W LHGYIV+DCDS++VMV + K+L + +A A
Sbjct: 302 VMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATA 361
Query: 333 QTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
+KAGLDLDCG + +T + +AV+QGK+KE D+D +L +YT LMRLGFFDG
Sbjct: 362 AAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGM 421
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHANAT 443
P++ SLG ++C+D + ELAA+AAR+G+VLLKND LPL+ K+ +V++VG H NAT
Sbjct: 422 PEFESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHINAT 481
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
M+G+Y G PCR ++P N TY CD AC + + AS AK ADATI++
Sbjct: 482 DVMLGDYRGKPCRIVTPYNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADATIVI 541
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
AGL++SVE ES DREDL LP Q+ IN VA + P++LVIMSAGGVD++FA NT I
Sbjct: 542 AGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNTKIG 601
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AI+WAGYPGEEGG AIADV+FGK+NPGGRLP+TW+ +YV +P+TSM LRP +LGYPG
Sbjct: 602 AIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAALGYPG 661
Query: 624 RTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR-- 680
RTYKFY GP LYPFG+GLSYT F Y + T+ +++ +HC+ L Y A
Sbjct: 662 RTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYKMGAPSPSPA 721
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
CP + V C + F + N G G VV VY+ PP E+ +KQ++ F+RVFV
Sbjct: 722 CPALNVASHMCSEVVSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPLKQLVAFRRVFVP 781
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG--VSFPIHLNF 790
AG + F N CK+ IV+ A T++P+G T+ VG+ +SFP+ +N
Sbjct: 782 AGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVGDDALVLSFPVTINL 833
>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
Length = 825
Score = 900 bits (2325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/773 (56%), Positives = 553/773 (71%), Gaps = 17/773 (2%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCDP RF+ LGL MS F +CD+SLPY+ RV+DLV R++L+EKV+ LGD A G PR+GLP
Sbjct: 50 VCDPVRFAALGLDMSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPP 109
Query: 96 YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y+WW EALHGVS+VGPG T F DV+PGATSFP VI + A+FNESLW+ IG VSTE RAM
Sbjct: 110 YKWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 169
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV---EGHENA 211
YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAVN+VRG+QDV G
Sbjct: 170 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAAT 229
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
D SRP+KVSSCCKH+AAYDVD W DR FDA+V E+DM ETF RPFEMC+++GDAS
Sbjct: 230 ADPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDAS 289
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
VMCSYNR+NGIP+CAD +LL++TVR +W LHGYIV+DCDS++VMV + K+L + +A
Sbjct: 290 CVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEAT 349
Query: 332 AQTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
A +KAGLDLDCG + +T + +AV+QGK+KE D+D +L +YT LMRLGFFDG
Sbjct: 350 AAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDG 409
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHANA 442
P++ SLG D+C+ ++ ELAA+AAR+G+VLLKND LPL+ +K+ +V++VG H NA
Sbjct: 410 MPEFESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINA 469
Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
T M+G+Y G PCR ++P N TY CD AC + + AS AK ADATI+
Sbjct: 470 TDVMLGDYRGKPCRIVTPYDAIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADATIV 529
Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
+AGL++SVE ES DREDL LP Q+ IN VAE + P++LVIMSAGGVD++FA+ NT I
Sbjct: 530 IAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKI 589
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AI+WAGYPGEEGG AIADV+FGK+NPGGRLP+TW+ +YV +P+TSM LRP + GYP
Sbjct: 590 GAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAHGYP 649
Query: 623 GRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT-- 679
GRTYKFY GP LYPFG+GLSYT F Y + T+ + + +HC+ L Y S + +
Sbjct: 650 GRTYKFYGGPAVLYPFGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAPSPS 709
Query: 680 -RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
CP + V RCD+ F + N G G VV VY+ PP E+ KQ++ F+RVF
Sbjct: 710 PACPALNVASHRCDEVVSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPRKQLVEFRRVF 769
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
V AG + F N CK+ IV+ A T++P+G T+ VG+ ++ + N
Sbjct: 770 VPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSFAVTIN 822
>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 810
Score = 878 bits (2269), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/763 (55%), Positives = 548/763 (71%), Gaps = 21/763 (2%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCDP RF+ LGL+M+ F +CD+SLPY+ RV+DLV R+TL+EKV+ LGD A G R+GLP
Sbjct: 45 VCDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPP 104
Query: 96 YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y WW EALHGVS+ GPG T F DV+PGATSFP VI + A+FNE+LW IG AVSTE RAM
Sbjct: 105 YLWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAM 164
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE--NAT 212
YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAV++VR +QD++G
Sbjct: 165 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGA 224
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
D +RP+KVSSCCKHYAAYDVD W DR FDA+V E+DM ETF RPFEMCV++GDAS
Sbjct: 225 DPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASC 284
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMCSYNR+NG+P+CA+ +LL++TVRGEW LHGYIV+DCDS++VMV + K+L + +A A
Sbjct: 285 VMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATA 344
Query: 333 QTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
+KAGLDLDCG + +T F +AV+QGK++E+++D +L+ LY LMRLGFFDG
Sbjct: 345 AAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDGI 404
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHANAT 443
P+ SLG D+C++E+ ELAA+AAR+G+VL+KND LPL+++KV ++++VG H NAT
Sbjct: 405 PELESLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQHINAT 464
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
M+G+Y G PCR ++P + T CD AC + + KT DATI++
Sbjct: 465 DVMLGDYRGKPCRVVTPYDAIRKVVSATSMQVCDHGACST-------AANGKTVDATIVI 517
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
AGL++SVE E DREDL LP QT IN VAE + P+ILVI+SAGGVD++FA+ N I
Sbjct: 518 AGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNNPKIG 577
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AI+WAGYPGEEGG AIADV+FGK+NPGGRLP+TWY +Y+ +P+TSM LRPV GYPG
Sbjct: 578 AIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVADKGYPG 637
Query: 624 RTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT-SDASKTRC 681
RTYKFY GP LYPFG+GLSY+ F Y + ++ V + + C+ L + C
Sbjct: 638 RTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTTAPLAC 697
Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
P V V C + F + N GS DG+ VV+VY+ PPAE+ +KQ++ F+RVFV A
Sbjct: 698 PAVNVAGHGCKEEVSFSLTVANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFRRVFVPA 757
Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
G ++ F N CK+ IV+ A T++P+G T+ VG+ +SF
Sbjct: 758 GAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSF 800
>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
Length = 821
Score = 865 bits (2234), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/777 (54%), Positives = 547/777 (70%), Gaps = 24/777 (3%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP-RLGLP 94
VCDP RF+ LGL M+ F +CD+SLPY+ RV+DLV R+TL+EKV LGD A G R+GLP
Sbjct: 45 VCDPARFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLP 104
Query: 95 QYEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
+Y WW EALHGVS+ PG T F DV+PGATSFP V+ + A+FNE+LW+ IG A STE RA
Sbjct: 105 RYMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRA 164
Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
MYNLG A LTYWSPNINV RDPRWGR +ETPGEDPF+VGR+AV++VR +QD++ NA
Sbjct: 165 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGA 224
Query: 214 LNSRP----LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ P LKVSSCCKHYAAYDVD W G DR FDA V E+DM ETF RPFEMCV++GD
Sbjct: 225 GAADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGD 284
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
AS VMCSYNR+NG+P+CA+ +LL TVR +W LHGYIV+DCDS++VMV + K+L
Sbjct: 285 ASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQ 344
Query: 330 AVAQTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A A +KAGLDLDCG + +T + AV+QGK+KE ++D++L +LY LMRLGFF
Sbjct: 345 ATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFF 404
Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHA 440
DGSP++ SLG D+C++E+ E+AAEAAR+G+VLLKND + LPL++ KV ++A+VG H
Sbjct: 405 DGSPEFQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQHI 464
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
NAT M+G+Y G PCR ++P + T CD AC + A+ AAKT DAT
Sbjct: 465 NATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQACDKGAC--GTTALGAAIAAKTVDAT 522
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
I++ GL++SVE E DREDL LP QTQ IN VAE ++ P+ LVI+SAGGVDI+FA+ N
Sbjct: 523 IVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQNNP 582
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
I AILWAGYPGEEGG IADV+FGK+NPGGRLP+TWY +Y+ LP+TSM LRPV G
Sbjct: 583 KIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNEYIGKLPMTSMALRPVADKG 642
Query: 621 YPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL--QHCRNLNYT--SD 675
YPGRTYKFY+GP LYPFG+GLSYT F Y+ + ++ V + C+NL Y +
Sbjct: 643 YPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVKIGTAWEDSCKNLTYKPGTT 702
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
AS CP + V C + F + N G GS VV VY+ PPAE+ +KQ++ F+
Sbjct: 703 ASTAPCPAINVAGHGCQEEVSFTLKVSNTGGIGGSHVVPVYTAPPAEVDDAPLKQLVAFR 762
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV--SFPIHLNF 790
R+FV AG + F + CK+ IV+ A T++PAG + VG+ + SFP+ ++
Sbjct: 763 RMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVLVGDESLSFSFPVKIDL 819
>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
ARA-I [Oryza sativa Japonica Group]
Length = 818
Score = 860 bits (2222), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/776 (55%), Positives = 545/776 (70%), Gaps = 26/776 (3%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCDP RF+ GL M+ F +CD+SLPY+ RV+DLV RMTL+EKV LGD A G PR+GLP+
Sbjct: 46 VCDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPR 105
Query: 96 YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y WW EALHGVS+VGPG T F D +PGATSFP VI + ASFNE+LW+ IG VSTE RAM
Sbjct: 106 YLWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAM 165
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAVN+VRG+QD++G A
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 225
Query: 215 N------SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
SRP+KVSSCCKHYAAYDVD W G DR FDARV E+DM ETF RPFEMC+++G
Sbjct: 226 AAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDG 285
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
DAS VMCSYNR+NG+P+CAD +LL +TVR +W LHGYIV+DCDS++VMV + K+L +
Sbjct: 286 DASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 345
Query: 329 DAVAQTLKAGLDLDCGQ-------YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
+A A +KAGLDLDCG ++T + +AV+QGK+KE+ +D +L LY LMRLGF
Sbjct: 346 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 405
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PH 439
FDG P+ SLG D+C++E+ ELAA+AAR+G+VLLKND LPL+ KV +VA+ G H
Sbjct: 406 FDGIPELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQH 465
Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
NAT M+G+Y G PCR ++P G + T CD +C + A+ AAKT DA
Sbjct: 466 INATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSCDT------AAAAAKTVDA 519
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
TI++AGL++SVE ES DREDL LP Q IN VAE + P++LVIMSAGGVD++FA+ N
Sbjct: 520 TIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDN 579
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
I A++WAGYPGEEGG AIADV+FGK+NPGGRLP+TWY +YV +P+TSM LRP
Sbjct: 580 PKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEH 639
Query: 620 GYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD-AS 677
GYPGRTYKFY G LYPFG+GLSYT F Y + + V + ++C+ L Y + +S
Sbjct: 640 GYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSS 699
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
CP V V C + F V N G DG+ VV +Y+ PPAE+ KQ++ F+RV
Sbjct: 700 PPACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRV 759
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG--VSFPIHLNFN 791
V AG + F N CK+ IV+ A T++P+G + VG+ +SFP+ ++
Sbjct: 760 RVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 815
>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 807
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/814 (51%), Positives = 552/814 (67%), Gaps = 60/814 (7%)
Query: 13 LSIALLVFSTNAVDANGSSSPVF---VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
LS A V ++ D GS+ VCD RF+ GL MS + +CD+ LPY RV+DL+
Sbjct: 18 LSTARAVLPSSNDDDGGSAKTAAYTKVCDASRFAAAGLDMSRYRYCDAKLPYGDRVRDLI 77
Query: 70 SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV----------- 118
MT++EKV LGD+A G PR+GLP Y+WWSEALHG+S+ GP T FDD+
Sbjct: 78 GWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGPTTKFDDLKKPRLHSGRAA 137
Query: 119 IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWG 178
+ T F VI + ASFNESLW+ IGQA+STEARAMYNLG+ GLTYWSPNINV RDPRWG
Sbjct: 138 VFNGTVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKGGLTYWSPNINVVRDPRWG 197
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN----SRPLKVSSCCKHYAAYDVD 234
R ETPGEDPFVVGRYAVN+VRG+QDV+ + A N SRPLK S+CCKHYAAYDVD
Sbjct: 198 RALETPGEDPFVVGRYAVNFVRGMQDVD--DAAAGFNGDPLSRPLKTSACCKHYAAYDVD 255
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+W G R+ FDARVTE+DM ETF RPFEMCV++GDAS+VMCSYNRVNGIP+CAD +LL
Sbjct: 256 DWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASAVMCSYNRVNGIPACADARLLAG 315
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ--------- 345
T+R +W LHGYIV+DCD+++VM DN +L + +A A +LKAGLDLDCG+
Sbjct: 316 TLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASAASLKAGLDLDCGESWIVQKGKP 375
Query: 346 ---YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
+ + + AV+QGK++E+DID +L LYT LMRLG+FDG P+Y SL ++DICS+ +
Sbjct: 376 VMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLGYFDGMPRYESLDEKDICSEAHR 435
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA-TVAMIGNYAGIPCRYMSPI 461
LA + AR+ +VLLKN LPL+++K+ +VAV GPHA A M G+Y G PCRY++P
Sbjct: 436 SLALDGARQSMVLLKNLDGLLPLDASKLASVAVRGPHAEAPEKVMDGDYTGPPCRYITPR 495
Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
G S N++ + G D TI + G+++ +E E DREDL
Sbjct: 496 EGISKDVNISQQGG----------------------DVTIYMGGINMHIEREGNDREDLL 533
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
LP QT+ I +VA + P++LVI+S GG+D++FA+++ I AILWAGYPG EGG AIAD
Sbjct: 534 LPKNQTEEILRVAAASPSPIVLVILSGGGIDVSFAQSHPKIGAILWAGYPGGEGGHAIAD 593
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYG 640
V+FG++NPGGRLP+TW+ Y+ LP+TSM LRP GYPGRTYKFY+GP LYPFGYG
Sbjct: 594 VIFGRYNPGGRLPLTWFKNKYIHQLPMTSMALRPRPEHGYPGRTYKFYDGPDVLYPFGYG 653
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
LSYT+F+Y LL+ + + + +HCR L+Y + + CP V V C + F V
Sbjct: 654 LSYTKFRYELLNKETAVTLAPGR-RHCRQLSYKTGSVGPDCPAVDVASHACAETVSFNVS 712
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
N G DG++ V+VY+ PPAE+A IKQV F+RV V+AG + + F N CK+ IV
Sbjct: 713 VVNAGKADGANAVLVYTAPPAELAGAPIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIV 772
Query: 761 DYAANTLLPAGEHTIFVGNG---GVSFPIHLNFN 791
+ A T++P+G T+ V NG VSFP+ ++F+
Sbjct: 773 EKTAYTVVPSGVSTVIVENGDSSAVSFPVQISFS 806
>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
Length = 809
Score = 838 bits (2165), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/789 (52%), Positives = 533/789 (67%), Gaps = 58/789 (7%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCD RF+++GL MS+F +CD+SLPY+ RV+DL+ MT++EKV LGD +HG PR+GLP
Sbjct: 45 VCDADRFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPP 104
Query: 96 YEWWSEALHGVSNVGPGTHFDDV--IPG----------ATSFPTVILTTASFNESLWKKI 143
Y+WWSEALHGVS+ GP FDD+ PG AT F VI + ASFNE+LWK I
Sbjct: 105 YKWWSEALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSI 164
Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
GQAVSTEARAMYNLG+ GLTYWSPNINV RDPRWGR ETPGEDPFV GRYAVN+VRG+Q
Sbjct: 165 GQAVSTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQ 224
Query: 204 DVEGHENA-TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D+ GH+ D ++RP+K S+CCKHYAAYDVD+W R+ FDARV+E+DM ETFLRPFE
Sbjct: 225 DIPGHDGGGDDPSTRPIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFE 284
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
MCV++GDAS VMCSYNRVNGIP+CAD +LL+ T+RG+W LHGYIV+DCD+++VM DN +
Sbjct: 285 MCVRDGDASGVMCSYNRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATW 344
Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGN------------AVQQGKVKETDIDKSLK 370
L + ++ A +++AGLDLDC + + G AV QGK++E+DID +L+
Sbjct: 345 LHFTGAESSAASIRAGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALR 404
Query: 371 YLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
Y LMRLG+FD P+Y SL + DIC+DE+ LA + AR+G+VLLKND LPL+ K+
Sbjct: 405 NQYMTLMRLGYFDNIPRYASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKI 464
Query: 431 KTVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
VAV GPHA A ++ G+Y G PCRY++P G S ++++
Sbjct: 465 LAVAVHGPHARAPEKIMDGDYTGPPCRYVTPRQGISKDVKISHR---------------- 508
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A+ TI L G++L +E E DREDL LP QT+ I A+ + P+ILVI+S G
Sbjct: 509 -------ANTTIYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGG 561
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G+DI+FA + I AILWAGYPG EGG AIADV+FG++NPGGRLP+TW+ Y+Q +P+T
Sbjct: 562 GIDISFAHKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMT 621
Query: 610 SMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL-QHC 667
SM RPV GYPGRTYKFY+GP LYPFGYGLSYT+F Y + T V L HC
Sbjct: 622 SMEFRPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFLYE--TSTNGTAVTLPATGGHC 679
Query: 668 RNLNYT-SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
+ L+Y S A+ C V V C + F + N G G+ VV+VY+ PP E+A
Sbjct: 680 KGLSYKPSVATTPACQAVDVAGHACTETVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQA 739
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG----GV 782
IKQV F+RVFV A + F N CK+ IV+ A T++P+G + V NG V
Sbjct: 740 PIKQVAAFRRVFVPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSV 799
Query: 783 SFPIHLNFN 791
SFP+ ++F+
Sbjct: 800 SFPVKIDFS 808
>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 814
Score = 836 bits (2160), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/788 (52%), Positives = 535/788 (67%), Gaps = 58/788 (7%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCD RF+++GL MS+F +CD+SLPY+ RV+DL+ MT++EKV LGD +HG PR+GLP
Sbjct: 52 VCDAERFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPP 111
Query: 96 YEWWSEALHGVSNVGPGTHFDDV--IPG----------ATSFPTVILTTASFNESLWKKI 143
Y+WWSEALHGVS+ GP FDD+ PG AT F VI + ASFNE+LW I
Sbjct: 112 YKWWSEALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSI 171
Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
GQAVSTEARAMYNLG+ GLTYWSPNINV RDPRWGR ETPGEDP+V GRYAVN+VRG+Q
Sbjct: 172 GQAVSTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQ 231
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
D+ GH + D ++RP+K S+CCKH+AAYDVDNW R+ +DARV+E+DM ETFLRPFEM
Sbjct: 232 DIPGHYSG-DPSARPIKTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEM 290
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
CV+EGD SSVMCSYNRVNG+P+CAD +LL+ TVRGEW L+GYIV+DCD+++VM DN +L
Sbjct: 291 CVREGDVSSVMCSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWL 350
Query: 324 ADSKEDAVAQTLKAGLDLDCGQ------------YYTNFTGNAVQQGKVKETDIDKSLKY 371
+ ++ A +L+AG+DLDC + Y + + AV QGK++E+DID +L
Sbjct: 351 NFTAAESSAVSLRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTN 410
Query: 372 LYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
LY LMRLG+FD P+Y SL + D+C+DE+ LA + AR+GIVLLKND LPL+ K
Sbjct: 411 LYMTLMRLGYFDNIPRYASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTL 470
Query: 432 TVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAA 490
VAV GPHA A ++ G+Y G PCRY++P G S +++K
Sbjct: 471 AVAVHGPHARAPEKIMDGDYTGPPCRYVTPRQGISRDVKISHK----------------- 513
Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
A TI L G++L +E E DREDL LP QT+ I A+ + P+ILVI+S GG
Sbjct: 514 ------AKMTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGG 567
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+DI+FA+ + I AILWAGYPG EGG AIADV+FG++NPGGRLP+TW+ Y++ +P+TS
Sbjct: 568 IDISFAQKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTS 627
Query: 611 MPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL-QHCR 668
M RPV GYPGRTYKFY+GP LYPFGYGLSYT+F+Y + T + V+L HC+
Sbjct: 628 MEFRPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFQYE--TSTDGVSVSLPAPGGHCK 685
Query: 669 NLNYT-SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
L+Y S A+ C V V D C + F V N G G+ VV+VY+ PP E+A
Sbjct: 686 GLSYKPSVATVPACQAVNVADHACTETVSFNVSVTNAGGRGGAHVVLVYTAPPPEVAEAP 745
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG----GVS 783
IKQV F+RVFV A + F N CK+ IV+ A T++P+G + V NG VS
Sbjct: 746 IKQVAAFRRVFVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVS 805
Query: 784 FPIHLNFN 791
FP+ ++ +
Sbjct: 806 FPVKIDLS 813
>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
Length = 816
Score = 832 bits (2150), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/789 (53%), Positives = 532/789 (67%), Gaps = 57/789 (7%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCD RF+ LGL M+ F +CD+SLPY+ RV+DL+ RMT++EKV LGD+ G R+GLP
Sbjct: 51 VCDATRFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPA 110
Query: 96 YEWWSEALHGVSNVGPGTHFDDV-----------IPGATSFPTVILTTASFNESLWKKIG 144
Y WWSEALHG+S+ GP T FDD+ + AT F VI + ASFNE+LWK IG
Sbjct: 111 YRWWSEALHGLSSTGPTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIG 170
Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
QAVSTEARAMYN+G+ GLTYWSPNINV RDPRWGR ETPGEDP+VVGRYAVN+VRG+QD
Sbjct: 171 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 230
Query: 205 VEGHENAT---DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+ GHE D N+RPLK S+CCKHYAAYD+D+W R+ FDARV E+DM ETF RPF
Sbjct: 231 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 290
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
EMCV++GD SSVMCSYNRVNGIP+CAD +LL+QT+R +W LHGYIV+DCD+++VM DN
Sbjct: 291 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 350
Query: 322 FLADSKEDAVAQTLKAGLDLDCGQ-------------YYTNFTGNAVQQGKVKETDIDKS 368
+L + +A A LKAGLDLDCG+ + T + AV +GK++E+DID +
Sbjct: 351 WLGYTGAEASAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNA 410
Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
L Y LMRLG+FD QY SLG+QDIC+D++ LA + AR+GIVLLKND LPL++
Sbjct: 411 LTNQYMTLMRLGYFDDITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 470
Query: 429 KVKTVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
KV V V GPH A ++ G+Y G PCRY++P G S Y +++
Sbjct: 471 KVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-------------- 516
Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
A+ TI GL+L++E E DRED+ LP QT+ I +VA+ + P+ILVI+S
Sbjct: 517 ---------ANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILS 567
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
GG+D++FA+ N I AILWAGYPG EGG AIADV+FGK NP GRLP+TW+ Y+ LP
Sbjct: 568 GGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLP 627
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
+TSM LRPV GYPGRTYKFYNGP LYPFGYGLSYT+F Y + + + V + H
Sbjct: 628 MTSMDLRPVAKHGYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAG-GH 686
Query: 667 CRNLNYTSDASKT--RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
C+ L+Y S S CP + VN C + F V N G T GS VIV+SKPPAE+
Sbjct: 687 CKKLSYKSGVSSAAPACPAINVNGHACTETVSFNVSVTNGGDTGGSHPVIVFSKPPAEVD 746
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN--GGV 782
IKQV+ F+ VFV A + F N CK+ IV+ A T++P+G T+ V N V
Sbjct: 747 DAPIKQVVAFRSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVDSSV 806
Query: 783 SFPIHLNFN 791
SFP+ ++F+
Sbjct: 807 SFPVKISFS 815
>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
Length = 816
Score = 830 bits (2145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/787 (53%), Positives = 531/787 (67%), Gaps = 56/787 (7%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCD RF+ LGL M+ F +CD+SLPY+ RV+DL+ RMT++EKV LGD+ G R+GLP
Sbjct: 52 VCDATRFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPA 111
Query: 96 YEWWSEALHGVSNVGPGTHFDDV-----------IPGATSFPTVILTTASFNESLWKKIG 144
Y WWSEALHG+S+ GP T FDD+ + AT F VI + ASFNE+LWK IG
Sbjct: 112 YRWWSEALHGLSSTGPTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIG 171
Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
QAVSTEARAMYN+G+ GLTYWSPNINV RDPRWGR ETPGEDP+VVGRYAVN+VRG+QD
Sbjct: 172 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 231
Query: 205 VEGHENAT---DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+ GHE D N+RPLK S+CCKHYAAYD+D+W R+ FDARV E+DM ETF RPF
Sbjct: 232 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 291
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
EMCV++GD SSVMCSYNRVNGIP+CAD +LL+QT+R +W LHGYIV+DCD+++VM DN
Sbjct: 292 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 351
Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTG-------------NAVQQGKVKETDIDKS 368
+L + +A A LKAGLDLDCG+ + N T AV +GK++E+DID +
Sbjct: 352 WLGYTGAEASAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 411
Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
L Y LMRLG+FD QY SLG+QDIC+D++ LA + AR+GIVLLKND LPL++
Sbjct: 412 LTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 471
Query: 429 KVKTVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
KV V V GPH A ++ G+Y G PCRY++P G S Y +++
Sbjct: 472 KVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-------------- 517
Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
A+ TI GL+L++E E DRED+ LP QT+ I +VA+ + P+ILVI+S
Sbjct: 518 ---------ANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILS 568
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
GG+D++FA+ N I AILWAGYPG EGG AIADV+FGK NP GRLP+TW+ Y+ LP
Sbjct: 569 GGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLP 628
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
+TSM LRPV GYPGRTYKFY+GP LYPFGYGLSYT+F Y + + + V + H
Sbjct: 629 MTSMDLRPVAKHGYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVAG-GH 687
Query: 667 CRNLNYTSDASKT-RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA 725
C+ L+Y S S CP + VN C + F V N G T GS VIV+SKPPAE+
Sbjct: 688 CKKLSYKSGVSTAPACPAINVNGHVCTETVSFNVSVTNGGDTGGSHPVIVFSKPPAEVDD 747
Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN--GGVS 783
+KQV+ F+ VFV A + F N CK+ IV+ A T++P+G TI V N VS
Sbjct: 748 APMKQVVAFKSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDSSVS 807
Query: 784 FPIHLNF 790
FP+ ++F
Sbjct: 808 FPVKIDF 814
>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
lyrata]
Length = 763
Score = 800 bits (2065), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/775 (51%), Positives = 528/775 (68%), Gaps = 33/775 (4%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
+ F +I + S+++V S F CD + L+ FC S+P + RVKDL
Sbjct: 1 MAFLAAILFFLISSSSVCVQ--SRETFACDIKDAATATLR-----FCQLSVPITERVKDL 53
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
+ R+TL EKV LG+ A +PRLG+ YEWWSEALHGVSNVGPGT F V P ATSFP V
Sbjct: 54 IGRLTLVEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQV 113
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
I T ASFN SLW+ IG+ VS EARAMYN G GLTYWSPN+N+ RDPRWGR ETPGEDP
Sbjct: 114 ITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDP 173
Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
V G+YA +YVRGLQ G++ + LKV++CCKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 174 VVAGKYAASYVRGLQ---GNDRSR------LKVAACCKHFTAYDLDNWNGVDRFHFNAKV 224
Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
++QD+E+TF PF MCVKEG+ +S+MCSYN VNG+P+CADP LL +T+R EW L+GYIV+
Sbjct: 225 SKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEWGLNGYIVS 284
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCDS+ V+ D + + E+A A ++KAGLDLDCG + T +AV++ ++E+D+D +
Sbjct: 285 DCDSVGVLYDTQHYTG-TPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNA 343
Query: 369 LKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
L TV MRLG FDG + Y LG +C+ + LA EAA++GIVLLKN ++LPL
Sbjct: 344 LINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPL 403
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
+S + +TVAV+GP+++ATVAMIGNYAGI C Y SP+ G +GYA ++ GC DV C +
Sbjct: 404 SSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSPVQGITGYARTVHQKGCVDVHCMDDR 463
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
AA EAA+ ADAT+++ GLD S+EAE DR L LPG Q +LI++VA+ AKGPVILV+
Sbjct: 464 LFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAAKGPVILVL 523
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MS G +DI+FAE + I AI+WAGYPG+EGG AIAD++FG NPGG+LP+TWY DY+
Sbjct: 524 MSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTN 583
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
LP+T M +RP+ S PGRTY+FY+GP +YPFG+GLSYT+F +++ K I + +
Sbjct: 584 LPMTEMSMRPIHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHSIADAPKVIPIAV---- 639
Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
R N T R V RC+ VD NVGS DG+ ++V+S PP
Sbjct: 640 --RGRNGTVSGKSIR-----VTHARCNRLSLGVHVDVTNVGSRDGTHTMLVFSAPPGGEW 692
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A KQ++ F+RV V G KR++ + CK L++VD A N +P G+H I +G+
Sbjct: 693 APK-KQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGD 746
>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
Precursor
gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
Length = 768
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/775 (50%), Positives = 527/775 (68%), Gaps = 33/775 (4%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
+ F I + S+++V + S F CD + L+ FC S+P RV+DL
Sbjct: 6 MAFLAVILFFLISSSSVCVH--SRETFACDTKDAATATLR-----FCQLSVPIPERVRDL 58
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
+ R+TL EKV LG+ A +PRLG+ YEWWSEALHGVSNVGPGT F V P ATSFP V
Sbjct: 59 IGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQV 118
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
I T ASFN SLW+ IG+ VS EARAMYN G GLTYWSPN+N+ RDPRWGR ETPGEDP
Sbjct: 119 ITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDP 178
Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
V G+YA +YVRGLQ G++ + LKV++CCKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 179 VVAGKYAASYVRGLQ---GNDRSR------LKVAACCKHFTAYDLDNWNGVDRFHFNAKV 229
Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
++QD+E+TF PF MCVKEG+ +S+MCSYN+VNG+P+CADP LL +T+R +W L+GYIV+
Sbjct: 230 SKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVS 289
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCDS+ V+ D + + E+A A ++KAGLDLDCG + T +AV++ ++E+D+D +
Sbjct: 290 DCDSVGVLYDTQHYTG-TPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNA 348
Query: 369 LKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
L TV MRLG FDG + Y LG +C+ + LA EAA++GIVLLKN ++LPL
Sbjct: 349 LINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPL 408
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
+S + +TVAV+GP+++ATV MIGNYAG+ C Y SP+ G +GYA ++ GC DV C +
Sbjct: 409 SSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDR 468
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
AA EAA+ ADAT+++ GLD S+EAE DR L LPG Q +L+++VA+ AKGPVILV+
Sbjct: 469 LFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVL 528
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MS G +DI+FAE + I AI+WAGYPG+EGG AIAD++FG NPGG+LP+TWY DY+
Sbjct: 529 MSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTN 588
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
LP+T M +RPV S PGRTY+FY+GP +YPFG+GLSYT+F +N+ K I + +
Sbjct: 589 LPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAV---- 644
Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
R N T R V RCD V+ NVGS DG+ ++V+S PP
Sbjct: 645 --RGRNGTVSGKSIR-----VTHARCDRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEW 697
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A KQ++ F+RV V G KR++ + CK L++VD A N +P G+H I +G+
Sbjct: 698 APK-KQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGD 751
>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 775
Score = 798 bits (2060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/784 (50%), Positives = 517/784 (65%), Gaps = 31/784 (3%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
M+ S LL LL + +A F CDP + + FC +SL
Sbjct: 1 MSSTFSPLLNLIAVFLLLFLVRHTCEARDP----FACDPKNGA-----TENMPFCKASLA 51
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
RVKDLV R+TL EKV+ L + A VPRLG+ YEWWSEALHGVSNVGPG F+ P
Sbjct: 52 IPERVKDLVGRLTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFP 111
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
GATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR
Sbjct: 112 GATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 171
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ETPGEDP + G YA +YVRGLQ +G+ LKV++CCKH+ AYD+DNW G+D
Sbjct: 172 QETPGEDPVLAGTYAASYVRGLQGTDGNR---------LKVAACCKHFTAYDLDNWNGMD 222
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
R+HF+A+V++QD+EETF PF MCV EG +SVMCSYN+VNG+P+CADP LL +TVRG W
Sbjct: 223 RFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLW 282
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
L GYIV+DCDS+ V DN + + E+A A +KAGLDLDCG + T NAV++G +
Sbjct: 283 QLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLL 341
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
E D++ +L TV MRLG FDG P Y LG +D+C + ELA EAAR+GIVLLK
Sbjct: 342 SEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLK 401
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
N LPL+ + TVAV+GP++ ATV MIGNYAG+ C Y +P+ G YA ++ GC+
Sbjct: 402 NTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNPLQGIGRYAKTIHQLGCE 461
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
+VACK++ +A AA+ ADAT+++ GLD S+EAE++DR L LPG Q L+++VA +
Sbjct: 462 NVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAAS 521
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
KGP ILVIMS G VDI FA+ N I ILWAGYPG+ GG AIAD++FG NPGG+LP+TW
Sbjct: 522 KGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPGGKLPVTW 581
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y +Y+ LP+T+M +R S GYPGRTY+FYNGP +YPFG+GL+YT F + L S +
Sbjct: 582 YPQEYLTKLPMTNMAMRGSKSAGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPTVV 641
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY 716
V LN + N ++ A + V RCD +VD +NVGS DG+ ++V+
Sbjct: 642 SVPLNGHRRANVTNISNRA-------IRVTHARCDKLSISLEVDIKNVGSRDGTHTLLVF 694
Query: 717 SKPPAEIAATYI-KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTI 775
S PPA + KQ++ F+++ V A +R+ + CK L++VD + +P GEH+
Sbjct: 695 SAPPAGFGHWALEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLSVVDKSGIRRIPLGEHSF 754
Query: 776 FVGN 779
+G+
Sbjct: 755 NIGD 758
>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
Length = 775
Score = 797 bits (2059), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/781 (50%), Positives = 533/781 (68%), Gaps = 28/781 (3%)
Query: 3 KVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
KV S LCFS+ ++ + N V G +S VF CD + + + SS+ FCD SL
Sbjct: 10 KVSSVFLCFSIFYVAVLLNCNHV--YGQTSTVFACDVAKNTNV----SSYGFCDKSLSVE 63
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
RV DLV R+TL EK+ LG+ A V RLG+P+YEWWSEALHGVSN+GPGTHF ++PGA
Sbjct: 64 DRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGA 123
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
TSFP ILT ASFN SL++ IG VS EARAMYN+G AGLTYWSPNIN+ RDPRWGR E
Sbjct: 124 TSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 183
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
TPGEDP + +YA YV+GLQ + D +S LKV++CCKHY AYDVDNWKGV RY
Sbjct: 184 TPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVAACCKHYTAYDVDNWKGVQRY 237
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
FDA V++QD+++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL +RG+W L
Sbjct: 238 TFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKL 297
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
+GYIV+DCDS++V+ + + + E+A A+T+ +GLDLDCG Y +TG AV+QG V E
Sbjct: 298 NGYIVSDCDSVEVLFKDQHY-TKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDE 356
Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
I+ ++ + LMRLGFFDG P Y +LG +D+C+ EN ELA EAAR+GIVLLKN
Sbjct: 357 ASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNS 416
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
+LPL+S +K++AV+GP+ANAT MIGNY GIPC+Y SP+ G + + +Y GC DV
Sbjct: 417 PGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVPTSYAPGCPDV 476
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
C +N I A++ A +ADATII+ G +L++EAESLDR ++ LPG Q QL+N+VA V+KG
Sbjct: 477 QC-ANAQIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKG 535
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVILVIMS GG+D++FA+TN I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 536 PVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYP 595
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
YV+ +P+T+M +R + GYPGRTY+FY G T++ FG G+S+ ++ ++ + + V
Sbjct: 596 QSYVEKIPMTNMNMRSDPATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSV 655
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK 718
L + CR+L C + V D C + F+ + +N+G S V+++
Sbjct: 656 PLAEDHECRSL---------ECKSLDVADEHCQNLAFDIHLSVKNMGKMSSSHSVLLFFT 706
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
PP + K ++GF++V + ++F + C L++VD N +P G+H + VG
Sbjct: 707 PP-NVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVG 765
Query: 779 N 779
N
Sbjct: 766 N 766
>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
Length = 763
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/775 (50%), Positives = 527/775 (68%), Gaps = 33/775 (4%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
+ F I + S+++V + S F CD + L+ FC S+P RV+DL
Sbjct: 1 MAFLAVILFFLISSSSVCVH--SRETFACDTKDAATATLR-----FCQLSVPIPERVRDL 53
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
+ R+TL EKV LG+ A +PRLG+ YEWWSEALHGVSNVGPGT F V P ATSFP V
Sbjct: 54 IGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQV 113
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
I T ASFN SLW+ IG+ VS EARAMYN G GLTYWSPN+N+ RDPRWGR ETPGEDP
Sbjct: 114 ITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDP 173
Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
V G+YA +YVRGLQ G++ + LKV++CCKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 174 VVAGKYAASYVRGLQ---GNDRSR------LKVAACCKHFTAYDLDNWNGVDRFHFNAKV 224
Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
++QD+E+TF PF MCVKEG+ +S+MCSYN+VNG+P+CADP LL +T+R +W L+GYIV+
Sbjct: 225 SKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVS 284
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCDS+ V+ D + + E+A A ++KAGLDLDCG + T +AV++ ++E+D+D +
Sbjct: 285 DCDSVGVLYDTQHYTG-TPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNA 343
Query: 369 LKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
L TV MRLG FDG + Y LG +C+ + LA EAA++GIVLLKN ++LPL
Sbjct: 344 LINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPL 403
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
+S + +TVAV+GP+++ATV MIGNYAG+ C Y SP+ G +GYA ++ GC DV C +
Sbjct: 404 SSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDR 463
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
AA EAA+ ADAT+++ GLD S+EAE DR L LPG Q +L+++VA+ AKGPVILV+
Sbjct: 464 LFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVL 523
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MS G +DI+FAE + I AI+WAGYPG+EGG AIAD++FG NPGG+LP+TWY DY+
Sbjct: 524 MSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTN 583
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
LP+T M +RPV S PGRTY+FY+GP +YPFG+GLSYT+F +N+ K I + +
Sbjct: 584 LPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAV---- 639
Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
R N T R V RCD V+ NVGS DG+ ++V+S PP
Sbjct: 640 --RGRNGTVSGKSIR-----VTHARCDRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEW 692
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A KQ++ F+RV V G KR++ + CK L++VD A N +P G+H I +G+
Sbjct: 693 APK-KQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGD 746
>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
Length = 876
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/753 (52%), Positives = 515/753 (68%), Gaps = 32/753 (4%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
+ + FC+ SL Y R KDLVSR++L EKVQQL + A GVPRLG+P YEWWSEALHGVS+V
Sbjct: 37 AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
GPG HF+ +PGATSFP ILT ASFN SLW K+G+ VSTEARAM+N+G AGLTYWSPN+
Sbjct: 97 GPGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNV 156
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
NV RDPRWGR ETPGEDP VV +YAVNYV+GLQDV H+ SR LKVSSCCKHY
Sbjct: 157 NVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV--HDAG---KSRRLKVSSCCKHYT 211
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
AYD+DNWKG+DR+HFDA+VT+QD+E+T+ PF+ CV+EGD SSVMCSYNRVNGIP+CADP
Sbjct: 212 AYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADP 271
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
LL +RG+W L GYIV+DCDSIQV ++ + ++EDAVA LKAGL+++CG +
Sbjct: 272 NLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNCGDFLGK 330
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAA 406
+T NAV+ K+ +D+D++L Y Y VLMRLGFFDG P+ + +LG D+CS ++ LA
Sbjct: 331 YTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLAL 390
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
EAA++GIVLL+N + LPL VK +AV+GP+ANAT MI NYAG+PC+Y SPI G
Sbjct: 391 EAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQK 449
Query: 467 YA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
Y + Y+ GC DV C I AA +A AD T+++ GLD +VEAE LDR +L LPG
Sbjct: 450 YVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPG 509
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
YQ +L+ VA AK V+LVIMSAG +DI+FA+ + I+A+LW GYPGE GG AIA V+F
Sbjct: 510 YQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIF 569
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G +NP GRLP TWY ++ + +T M +RP + G+PGR+Y+FY G +Y FGYGLSY+
Sbjct: 570 GDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYS 629
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTS--DASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
F +LS I + N + NLN T+ D S C +DL+ + +
Sbjct: 630 SFSTFVLSAPSIIHIKTNPIM---NLNKTTSVDISTVNC-----HDLK----IRIVIGVK 677
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATY-----IKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
N G GS VV+V+ KPP + + Q++GF+RV V ++ F+ CK+L
Sbjct: 678 NHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKAL 737
Query: 758 NIVDYAANTLLPAGEHTIFVG-NGGVSFPIHLN 789
++VD L G H + +G N HLN
Sbjct: 738 SLVDTHGKRKLVTGHHKLVIGSNSDQQIYHHLN 770
>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
Precursor
gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
Length = 781
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/753 (52%), Positives = 515/753 (68%), Gaps = 32/753 (4%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
+ + FC+ SL Y R KDLVSR++L EKVQQL + A GVPRLG+P YEWWSEALHGVS+V
Sbjct: 37 AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
GPG HF+ +PGATSFP ILT ASFN SLW K+G+ VSTEARAM+N+G AGLTYWSPN+
Sbjct: 97 GPGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNV 156
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
NV RDPRWGR ETPGEDP VV +YAVNYV+GLQDV H+ SR LKVSSCCKHY
Sbjct: 157 NVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV--HDAG---KSRRLKVSSCCKHYT 211
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
AYD+DNWKG+DR+HFDA+VT+QD+E+T+ PF+ CV+EGD SSVMCSYNRVNGIP+CADP
Sbjct: 212 AYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADP 271
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
LL +RG+W L GYIV+DCDSIQV ++ + ++EDAVA LKAGL+++CG +
Sbjct: 272 NLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNCGDFLGK 330
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAA 406
+T NAV+ K+ +D+D++L Y Y VLMRLGFFDG P+ + +LG D+CS ++ LA
Sbjct: 331 YTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLAL 390
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
EAA++GIVLL+N + LPL VK +AV+GP+ANAT MI NYAG+PC+Y SPI G
Sbjct: 391 EAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQK 449
Query: 467 YA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
Y + Y+ GC DV C I AA +A AD T+++ GLD +VEAE LDR +L LPG
Sbjct: 450 YVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPG 509
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
YQ +L+ VA AK V+LVIMSAG +DI+FA+ + I+A+LW GYPGE GG AIA V+F
Sbjct: 510 YQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIF 569
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G +NP GRLP TWY ++ + +T M +RP + G+PGR+Y+FY G +Y FGYGLSY+
Sbjct: 570 GDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYS 629
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTS--DASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
F +LS I + N + NLN T+ D S C +DL+ + +
Sbjct: 630 SFSTFVLSAPSIIHIKTNPIM---NLNKTTSVDISTVNC-----HDLK----IRIVIGVK 677
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATY-----IKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
N G GS VV+V+ KPP + + Q++GF+RV V ++ F+ CK+L
Sbjct: 678 NHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKAL 737
Query: 758 NIVDYAANTLLPAGEHTIFVG-NGGVSFPIHLN 789
++VD L G H + +G N HLN
Sbjct: 738 SLVDTHGKRKLVTGHHKLVIGSNSDQQIYHHLN 770
>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/790 (49%), Positives = 536/790 (67%), Gaps = 28/790 (3%)
Query: 3 KVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
KV S LCFS+ ++ + N V G +S VF CD + + +SS+ FCD+SL
Sbjct: 9 KVSSVFLCFSIFYVTVLLNCNHV--YGQTSTVFACDVAKNT----NVSSYGFCDNSLSVE 62
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
RV DLV R+TL EK+ LG+ A V RLG+P+YEWWSEALHGVSN+GPGTHF ++PGA
Sbjct: 63 DRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGA 122
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
T+FP ILT ASFN SL++ IG VS EARAMYN+G AGLTYWSPNIN+ RDPRWGR E
Sbjct: 123 TNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 182
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
TPGEDP + +YA YV+GLQ + D +S LKV++CCKHY AYDVDNWKGV RY
Sbjct: 183 TPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVAACCKHYTAYDVDNWKGVQRY 236
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
FDA V++QD+++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL +RG+W L
Sbjct: 237 TFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKL 296
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
+GYIV+DCDS++V+ + + + E+A A+T+ +GLDLDCG Y +TG AV+QG V E
Sbjct: 297 NGYIVSDCDSVEVLYKDQHY-TKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDE 355
Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
I ++ + LMRLGFFDG P Y +LG +D+C+ EN ELA EAAR+GIVLLKN
Sbjct: 356 ASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNS 415
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
+LPL+S +K++AV+GP+ANAT MIGNY GIPC+Y SP+ G + + +Y GC DV
Sbjct: 416 PRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVPTSYAPGCPDV 475
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
C +N I A++ A +ADATII+ G +L++EAESLDR ++ LPG Q QL+N+VA V+KG
Sbjct: 476 QC-ANAQIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKG 534
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVILVIMS GG+D++FA+TN I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 535 PVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYP 594
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
YV+ +P+T+M +R + GYPGRTY+FY G T++ FG G+S+ ++ ++ + + V
Sbjct: 595 QSYVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSV 654
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK 718
L + CR+L C + V D C + F+ + +N+G S V+++
Sbjct: 655 PLAEDHECRSL---------ECKSLDVADKHCQNLAFDIHLSVKNMGKMSSSHSVLLFFT 705
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
PP + K ++GF++V + ++F + C L++VD N +P G+H + VG
Sbjct: 706 PP-NVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVG 764
Query: 779 NGGVSFPIHL 788
N S + +
Sbjct: 765 NLKHSLSVRI 774
>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
Length = 768
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/760 (51%), Positives = 517/760 (68%), Gaps = 28/760 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP KLGL S FC +LP +RV+DL+ R+TL EK++ L + A VPRLG+
Sbjct: 28 FACDP----KLGL-TRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQ 82
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGAT+FP VI T ASFNESLW++IG+ VS EARAM
Sbjct: 83 GYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARAM 142
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G AGLTYWSPN+NV RDPRWGR ETPGEDP V G+YA +YVRGLQ G
Sbjct: 143 YNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLR----- 197
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+ PF+ CV G +SVM
Sbjct: 198 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVM 253
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RGEW L+GYIV+DCDS+ V+ D + A + E+A A T
Sbjct: 254 CSYNQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTA-TPEEAAAST 312
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
++AGLDLDCG + T NAV+ G +KE D++ +L TV MRLG FDG P + +L
Sbjct: 313 IRAGLDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNL 372
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA +AAR+GIVLL+N TLPL S ++TVAV+GP+++ TV MIGNYA
Sbjct: 373 GPRDVCTPAHQQLALQAARQGIVLLQNRGRTLPL-SRTLQTVAVIGPNSDVTVTMIGNYA 431
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G YA + GC+DV C N AA AA+ ADATI++ GLD S+E
Sbjct: 432 GVACGYTTPLQGIRRYAKTVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIE 491
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE DR+ L LPGYQ +L++ VA ++GP ILV+MS G +D++FA+ + I AILW GYP
Sbjct: 492 AEFRDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYP 551
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG NPGG+LP+TWY +Y+ +P+T+M +R S GYPGRTY+FY G
Sbjct: 552 GQAGGAAIADVLFGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKG 611
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
P ++PFG+G+SYT F ++L+ + + V L L RN S+A + V+ C
Sbjct: 612 PVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNA-------IRVSHANC 664
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ +D +N G DG+ ++V+S PP +T KQ+IGF++V + G KR+K
Sbjct: 665 EALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQ-KQLIGFEKVHLVTGSQKRVKID 723
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
+ CK L++VD +P GEH +++G+ S + N
Sbjct: 724 IHVCKHLSVVDRFGIRRIPIGEHDLYIGDLKHSISLQANL 763
>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
Length = 765
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/776 (49%), Positives = 518/776 (66%), Gaps = 36/776 (4%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
+ ++ LL+ S+ A D F CDP S ++F FC +SLP RV DL
Sbjct: 4 ILITIVFLLLLMSSEARDP-------FACDPKNTS-----TNNFPFCKASLPIPTRVNDL 51
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
+ R+TL EKV L + A VPR+G+ YEWWSEALHGVSNVGPGT F P ATSFP V
Sbjct: 52 IGRLTLQEKVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQV 111
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
I T ASFN SLW+ IG+ S EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP
Sbjct: 112 ITTVASFNASLWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP 171
Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
+ G+YA +YVRGLQ + S LKV++ CKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 172 ILAGKYAASYVRGLQGTD---------SSRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 222
Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
++QDME+TF PF MCVKEG+ +SVMCSYN+VNG+P+CADP LL +T+RG+W L GYIV+
Sbjct: 223 SKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVS 282
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCDS+ V N + + + E+A A +KAGLDLDCG + T NAV++G + ETD++ +
Sbjct: 283 DCDSVGVFYTNQHYTS-TPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGA 341
Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
L TV MRLG FDG P Y +LG D+C+ + ELA +AAR+GIVLLKN +LPL
Sbjct: 342 LANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPL 401
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
++ +TVAV+GP++NATV MIGNYAGI C Y SP+ G YA ++ GC +VAC +
Sbjct: 402 STKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSPLQGIGKYARTIHEPGCANVACNDDK 461
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
+A AA+ ADAT+++ GLD S+EAE +DR L LPG+Q L+++VA ++GP ILV+
Sbjct: 462 QFGSALNAARQADATVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVL 521
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MS G +DI FA+ + I ILWAGYPG+ GG AIAD++FG NPG +LP+TWY Y++
Sbjct: 522 MSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAIADILFGTTNPGAKLPMTWYPQGYLKN 581
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
L +T+M +RP S GYPGRTY+FYNGP +YPFGYGLSYT F + L S K + V ++ +
Sbjct: 582 LAMTNMAMRPSSSTGYPGRTYRFYNGPVVYPFGYGLSYTNFVHTLASAPKVVSVPVDGHR 641
Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
+ N + + V RC +D +NVGS DG++ ++V+S PP
Sbjct: 642 RGNSSNKAA---------IRVTHARCGKLSIRLDIDVKNVGSKDGTNTLLVFSVPPTGNG 692
Query: 725 A-TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KQ++ F++V+V A +R++ + CK L++VD + +P G H+I +G+
Sbjct: 693 HWAPQKQLVAFEKVYVPAKAQQRVRINIHVCKLLSVVDKSGTRRIPMGAHSIHIGD 748
>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 772
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/756 (50%), Positives = 510/756 (67%), Gaps = 27/756 (3%)
Query: 29 GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
G + F CDP + L FC +SL RVKDL+ R+TL EKV L + A V
Sbjct: 22 GEARDPFACDPKNTATKNLP-----FCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAV 76
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
PRLG+ YEWWSEALHGVSNVGPGT F P ATSFP VI T ASFN SLW+ IG+ S
Sbjct: 77 PRLGIKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVAS 136
Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ +G+
Sbjct: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGN 196
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
LKV++ CKH+ AYD+DNW GVDR+HF+A+V++QD+E+TF PF MCVKEG
Sbjct: 197 R---------LKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEG 247
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
+SVMCSYN+VNG+P+CADP LL +TVRG+W L+GYIV+DCDS+ V ++ + + + E
Sbjct: 248 KVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTS-TPE 306
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
+A A +KAGLDLDCG + T NAV++G + E D++ +L TV MRLG +DG P
Sbjct: 307 EAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSS 366
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y +LG +D+C+ + ELA EAAR+GIVLLKN +LPL++ + +TVAV+GP++N T
Sbjct: 367 HPYNNLGPRDVCTQSHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFT 426
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
MIGNYAGI C Y SP+ G Y Y+ GC +VAC + A AA+ ADAT+++ G
Sbjct: 427 MIGNYAGIACGYTSPLQGIGTYTKTIYEHGCANVACTDDKQFGRAINAAQQADATVLVMG 486
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
LD S+EAE++DR L LPG+Q L+++VA +KGP ILVIMS G VDI FA+ + I+ I
Sbjct: 487 LDQSIEAETVDRASLLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGI 546
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LWAGYPG+ GG AIAD++FG NPGG+LP+TWY Y++ LP+T+M +R S GYPGRT
Sbjct: 547 LWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRT 606
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y+FYNGP +YPFGYGLSYT F + L S K + + ++ +H + N + A K
Sbjct: 607 YRFYNGPVVYPFGYGLSYTHFVHTLTSAPKLVSIPVDGHRHGNSSNIANKAIK------- 659
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA-TYIKQVIGFQRVFVRAGR 743
V RC VD +NVGS DG ++V+S PPA KQ++ F++V + A
Sbjct: 660 VTHARCGKLSINLHVDVKNVGSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKA 719
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+R++ + CK L++VD + +P G H++ +G+
Sbjct: 720 QQRVRVKIHVCKLLSVVDRSGTRRIPMGLHSLHIGD 755
>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
Length = 768
Score = 790 bits (2040), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/749 (51%), Positives = 511/749 (68%), Gaps = 29/749 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CD SK G + FC LP RVKDL+ R+TL EKV L + A V RLG+
Sbjct: 28 FACD----SKDG-TTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIK 82
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFN +LW+ IG+ VS EARAM
Sbjct: 83 GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAM 142
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G AGLTYWSPN+N+ RDPRWGR ETPGEDP +VG+YA +YV+GLQ +G
Sbjct: 143 YNGGAAGLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQGNDGER----- 197
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QDM++TF PF MCVKEG +SVM
Sbjct: 198 ----LKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVM 253
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNGIP+CADP LL +TVR +W L+GYIV+DCDS+ V D + + + E+A A
Sbjct: 254 CSYNQVNGIPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTS-TPEEAAADA 312
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T +AV++G + E D++ +L TV MRLG FDG P Y +L
Sbjct: 313 IKAGLDLDCGPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNL 372
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + ELA EA R+GIVLLKN +LPL+ + +TVA++GP++N TV MIGNYA
Sbjct: 373 GPKDVCTPAHQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYA 432
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C+Y +P+ G YA ++ GC DV C ++ A +AA+ ADAT+++ GLD S+E
Sbjct: 433 GVACQYTTPLQGIGSYAKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIE 492
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE DR L LPG Q +L+++VA +KGP ILV+MS G +D++FA+ + I AILWAGYP
Sbjct: 493 AEFRDRTGLLLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYP 552
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG NPGG+LP+TWY +Y+ LP+T M +R S GYPGRTY+FY G
Sbjct: 553 GQAGGAAIADVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQG 612
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+YPFG+G+SYT F +N+ S + V L+ H N + + A + V +C
Sbjct: 613 KVVYPFGHGMSYTHFVHNIASAPTMVSVPLDG--HRGNTSISGKA-------IRVTHTKC 663
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ +VD +NVGS DG+ ++VYS PPA + + KQ++ F+RV V AG +R+
Sbjct: 664 NKLSLGIQVDVKNVGSKDGTHTLLVYSAPPAGRWSPH-KQLVAFERVHVSAGTQERVGIS 722
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD + +P GEH+I +GN
Sbjct: 723 IHVCKLLSVVDRSGIRRIPIGEHSIHIGN 751
>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 775
Score = 788 bits (2036), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/750 (50%), Positives = 519/750 (69%), Gaps = 27/750 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + GL F FC++ +P +RV+DL++R+TL EK++ + + A VPRLG+
Sbjct: 37 FACDP----RNGL-TRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQ 91
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGAT FP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 92 GYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAM 151
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G+AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA +YV+GLQ D
Sbjct: 152 YNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DS 203
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+DNW GVDR+HF+A+V++QD+E+T+ PF+ CV EG +SVM
Sbjct: 204 AGNHLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVM 263
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V DN + + E+A A+
Sbjct: 264 CSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHY-TKTPEEAAAEA 322
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T +A+++G + E D++ +L L +V MRLG FDG P Y +L
Sbjct: 323 IKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNL 382
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA EAARE IVLL+N N+LPL+ ++++T+ VVGP+A+ATV MIGNYA
Sbjct: 383 GPRDVCTSAHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYA 442
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G + Y ++ GC VAC+ N AA A+ ADA +++ GLD +VE
Sbjct: 443 GVACGYTTPLQGIARYVKTAHQVGCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVE 502
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE+ DR L LPG Q +L+ +VA AKGPVIL+IMS G VDI+FA+ + I AILW GYP
Sbjct: 503 AETRDRVGLLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYP 562
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG NPGGRLP+TWY Y+ +P+T+M +RP + GYPGRTY+FY G
Sbjct: 563 GQAGGTAIADVIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYPGRTYRFYKG 622
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
P ++PFG+GLSY++F ++L K + V + LQ N +S A K V+ C
Sbjct: 623 PVVFPFGHGLSYSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVK-------VSHANC 675
Query: 692 DDYF--EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
DD EF VD +N GS DG+ ++++S+PP + IKQ++GF + V AG +R+K
Sbjct: 676 DDSLEMEFHVDVKNEGSMDGTHTLLIFSQPP-HGKWSQIKQLVGFHKTHVLAGSKQRVKV 734
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD +P GEH + +G+
Sbjct: 735 GVHVCKHLSVVDQFGVRRIPTGEHELHIGD 764
>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 788 bits (2036), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/756 (51%), Positives = 513/756 (67%), Gaps = 27/756 (3%)
Query: 29 GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
G + F CDP + L FC + L RVKDL+ R+TL EKV L + A V
Sbjct: 21 GEARDPFACDPKNTATKNLP-----FCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAV 75
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
PRLG+ YEWWSEALHGVSNVGPGT F P ATSFP VI T ASFN SLW+ IG+ S
Sbjct: 76 PRLGIKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVAS 135
Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ+ +G+
Sbjct: 136 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGN 195
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
LKV++ CKH+ AYD+DNW GVDR+HF+A+V++QD+E+TF PF MCVKEG
Sbjct: 196 R---------LKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEG 246
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
+SVMCSYN+VNG+P+CADP LL +TVRG+W L+GYIV+DCDS+ V ++ + + + E
Sbjct: 247 KVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTS-TPE 305
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
+A A +KAGLDLDCG + T NAV++G + ETD++ +L TV MRLG +DG P
Sbjct: 306 EAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSS 365
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y LG +D+C+ + ELA EAAR+GIVLLKN +LPL++ + TVAV+GP++N TV
Sbjct: 366 HPYGKLGPRDVCTPSHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVT 425
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
MIGNYAGI C Y SP+ G Y ++ GC +VAC ++ A A+ ADAT+++ G
Sbjct: 426 MIGNYAGIACGYTSPLEGIGRYTKTIHELGCANVACTNDKQFGRAINVAQQADATVLVMG 485
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
LD S+EAE++DR L LPG Q L+++VA +KGP ILVIMS G VDI FA+ N I+AI
Sbjct: 486 LDQSIEAETVDRAGLLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAI 545
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LWAGYPG+ GG AIAD++FG NPGG+LP+TWY Y++ LP+T+M +R S GYPGRT
Sbjct: 546 LWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRT 605
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y+FYNGP +YPFGYGLSYT F + L S K + + ++ +H N +S A+K +
Sbjct: 606 YRFYNGPVVYPFGYGLSYTHFVHTLASAPKLVSIPVDGHRHG---NSSSIANKA----IK 658
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA-TYIKQVIGFQRVFVRAGR 743
V RC +VD +NVGS DG+ ++V+S PPA KQ++ FQ++ + +
Sbjct: 659 VTHARCGKLSISLQVDVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKA 718
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+R+ + CK L++VD + +P G H++ +G+
Sbjct: 719 QQRVNVNIHVCKLLSVVDRSGTRRVPMGLHSLHIGD 754
>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 788 bits (2036), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/769 (49%), Positives = 517/769 (67%), Gaps = 28/769 (3%)
Query: 14 SIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMT 73
S +LL+F +G + F CDP + G F FC S+ RVKDL+ R+T
Sbjct: 8 SSSLLIFLVVLAVVSGEARDPFACDPKDGANAG-----FPFCRKSIGIGERVKDLIGRLT 62
Query: 74 LDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTA 133
L+EKV+ L + A GVPRLG+ YEWWSEALHGVSNVGPGT F PGATSFP VI T A
Sbjct: 63 LEEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAA 122
Query: 134 SFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGR 193
SFN SLW+ IGQ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP + G+
Sbjct: 123 SFNSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGK 182
Query: 194 YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
YA YVRGLQ NA D LKV++CCKH+ AYD+DNW GVDR+HFDARV++Q+M
Sbjct: 183 YAARYVRGLQG-----NAGDR----LKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEM 233
Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
E+TF PF CV EG +SVMCSYN+VNG+P+CADP LL TVR +W L+GY+V+DCDS+
Sbjct: 234 EDTFDVPFRSCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSV 293
Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
V DN + ++ E+A A +KAGLDLDCG + T +A+++G V E D+D +L
Sbjct: 294 GVFYDNQHY-TNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTV 352
Query: 374 TVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
TV MRLG FDG P + LG +D+CS + ELA EAAR+GIVLLKN ++LPL++
Sbjct: 353 TVQMRLGMFDGEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSH 412
Query: 431 KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAA 490
+++AV+GP+++A V MIGNYAGIPC Y +P+ G Y+ ++ GC DVAC + A
Sbjct: 413 RSIAVIGPNSDANVTMIGNYAGIPCEYTTPLQGIGRYSRTIHQKGCADVACSEDQLFAGA 472
Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+AA ADAT+++ GLD S+EAE+ DR DL LPG Q +L+++VA ++GP +LV+MS G
Sbjct: 473 IDAASQADATVLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGP 532
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
VD++FA+ + I AI+WAGYPG+ GG AIAD++FG NPGG+LP+TWY +Y+ +P+T+
Sbjct: 533 VDVSFAKKDPRIAAIVWAGYPGQAGGAAIADILFGVANPGGKLPMTWYPQEYLSKVPMTT 592
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
M +R + S YPGRTY+FY GP +Y FG+GLSYT F + + + + L+ +
Sbjct: 593 MAMRAIPSKAYPGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAPTAVAIPLHG-----HH 647
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
N T R N L +D +NVG+ DGS ++V+SKPPA A + KQ
Sbjct: 648 NTTVSGKAIRVTHAKCNRLS----IALHLDVKNVGNKDGSHTLLVFSKPPAGHWAPH-KQ 702
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ F++V V A +R++ + CK L++VD + +P G+H + +G+
Sbjct: 703 LVAFEKVHVAARTQQRVQINIHVCKYLSVVDRSGIRRIPMGQHGLHIGD 751
>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
Length = 774
Score = 787 bits (2033), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/778 (50%), Positives = 525/778 (67%), Gaps = 29/778 (3%)
Query: 14 SIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMT 73
S+ + +F ++ A + P F CD + +F FC ++LP RV+DL+ R+T
Sbjct: 13 SLFIFIFLFVSIQA---ARPPFACD-----QKNRAFRNFPFCQTNLPIGDRVRDLIGRLT 64
Query: 74 LDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTA 133
L EKV+ LG+ A VPRLG+ YEWWSEALHGVSNVGPGT F PGATSFP VI T A
Sbjct: 65 LQEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAA 124
Query: 134 SFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGR 193
SFN SLW++IG+ VS EARAMYN GLTYWSPN+N+ RDPRWGR ETPGEDP V
Sbjct: 125 SFNASLWEEIGRVVSDEARAMYNGEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAAL 184
Query: 194 YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
YA YVRGLQ G+E+ L KV++CCKHY AYD+DNW GVDR+HF+A+VT+QD+
Sbjct: 185 YAERYVRGLQ---GNEDGDSL-----KVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDI 236
Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
E+TF PF CVK+G +S+MCSYN+VNGIP+CADP+LL +T+RG W L+GYIV+DCDS+
Sbjct: 237 EDTFDVPFRSCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSV 296
Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
V D + + + E+A A +KAGLDLDCG + + T NAV G +KE ID +L
Sbjct: 297 GVFYDTQHYTS-TPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTV 355
Query: 374 TVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
V MRLG FDG P QY LG +D+CS + ELA EAAR+GIVLLKN LPL+ +
Sbjct: 356 AVQMRLGMFDGEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRH 415
Query: 431 KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAA 490
+TVAV+GP+++ TV MIGNYAG+ C Y SP+ G S YA ++ GC DVAC + A
Sbjct: 416 RTVAVIGPNSDVTVTMIGNYAGVACGYTSPLQGISKYAKTIHEKGCGDVACSDDKLFAGA 475
Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
AA+ ADAT+++ GLD S+EAE DR L LPG+Q +LI++V++ ++GPV+LV+MS G
Sbjct: 476 VNAARQADATVLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGP 535
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
VD+ FA + I AI+WAGYPG+ GG AIADV+FG NPGG+LP+TWY +Y+ LP+T+
Sbjct: 536 VDVTFANNDPRIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTT 595
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
M +R + GYPGRTY+FY GP +YPFG+GLSYT+F + KT+ + ++ +H N
Sbjct: 596 MDMRSNLAKGYPGRTYRFYKGPLVYPFGHGLSYTKFITTIFEAPKTLAIPIDG-RHTYNS 654
Query: 671 NYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
+ S+ S + V +C + VD +NVG DGS ++V+SKPP +I + K
Sbjct: 655 STISNKS------IRVTHAKCSKISVQIHVDVKNVGPKDGSHTLLVFSKPPVDIWVPH-K 707
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
Q++ FQ+V+V A +R+ + CK L++VD A +P GEH+I +G+ S +
Sbjct: 708 QLVAFQKVYVPARSKQRVAINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSLQ 765
>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 769
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/771 (50%), Positives = 518/771 (67%), Gaps = 32/771 (4%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
LSI +L+ +A+ S +P F CDP + + FC SL RVKDL+ R+
Sbjct: 9 LSIFILL---SAIHGRASRAP-FACDPNNSVT-----TDYPFCRRSLVVGERVKDLIGRL 59
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
TL+EKV+ L A GVPRLG+ Y+WWSEALHGVSNVGPGT F P ATSFP VI T
Sbjct: 60 TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVG 192
ASFN SLW+ IG+ VS EARAMYN G GLTYWSPN+N+ RDPRWGR ETPGEDP + G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179
Query: 193 RYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQD 252
YAVNYVRGLQ EG+ LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QD
Sbjct: 180 TYAVNYVRGLQGTEGNR---------LKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQD 230
Query: 253 MEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDS 312
+E+TF PF MCVK G SSVMCSYN+VNG+P+CADP LL T+R +W L GYIV+DCDS
Sbjct: 231 IEDTFEVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDS 290
Query: 313 IQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
+ V ++ + + + E+A A +KAGLDLDCG + T NAV++G + E+ I+ +L
Sbjct: 291 VGVFYNSQHYTS-TPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNT 349
Query: 373 YTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
+V MRLG FDG + Y LG + +CSD N +LA +AAR+GIVLL+N + +LPL++ +
Sbjct: 350 LSVQMRLGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNR 409
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
+ VAVVGP++NAT+ MIGNYAGI C Y++P+ G S Y ++ GC VAC+SN
Sbjct: 410 HRIVAVVGPNSNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGG 469
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A EAA+ ADA +++ GLD S+EAE DR L LPG Q L+ +VA VAKGPVILV+MS G
Sbjct: 470 AIEAARVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGG 529
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+D++FA+ + I I+W GYPG+ GG AIADV+FG+ NPGG+LP+TWY DYV LP+T
Sbjct: 530 PIDVSFAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMT 589
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+M LRP S YPGRTY+FY GP +YPFG+GLSYT F + +LS T+ V + +H N
Sbjct: 590 TMSLRPGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHN 647
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
S+ V V +CD KV +N+G+ DG+ ++VYS PP +
Sbjct: 648 ------GSEFWGKAVRVTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQ- 700
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KQ++ F++V + A K ++ + CK L++VD +P GEH I +G+
Sbjct: 701 KQLVAFEKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGD 751
>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
Length = 771
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/752 (51%), Positives = 503/752 (66%), Gaps = 31/752 (4%)
Query: 35 FVCDP--GRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
F CDP G K+ FC SL + RVKDL+ R+TL+EKV+ L + A VPRLG
Sbjct: 27 FACDPKNGGTKKMA-------FCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLG 79
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+ YEWWSEALHGVSN+GP F+ P ATSFP VI T ASFN SLW+ IGQ VS EAR
Sbjct: 80 MKGYEWWSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEAR 139
Query: 153 AMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + G YA YVRGLQ +
Sbjct: 140 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHANR--- 196
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
LKV++CCKH+ AYD+DNW G+DR+HF+A+V++QD+E+TF PF+MCV EG +S
Sbjct: 197 ------LKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVAS 250
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMCSYN+VNG+P+CADP LL +TVRG W L GYIV+DCDS+ V DN + + E+A A
Sbjct: 251 VMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAA 309
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YV 389
+KAGLDLDCG + T NAV++G + E D++ +L TV MRLG FDG P Y
Sbjct: 310 DAIKAGLDLDCGPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYG 369
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
LG +D+C + ELA EAAR+GIVLLKN LPL+S +TVAV+GP++ AT+ MIGN
Sbjct: 370 HLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGN 429
Query: 450 YAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
YAG+ C Y +P+ G YA ++ GC +VACK++ A AA+ ADAT+++ GLD S
Sbjct: 430 YAGVACGYTNPLQGIGRYARTVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQS 489
Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
+EAE++DR L LPG Q L+++VA +KGP ILV+MS G VDI FA+ N I ILWAG
Sbjct: 490 IEAETVDRTGLLLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAG 549
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPG+ GG AIAD++FG NPGG+LP+TWY +Y+ LP+T+M +R S GYPGRTY+FY
Sbjct: 550 YPGQAGGAAIADILFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFY 609
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NGP +YPFG+GL+YT F + L S + V LN + N ++ A + V
Sbjct: 610 NGPVVYPFGHGLTYTHFVHTLASAPTVVSVPLNGHRRANVTNISNRA-------IRVTHA 662
Query: 690 RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVFVRAGRNKRI 747
RCD +VD +NVGS DG+ ++V+S PPA + KQ++ F++V V A R+
Sbjct: 663 RCDKLSITLQVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRV 722
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD + +P GEH+ +G+
Sbjct: 723 GVNIHVCKLLSVVDRSGIRRIPLGEHSFNIGD 754
>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 769
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/771 (50%), Positives = 518/771 (67%), Gaps = 32/771 (4%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
LSI +L+ +A+ S +P F CDP + + FC SL RVKDL+ R+
Sbjct: 9 LSIFILL---SAIHGRASRAP-FACDPNNSVT-----TDYPFCRRSLVVEERVKDLIGRL 59
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
TL+EKV+ L A GVPRLG+ Y+WWSEALHGVSNVGPGT F P ATSFP VI T
Sbjct: 60 TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVG 192
ASFN SLW+ IG+ VS EARAMYN G GLTYWSPN+N+ RDPRWGR ETPGEDP + G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179
Query: 193 RYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQD 252
YAVNYVRGLQ EG+ LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QD
Sbjct: 180 TYAVNYVRGLQGTEGNR---------LKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQD 230
Query: 253 MEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDS 312
+E+TF PF MCVK G SSVMCSYN+VNG+P+CADP LL T+R +W L GYIV+DCDS
Sbjct: 231 IEDTFEVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDS 290
Query: 313 IQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
+ V ++ + + + E+A A +KAGLDLDCG + T NAV++G + E+ I+ +L
Sbjct: 291 VGVFYNSQHYTS-TPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNT 349
Query: 373 YTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
+V MRLG FDG + Y LG + +CSD N +LA +AAR+GIVLL+N + +LPL++ +
Sbjct: 350 LSVQMRLGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNR 409
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
+ VAVVGP++NAT+ MIGNYAGI C Y++P+ G S Y ++ GC VAC+SN
Sbjct: 410 HRIVAVVGPNSNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGG 469
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A EAA+ ADA +++ GLD S+EAE DR L LPG Q L+ +VA VAKGPVILV+MS G
Sbjct: 470 AIEAARVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGG 529
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+D++FA+ + I I+W GYPG+ GG AIADV+FG+ NPGG+LP+TWY DYV LP+T
Sbjct: 530 PIDVSFAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMT 589
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+M LRP S YPGRTY+FY GP +YPFG+GLSYT F + +LS T+ V + +H N
Sbjct: 590 TMSLRPGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHN 647
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
S+ V V +CD KV +N+G+ DG+ ++VYS PP +
Sbjct: 648 ------GSEFWGKAVRVTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQ- 700
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KQ++ F++V + A K ++ + CK L++VD +P GEH I +G+
Sbjct: 701 KQLVAFEKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGD 751
>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
Length = 781
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/755 (52%), Positives = 520/755 (68%), Gaps = 22/755 (2%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+S CD G + S+F FC++SL Y R KDLVSR+TL EK QQL + + G+ R
Sbjct: 20 TSQKHACDKG-----SPKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISR 74
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+P YEWWSEALHGVSNVGPGT FD +PGATSFP VIL+ ASFNE+LW +GQ VS E
Sbjct: 75 LGVPAYEWWSEALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNE 134
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN+ AGLT+WSPN+NV RDPRWGR ETPGEDP VV RYAVNYVRGLQ+V +
Sbjct: 135 ARAMYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEAS 194
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
A LKVSSCCKHY AYDVDNWKGVDR+HFDA+VT+QD+E+T+ PF+ CV EG
Sbjct: 195 A---KGDRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHV 251
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
SSVMCSYNRVNGIP+CADP LL +RG+W L GYIV+DCDS++V ++ + + EDA
Sbjct: 252 SSVMCSYNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHY-TKTPEDA 310
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQY 388
VA LKAGL+++CG + +T NAV KV + +D++L Y Y VLMRLGFF+ S +
Sbjct: 311 VALALKAGLNMNCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFENPKSLPF 370
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+LG D+C+ EN +LA EAA++GIVLL+N++ LPL+ K+K +AV+GP+ANAT MI
Sbjct: 371 ANLGPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMIS 430
Query: 449 NYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
NYAGIPCRY SP+ G Y ++VTY GC DV C + N AA +AA +ADA +++ GLD
Sbjct: 431 NYAGIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLD 490
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAE LDR +L LPG+Q +L+ VA KG +ILVIM+AG +DI+F ++ +NI ILW
Sbjct: 491 QSIEAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILW 550
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG++GG AIA V+FG +NPGGR P TWY YV +P+T M +R S +PGRTY+
Sbjct: 551 VGYPGQDGGNAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRTYR 610
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN-KLQHCRNLNYTSDASKTRCPGVLV 686
FYNG +LY FGYGLSY+ F ++ S TI + N + N + D + +
Sbjct: 611 FYNGKSLYEFGYGLSYSTFSTHIASAPSTIMLQKNTSISKPLNNIFLDDQV------IDI 664
Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAE--IAATYIKQVIGFQRVFVRAGR 743
+ + C + F + +N G DGS VV+V+ +PP+ ++ +KQ+IGF+R V+ G+
Sbjct: 665 STISCFNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGK 724
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ + + CK L+ VD L G+H I VG
Sbjct: 725 TEFVTVKIDICKMLSNVDSDGKRKLVIGQHNILVG 759
>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 784 bits (2024), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/779 (48%), Positives = 524/779 (67%), Gaps = 26/779 (3%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
V LCF + + N +G +S VF CD + L + + FCD SL R
Sbjct: 11 VPVFLCFFSFMFVATVLLNCDRVSGQTSSVFACDVAKNPAL----AGYGFCDKSLSLEDR 66
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
V DLV R+TL EK+ L + A V RLG+P+YEWWSEALHGVSNVGPGTHF ++PGATS
Sbjct: 67 VADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATS 126
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP ILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPNIN+ RDPRWGR ETP
Sbjct: 127 FPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQETP 186
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP + +YA YV+GLQ + D +S LKV++CCKHY AYD+DNWKG+ RY F
Sbjct: 187 GEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVAACCKHYTAYDLDNWKGIQRYTF 240
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
+A VT+QDM++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL +RGEW L+G
Sbjct: 241 NAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNG 300
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YIV+DCDS++V+ + + + E+A A+T+ AGLDL+CG Y +T AV+QG + E
Sbjct: 301 YIVSDCDSVEVLFKDQHY-TKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEAS 359
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
I+ ++ + LMRLGFFDG P Y +LG D+C+ EN ELA EAAR+GIVLLKN
Sbjct: 360 INNAVSNNFATLMRLGFFDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLG 419
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVAC 481
+LPLN+ +K++AV+GP+ANAT MIGNY GIPC Y+SP+ + +Y GC +V C
Sbjct: 420 SLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQALTALVPTSYAAGCPNVQC 479
Query: 482 KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
+N + A++ A +ADAT+I+ G L++EAESLDR ++ LPG Q L+++VA +KGPV
Sbjct: 480 -ANAELDDATQIAASADATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPV 538
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
ILVIMS GG+D++FA++N I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 539 ILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQS 598
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
YV +P+T+M +R + GYPGRTY+FY G T++ FG G+S++ ++ ++ + + V L
Sbjct: 599 YVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAPQLVSVPL 658
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP 720
+ CR+ + C + V D C + F+ + +N+G S VV+++ PP
Sbjct: 659 AEDHECRS---------SECMSLDVADEHCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPP 709
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ K ++GF++V + +++F + CK L++VD N +P G+H + VGN
Sbjct: 710 -DVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDICKDLSVVDELGNRKVPLGQHLLHVGN 767
>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Glycine max]
Length = 776
Score = 784 bits (2024), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/781 (49%), Positives = 530/781 (67%), Gaps = 30/781 (3%)
Query: 5 VSSLLCF-SLS-IALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
V LCF S + +A ++ + N V +G +S VF CD + L + + FCD SL
Sbjct: 11 VPVFLCFFSFTFVASVLLNCNRV--SGQTSAVFACDVAKNPAL----AGYGFCDKSLSVE 64
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
RV DLV R+TL EK+ L + A V RLG+P+YEWWSEALHGVSNVGPGTHF ++PGA
Sbjct: 65 DRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGA 124
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
TSFP ILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPNIN+ RDPRWGR E
Sbjct: 125 TSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 184
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
TPGEDP + +YA YV+GLQ + D +S LKV++CCKHY AYD+DNWKG+ RY
Sbjct: 185 TPGEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVAACCKHYTAYDLDNWKGIQRY 238
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
F+A VT+QDM++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL +RGEW L
Sbjct: 239 TFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKL 298
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
+GYIV+DCDS++V+ + + + E+A AQT+ AGLDL+CG Y +T AV+QG + E
Sbjct: 299 NGYIVSDCDSVEVLFKDQHY-TKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDE 357
Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
I+ ++ + LMRLGFFDG P Y +LG +D+C+ EN ELA EAAR+GIVLLKN
Sbjct: 358 ASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNS 417
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
+LPLN+ +K++AV+GP+ANAT MIGNY GIPC Y+SP+ + +Y GC +V
Sbjct: 418 PGSLPLNAKTIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQTLTALVPTSYAAGCPNV 477
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
C +N + A++ A +ADAT+I+ G L++EAESLDR ++ LPG Q L+++VA +KG
Sbjct: 478 QC-ANAELDDATQIAASADATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKG 536
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVILVIMS GG+D++FA++N I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 537 PVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYP 596
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
YV +P+T+M +R + GYPGRTY+FY G T++ FG G+S++ ++ ++ + + V
Sbjct: 597 QAYVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAPQLVSV 656
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK 718
L + CR+ + C + + D C + F+ + +N G S VV+++
Sbjct: 657 PLAEDHECRS---------SECMSLDIADEHCQNLAFDIHLGVKNTGKMSTSHVVLLFFT 707
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
PP ++ K ++GF++V + +++F + CK L++VD N +P G+H + VG
Sbjct: 708 PP-DVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDVCKDLSVVDELGNRKVPLGQHLLHVG 766
Query: 779 N 779
N
Sbjct: 767 N 767
>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
Length = 774
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/750 (50%), Positives = 516/750 (68%), Gaps = 27/750 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + GL F FC++ +P +RV+DL++R+TL EK++ + + A VPRLG+
Sbjct: 36 FACDP----RNGL-TRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQ 90
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGAT FP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 91 GYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAM 150
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G+AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA +YV+GLQ D
Sbjct: 151 YNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DG 202
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+DNW GVDR+HF+A+V++QD+E+T+ PF+ CV EG +SVM
Sbjct: 203 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVM 262
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V DN + + E+A A+
Sbjct: 263 CSYNQVNGKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHY-TRTPEEAAAEA 321
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T +A+++G + E D++ +L L TV MRLG FDG P + +L
Sbjct: 322 IKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNL 381
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA EAARE IVLL+N N+LPL+ ++++ V V+GP+ +ATV MIGNYA
Sbjct: 382 GPRDVCTPAHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYA 441
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G + Y ++ GC VAC+ N AA A+ DAT+++ GLD ++E
Sbjct: 442 GVACGYTTPLQGIARYVKTAHQVGCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIE 501
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE+ DR L LPG Q +L+ +VA AKGPVILVIMS G VD++FA+ N I AILW GYP
Sbjct: 502 AETRDRVGLLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYP 561
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG NPGGRLP+TWY Y+ +P+T+M +RP + GYPGRTY+FY G
Sbjct: 562 GQAGGTAIADVIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGYPGRTYRFYKG 621
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
P ++PFG+GLSY++F +L K + V + LQ N +S A K V+ C
Sbjct: 622 PVVFPFGHGLSYSRFSQSLALAPKQVSVQILSLQALTNSTLSSKAVK-------VSHANC 674
Query: 692 DDYF--EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
DD EF VD +N GS DG+ ++++SKPP + IKQ++ F + V AG +R+K
Sbjct: 675 DDSLETEFHVDVKNEGSMDGTHTLLIFSKPPPG-KWSQIKQLVTFHKTHVPAGSKQRLKV 733
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++CK L++VD +P GEH + +G+
Sbjct: 734 NVHSCKHLSVVDQFGVRRIPTGEHELHIGD 763
>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 782
Score = 781 bits (2018), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/759 (50%), Positives = 520/759 (68%), Gaps = 26/759 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP L+ FC ++LP +RV+DL+SR+TL EK++ L + A VPRLG+
Sbjct: 42 FACDPRNGVTRNLK-----FCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGIQ 96
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPG F PGATSFP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 97 GYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARAM 156
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G AGLTYWSPN+NV RDPRWGR ETPGEDP + G+YA +YVRGLQ G +
Sbjct: 157 YNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLK----- 211
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+ PF+ CV EG +SVM
Sbjct: 212 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVM 267
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V+ DN + + + E+A A T
Sbjct: 268 CSYNQVNGKPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTS-TPEEAAAAT 326
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T NAV++G + E D++ +L TV MRLG FDG P Y +L
Sbjct: 327 IKAGLDLDCGPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNL 386
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + ELA EAAR+GIVLL+N LPL+S++ T+AV+GP+++ TV MIGNYA
Sbjct: 387 GPRDVCTPAHQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYA 446
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
GI C+Y SP+ G S YA ++ GC DVAC SN AA AA+ ADAT+++ GLD S+E
Sbjct: 447 GIACKYTSPLQGISRYAKTLHQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIE 506
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE DR L LPG+Q +L+++VA ++GP ILV+MS G +D++FA+ + + AILWAGYP
Sbjct: 507 AEFRDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYP 566
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG NPGG+LP+TWY Y+ +P+T+M +RP + GYPGRTY+FY G
Sbjct: 567 GQAGGAAIADVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGRTYRFYKG 626
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
++PFG+G+SYT F ++L K + + + L LN T + R V+ + C
Sbjct: 627 NVVFPFGHGMSYTSFSHSLTQAPKEVSLPITNLY---ALNTTISSKAIR-----VSHINC 678
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
++ +N G+ DG+ ++V+S PP+ + KQ+IGF++V + AG ++K
Sbjct: 679 QTSLGIDINVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDLVAGSQIQVKIDI 738
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
+ CK L+ VD +P G+H I++G+ S + N
Sbjct: 739 HVCKHLSAVDRFGIRRIPIGDHHIYIGDLKHSISLQANM 777
>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
AltName: Full=Xylan
1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
Full=Alpha-N-arabinofuranosidase; AltName:
Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
Flags: Precursor
gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
varia]
Length = 774
Score = 780 bits (2014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/779 (49%), Positives = 523/779 (67%), Gaps = 28/779 (3%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
VS LCF + A L+ S V + +S VF CD + L +++ FC+ L R
Sbjct: 11 VSVFLCFFVLFATLLLSGGRVSSQ--TSAVFACDVAKNPAL----ANYGFCNKKLSVDAR 64
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
VKDLV R+TL EKV L + A V RLG+P+YEWWSEALHGVSN+GPGTHF +VIPGATS
Sbjct: 65 VKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPGATS 124
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP IL ASFN SL++ IG+ VSTEARAM+N+G AGLTYWSPNIN+ RDPRWGR ETP
Sbjct: 125 FPMPILIAASFNASLFQTIGKVVSTEARAMHNVGLAGLTYWSPNINIFRDPRWGRGQETP 184
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP + +YA YV+GLQ + D +S LKV++CCKHY AYDVD+WKGV RY F
Sbjct: 185 GEDPLLASKYAAGYVKGLQQTD------DGDSNKLKVAACCKHYTAYDVDDWKGVQRYTF 238
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
+A VT+QD+++T+ PF+ CV +G+ +SVMCSYN+VNG P+CADP LL +RG+W L+G
Sbjct: 239 NAVVTQQDLDDTYQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNG 298
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YIV+DCDS+ V+ N + + E+A A+++ AGLDL+CG + +T AV+QG + E
Sbjct: 299 YIVSDCDSVDVLFKNQHY-TKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEAS 357
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
I+ ++ + LMRLGFFDG P Y +LG +D+C+ N ELA EAAR+GIVLLKN
Sbjct: 358 INNAVYNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAG 417
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVAC 481
+LPLN+ +K++AV+GP+ANAT AMIGNY GIPC+Y SP+ G + ++ GC DV C
Sbjct: 418 SLPLNAKAIKSLAVIGPNANATRAMIGNYEGIPCKYTSPLQGLTALVPTSFAAGCPDVQC 477
Query: 482 KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
+N ++ A + A +ADAT+I+ G +L++EAES DR ++ LPG Q QL+ +VA VAKGPV
Sbjct: 478 -TNAALDDAKKIAASADATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPV 536
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
IL IMS GG+D++FA+TN I +ILW GYPGE GG AIADV+FG NP GRLP+TWY
Sbjct: 537 ILAIMSGGGMDVSFAKTNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQS 596
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
YV +P+T+M +RP + GYPGRTY+FY G T++ FG G+SY+ F++ L+ + + V L
Sbjct: 597 YVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPL 656
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP 720
+ CR+ ++C + V C + F+ + +N G S V ++S PP
Sbjct: 657 AEDHVCRS---------SKCKSLDVVGEHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPP 707
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A A K ++ F++V + + F + CK L +VD N + G+H + VG+
Sbjct: 708 AVHNAPQ-KHLLAFEKVLLTGKSEALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGD 765
>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
Length = 771
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/785 (49%), Positives = 522/785 (66%), Gaps = 37/785 (4%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
M+ S +L I LL S +A D+ F CD + L FC+ L
Sbjct: 1 MSSTFSLSPLITLFILLLQSSCDARDS-------FACDAKDAATKNLP-----FCNVKLA 48
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
RVKDL+ R+T+ EKV L + A VPR+G+ YEWWSEALHGVSNVGPGT F V P
Sbjct: 49 IPERVKDLIGRLTMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFP 108
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR
Sbjct: 109 AATSFPQVITTAASFNASLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRG 168
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ETPGEDP + GRYA +YV+GLQ +G++ LKV++CCKH+ AYDVDNW GVD
Sbjct: 169 QETPGEDPVLAGRYAASYVKGLQGTDGNK---------LKVAACCKHFTAYDVDNWNGVD 219
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
R+HF+A V++QD+E+TF PF MCVKEG +SVMCSYN+VNG+P+CADP LL +TVRG W
Sbjct: 220 RFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVW 279
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
L GYIV+DCDS+ V+ ++ + + + E+A A +KAGLDLDCG + T +AV++G +
Sbjct: 280 GLDGYIVSDCDSVGVLYNSQHYTS-TPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLL 338
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
E D++ +L V MRLG FDG P Y LG +D+C + ELA EAAR+GIVLLK
Sbjct: 339 TEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLK 398
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
N TLPL+ + +TVAV+GP+++ TV MIGNYAGI C Y SP+ G YA ++ GC
Sbjct: 399 NTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAGIACGYTSPLQGIGRYAKTIHQQGCS 458
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
+VAC+ + A +AA+ ADATI++ GLD S+EAE++DR L LPG+Q L+++VA +
Sbjct: 459 NVACRDDKQFGPALDAARHADATILVIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAAS 518
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
KGP ILV+MS G VDI FA+ + + ILWAGYPG+ GG AIAD++FG +PGG+LP+TW
Sbjct: 519 KGPTILVLMSGGPVDITFAKNDPKVAGILWAGYPGQAGGAAIADILFGTASPGGKLPVTW 578
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y +Y++ L +T+M +RP +GYPGRTY+FY GP +YPFG+GL+YT F + L S +
Sbjct: 579 YPQEYLKNLAMTNMAMRP-SKIGYPGRTYRFYKGPVVYPFGHGLTYTHFVHELSSAPTVV 637
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY 716
V ++ +H N N ++ A + V RC VD +NVGS DG+ ++V+
Sbjct: 638 SVPVHGHRHGNNTNISNKA-------IRVTHARCGKLSIALHVDVKNVGSRDGTHTLLVF 690
Query: 717 SKPPAEIAATYI--KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
S PP ++ K ++ F++V V A +R++ + CK L++VD + +P GEH+
Sbjct: 691 SAPP-NGGNHWVPQKSLVAFEKVHVPAKTKQRVRVNIHVCKLLSVVDKSGIRRIPMGEHS 749
Query: 775 IFVGN 779
+ +G+
Sbjct: 750 LHIGD 754
>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
Length = 865
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/799 (50%), Positives = 520/799 (65%), Gaps = 56/799 (7%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
V + SL IA LV S N F CD + + + FC+ SL Y R
Sbjct: 3 VGRFVGVSLLIAALVSSLCESQKN------FACD-----RNDPATAKYGFCNVSLSYEAR 51
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
KDLVSR++L EKVQQL + A GV RLG+P YEWWSEALHGVS+VGPG F+ +PGATS
Sbjct: 52 AKDLVSRLSLKEKVQQLVNKATGVSRLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATS 111
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP ILT ASFN SLW K+G+ VSTEARAM+N+G AGLTYWSPN+N+ RDPRWGR ETP
Sbjct: 112 FPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNIFRDPRWGRGQETP 171
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP VV +YAVNYV+GLQDV+ SR LKVSSCCKHY AYD+DNWKG+DR+HF
Sbjct: 172 GEDPLVVSKYAVNYVKGLQDVQDAG-----KSRRLKVSSCCKHYTAYDLDNWKGIDRFHF 226
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
DA+VT+QD+E+T+ PF+ CV+EGD SSVMCSYNRVNGIP+CADP LL +RG+W L G
Sbjct: 227 DAKVTKQDLEDTYQPPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDG 286
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YIV+DCDSIQV D+ + K L+++CG + +T NAV+ K+ ++
Sbjct: 287 YIVSDCDSIQVYFDDIHY------------TKTRLNMNCGDFLGKYTENAVKLKKLNGSE 334
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
+D++L Y Y VLMRLGFFDG P+ + LG D+CS ++ LA EAA++GIVLL+N +
Sbjct: 335 VDEALIYNYIVLMRLGFFDGDPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLEN-RG 393
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NVTYKTGCDDV 479
LPL+ VK +AV+GP+ANAT MI NYAG+PC+Y SP+ G Y V Y+ GC DV
Sbjct: 394 DLPLSKTAVKKIAVIGPNANATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDV 453
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
C I AA +A AD T+++ GLD +VEAE LDR +L LPGYQ +L+ VA AK
Sbjct: 454 NCGEQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKK 513
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
V+LVIMSAG +DI+FA+ + I A+LW GYPGE GG AIA V+FG +NP GRLP TWY+
Sbjct: 514 TVVLVIMSAGPIDISFAKNLSTISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYS 573
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
++ + +T M +RP + G+PGR+Y+FY G +Y FGYGLSY+ F +LS I +
Sbjct: 574 QEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSAFSTFVLSAPSIIHI 633
Query: 660 NLNKLQHCRNLNYTS--DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
N + NLN T+ D S C +DL+ + +N G GS VV+V+
Sbjct: 634 KTNPIL---NLNKTTSIDISTVNC-----HDLK----IRIVIGVKNRGQRSGSHVVLVFW 681
Query: 718 KPPAEIAATYI------KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
KPP + + T + Q++GF+RV V +++ F+ CK+L++VD L G
Sbjct: 682 KPP-KCSKTLVGAGVPQTQLVGFERVEVGRSMTEKVTVEFDVCKALSLVDTHGKRKLVTG 740
Query: 772 EHTIFVG-NGGVSFPIHLN 789
HT+ +G N HLN
Sbjct: 741 HHTLVIGSNSDQQIYHHLN 759
>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 765
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/749 (50%), Positives = 509/749 (67%), Gaps = 26/749 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CD G+ + + + FCD SL RVKDLV R+TL EK+ L + A V RLG+P
Sbjct: 30 FACDVGKSPAV----AGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIP 85
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
+YEWWSEALHGVSNVGPGT F +VIPGATSFP ILT ASFN SL++ IG+ VSTEARAM
Sbjct: 86 KYEWWSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAM 145
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN+G AGLTYWSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ +G +
Sbjct: 146 YNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGD----- 200
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYDVDNWKG+ RY F+A VT+QDME+TF PF+ CV +G+ +SVM
Sbjct: 201 -PNKLKVAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVM 259
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL VRGEW L+GYIV+DCDS++V+ + + + E+A A +
Sbjct: 260 CSYNKVNGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHY-TKTPEEAAAIS 318
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+ AGLDL+CG++ +T AV+QG + E I+ ++ + LMRLGFFDG P+ Y +L
Sbjct: 319 ILAGLDLNCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNL 378
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ EN ELA EAAR+GIVLLKN +LPLN+ +K++AV+GP+ANAT MIGNY
Sbjct: 379 GPKDVCTQENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYE 438
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
GIPC+Y+SP+ G + +A +Y GC DV C N + A + A +ADAT+I+ G L++E
Sbjct: 439 GIPCKYISPLQGLTAFAPTSYAAGCLDVRC-PNPVLDDAKKIAASADATVIVVGASLAIE 497
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AESLDR ++ LPG Q L+++VA +KGPVILVIMS GG+D++FA+ N I +ILW GYP
Sbjct: 498 AESLDRVNILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYP 557
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
GE GG AIADV+FG NP GRLP+TWY YV +P+T+M +RP + GYPGRTY+FY G
Sbjct: 558 GEAGGAAIADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKG 617
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
T++ FG GLSY+ + L+ + + V L + CR+ + C + V C
Sbjct: 618 ETVFAFGDGLSYSSIVHKLVKAPQLVSVQLAEDHVCRS---------SECKSIDVVGEHC 668
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ F+ + +N G + V ++S PPA A K ++GF++V + + F
Sbjct: 669 QNLVFDIHLRIKNKGKMSSAHTVFLFSTPPAVHNAPQ-KHLLGFEKVHLIGKSEALVSFK 727
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L+IVD N + G+H + VG+
Sbjct: 728 VDVCKDLSIVDELGNRKVALGQHLLHVGD 756
>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
Length = 774
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/782 (50%), Positives = 526/782 (67%), Gaps = 28/782 (3%)
Query: 2 AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
A V+ LCF + + S V A SSPVF CD LG F FC++SL
Sbjct: 8 APKVTVFLCFLSCFSHFLSSPKWVLAQ--SSPVFACDVENNPTLG----QFGFCNTSLET 61
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
+ RV DLV R+TL+EK+ L + A V RLG+P+YEWWSEALHGVS VGPGTHF+ V+PG
Sbjct: 62 AARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPG 121
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
ATSFP VILT ASFN SL++ IG+AVSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR
Sbjct: 122 ATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQ 181
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ETPGEDP + +YA YVRGLQ + D + LKV++CCKHY AYD+DNWKGVDR
Sbjct: 182 ETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVAACCKHYTAYDLDNWKGVDR 235
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+HF+A VT+QDM++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ VRGEW
Sbjct: 236 FHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWK 295
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
L+GYIV+DCDS+ V ++ + + E+A A+ + AGLDL+CG + T AV+ G V
Sbjct: 296 LNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVD 354
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
E+ +DK++ + LMRLGFFDG+P Y LG +D+C+ E+ ELA EAAR+GIVLLKN
Sbjct: 355 ESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKN 414
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
+ +LPL+ +KT+AV+GP+AN T MIGNY G PC+Y +P+ G + TY GC +
Sbjct: 415 SKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSN 474
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
VAC + I A + A ADAT+++ G+D S+EAE DR ++ LPG Q LI +VA+ +K
Sbjct: 475 VACGTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASK 533
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
G VILV+MS GG DI+FA+ + I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 534 GNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWY 593
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
YV +P+T+M +RP + GYPGRTY+FY G T+Y FG GLSYTQF ++L+ K++
Sbjct: 594 PQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVS 653
Query: 659 VNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
+ + + C + S DA + C ++ F+ + N G+ GS V ++S
Sbjct: 654 IPIEEGHSCHSSKCKSVDAVQESCQNLV---------FDIHLRVNNAGNISGSHTVFLFS 704
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
PP+ + + K ++GF++VFV A ++F + CK L+IVD + G H + V
Sbjct: 705 SPPS-VHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHV 763
Query: 778 GN 779
GN
Sbjct: 764 GN 765
>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
Length = 784
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/792 (49%), Positives = 522/792 (65%), Gaps = 29/792 (3%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
++ V + LCF L L +N SSPVF CD L +++ FC++ L
Sbjct: 18 VSSVFLTFLCFFLYFLDL--------SNAQSSPVFACDVAANPSL----AAYGFCNTVLK 65
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
RV DLV+R+TL EK+ L A+GV RLG+P YEWWSEALHGVS +GPGTHF +P
Sbjct: 66 IEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVP 125
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
GATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR
Sbjct: 126 GATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRG 185
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ETPGEDP + +YA YV+GLQ+ +G + S LKV++CCKHY AYDVDNWKGV+
Sbjct: 186 QETPGEDPLLASKYASGYVKGLQETDGGD------SNRLKVAACCKHYTAYDVDNWKGVE 239
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
RY F+A VT+QDM++T+ PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ +RGEW
Sbjct: 240 RYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEW 299
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
L+GYIV+DCDS+ V+ N + E A A ++ AGLDL+CG + T AV+ G V
Sbjct: 300 KLNGYIVSDCDSVDVLYKNQHYTKTPAE-AAAISILAGLDLNCGSFLGQHTEEAVKSGLV 358
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
E IDK++ + LMRLGFFDG+P+ Y LG D+C+ N ELAA+AAR+GIVLLK
Sbjct: 359 NEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLK 418
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
N LPL+ +KT+AV+GP+AN T MIGNY G PC+Y +P+ G +G + TY GC
Sbjct: 419 N-TGFLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTTYLPGCS 477
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
+VAC + + A++ A TAD T++L G D S+EAES DR DL LPG Q +L+ QVA+ A
Sbjct: 478 NVACAVAD-VAGATKLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAA 536
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
KGPV+LVIMS GG DI FA+ + I ILW GYPGE GG AIAD++FG++NP GRLP+TW
Sbjct: 537 KGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTW 596
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y YV+ +P+T M +RP S GYPGRTY+FY G T+Y FG GLSYT+F ++L+ +
Sbjct: 597 YPQSYVEKVPMTIMNMRPDKSKGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLV 656
Query: 658 QVNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
++L + CR+ S DA C + FE ++ +N G +G V ++
Sbjct: 657 SLSLEENHVCRSSECQSLDAIGPHCENAVSGG---GSAFEVQIKVRNGGDREGIHTVFLF 713
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
+ PPA I + K ++GF+++ + ++F CK L++VD + G+H +
Sbjct: 714 TTPPA-IHGSPRKHLLGFEKIRLGKMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLH 772
Query: 777 VGNGGVSFPIHL 788
VG+ S I +
Sbjct: 773 VGDLKHSLSIRI 784
>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
Length = 780
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/789 (48%), Positives = 524/789 (66%), Gaps = 37/789 (4%)
Query: 11 FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
FSLS+ L ++ N S+PVF CD L +++ FC++++ RV DLV+
Sbjct: 18 FSLSLIFLCLLDSS---NAQSTPVFACDVAGNPSL----AAYGFCNTAIKIEYRVADLVA 70
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
R+TL EK+ L HGV RLG+P YEWWSEALHGVS VGPGT F +PGATSFP VIL
Sbjct: 71 RLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQVIL 130
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
T ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR ETPGEDP +
Sbjct: 131 TAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLL 190
Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
+YA YV+GLQ+ + ++D N LKV++CCKHY AYDVDNWKGV+RY F+A V +
Sbjct: 191 SSKYASGYVKGLQETD----SSDANR--LKVAACCKHYTAYDVDNWKGVERYSFNAVVNQ 244
Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
QD+++T+ PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ +RGEW L+GYIV+DC
Sbjct: 245 QDLDDTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVSDC 304
Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
DS+ V+ N + + E+A A ++ AGLDL+CG + + T AV+ G VKE IDK++
Sbjct: 305 DSVDVLYKNQHY-TKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAIT 363
Query: 371 YLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
+ LMRLGFFDG P+ Y LG +D+C+ N ELAAEAAR+GIVLLKN LPL+
Sbjct: 364 NNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKN-TGALPLSP 422
Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
+KT+AV+GP+AN T MIGNY G PC+Y +P+ G +G + TY GC +VAC + +
Sbjct: 423 KTIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVHTTYLPGCSNVACAVAD-V 481
Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
+++ A +DAT+++ G D S+EAES DR DL LPG Q +L+ QVA+ AKGPV LVIMS
Sbjct: 482 AGSTKLAAASDATVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMS 541
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
GG DI FA+ + I ILW GYPGE GG A ADV+FG++NP GRLP+TWY YV+ +P
Sbjct: 542 GGGFDITFAKNDAKIAGILWVGYPGEAGGIATADVIFGRYNPSGRLPMTWYPQSYVEKVP 601
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
+T+M +RP S GYPGRTY+FY G T+Y FG GLSYT+F ++L+ + + ++L + C
Sbjct: 602 MTNMNMRPDKSNGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPRLVSLSLEENHVC 661
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDD--------YFEFKVDFQNVGSTDGSDVVIVYSKP 719
R+ + C + CD+ FE + QN G +G V +++ P
Sbjct: 662 RS---------SECQSLNAIGPHCDNAVSGTGGKAFEVHIKVQNGGDREGIHTVFLFTTP 712
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
PA + + K ++GF+++ + +KF + CK L++VD + G+H + VG+
Sbjct: 713 PA-VHGSPRKHLLGFEKIRLGKMEEAVVKFKVDVCKDLSVVDEVGKRKIGLGQHLLHVGD 771
Query: 780 GGVSFPIHL 788
S I +
Sbjct: 772 VKHSLSIRI 780
>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
[Glycine max]
Length = 901
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/744 (52%), Positives = 514/744 (69%), Gaps = 17/744 (2%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S+F FCD+SL Y R KDLVSR+TL EK QQL + + G+ RLG+P YEWWSEALHGVS
Sbjct: 30 KTSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVS 89
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
N+GPGT FD +PGATSFP VIL+ ASFN SLW+K+GQ VSTEARAMYN+ AGLT+WSP
Sbjct: 90 NLGPGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLAGLTFWSP 149
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
N+NV RDPRWGR ETPGEDP VV RYAV Y+RGLQ+VE +A + LKVSSCCKH
Sbjct: 150 NVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVEDEASA---KADRLKVSSCCKH 206
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
Y AYD+DNWKG+DR+HFDA+VT+QD+E+++ PF+ CV EG SSVMCSYNRVNGIP+CA
Sbjct: 207 YTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCSYNRVNGIPTCA 266
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
DP LL +RG+W L GYIV+DCDS++V + + A + EDAVA LKAGL+++CG +
Sbjct: 267 DPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTA-TPEDAVALALKAGLNMNCGDFL 325
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELA 405
+T NAV KV +D++L Y Y VLMRLGFFD S + +LG D+C+ +N +LA
Sbjct: 326 KKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFDDPKSLPFANLGPSDVCTKDNQQLA 385
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
+AA++GIVLL+N+ LPL+ +K +AV+GP+ANAT MI NYAGIPCRY SP+ G
Sbjct: 386 LDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIPCRYTSPLQGLQ 445
Query: 466 GY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
Y ++V Y GC +V C + + I AA +AA +ADA +++ GLD S+EAE LDRE+L LPG
Sbjct: 446 KYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAEGLDRENLTLPG 505
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
+Q + + VA KG VILVIM+AG +DI+ ++ +NI ILW GYPG+ GG AIA V+F
Sbjct: 506 FQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQAGGDAIAQVIF 565
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G +NPGGR P TWY YV +P+T M +R S +PGRTY+FYNG +LY FG+GLSY+
Sbjct: 566 GDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFPGRTYRFYNGNSLYEFGHGLSYS 625
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP------GVLVNDLRCDDY-FEF 697
F + S +I + + N+ +S+ S T+ + ++ + C D F
Sbjct: 626 TFSMYVASAPSSIMIENTSISEPHNM-LSSNNSGTQVESLSDGQAIDISTINCQDLTFLL 684
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAE--IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
+ +N G +GS VV+V+ +P + IKQ+IGF+RV V G + + + C+
Sbjct: 685 VIGVKNNGPLNGSHVVLVFWEPATSEFVIGAPIKQLIGFERVQVVVGVTEFVTVKIDICQ 744
Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
++ VD L G+HTI VG+
Sbjct: 745 LISNVDSDGKRKLVIGQHTILVGS 768
>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/754 (51%), Positives = 508/754 (67%), Gaps = 28/754 (3%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
SPVF CD L +SF FC++SL S RV DLV R+TL EK+ L + A V RL
Sbjct: 1 SPVFACDVVSNPSL----ASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRL 56
Query: 92 GLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
G+P+YEWWSEALHGVS VGPGTHF V+PGATSFP VILT ASFN SL+ IG+ VSTEA
Sbjct: 57 GIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEA 116
Query: 152 RAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
RAMYN+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +Y YV+GLQ +
Sbjct: 117 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRD----- 171
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
D N LKV++CCKHY AYD+DNWKGVDRYHF+A VT+QDM++TF PF+ CV +G+ +
Sbjct: 172 -DGNPDGLKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVA 230
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
SVMCSYN+VNGIP+CADP LL+ +RGEW L+GYIV DCDSI V ++ + + E+A
Sbjct: 231 SVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHY-TKTPEEAA 289
Query: 332 AQTLKAG--LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A+ + AG LDL+CG + T AV G V E+ ID+++ + LMRLGFFDG P
Sbjct: 290 AKAILAGIRLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQ 349
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
Y LG +D+C+ EN ELA EAAR+GIVLLKN +LPL+ +K +AV+GP+AN T M
Sbjct: 350 LYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 409
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
IGNY G PC+Y +P+ G + TY GC +VAC S + A + A ADAT+++ G
Sbjct: 410 IGNYEGTPCKYTTPLQGLAALVATTYLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGA 468
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
DLS+EAES DR D+ LPG Q LI VA + GPVILVIMS GG+D++FA+TN I +IL
Sbjct: 469 DLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSIL 528
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
W GYPGE GG AIAD++FG +NP GRLP+TWY YV +P+T+M +RP S GYPGRTY
Sbjct: 529 WVGYPGEAGGAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTY 588
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
+FY G T+Y FG GLSY++F + L + V L + C Y+S+ C V
Sbjct: 589 RFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPLEENHVC----YSSE-----CKSVAA 639
Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
+ C + F+ + +N G+T GS V ++S PP+ + + K ++GF++VF+ A +
Sbjct: 640 AEQTCQNLTFDVHLRIKNTGTTSGSHTVFLFSTPPS-VHNSPQKHLVGFEKVFLHAQTDS 698
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + CK L++VD + + GEH + +G+
Sbjct: 699 HVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGS 732
>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
Length = 777
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/753 (50%), Positives = 507/753 (67%), Gaps = 25/753 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
SSPVF CD K ++SF FC+ SL S RV DLV+R+TL EK+ L + A V R
Sbjct: 37 SSPVFACD----VKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSR 92
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+P+YEWWSEALHGVS VGPGTHF +++PGATSFP VILT ASFN SL++ IG+ VSTE
Sbjct: 93 LGIPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTE 152
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +Y YVRGLQ + +
Sbjct: 153 ARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQQTDNGD- 211
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
S LKV++CCKHY AYD+DNWKG DRYHF+A VT+QD+++TF PF+ CV +G+
Sbjct: 212 -----SERLKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNV 266
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+SVMCSYN+VNG P+CADP LL +RGEW L+GYIV+DCDS+ V+ ++ + + E+A
Sbjct: 267 ASVMCSYNQVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHY-TKTPEEA 325
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
A T+ AGLDL+CG + T AV G + + +DK++ + LMRLGFFDG P
Sbjct: 326 AAITILAGLDLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQL 385
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y LG +D+C+ N ELA EAAR+GIVLLKN +LPL+ +KT+AV+GP+AN T MI
Sbjct: 386 YGKLGPKDVCTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMI 445
Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GNY G PC+Y +P+ G + TY GC +VAC + + A + A +ADAT+++ G D
Sbjct: 446 GNYEGTPCKYTTPLQGLTASVATTYLAGCSNVACAAAQ-VDDAKKLAASADATVLVMGAD 504
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAES DR D+ LPG Q LI QVA V+KGPVILVIMS GG+D++FA+TN I +ILW
Sbjct: 505 QSIEAESRDRVDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILW 564
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPGE GG AIADV+FG +NP GRLP+TWY YV +P+T+M +RP S GYPGRTY+
Sbjct: 565 VGYPGEAGGAAIADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYR 624
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY G T+Y FG GLSY+++K+ L+ + + + L CR S ++C V
Sbjct: 625 FYTGETVYSFGDGLSYSEYKHQLVQAPQLVSIPLEDDHVCR--------SSSKCISVDAG 676
Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
+ C F + +N+G G+ V ++ PP+ + + K ++ F++V + A
Sbjct: 677 EQNCQGLAFNIDLKVRNIGKVRGTHTVFLFFTPPS-VHNSPQKHLVDFEKVSLDAKTYGM 735
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + CK L++VD + + G H + VGN
Sbjct: 736 VSFKVDVCKHLSVVDEFGSRKVALGGHVLHVGN 768
>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
Length = 784
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/785 (49%), Positives = 516/785 (65%), Gaps = 29/785 (3%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
LCF L L FS N SSPVF CD L +++ FC++ L RV D
Sbjct: 25 FLCFFLY--FLNFS------NAQSSPVFACDVAANPSL----AAYGFCNTVLKIEYRVAD 72
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV+R+TL EK+ L A+GV RLG+P YEWWSEALHGVS +GPGTHF +PGATSFP
Sbjct: 73 LVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQ 132
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
VILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR ETPGED
Sbjct: 133 VILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGED 192
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P + +YA YV+GLQ+ +G + S LKV++CCKHY AYDVDNWKGV+RY F+A
Sbjct: 193 PLLASKYASGYVKGLQETDGGD------SNRLKVAACCKHYTAYDVDNWKGVERYSFNAV 246
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
VT+QDM++T+ PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ +RGEW L+GYIV
Sbjct: 247 VTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIV 306
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
+DCDS+ V+ N + E A A ++ AGLDL+CG + T AV+ G V E IDK
Sbjct: 307 SDCDSVDVLYKNQHYTKTPAE-AAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDK 365
Query: 368 SLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
++ + LMRLGFFDG+P+ Y LG D+C+ N ELAA+AAR+GIVLLKN LP
Sbjct: 366 AISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLP 424
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSN 484
L+ +KT+AV+GP+AN T MIGNY G PC+Y +P+ G +G + TY GC +VAC
Sbjct: 425 LSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVA 484
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
+ + A++ A TAD ++++ G D S+EAES DR DL LPG Q +L+ QVA+ AKGPV+LV
Sbjct: 485 D-VAGATKLAATADVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLV 543
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
IMS GG DI FA+ + I ILW GYPGE GG AIAD++FG++NP G+LP+TWY YV+
Sbjct: 544 IMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVE 603
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
+P+T M +RP + GYPGRTY+FY G T+Y FG GLSYT+F + L+ + + L +
Sbjct: 604 KVPMTIMNMRPDKASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEEN 663
Query: 665 QHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
CR+ S DA C + FE + +N G +G V +++ PPA I
Sbjct: 664 HVCRSSECQSLDAIGPHCENAVSGG---GSAFEVHIKVRNGGDREGIHTVFLFTTPPA-I 719
Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ K ++GF+++ + ++F CK L++VD + G+H + VG+ S
Sbjct: 720 HGSPRKHLVGFEKIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHS 779
Query: 784 FPIHL 788
I +
Sbjct: 780 LSIRI 784
>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
gi|219885199|gb|ACL52974.1| unknown [Zea mays]
gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 794
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/763 (50%), Positives = 499/763 (65%), Gaps = 27/763 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F C PG + +S FC SLP R +DLVSR+T EKV+ L + A GVPRLG+
Sbjct: 27 FACAPGGPA------ASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVA 80
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVS+ GPG F PGAT+FP VI T AS N +LW+ +G+AVS EARAM
Sbjct: 81 GYEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAM 140
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN GRAGLT+WSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ N
Sbjct: 141 YNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHR 200
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
N LK+++CCKH+ AYD+D W G DR+HF+A V QD+E+TF PF CV++G A+SVM
Sbjct: 201 NR--LKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVM 258
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG+P+CAD L T+RG W L GYIV+DCDS+ V + + + EDA A T
Sbjct: 259 CSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHY-TRTPEDAAAAT 317
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
L+AGLDLDCG + + G+AV GKV + D+D +L TV MRLG FDG P + L
Sbjct: 318 LRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRL 377
Query: 392 GKQDICSDENIELAAEAAREGIVLLKN------DQNTLPLNSAKVKTVAVVGPHANATVA 445
G D+C+ E+ +LA +AAR+G+VLLKN +++ LPL A + VAVVGPHA+ATVA
Sbjct: 378 GPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVA 437
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
MIGNYAG PCRY +P+ G + YA V ++ GC DVAC+ N I AA EAA+ ADAT+++A
Sbjct: 438 MIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVA 497
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLD VEAE LDR L LPG Q +LI+ VA+ +KGPVILV+MS G +DIAFA+ + I
Sbjct: 498 GLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDG 557
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILW GYPG+ GG+AIADV+FG NPG +LP+TWY+ DY+Q +P+T+M +R + GYPGR
Sbjct: 558 ILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGR 617
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP-- 682
TY+FY GPT+YPFG+GLSYTQF + L + V L+ H + + P
Sbjct: 618 TYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVR 677
Query: 683 GVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP-----AEIAATYIKQVIGFQR 736
V V RC+ VD NVG DG+ V+VY P A A +Q++ F++
Sbjct: 678 AVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEK 737
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
V V AG R++ C L++ D +P GEH + +G
Sbjct: 738 VHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGE 780
>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
distachyon]
Length = 779
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/760 (51%), Positives = 500/760 (65%), Gaps = 29/760 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+ P F C PG S + FC +LP R +DLV+R+T EKV+ L + A GVPR
Sbjct: 23 TRPPFACAPGGPS------TRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPR 76
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVS+ GPG F PGAT+FP VI T ASFN SLW+ IG+AVS E
Sbjct: 77 LGVEGYEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDE 136
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
RA+YN +AGLT+WSPN+N+ RDPRWGR ETPGEDP V GRYA YVRGLQ
Sbjct: 137 GRAIYNGRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQHAGR- 195
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
LK ++CCKH+ AYD+D W G DR+HF+A VT QD+E+TF PF CV EG A
Sbjct: 196 --------LKTAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRA 247
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
++VMCSYN+VNG+P+CAD L T+RG+W L GYIV+DCDS+ V + ++EDA
Sbjct: 248 AAVMCSYNQVNGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHY-TRTREDA 306
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQ 387
VA TL+AGLDLDCG + +T AV QGKVKE DID ++ TV MRLG FDG +
Sbjct: 307 VAATLRAGLDLDCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQP 366
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKN---DQNTLPLNSAKVK-TVAVVGPHANAT 443
+ LG Q +C+ + ELA EAA + IVLLKN + LPL+S + TVAVVGPH+ AT
Sbjct: 367 FGHLGPQHVCTPAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEAT 426
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACK-SNNSIFAASEAAKTADATI 501
VAMIGNYAG PC Y +P+ G YA T ++ GC DVAC+ S I AA +AA+ ADAT+
Sbjct: 427 VAMIGNYAGKPCAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATV 486
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
++ GLD SVEAE LDR L LPG Q +L++ VA +KGPVILV+MS G VDIAFA+ + N
Sbjct: 487 VVVGLDQSVEAEGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRN 546
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
+ AILWAGYPG+ GG+AIADV+FG NPGG+LP+TWY DY++ P+T+M +R + GY
Sbjct: 547 VAAILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGY 606
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
PGRTY+FY GPT++PFG+GLSYT+F + L + + + R + + +
Sbjct: 607 PGRTYRFYAGPTIHPFGHGLSYTKFAHTLAH--APAHLTVRRAAGHRTTAAINTTTASHL 664
Query: 682 PGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFV 739
V V +C+ VD +NVGS DG+ V VY+ PP A I ++Q++ F++V V
Sbjct: 665 NDVRVAHAQCEGLSVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHV 724
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
AG R+K + C SL+I D +P GEH + +G
Sbjct: 725 AAGAVARVKMGVDVCGSLSIADQEGVRRIPIGEHRLMIGE 764
>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
Length = 925
Score = 766 bits (1978), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/741 (51%), Positives = 506/741 (68%), Gaps = 16/741 (2%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S F FC++SLPY R DLVSR+TL EK +QL + A G+ RLG+P YEWWSEALHGVSN
Sbjct: 37 SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNS 96
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
G G HF D IP T FP VIL+ ASFNESLW +GQ VSTE RAMYN+G+AGLTYWSPN+
Sbjct: 97 GIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNV 156
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
N+ RDPRWGR ETPGEDP VV RYAVNYVRGLQ+V G E + + LKVSSCCKHY
Sbjct: 157 NIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEG--NFAADRLKVSSCCKHYT 213
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
AYDVD WKGVDR+HFDA+VT QD+E+T+ PF+ CV+EG SSVMCSYNRVNG+P+CA+P
Sbjct: 214 AYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMCSYNRVNGVPTCANP 273
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+LL +R +W L GYIV+DCDSI V + + ++ EDAVA LKAGL+L+CG Y +
Sbjct: 274 ELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNCGSYLGD 332
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAA 406
+T NAV GKVKE+ +B++L Y Y VLMRLGFFDG P + GK D+C+ ++ LA
Sbjct: 333 YTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLAL 392
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
+AA++GIVLL N+ LPL+ KT+AV+GP+A+AT M+ NYAG+PCRY SP+ G
Sbjct: 393 DAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQK 451
Query: 467 YAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
Y + V+Y+ GC +V+C I A+ A ADAT+++ GLDL +EAE LDR +L LPG+
Sbjct: 452 YVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGF 511
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ + A+ A G VILV+MSAG VDI+F + + I ILW GYPG+ GG AI+ V+FG
Sbjct: 512 QEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFG 571
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NPGGR P TWY +YV +P+T M +RP + +PGRTY+FY G +LY FG+GLSY+
Sbjct: 572 DYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATXNFPGRTYRFYTGKSLYQFGHGLSYST 631
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL---NY-TSDASKTRCPGVLVNDLRCDDY--FEFKV 699
F + S T+ V+L N+ NY T T + ++ + C + + +
Sbjct: 632 FYKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLSNIDIVI 691
Query: 700 DFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
+N G DG+ VV+ + KPP + + +++GF+RV V+ G+ + + + C ++
Sbjct: 692 GVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKIS 751
Query: 759 IVDYAANTLLPAGEHTIFVGN 779
VD L G HT+ VG+
Sbjct: 752 NVDEEGKRKLVMGMHTLVVGS 772
>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Vitis vinifera]
gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
Length = 774
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/782 (49%), Positives = 524/782 (67%), Gaps = 28/782 (3%)
Query: 2 AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
A V+ LCF + + S V G SSPVF CD LG F FC++SL
Sbjct: 8 APKVTVFLCFLSCFSHFLSSPKWV--LGQSSPVFACDVENNPTLG----QFGFCNTSLET 61
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
+ RV DLV R+TL+EK+ L + A V RLG+P+YEWWSEALHGVS VGPGTHF+ ++PG
Sbjct: 62 AARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPG 121
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
ATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR
Sbjct: 122 ATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQ 181
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ETPGEDP + +YA YVRGLQ +G + + D LKV++CCKHY AYD+DNWKGVDR
Sbjct: 182 ETPGEDPLLSSKYASAYVRGLQ--QGDDGSPDR----LKVAACCKHYTAYDLDNWKGVDR 235
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
HF+A VT+QDM++TF PF+ CV +G+ +SVMCS+N+VNG P+CADP LL+ VRGEW
Sbjct: 236 LHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWK 295
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
L+GYIV+DCDS+ V ++ + + E+A A+ + AGLDL+CG + T AV+ G V
Sbjct: 296 LNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVD 354
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
E+ +DK++ + LMRLGFFDG+P Y LG +D+C+ E+ E+A EAAR+GIVLLKN
Sbjct: 355 ESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKN 414
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
+ +LPL+ +KT+A++GP+AN T MIGNY G PC+Y +P+ G + TY GC +
Sbjct: 415 SKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSN 474
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
VAC + I A + A ADAT+++ G+D S+EAE DR + LPG Q LI +VA+ +K
Sbjct: 475 VACGTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASK 533
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
G VILV+MS GG DI+FA+ + I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 534 GNVILVVMSGGGFDISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWY 593
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
YV +P+T+M +RP + GYPGRTY+FY G T+Y FG GLSYTQF ++L+ K++
Sbjct: 594 PQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVS 653
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYS 717
+ + + C + ++C V C + F+ + N G+ GS V ++S
Sbjct: 654 IPIEEGHSCHS---------SKCKSVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFS 704
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
PP+ + + K ++GF++VFV A ++F + CK L+IVD + G H + V
Sbjct: 705 SPPS-VHNSPQKHLLGFEKVFVTAKAEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHV 763
Query: 778 GN 779
G+
Sbjct: 764 GS 765
>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
Length = 818
Score = 765 bits (1975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/741 (51%), Positives = 506/741 (68%), Gaps = 16/741 (2%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S F FC++SLPY R DLVSR+TL EK +QL + A G+ RLG+P YEWWSEALHGVSN
Sbjct: 61 SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNS 120
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
G G HF D IP T FP VIL+ ASFNESLW +GQ VSTE RAMYN+G+AGLTYWSPN+
Sbjct: 121 GIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNV 180
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
N+ RDPRWGR ETPGEDP VV RYAVNYVRGLQ+V G E + + LKVSSCCKHY
Sbjct: 181 NIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEG--NFAADRLKVSSCCKHYT 237
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
AYDVD WKGVDR+HFDA+VT QD+E+T+ PF+ CV+EG SSVMCSYNRVNG+P+CA+P
Sbjct: 238 AYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANP 297
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+LL +R +W L GYIV+DCDSI V + + ++ EDAVA LKAGL+L+CG Y +
Sbjct: 298 ELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNCGSYLGD 356
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAA 406
+T NAV GKVKE+ ++++L Y Y VLMRLGFFDG P + GK D+C+ ++ LA
Sbjct: 357 YTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLAL 416
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
+AA++GIVLL N+ LPL+ KT+AV+GP+A+AT M+ NYAG+PCRY SP+ G
Sbjct: 417 DAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQK 475
Query: 467 YAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
Y + V+Y+ GC +V+C I A+ A ADAT+++ GLDL +EAE LDR +L LPG+
Sbjct: 476 YVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGF 535
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ + A+ A G VILV+MSAG VDI+F + + I ILW GYPG+ GG AI+ V+FG
Sbjct: 536 QEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFG 595
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NPGGR P TWY +YV +P+T M +RP + +PGRTY+FY G +LY FG+GLSY+
Sbjct: 596 DYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGHGLSYST 655
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL---NY-TSDASKTRCPGVLVNDLRCDDY--FEFKV 699
F + S T+ V+L N+ NY T T + ++ + C + + +
Sbjct: 656 FYKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLSNIDIVI 715
Query: 700 DFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
+N G DG+ VV+ + KPP + + +++GF+RV V+ G+ + + + C ++
Sbjct: 716 GVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKIS 775
Query: 759 IVDYAANTLLPAGEHTIFVGN 779
VD L G HT+ VG+
Sbjct: 776 NVDEEGKRKLVMGMHTLVVGS 796
>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
Length = 770
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/773 (48%), Positives = 511/773 (66%), Gaps = 27/773 (3%)
Query: 11 FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
FS+ I ++ S+ +SPVF CD LG + FCD+SL RV DLV+
Sbjct: 12 FSI-IGFILLSSLLKQVLAQNSPVFACDVTSNPALG----NLTFCDASLAVENRVNDLVN 66
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
R+TL EK+ L A GV RLG+P+YEWWSEALHGV+ GPG HF ++PGATSFP VIL
Sbjct: 67 RLTLGEKIGFLVSGAGGVSRLGIPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVIL 126
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
T ASFN +L++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR ETPGEDP +
Sbjct: 127 TAASFNVTLFQTIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPTL 186
Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
+Y V YV GLQ + D ++ LKV++CCKHY AYDVDNWKG++RY F+A V +
Sbjct: 187 TSKYGVAYVEGLQQTD------DGSTNKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQ 240
Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
QD+++TF PF CV EG +SVMCSYN+VNG P+C DP LL VRGEW L+GYIV DC
Sbjct: 241 QDLDDTFQPPFRSCVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDC 300
Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
DS+QV+ + + + E+A A L +G+DL+CG + + +T AV Q V E+ ID+++
Sbjct: 301 DSLQVIFKSQNY-TKTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAIS 359
Query: 371 YLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
+ LMRLGFFDG+P+ Y +LG +D+C+ EN ELA EAAR+GIVLLKN +LPL
Sbjct: 360 NNFATLMRLGFFDGNPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTP 419
Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
+K++AV+GP+AN T MIGNY GIPC+Y +P+ G + YK GC DV+C + I
Sbjct: 420 TAIKSLAVIGPNANVTKTMIGNYEGIPCKYTTPLQGLTASVATIYKPGCADVSCNTAQ-I 478
Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
A + A TADA +++ G D S+E ESLDR + LPG Q+ L+ +VA+VAKGPVILVIMS
Sbjct: 479 DDAKQIATTADAVVLVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMS 538
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
GG+D+ FA N I +ILW G+PGE GG A+ADV+FG +NP GRLP+TWY Y ++P
Sbjct: 539 GGGMDVQFAVDNPKITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVP 598
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
+T M +RP + YPGRTY+FY GPT++ FG+GLSY+QFK++L + + + L + C
Sbjct: 599 MTDMNMRPNPATNYPGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFVSLPLGEKHTC 658
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
R ++C V C + F+ + +NVG GS ++ +++ PP+ A
Sbjct: 659 R---------LSKCKTVDAVGQSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSVHNAP 709
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K ++GF++V + +KF N CK L++ D N + G H + +G+
Sbjct: 710 K-KHLLGFEKVHLTPQGEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGD 761
>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
Length = 810
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/760 (50%), Positives = 510/760 (67%), Gaps = 21/760 (2%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+S F CD K Q + + FC++SL Y R KDL+SR+TL EKVQQ+ + A G+PR
Sbjct: 21 ASQNFACD-----KNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPR 75
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+P YEWWSEALHGVSNVG G F+ +PGATSFP +IL+ ASFNE+LW K+GQ VSTE
Sbjct: 76 LGIPAYEWWSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTE 135
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
AR M+++G AGLTYWSPN+NV RDPRWGR ETPGEDP VV RYAVNYVRGLQ+V N
Sbjct: 136 ARTMHSVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGN 195
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
+T + LKVSSCCKHY AYD+D WKGVDR+HFDA+VT+QD+E+T+ PF CV+E
Sbjct: 196 ST---ADKLKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHV 252
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
SSVMCSYNRVNGIP+CADP LL +RGEW+L GYIV+DCDSI+V D+ + A + EDA
Sbjct: 253 SSVMCSYNRVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTA-TPEDA 311
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
VA LKAGL+++CG++ +T +AV+ KV+E+ +D++L Y + VLMRLGFFDG P+
Sbjct: 312 VALALKAGLNMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLL 371
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
+ +LG D+CSD + +LA +AAR+GIVLL N + LPL+ + +AV+GP+AN T MI
Sbjct: 372 FGNLGPSDVCSDGHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIGPNANVTTTMI 430
Query: 448 GNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
NYAGIPC+Y +P+ G Y + VTY GC V+C + I AA++AA ADA ++L GL
Sbjct: 431 SNYAGIPCKYTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGL 490
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
D S+E E LDRE+L LPG+Q +L+ V G V+LV+MS+ +D++FA + IK IL
Sbjct: 491 DQSIEREGLDRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGIL 550
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
W GYPG+ GG A+A V+FG +NP GR P TWY +Y +P+T M +R + +PGRTY
Sbjct: 551 WVGYPGQAGGDAVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTY 610
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH----CRNLNYTSDASKTRCP 682
+FY G TLY FG+GLSY+ F ++S T+ + N N T +
Sbjct: 611 RFYAGNTLYKFGHGLSYSTFSNFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQ 670
Query: 683 GVLVNDLRC-DDYFEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFV 739
+ + L C + + +N G G VV+V+ KPP +E+ Q++GF RV V
Sbjct: 671 AMDITTLNCTNSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEV 730
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
G+ + + + CK L++VD L G+H +G+
Sbjct: 731 NRGKTQNVTLEIDVCKRLSLVDSEGKRKLVTGQHIFTIGS 770
>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 782
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/755 (50%), Positives = 511/755 (67%), Gaps = 26/755 (3%)
Query: 28 NGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG 87
+ S F CD ++ +S F FCDSSL + RV+DLV R+TL EK+ L + A
Sbjct: 40 SAQSPTAFACD----AETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARN 95
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
V RLG+P+YEWWSEALHGVS VGPGT F +V+PGATSFP VILT ASFN SL++ IG+ V
Sbjct: 96 VTRLGIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVV 155
Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
STEARAMYN+G AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ +
Sbjct: 156 STEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD- 214
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
D + LKV++CCKHY AYD+DNWKG DRYHF+A V+ QD+E+TF PF+ CV +
Sbjct: 215 -----DGDPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVID 269
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G+ +SVMCSYN+VNG P+CADP LL +RG+W L+GYIV+DCDS+ V+ ++ + S
Sbjct: 270 GNVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSP 328
Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
E+A A+T+ AGLDLDCG + T AV G V E I K++ LMRLGFFDG+P
Sbjct: 329 EEAAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPS 388
Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
Y LG +D+C+ E+ ELA EAAR+GIVLLKN +LPL+S+ +K++AV+GP+AN T
Sbjct: 389 KQLYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTK 448
Query: 445 AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
MIGNY G PC+Y +P+ G S + +++ GC +VAC S + A + A +ADAT+++
Sbjct: 449 TMIGNYEGTPCKYTTPLQGLSAVVSTSFQPGCANVACTSAQ-LDEAKKIAASADATVLVV 507
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
G D S+EAES DR DL LPG Q LI +VA+ +KGPVILVIM+ GG+DI FA+ + I +
Sbjct: 508 GSDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITS 567
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILW G+PGE GG AIADV+FG FNP GRLP+TWY YV+ +P+T M +RP S G+PGR
Sbjct: 568 ILWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGR 627
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+FY G T+Y FG GLSY+ FK++L+ K + + L + C + ++C +
Sbjct: 628 TYRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSIPLEEGHICHS---------SKCHSL 678
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
V C + F+ + +NVG GS V +YS PP+ + + K ++GF++V + G
Sbjct: 679 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPS-VHNSPQKHLLGFEKVSLGRGG 737
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++F + CK L++ D + + G H + VG
Sbjct: 738 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVG 772
>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
Length = 751
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/781 (49%), Positives = 517/781 (66%), Gaps = 49/781 (6%)
Query: 2 AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
A V+ LCF + + S V A SSPVF CD LG F FC++SL
Sbjct: 8 APKVTVFLCFLSCFSHFLSSPKWVLAQ--SSPVFACDVENNPTLG----QFGFCNTSLET 61
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
+ RV DLV R+TL+EK+ L + A V RLG+P+YEWWSEALHGVS VGPGTHF+ V+PG
Sbjct: 62 AARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPG 121
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
ATSFP VILT ASFN SL++ IG+AVSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR
Sbjct: 122 ATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQ 181
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ETPGEDP + +YA YVRGLQ + D + LKV++CCKHY AYD+DNWKGVDR
Sbjct: 182 ETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVAACCKHYTAYDLDNWKGVDR 235
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+HF+A VT+QDM++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ VRGEW
Sbjct: 236 FHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWK 295
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
L+GYIV+DCDS+ V ++ + + E+A A+ + AGLDL+CG + T AV+ G V
Sbjct: 296 LNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVD 354
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
E+ +DK++ + LMRLGFFDG+P Y LG +D+C+ E+ ELA EAAR+GIVLLKN
Sbjct: 355 ESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKN 414
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
+ +LPL+ +KT+AV+GP+AN T MIGNY G PC+Y +P+ G + TY GC +
Sbjct: 415 SKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSN 474
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
VAC + I A + A ADAT+++ G+D S+EAE DR ++ LPG Q LI +VA+ +K
Sbjct: 475 VACGTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASK 533
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
G VILV+MS GG DI+FA+ + I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 534 GNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWY 593
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
YV +P+T+M +RP + GYPGRTY+FY G T+Y FG GLSYTQF ++L
Sbjct: 594 PQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHL-------- 645
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
+ DA + C ++ F+ + N G+ GS V ++S
Sbjct: 646 --------------SVDAVQESCQNLV---------FDIHLRVNNAGNISGSHTVFLFSS 682
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
PP+ + + K ++GF++VFV A ++F + CK L+IVD + G H + VG
Sbjct: 683 PPS-VHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVG 741
Query: 779 N 779
N
Sbjct: 742 N 742
>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Cucumis sativus]
Length = 809
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/756 (50%), Positives = 511/756 (67%), Gaps = 26/756 (3%)
Query: 28 NGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG 87
+ S F CD ++ +S F FCDSSL + RV+DLV R+TL EK+ L + A
Sbjct: 67 SAQSPTAFACD----AETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARN 122
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
V RLG+P+YEWWSEALHGVS VGPGT F +V+PGATSFP VILT ASFN SL++ IG+ V
Sbjct: 123 VTRLGIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVV 182
Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
STEARAMYN+G AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ +
Sbjct: 183 STEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD- 241
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
D + LKV++CCKHY AYD+DNWKG DRYHF+A V+ QD+E+TF PF+ CV +
Sbjct: 242 -----DGDPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVID 296
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G+ +SVMCSYN+VNG P+CADP LL +RG+W L+GYIV+DCDS+ V+ ++ + S
Sbjct: 297 GNVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSP 355
Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
E+A A+T+ AGLDLDCG + T AV G V E I K++ LMRLGFFDG+P
Sbjct: 356 EEAAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPS 415
Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
Y LG +D+C+ E+ ELA EAAR+GIVLLKN +LPL+S+ +K++AV+GP+AN T
Sbjct: 416 KQLYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTK 475
Query: 445 AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
MIGNY G PC+Y +P+ G S + +++ GC +VAC S + A + A +ADAT+++
Sbjct: 476 TMIGNYEGTPCKYTTPLQGLSAVVSTSFQPGCANVACTSAQ-LDEAKKIAASADATVLVV 534
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
G D S+EAES DR DL LPG Q LI +VA+ +KGPVILVIM+ GG+DI FA+ + I +
Sbjct: 535 GSDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITS 594
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILW G+PGE GG AIADV+FG FNP GRLP+TWY YV+ +P+T M +RP S G+PGR
Sbjct: 595 ILWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGR 654
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+FY G T+Y FG GLSY+ FK++L+ K + + L + C + ++C +
Sbjct: 655 TYRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSIPLEEGHICHS---------SKCHSL 705
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
V C + F+ + +NVG GS V +YS PP+ + + K ++GF++V + G
Sbjct: 706 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPS-VHNSPQKHLLGFEKVSLGRGG 764
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++F + CK L++ D + + G H + VG
Sbjct: 765 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVGT 800
>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
Length = 755
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/760 (50%), Positives = 512/760 (67%), Gaps = 28/760 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CD +K GL S FC ++P +RV+DL+ R+TL EK++ L + A VPRLG+
Sbjct: 20 FACD----AKNGL-TRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQ 74
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 75 GYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAM 134
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
+N G AGLTYWSPN+NV RDPRWGR ETPGEDP V G+YA +YVRGLQ G
Sbjct: 135 FNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFR----- 189
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+ PF+ CV EG +SVM
Sbjct: 190 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVM 245
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RGEW L+GYIV+DCDS+ V+ +N + A +E A A T
Sbjct: 246 CSYNQVNGKPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEE-AAAAT 304
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T NAV+ G + E D++ +L TV MRLG FDG P + L
Sbjct: 305 IKAGLDLDCGPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKL 364
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA AA++GIVLL+N TLPL+ + TVAV+GP A+ TV MIGNYA
Sbjct: 365 GPRDVCTPAHQQLALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNYA 423
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G S YA +++GC DVAC N A AA ADAT+++ GLD S+E
Sbjct: 424 GVACGYTTPLQGISRYAKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIE 483
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE DR+DL LPGYQ +LI++VA ++GP ILV+MS G +D++FA+ + I AILWAGYP
Sbjct: 484 AEFRDRKDLLLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYP 543
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG NPGG+LP+TWY DY+ +P+T+M +R S GYPGRTY+FY G
Sbjct: 544 GQAGGAAIADVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKG 603
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
P ++PFG+G+SYT F ++L+ + + V L +N ++ + V+ C
Sbjct: 604 PVVFPFGHGMSYTTFAHSLVQAPQEVAVPFTSLYALQNTTAARNS-------IRVSHANC 656
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ +D +N G DG ++V+S PP E + K++IGF++V + AG KR+K
Sbjct: 657 EPLVLGVHIDVKNTGDMDGIQTLLVFSSPP-EGKWSANKKLIGFEKVHIVAGSKKRVKID 715
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
CK L++VD LP G+H + +G+ S + N
Sbjct: 716 IPVCKHLSVVDRFGIRRLPIGKHDLHIGDLKHSISLQANL 755
>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
Length = 762
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/772 (48%), Positives = 511/772 (66%), Gaps = 29/772 (3%)
Query: 10 CFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
CF I ++ + V + P F CDP K GL S+ FC++ +P RV+DL+
Sbjct: 3 CFKNLITFMLLISILVTLSEGRVP-FACDP----KNGL-TRSYKFCNTRVPIHARVQDLI 56
Query: 70 SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
R+ L EK++ + + A VPRLG+ YEWWSEALHGVSNVGPGT F ATSFP VI
Sbjct: 57 GRLALPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFSAATSFPQVI 116
Query: 130 LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF 189
T ASFN+SLW +IG+ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP
Sbjct: 117 TTAASFNQSLWLEIGRIVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPT 176
Query: 190 VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
V G+YA +YV+GLQ G N LKV++CCKHY AYD+DNW GVDR+HF+A+V+
Sbjct: 177 VAGKYAASYVQGLQG-NGAGNR-------LKVAACCKHYTAYDLDNWNGVDRFHFNAKVS 228
Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
+QD+ +T+ PF+ CV++G +SVMCSYN+VNG P+CADP+LL T+RGEW L+GYIV+D
Sbjct: 229 KQDLADTYDVPFKACVRDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSD 288
Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
CDS+ V+ DN + + E A A +KAGLDLDCG + T A++QG + E D++ +L
Sbjct: 289 CDSVGVLYDNQHY-TRTPEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLAL 347
Query: 370 KYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
L TV MRLG FDG Q Y +LG +D+C + ++A EAAR+GIVLL+N N LPL+
Sbjct: 348 ANLITVQMRLGMFDGDAQPYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPT 407
Query: 429 KVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIF 488
+ +TV V+GP+++ TV MIGNYAGI C Y +P+ G + Y ++ GC DV C N
Sbjct: 408 RYRTVGVIGPNSDVTVTMIGNYAGIACGYTTPLQGIARYVKTIHQAGCKDVGCGGNQLFG 467
Query: 489 AASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSA 548
+ + A+ ADAT+++ GLD S+EAE DR L LPG+Q +L+++VA A+GPVILV+MS
Sbjct: 468 LSEQVARQADATVLVMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSG 527
Query: 549 GGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL 608
G +D+ FA+ + I AILW GYPG+ GG AIADV+FG+ NP GRLP TWY DYV+ +P+
Sbjct: 528 GPIDVTFAKNDPKISAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPM 587
Query: 609 TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
T+M +R + GYPGRTY+FY GP ++PFG+GLSY++F ++L K + V
Sbjct: 588 TNMDMRANPATGYPGRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQA 647
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
N ++ A K V+ CD+ F VD +N GS DG+ ++VYSK P
Sbjct: 648 FTNSSNKAMK-------VSHANCDELEVGFHVDVKNEGSMDGAHTLLVYSKAP-----NG 695
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+KQ++ F + +V AG R+K + C L+ VD +P GEH + +G+
Sbjct: 696 VKQLVNFHKTYVPAGSKTRVKVGVHVCNHLSAVDEFGVRRIPMGEHELQIGD 747
>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
Length = 774
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/753 (49%), Positives = 500/753 (66%), Gaps = 26/753 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+ P F CDP L+ FC +P +RV+DL+ R+TL EK+ L + A VPR
Sbjct: 28 ARPPFACDPRNPITRTLK-----FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPR 82
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVSNVGPGT F + GATSFP VI T ASFNESLW++IG+ VS E
Sbjct: 83 LGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDE 141
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP + +Y YV+GLQ
Sbjct: 142 ARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQG------ 195
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
D LKV++CCKHY AYD+DNW GVDR+HF+ARV++QD+E+T+ PF CV +G+
Sbjct: 196 --DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNV 253
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+SVMCSYN+VNG P+CADP+LL T+RG+W L+GYIV+DCDS+ V DN + + E+A
Sbjct: 254 ASVMCSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEEA 312
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
A +KAGLDLDCG + T AV+ G+V E DI+ +L TV MRLG FDG P +
Sbjct: 313 AAYAIKAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQR 372
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y +LG D+C + ELA EAAR+GIVLL+N N+LPL++ + +TVAV+GP+++ T MI
Sbjct: 373 YGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMI 432
Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GNYAGI C Y +P+ G + Y ++ GC DV C N I AA AA+ ADAT+++ GLD
Sbjct: 433 GNYAGIACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLD 492
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAE DR DL LPG+Q +L+++VA ++GP ILVIMS G +D+ FA+ + I AI+W
Sbjct: 493 QSIEAEFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIW 552
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG+ GG AIADV+FG NP G+LP+TWY +YV LP+T M +R + GYPGRTY+
Sbjct: 553 VGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYR 612
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY GP ++PFG GLSYT+F ++L + V L +N + + V+
Sbjct: 613 FYKGPVVFPFGLGLSYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGNHD------IRVS 666
Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
CD + +D +N G+ DG+ ++V++ PP A KQ++GF +V + AG +R
Sbjct: 667 HTNCDSLSLDVHIDIKNSGTMDGTHTLLVFATPPTGKWAPN-KQLVGFHKVHIVAGSERR 725
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ CK L++VD +P G+H + +G+
Sbjct: 726 VRVGVQVCKHLSVVDELGIRRIPLGQHKLEIGD 758
>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
Length = 769
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/733 (50%), Positives = 498/733 (67%), Gaps = 31/733 (4%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + GL S FC +S+P +RV+DL+ R+TL EK++ L + A VPRLG+
Sbjct: 28 FACDP----RNGL-TRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPRLGIQ 82
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGATSFP VI T A+FNESLW++IG+ VS EARAM
Sbjct: 83 GYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDEARAM 142
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + G+YA +YVR LQ G
Sbjct: 143 YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQGNTGSR----- 197
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+ PF+ CV EG +SVM
Sbjct: 198 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 253
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP +L T+RG+W L GYIV+DCDS+ V+ + + + E+A A
Sbjct: 254 CSYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHY-TRTPEEAAADA 312
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T AV+ G ++E D++ + Y TV MRLG FDG P + +L
Sbjct: 313 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNL 372
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA +AA +GIVLLKN TLPL++ + TVAV+GP+++ TV MIGNYA
Sbjct: 373 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 432
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G S YA ++ GC VAC N I AA AA+ ADAT+++ GLD S+E
Sbjct: 433 GVACGYTTPLQGISRYAKTIHQAGCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 492
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE +DR L LPG Q +L+++VA+ ++GPV+LV+M G VD++FA+ + I AILW GYP
Sbjct: 493 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 552
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG+ NPGG+LP+TWY DYV LP+T M +R GYPGRTY+FY G
Sbjct: 553 GQAGGAAIADVLFGRANPGGKLPMTWYPQDYVARLPMTDMRMRA--GRGYPGRTYRFYKG 610
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNL-NKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
P ++PFG+G+SYT F + L V + L +N +S+A + V
Sbjct: 611 PVVFPFGHGMSYTTFAHTLSKAPNQFSVPIATSLYAFKNTTISSNA-------IRVAHTN 663
Query: 691 CDDYFE--FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
C+D VD +N G G+ ++V++KPPA + KQ+IGF++V V AG + ++
Sbjct: 664 CNDAMSLGLHVDVKNTGDMAGTHTLLVFAKPPAGNWSPN-KQLIGFKKVHVTAGALQSVR 722
Query: 749 FVFNACKSLNIVD 761
+ CK L++VD
Sbjct: 723 LDIHVCKHLSVVD 735
>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
Length = 774
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/779 (49%), Positives = 522/779 (67%), Gaps = 28/779 (3%)
Query: 7 SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVK 66
+LL + + +LVF V ++ S P+F CDP GL + FC +++P +RV+
Sbjct: 7 ALLIGNKVVVILVFLLCLVHSSESLRPLFACDPAN----GL-TRTLRFCRANVPIHVRVQ 61
Query: 67 DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
DL+ R+TL EK++ L + A VPRLG+ YEWWSEALHG+S+VGPG F PGATSFP
Sbjct: 62 DLLGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFP 121
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
VI T ASFN+SLW++IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGE
Sbjct: 122 QVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGE 181
Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
DP V +YA +YVRGLQ T +R LKV++CCKHY AYD+DNW GVDR+HF+A
Sbjct: 182 DPIVAAKYAASYVRGLQ-------GTAAGNR-LKVAACCKHYTAYDLDNWNGVDRFHFNA 233
Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
+VT+QD+E+T+ PF+ CV EG +SVMCSYN+VNG P+CAD LL T+RG+W L+GYI
Sbjct: 234 KVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYI 293
Query: 307 VADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDID 366
V+DCDS+ V N + + E+A A+++KAGLDLDCG + FT AV++G + E DI+
Sbjct: 294 VSDCDSVDVFF-NQQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDIN 352
Query: 367 KSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
+L TV MRLG FDG+ Y +LG +D+C+ + LA EAA +GIVLLKN +LPL
Sbjct: 353 LALANTLTVQMRLGMFDGNLGPYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPL 412
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
+ + +TVAV+GP+++ T MIGNYAG C Y SP+ G S YA ++ GC VACK N
Sbjct: 413 SPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQAGCAGVACKGNQ 472
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
AA AA+ ADAT+++ GLD S+EAE+ DR L LPGYQ L+ +VA+ ++GPVILV+
Sbjct: 473 GFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVL 532
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MS G +D+ FA+ + + AI+WAGYPG+ GG AIA+++FG NPGG+LP+TWY DYV
Sbjct: 533 MSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAK 592
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL-SFTKTIQVNLNKL 664
+P+T M +R S YPGRTY+FY GP ++PFG+GLSYT F ++L S + V+L+
Sbjct: 593 VPMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLS-- 648
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
NLN + + + V+ C+ + V+ N G DG+ V V+++PP
Sbjct: 649 ----NLNSANTILNSSSHSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPIN 704
Query: 723 -IAATYI-KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
I + KQ+I F++V V AG + ++ +ACK L +VD +P GEH + +G+
Sbjct: 705 GIKGLGVNKQLIAFEKVHVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGD 763
>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
Length = 772
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/777 (49%), Positives = 518/777 (66%), Gaps = 34/777 (4%)
Query: 7 SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVK 66
SL+ L ++ L+F N V A P F CDP G F FC + +P +RV+
Sbjct: 10 SLIALVLCVSALLF--NLVHAR----PPFACDPRNPLTRG-----FKFCRTRVPVHVRVQ 58
Query: 67 DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
DL+ R+TL EK++ L + A VPRLG+ YEWWSEALHGVSNVGPGT F PGATSFP
Sbjct: 59 DLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFP 118
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
VI T ASFN+SLW++IGQ VS EARAMYN G+AGLTYWSPN+N+ RDPRWGR ETPGE
Sbjct: 119 QVITTAASFNQSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGE 178
Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
DP + +YA +YV+GLQ D LKV++CCKHY AYD+DNW GVDR+HF+A
Sbjct: 179 DPVLSAKYAASYVKGLQG--------DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNA 230
Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
RV++QD+ +T+ PF CV EG +SVMCSYN+VNG P+CADP LL T+RGEW L+GYI
Sbjct: 231 RVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYI 290
Query: 307 VADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDID 366
V+DCDS+ V D + + E+A A+ +KAGLDLDCG + T A++ G + E D+D
Sbjct: 291 VSDCDSVGVFYDQQHY-TRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVD 349
Query: 367 KSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTL 423
+L TV MRLG FDG P QY +LG +D+C+ + ELA EA+R+GIVLL+N+ +TL
Sbjct: 350 YALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTL 409
Query: 424 PLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKS 483
PL++ + +TVAVVGP+++ T MIGNYAG+ C Y +P+ G Y ++ GC +VAC +
Sbjct: 410 PLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTTPLQGIGRYTKTIHQQGCTNVACTT 469
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
N AA AA+ ADAT+++ GLD S+EAE DR DL +PG+Q +L+++VA ++GP +L
Sbjct: 470 NQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVL 529
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
V+MS G +D++FA+ + I AI+W GYPG+ GG A+ADV+FG NP G+LP+TWY DYV
Sbjct: 530 VLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNPSGKLPMTWYPQDYV 589
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
+P+T+M +R GYPGRTY+FY GP ++PFG GLSYT F ++L ++ V L
Sbjct: 590 SKVPMTNMAMRA--GRGYPGRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVPTSVSVPLTS 647
Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
L N S A V V+ C+ V +N G+ DG+ ++V+S PP+
Sbjct: 648 LSATTNSTMLSSA-------VRVSHTNCNPLSLALHVVVKNTGARDGTHTLLVFSSPPSG 700
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A KQ++GF +V + AG +KR+K + CK L++VD +P GEH + +G+
Sbjct: 701 KWAAN-KQLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQFGIRRIPIGEHKLQIGD 756
>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
Length = 771
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/771 (49%), Positives = 511/771 (66%), Gaps = 24/771 (3%)
Query: 11 FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
F L I +L F+ + G S F CDP L+ FC +SLP +RV+DL++
Sbjct: 6 FILIIFVLAFAYS-----GESRQPFACDPANAGIRNLR-----FCKTSLPIHVRVQDLIA 55
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
R+TL EK++ L + A V RLG+ YEWWSEALHGVSN G G F PGATSFP VI
Sbjct: 56 RLTLQEKIRLLVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVIT 115
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
T ASFN SLW++IG+ VS E RAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP +
Sbjct: 116 TAASFNASLWEEIGRVVSEEGRAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHL 175
Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
V +Y V+YV+GLQ G N LKV++CCKHY AYD+D+W G DRYHF+A+V+
Sbjct: 176 VAQYGVSYVKGLQGGGGRGNTR------LKVAACCKHYTAYDLDDWNGYDRYHFNAKVSM 229
Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
QD+E+T+ PF+ CV EG+ +SVMCSYN++NG PSCADP LL T+R +W L+GYIV+DC
Sbjct: 230 QDLEDTYNAPFKACVVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQWHLNGYIVSDC 289
Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
DS+ V+ + + EDA A T+KAGLDLDCG + T AV GKV + +I+ +L
Sbjct: 290 DSVGVLFEKQHY-TRYPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVSQVEINNALA 348
Query: 371 YLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
TV MRLG FDG + Y +LG +D+CS + +LA +AAREGIVLLKN LPL++ +
Sbjct: 349 NTITVQMRLGMFDGPNGPYANLGPKDVCSPAHQQLALQAAREGIVLLKNIGQALPLSTKR 408
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
+TVAV+GP+++AT+AMIGNYAG+PC Y+SP+ G S YA ++ GC VAC N +
Sbjct: 409 HRTVAVIGPNSDATLAMIGNYAGVPCGYISPLQGISRYARTIHQQGCMGVACPGNQNFGL 468
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A AA+ ADAT+++ GLD S+EAE+ DR L LPG+Q LI++VA +KGPV+LV+MS G
Sbjct: 469 AEVAARHADATVLVMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGG 528
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+D+ FA+ + + +I+W GYPG+ GG AIADV+FG NPGG+LP+TWY DYV + +
Sbjct: 529 PIDVTFAKNDPRVSSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQDYVAKVSMA 588
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV-NLNKLQHCR 668
+M +R S GYPGRTY+FY GPT++PFG G+SYT F +L+S T+ V L+
Sbjct: 589 NMDMRANPSKGYPGRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITVSVPTLHSHDLVS 648
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
N T +K + N D + +D +N G DG+ V+++S PP T
Sbjct: 649 NNTTTLMKAKATVRTIHTNCESLD--IDMHIDVKNTGDMDGTHAVLIFSTPPDP---TET 703
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KQ++ F++V V AG +R+K NACK L++ D + GEH I VG+
Sbjct: 704 KQLVAFEKVHVVAGAKQRVKINMNACKHLSVADEYGVRRIYMGEHKIHVGD 754
>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/755 (49%), Positives = 506/755 (67%), Gaps = 29/755 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+ P F CDP GL+ FC ++P +RV+DL+ R+TL EK++ L + A VPR
Sbjct: 28 ARPPFACDPHNPITRGLK-----FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPR 82
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFNESLW++IG+ V E
Sbjct: 83 LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDE 142
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA YV+GLQ
Sbjct: 143 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------ 196
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
D LKV++CCKHY AYD+DNW GV+R+HF+ARV++QD+ +T+ PF+ CV EG
Sbjct: 197 --DGAGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHV 254
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+SVMCSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V+ + + + E+A
Sbjct: 255 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEA 313
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
A +KAGLDLDCG + T AV++G V + +I+ +L TV MRLG FDG P Q
Sbjct: 314 AADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQ 373
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y +LG +D+C+ + +LA EAAR+GIVLL+N +LPL+ + +TVAV+GP+++ TV MI
Sbjct: 374 YGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMI 433
Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GNYAG+ C Y +P+ G Y ++ GC DV C N AA AA+ ADAT+++ GLD
Sbjct: 434 GNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 493
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAE +DR L LPG+Q +L+++VA ++GP ILV+MS G +D+ FA+ + I AI+W
Sbjct: 494 QSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 553
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG+ GG AIADV+FG NPGG+LP+TWY +YV LP+T M +R + GYPGRTY+
Sbjct: 554 VGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 613
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY GP ++PFG GLSYT F +NL ++ V L L+ N S A V V+
Sbjct: 614 FYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKA-------VRVS 666
Query: 688 DLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRN 744
C+ + VD +N GS DG+ ++V++ PP + AA+ KQ++GF ++ + AG
Sbjct: 667 HADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWAAS--KQLVGFHKIHIAAGSE 724
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R++ + CK L++VD +P GEH + +G+
Sbjct: 725 TRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 759
>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
Length = 775
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/755 (49%), Positives = 506/755 (67%), Gaps = 29/755 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+ P F CDP GL+ FC ++P +RV+DL+ R+TL EK++ L + A VPR
Sbjct: 28 ARPPFACDPHNPITRGLK-----FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPR 82
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFNESLW++IG+ V E
Sbjct: 83 LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDE 142
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA YV+GLQ
Sbjct: 143 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------ 196
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
D LKV++CCKHY AYD+DNW GV+R+HF+ARV++QD+ +T+ PF+ CV EG
Sbjct: 197 --DGAGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHV 254
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+SVMCSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V+ + + + E+A
Sbjct: 255 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEA 313
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
A +KAGLDLDCG + T AV++G V + +I+ +L TV MRLG FDG P Q
Sbjct: 314 AADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQ 373
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y +LG +D+C+ + +LA EAAR+GIVLL+N +LPL+ + +TVAV+GP+++ TV MI
Sbjct: 374 YGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMI 433
Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GNYAG+ C Y +P+ G Y ++ GC DV C N AA AA+ ADAT+++ GLD
Sbjct: 434 GNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 493
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAE +DR L LPG+Q +L+++VA ++GP ILV+MS G +D+ FA+ + I AI+W
Sbjct: 494 QSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 553
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG+ GG AIADV+FG NPGG+LP+TWY +YV LP+T M +R + GYPGRTY+
Sbjct: 554 VGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 613
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY GP ++PFG GLSYT F +NL ++ V L L+ N S A V V+
Sbjct: 614 FYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKA-------VRVS 666
Query: 688 DLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRN 744
C+ + VD +N GS DG+ ++V++ PP + AA+ KQ++GF ++ + AG
Sbjct: 667 HADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWAAS--KQLVGFHKIHIAAGSE 724
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R++ + CK L++VD +P GEH + +G+
Sbjct: 725 TRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 759
>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/753 (49%), Positives = 503/753 (66%), Gaps = 26/753 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+ P F CDP L+ FC +P +RV+DL+ R+TL EK+ L + A VPR
Sbjct: 28 ARPPFACDPRNPITRTLK-----FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPR 82
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVSNVGPGT F + GATSFP VI T ASFNESLW++IG+ VS E
Sbjct: 83 LGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDE 141
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP + +Y YV+GLQ
Sbjct: 142 ARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------ 195
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
D LKV++CCKHY AYD+DNW GVDR+HF+ARV++QD+E+T+ PF+ CV +G+
Sbjct: 196 --DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNV 253
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+SVMCSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V DN + + E A
Sbjct: 254 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAA 312
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
A +KAGLDLDCG + T A++ G+V E DI+ +L TV MRLG FDG P +
Sbjct: 313 AAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQR 372
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y +LG D+C + ELA EAAR+GIVLL+N N+LPL++ + +TVAV+GP+++ T MI
Sbjct: 373 YGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMI 432
Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GNYAGI C Y +P+ G + Y ++ GC DV C N I AA AA+ ADAT+++ GLD
Sbjct: 433 GNYAGIACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLD 492
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAE DR L LPG+Q +L+++VA ++GP ILVIMS G +D+ FA+ + I AI+W
Sbjct: 493 QSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIW 552
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG+ GG AIADV+FG NP G+LP+TWY +YV LP+T M +R + GYPGRTY+
Sbjct: 553 VGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYR 612
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY GP ++PFG GLSYT+F ++L + V L L +N S+ GV V+
Sbjct: 613 FYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNH------GVRVS 666
Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
CD +F +D +N G+ DG+ ++V++ PA A KQ++GF +V + AG +R
Sbjct: 667 HTNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQPAGKWAPN-KQLVGFHKVHIVAGSERR 725
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ + CK L+IVD +P G+H + +G+
Sbjct: 726 VRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGD 758
>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/773 (50%), Positives = 513/773 (66%), Gaps = 28/773 (3%)
Query: 11 FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
F L LVF + V A SSPVF CD L +S FC++S+ + RV DLV
Sbjct: 16 FLLFCMFLVFLSTHVSAQ--SSPVFACDVVSNPSL----ASLGFCNTSIGINDRVVDLVK 69
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
R+TL EK+ L + A V RLG+P+YEWWSEALHGVS VGPGTHF D + GATSFP VIL
Sbjct: 70 RLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVIL 129
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
T ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPNIN+ RDPRWGR ETPGEDP +
Sbjct: 130 TAASFNTSLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL 189
Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
+Y YV+GLQ + D + LKV++CCKHY AYD+DNWKG DRYHF+A VT+
Sbjct: 190 SSKYGSCYVKGLQQRD------DGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVTK 243
Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
QDM++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ +RGEW+L+GYIV DC
Sbjct: 244 QDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIVTDC 303
Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
DS+ V + + +E A A L AG+DL+CG + T AV+ G V E ID ++
Sbjct: 304 DSLDVFYKSQNYTKTPEEAAAAAIL-AGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVS 362
Query: 371 YLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
+ LMRLGFFDG P Y LG +D+C+ EN ELA EAAR+GIVLLKN +LPL+
Sbjct: 363 NNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSP 422
Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
+K +AV+GP+AN T MIGNY G PC+Y +P+ G + TY GC +VAC S +
Sbjct: 423 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVATTYLPGCSNVAC-STAQV 481
Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
A + A ADAT+++ G DLS+EAES DR D+ LPG Q LI VA V+ GPVILVIMS
Sbjct: 482 DDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMS 541
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
GG+D++FA TN I +ILW GYPGE GG AIAD++FG +NP GRLP+TWY YV +P
Sbjct: 542 GGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMTWYPQSYVDKVP 601
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
+T+M +RP S GYPGRTY+FY G T+Y FG GLSY+QF + L+ + + V L + C
Sbjct: 602 MTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQLVYVPLEESHVC 661
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDD-YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
+ + C V+ ++ C + F+ + +N G+ GS V ++S PPA + +
Sbjct: 662 HS---------SECQSVVASEQTCQNSTFDMLLRVKNEGTISGSHTVFLFSSPPA-VHNS 711
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K ++GF++VF+ A + ++F + CK L++VD + + GEH + VG+
Sbjct: 712 PQKHLVGFEKVFLNAQTGRHVRFKVDICKDLSVVDELGSKKVALGEHVLHVGS 764
>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
Length = 765
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/755 (49%), Positives = 510/755 (67%), Gaps = 29/755 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + +S + FCD + + R DL+ R+TL EKV L + +P
Sbjct: 26 AQTPVFACDASNAT-----VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALP 80
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VST
Sbjct: 81 RLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVST 140
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD G
Sbjct: 141 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGS 200
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+A LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF PF+ CV +G+
Sbjct: 201 DA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGN 253
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL+ +RG+W L+GYIV+DCDS+ V+ +N + + ED
Sbjct: 254 VASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PED 312
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+K+GLDL+CG + T AVQ GK+ E+D+D+++ + VLMRLGFFDG P+
Sbjct: 313 AAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKL 372
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ SLG +D+C+ N ELA EAAR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 373 PFGSLGPKDVCTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTM 431
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G Y+ GC +V C N+ + AA++AA +AD T+++ G
Sbjct: 432 IGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVG 491
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D SVE ESLDR L LPG Q QL++ VA ++GPVILV+MS G DI+FA+++ I AI
Sbjct: 492 ADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAI 551
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG A+AD++FG NPGGRLP+TWY + + +T M +RP S GYPGRT
Sbjct: 552 LWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRT 611
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGV 684
Y+FY G T+Y FG GLSYT+F ++L+S + + V L + C + ++ +A+ C +
Sbjct: 612 YRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHACHTEHCFSVEAAGEHCGSL 671
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
F+ + +N G G V ++S PP+ + + K ++GF++V + G+
Sbjct: 672 ---------SFDVHLRVRNAGGMAGGHTVFLFSSPPS-VHSAPAKHLLGFEKVSLEPGQA 721
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + CK L++VD N + G HT+ VG+
Sbjct: 722 GVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 756
>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/792 (49%), Positives = 526/792 (66%), Gaps = 33/792 (4%)
Query: 4 VVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCD-PGRFSKLGLQMSSFLFCDSSLPYS 62
V + LLCF L I+ +N SSPVF CD G S GL+ FC++ L
Sbjct: 16 VSTLLLCFLLCIS--------EQSNAQSSPVFACDVTGNPSLAGLR-----FCNTGLNIK 62
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
RV DLV R+TL+EK+ LG A GV RLG+P Y+WWSEALHGVSNVG G+ F +PGA
Sbjct: 63 SRVTDLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGA 122
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
TSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR E
Sbjct: 123 TSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQE 182
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
TPGEDP + +YAV YVRGLQ+ +G + LKV++CCKHY AYDVDNWK V R+
Sbjct: 183 TPGEDPELSSKYAVAYVRGLQETDGGD------PNRLKVAACCKHYTAYDVDNWKDVHRF 236
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
F+A V +QDM +TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ +RG+W L
Sbjct: 237 TFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKL 296
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
+GYIV+DCDS+ V+ + + E+AVA+++ AGLDL+C + + AV+ G V E
Sbjct: 297 NGYIVSDCDSVDVLYTKQHY-TKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNE 355
Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
T IDK++ + LMRLGFFDG P+ Y LG D+C+ N ELA +AAR+GIVLLKN
Sbjct: 356 TAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKN 415
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
+LPL+ + +KT+AV+GP+ANAT MIGNY GIPC+Y +P+ G + + TY+ GC +
Sbjct: 416 SAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCKYTTPLQGLAETVSSTYQLGC-N 474
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
VAC + + +A+ A +ADA +++ G D S+E E+LDR DL+LPG Q +L+ QVA+VAK
Sbjct: 475 VAC-AEPDLGSAAALAASADAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAK 533
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
GPV+LVIMS G DI FA+ I I+W GYPGE GG AIADV+FG+ NP G LP+TWY
Sbjct: 534 GPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWY 593
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
YV+ +P+T+M +RP S GYPGRTY+FY G T+Y FG GLSYT F + +L K +
Sbjct: 594 PQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFGDGLSYTNFNHQILKAPKLVS 653
Query: 659 VNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
++L++ CR+ S DA C + L FE ++ +NVG +GS V +++
Sbjct: 654 LDLDENHACRSSECQSVDAIGPHCDNAVGGGLN----FEVQLKVRNVGDREGSHTVFLFT 709
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
PP E+ + K ++GF+++ + I+F + CK L++VD + G + + V
Sbjct: 710 TPP-EVHGSPRKHLLGFEKIRLGEKEETVIRFNVDVCKDLSVVDEIGKRKIALGHYLLHV 768
Query: 778 GNGGVSFPIHLN 789
G+ S I ++
Sbjct: 769 GSFKHSLTISVS 780
>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
Length = 774
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/753 (49%), Positives = 503/753 (66%), Gaps = 26/753 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
+ P F CDP L+ FC +P +RV+DL+ R+TL EK+ L + A VPR
Sbjct: 28 ARPPFACDPRNPITRTLK-----FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPR 82
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVSNVGPGT F + GATSFP VI T ASFNESLW++IG+ VS E
Sbjct: 83 LGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDE 141
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP + +Y YV+GLQ
Sbjct: 142 ARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------ 195
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
D LKV++CCKHY AYD+DNW GVDR+HF+ARV++QD+E+T+ PF+ CV +G+
Sbjct: 196 --DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNV 253
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+SVMCSYN+VNG P+CADP LL T+RG+W L+GYIV+DCDS+ V DN + + E A
Sbjct: 254 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAA 312
Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
A +KAGLDLDCG + T A++ G+V E DI+ +L TV MRLG FDG P +
Sbjct: 313 AAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQR 372
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y +LG D+C + ELA EAAR+GIVLL+N N+LPL++ + +TVAV+GP+++ T MI
Sbjct: 373 YGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMI 432
Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GNYAGI C Y +P+ G + Y ++ GC DV C N I AA AA+ ADAT+++ GLD
Sbjct: 433 GNYAGIACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLD 492
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S+EAE DR L LPG+Q +L+++VA ++GP ILVIMS G +D+ FA+ + I AI+W
Sbjct: 493 QSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIW 552
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG+ GG AIADV+FG NP G+LP+TWY +YV LP+T M +R + GYPGRTY+
Sbjct: 553 VGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYR 612
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY GP ++PFG GLSYT+F ++L + V L L +N S+ GV V+
Sbjct: 613 FYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNH------GVRVS 666
Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
CD +F +D +N G+ DG+ ++V++ PA A KQ++GF +V + AG +R
Sbjct: 667 HTNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQPAGKWAPN-KQLVGFHKVHIVAGSERR 725
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ + CK L+IVD +P G+H + +G+
Sbjct: 726 VRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGD 758
>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
Length = 896
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/735 (51%), Positives = 490/735 (66%), Gaps = 52/735 (7%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S F FC++SLPY R DLVSR+TL EK +QL + A G+ RLG+P YEWWSEALHGVSN
Sbjct: 61 SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNS 120
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
G G HF D IP T FP VIL+ ASFNESLW +GQ VSTE RAMYN+G+AGLTYWSPN+
Sbjct: 121 GIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNV 180
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
N+ RDPRWGR ETPGEDP VV RYAVNYVRGLQ+V G E + + LKVSSCCKHY
Sbjct: 181 NIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEG--NFAADRLKVSSCCKHYT 237
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
AYDVD WKGVDR+HFDA+VT QD+E+T+ PF+ CV+EG SSVMCSYNRVNG+P+CA+P
Sbjct: 238 AYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANP 297
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+LL +R +W L GYIV+DCDSI V + + ++ EDAVA LKAGL+L+CG Y +
Sbjct: 298 ELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNCGSYLGD 356
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAA 406
+T NAV GKVKE+ ++++L Y Y VLMRLGFFDG P + GK D+C+ ++ LA
Sbjct: 357 YTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLAL 416
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
+AA++GIVLL N+ LPL+ KT+AV+GP+A+AT M+ NYAG+PCRY SP+ G
Sbjct: 417 DAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQK 475
Query: 467 YAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
Y + V+Y+ GC +V+C I A+ A ADAT+++ GLDL +EAE LDR +L LPG+
Sbjct: 476 YVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGF 535
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ + A+ A G VILV+MSAG VDI+F + + I ILW GYPG+ GG AI+ V+FG
Sbjct: 536 QEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFG 595
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NPGGR P TWY +YV +P+T M +RP + +PGRTY+FY G +LY FG+GLSY+
Sbjct: 596 DYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGHGLSYST 655
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F NL + I V +N G
Sbjct: 656 FYKNLSNIDIVIGV------------------------------------------KNAG 673
Query: 706 STDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
DG+ VV+ + KPP + + +++GF+RV V+ G+ + + + C ++ VD
Sbjct: 674 EIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKISNVDEEG 733
Query: 765 NTLLPAGEHTIFVGN 779
L G HT+ VG+
Sbjct: 734 KRKLVMGMHTLVVGS 748
>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 774
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/777 (49%), Positives = 520/777 (66%), Gaps = 26/777 (3%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
LL + + +LVF V ++ S P+F CDP GL + FC ++P +RV+D
Sbjct: 8 LLIGNKVVVILVFLLCLVHSSESLRPLFACDPAN----GL-TRTLRFCRVNVPIHVRVQD 62
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
L+ R+TL EK++ L + A VPRLG+ YEWWSEALHGVS+VGPG+ F PGATSFP
Sbjct: 63 LIGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQ 122
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
VI T ASFN+SLW++IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGED
Sbjct: 123 VITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGED 182
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P V +YA +YVRGLQ T +R LKV++CCKHY AYD+DNW GVDR+HF+A+
Sbjct: 183 PIVAAKYAASYVRGLQ-------GTAAGNR-LKVAACCKHYTAYDLDNWNGVDRFHFNAK 234
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
VT+QD+E+T+ PF+ CV EG +SVMCSYN+VNG P+CAD LL T+RG+W L+GYIV
Sbjct: 235 VTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIV 294
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
+DCDS+ V N + + E+A A ++KAGLDLDCG + FT AV++G + E DI+
Sbjct: 295 SDCDSVDVFF-NQQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINL 353
Query: 368 SLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
+L TV MRLG FDG+ Y +LG +D+CS + LA EAA +GIVLLKN +LPL+
Sbjct: 354 ALANTLTVQMRLGMFDGNLGPYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLS 413
Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
+ +TVAV+GP+++ T MIGNYAG C Y +P+ G S YA ++ GC VACK N
Sbjct: 414 PRRHRTVAVIGPNSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQAGCAGVACKGNQG 473
Query: 487 IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AA AA+ ADAT+++ GLD S+EAE+ DR L LPGYQ L+ +VA+ ++GPVILV+M
Sbjct: 474 FGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLM 533
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S G +D+ FA+ + + AI+WAGYPG+ GG AIA+++FG NPGG+LP+TWY DYV +
Sbjct: 534 SGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKV 593
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P+T M +R S YPGRTY+FY GP ++PFG+GLSYT F N L+ + Q++++
Sbjct: 594 PMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFT-NSLAKSPLAQLSVS---- 646
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPPAE-I 723
NLN + + + V+ C+ + V+ N G DG+ V V+++PP I
Sbjct: 647 LSNLNSANAILNSTSHSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPKNGI 706
Query: 724 AATYI-KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ KQ+I F++V V AG + ++ +ACK L +VD +P G+H + +G+
Sbjct: 707 KGLGVNKQLIAFEKVHVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKHKLHIGD 763
>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
Length = 767
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/779 (49%), Positives = 516/779 (66%), Gaps = 35/779 (4%)
Query: 6 SSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRV 65
++ CF + LL A ++P+ CDP K GL + FC +LP RV
Sbjct: 5 NNFFCFLVLFILL-------SAEARAAPL-ACDP----KSGL-TRALRFCRVNLPIRARV 51
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
+DL+ R+ L EKV+ L + A VPRLG+ YEWWSEALHGVSNVGPGT F P ATSF
Sbjct: 52 QDLIGRLNLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSF 111
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
P VI T ASFN SLW+ IGQ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPG
Sbjct: 112 PQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPG 171
Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
EDP + +YA +YVRGLQ + N LKV++CCKHY AYD+DNW VDR+HF+
Sbjct: 172 EDPTLASQYAASYVRGLQGI--------YNKNRLKVAACCKHYTAYDLDNWNAVDRFHFN 223
Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
A+V++QD+E+T+ PF+ CV+EG +SVMCSYN+VNG P+CADP LL T+RG+W L+GY
Sbjct: 224 AKVSKQDLEDTYNVPFKGCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGY 283
Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
IV+DCDS+ V+ D+ + + E+A A T+KAGLDLDCG + T AV++G + E D+
Sbjct: 284 IVSDCDSVGVLYDDQHY-TRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADV 342
Query: 366 DKSLKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
+++L +TV MRLG FDG + + LG +D+CS + +LA +AAR+GIVLL+N +
Sbjct: 343 NQALTNTFTVQMRLGMFDGEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRS 402
Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACK 482
LPL++A+ + +AV+GP+A+ATV MIGNYAG+ C Y SP+ G + YA ++ GC VAC
Sbjct: 403 LPLSTARHRNIAVIGPNADATVTMIGNYAGVACGYTSPLQGIARYAKTVHQAGCIGVACT 462
Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
SN AA+ AA ADAT+++ GLD S+EAE DR + LPG+Q +L+++VA ++GP I
Sbjct: 463 SNQQFGAATAAAAHADATVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTI 522
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
LV+M G VD+ FA+ + I AILW GYPG+ GG AIADV+FG NPGG+LP TWY Y
Sbjct: 523 LVLMCGGPVDVTFAKNDPKISAILWVGYPGQAGGTAIADVLFGTTNPGGKLPNTWYPQSY 582
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL- 661
V +P+T + +R S GYPGRTY+FY GP ++PFG+GLSYT+F +L + V L
Sbjct: 583 VAKVPMTDLAMRANPSNGYPGRTYRFYKGPVVFPFGFGLSYTRFTQSLAHAPTKVMVPLA 642
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP 720
N+ + ++ DA K V CD+ +D +N G DGS ++V+S PP
Sbjct: 643 NQFTNSNITSFNKDALK-------VLHTNCDNIPLSLHIDVKNKGKVDGSHTILVFSTPP 695
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ KQ+IGF+RV V AG +R++ + C L+ D +P GEHT+ +G+
Sbjct: 696 KGTKSSE-KQLIGFKRVHVFAGSKQRVRMNIHVCNHLSRADEFGVRRIPIGEHTLHIGD 753
>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
Full=Alpha-L-arabinofuranosidase; Flags: Precursor
gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
Length = 773
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/775 (48%), Positives = 516/775 (66%), Gaps = 25/775 (3%)
Query: 11 FSLSIALLVFSTNAVD-ANGSSSPVFVCD-PGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
FS+S L F + +N SSPVF CD G S GL+ FC++ L RV DL
Sbjct: 9 FSVSTLFLCFIVCISEQSNNQSSPVFACDVTGNPSLAGLR-----FCNAGLSIKARVTDL 63
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
V R+TL+EK+ L A GV RLG+P Y+WWSEALHGVSNVG G+ F +PGATSFP V
Sbjct: 64 VGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQV 123
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
ILT ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR ETPGEDP
Sbjct: 124 ILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDP 183
Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
+ +YAV YV+GLQ+ +G + LKV++CCKHY AYD+DNW+ V+R F+A V
Sbjct: 184 TLSSKYAVAYVKGLQETDGGD------PNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVV 237
Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
+QD+ +TF PF+ CV +G +SVMCSYN+VNG P+CADP LL+ +RG+W L+GYIV+
Sbjct: 238 NQQDLADTFQPPFKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVS 297
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCDS+ V+ + A + E+AVA++L AGLDL+C + AV+ G V ET IDK+
Sbjct: 298 DCDSVDVLFRKQHY-AKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKA 356
Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
+ + LMRLGFFDG P+ Y LG +D+C+ +N ELA + AR+GIVLLKN +LPL
Sbjct: 357 ISNNFATLMRLGFFDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPL 416
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
+ + +KT+AV+GP+ANAT MIGNY G+PC+Y +P+ G + + TY+ GC +VAC +
Sbjct: 417 SPSAIKTLAVIGPNANATETMIGNYHGVPCKYTTPLQGLAETVSSTYQLGC-NVAC-VDA 474
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
I +A + A +ADA +++ G D S+E E DR DL+LPG Q +L+ +VA A+GPV+LVI
Sbjct: 475 DIGSAVDLAASADAVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVI 534
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MS GG DI FA+ + I +I+W GYPGE GG AIADV+FG+ NP G LP+TWY YV+
Sbjct: 535 MSGGGFDITFAKNDKKITSIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEK 594
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
+P+++M +RP S GYPGR+Y+FY G T+Y F L+YT+F + L+ + + ++L++
Sbjct: 595 VPMSNMNMRPDKSKGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENH 654
Query: 666 HCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
CR+ S DA C N + FE ++ +N G GS V +++ P ++
Sbjct: 655 PCRSSECQSLDAIGPHCE----NAVEGGSDFEVHLNVKNTGDRAGSHTVFLFTTSP-QVH 709
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ IKQ++GF+++ + ++F N CK L++VD + G H + VG+
Sbjct: 710 GSPIKQLLGFEKIRLGKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGS 764
>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
Length = 772
Score = 746 bits (1925), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/771 (48%), Positives = 509/771 (66%), Gaps = 30/771 (3%)
Query: 15 IALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTL 74
I +L+ + G + F CDP + +S + FC +LP RVKDL+ R+TL
Sbjct: 9 IPILIILSAIFRHGGGAREPFACDPKDAA-----LSRYPFCRVALPIPERVKDLIGRLTL 63
Query: 75 DEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTAS 134
EKV+ L + A VPRLG+ YEWWSEALHGVSNVGPGT F PGATSFP VI T AS
Sbjct: 64 QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123
Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRY 194
FN SLW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP V G Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
A Y++GLQ +G LKV++CCKH+ AYD+DNW G DR+HF+A+VT QDM
Sbjct: 184 AARYIKGLQGNDGDR---------LKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMV 234
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
+TF PF CVKEG +SVMCSYN+VNG+P+CADP LL T+R +W L+GYIV+DCDS+
Sbjct: 235 DTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVG 294
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
V DN + + + E+A A +KAGLDLDCG + T +AV++G + +T I+ +L T
Sbjct: 295 VFYDNQHYTS-TAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKGLLTQTHINNALANTIT 353
Query: 375 VLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
V MRLG FDG+P Y LG +++CS + +LA +AAR+GIVLLKN LPL++ +
Sbjct: 354 VQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSADHHR 413
Query: 432 TVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
TVAV+GP+++ V MIGNYAG+ C Y++P+ G Y V ++ GCD+VAC ++ S A
Sbjct: 414 TVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDAL 473
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
AA TADAT+++ GLD SVEAE+ DR+ L LPG Q +L+ +VA ++GP ++++MS G +
Sbjct: 474 AAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPI 533
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
D++FA+ + I AILW GYPG+ GG AIADV+FG NPGG+LP+TWY Y+ LP+T+M
Sbjct: 534 DVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNM 593
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
+R S YPGRTY+FY GP +Y FG+GLSYT F + ++ + ++L+ +
Sbjct: 594 AMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQ----- 646
Query: 672 YTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-- 728
T AS + V +C VD +N G DG ++V+S PPA AT++
Sbjct: 647 -THSASTLSSKAIRVTHAKCQKLSLVIHVDVENKGDRDGFHTMLVFSTPPAN-GATWVPR 704
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KQ++ F+++ + + +R++ + CK L++VD +P G+H I +GN
Sbjct: 705 KQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLGDHYIHIGN 755
>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
Length = 770
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/749 (49%), Positives = 513/749 (68%), Gaps = 26/749 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + G+ + FC SLP R +DLV R+TL EK++ L + A VPRLG+
Sbjct: 27 FACDP----RNGV-TRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIK 81
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFN SLW++IG+ VS EARAM
Sbjct: 82 GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAM 141
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G AGLTYWSPN+N+ RDPRWGR ETPGEDP V +YA YVRGLQ NA D
Sbjct: 142 YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG-----NARDR 196
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+D+W G+DR+HF+ARV++QD+E+T+ PF+ CV EG+ +SVM
Sbjct: 197 ----LKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVM 252
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RGEW L+GYIV+DCDS+ V D + A + E+A A
Sbjct: 253 CSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTA-TPEEAAAVA 311
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T A++ GK+ E D++ +L +V MRLG FDG P Y +L
Sbjct: 312 IKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNL 371
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA EAAR+GIVL++N LPL++++ +T+AV+GP+++ T MIGNYA
Sbjct: 372 GPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYA 431
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G YA ++ GC VAC+ + AA AA+ ADAT+++ GLD S+E
Sbjct: 432 GVACGYTTPLQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIE 491
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE DR D+ LPG Q +L+++VA ++GP +LV+MS G +D++FA+ + I AI+W GYP
Sbjct: 492 AEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYP 551
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG+ NPGG+LP+TWY Y++ P+T+M +R + S GYPGRTY+FYNG
Sbjct: 552 GQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNG 611
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
P ++PFG+GLSY+ F ++L T+ V+L LQ +N S + + ++ C
Sbjct: 612 PVVFPFGHGLSYSTFAHSLAQAPTTVSVSLASLQTIKNSTIVSSGA------IRISHANC 665
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ F +D +N G+ DGS ++++S PP + K+++ F++V V AG +R++F
Sbjct: 666 NTQPLGFHIDVKNTGTMDGSHTLLLFSTPPPGTWSPN-KRLLAFEKVHVGAGSQERVRFD 724
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD+ +P GEH +G+
Sbjct: 725 VHVCKHLSVVDHFGIHRIPMGEHHFHIGD 753
>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
[Cucumis sativus]
Length = 772
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/771 (48%), Positives = 508/771 (65%), Gaps = 30/771 (3%)
Query: 15 IALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTL 74
I +L+ + G + F CDP + +S + FC +LP RVKDL+ R+TL
Sbjct: 9 IPILIILSAIFRHGGGAREPFACDPKDAA-----LSRYPFCRVALPIPERVKDLIGRLTL 63
Query: 75 DEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTAS 134
EKV+ L + A VPRLG+ YEWWSEALHGVSNVGPGT F PGATSFP VI T AS
Sbjct: 64 QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123
Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRY 194
FN SLW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP V G Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
A Y++GLQ +G LKV++CCKH+ AYD+DNW G DR+HF+A+VT QDM
Sbjct: 184 AARYIKGLQGNDGDR---------LKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMV 234
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
+TF PF CVKEG +SVMCSYN+VNG+P+CADP LL T+R +W L+GYIV+DCDS+
Sbjct: 235 DTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVG 294
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
V DN + + + E+A A +KAGLDLDCG + T +AV++ + +T I+ +L T
Sbjct: 295 VFYDNQHYTS-TAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKXLLTQTHINNALANTIT 353
Query: 375 VLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
V MRLG FDG+P Y LG +++CS + +LA +AAR+GIVLLKN LPL++ +
Sbjct: 354 VQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSAXHHR 413
Query: 432 TVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
TVAV+GP+++ V MIGNYAG+ C Y++P+ G Y V ++ GCD+VAC ++ S A
Sbjct: 414 TVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDAL 473
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
AA TADAT+++ GLD SVEAE+ DR+ L LPG Q +L+ +VA ++GP ++++MS G +
Sbjct: 474 AAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPI 533
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
D++FA+ + I AILW GYPG+ GG AIADV+FG NPGG+LP+TWY Y+ LP+T+M
Sbjct: 534 DVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNM 593
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
+R S YPGRTY+FY GP +Y FG+GLSYT F + ++ + ++L+ +
Sbjct: 594 AMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQ----- 646
Query: 672 YTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-- 728
T AS + V +C VD +N G DG ++V+S PPA AT++
Sbjct: 647 -THSASTLSSKAIRVTHAKCQKLSLVIHVDVENKGDRDGFHTMLVFSTPPAN-GATWVPR 704
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KQ++ F+++ + + +R++ + CK L++VD +P G+H I +GN
Sbjct: 705 KQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLGDHYIHIGN 755
>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
Length = 774
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/756 (48%), Positives = 500/756 (66%), Gaps = 29/756 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + G + FCD + S R DLVSR+TL +KV L + +
Sbjct: 33 AQTPVFACDAANSTVAG-----YAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALA 87
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VS
Sbjct: 88 RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSN 147
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + RYAV YV GLQD
Sbjct: 148 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGADA 207
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ PLKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G
Sbjct: 208 DG------PLKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGK 261
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL+ +RG+W L+GYIV+DCDS+ V+ + + E+
Sbjct: 262 VASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHY-TKTPEE 320
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+K+GLDL+CG + T AVQ G + E+D+D+++ + +LMRLGFFDG P+
Sbjct: 321 AAAITIKSGLDLNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKL 380
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
Y SLG +D+C+ N ELA E AR+GIVLLKND LPL++ +K++AV+GP+ANA+ M
Sbjct: 381 AYGSLGPKDVCTSSNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTM 439
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G Y+ GC +V C N+ + AA+ AA +AD T+++ G
Sbjct: 440 IGNYEGTPCKYTTPLHGLGNNVATVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVG 499
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D S+E E+LDR L LPG Q LI+ VA +KG VILV+MS G DI+FA+ + I AI
Sbjct: 500 ADQSIEREALDRTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAI 559
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG AIAD++FGK+NP GRLP+TWY + +P+T M +RP +S GYPGRT
Sbjct: 560 LWVGYPGEAGGAAIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRT 619
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTS-DASKTRCPG 683
Y+FY G T++ FG GLSYT +NL++ + + + L + C S +A+ C G
Sbjct: 620 YRFYTGETVFAFGDGLSYTTMSHNLVAAPPSEVSMQLAEGHACHTKECASVEAAGDHCEG 679
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+ FE ++ N G G+ V+++S PPA + K ++GF+++ + G+
Sbjct: 680 MA---------FEVRLRVHNTGEMAGAHTVLLFSSPPA-VHNAPAKHLLGFEKLNLEPGQ 729
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F + CK L++VD N + G HT+ VG+
Sbjct: 730 AGVAAFKVDVCKDLSVVDELGNRKVALGGHTLHVGD 765
>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
Length = 774
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/771 (47%), Positives = 508/771 (65%), Gaps = 34/771 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
S+ P F CDP S SSFLFC +SLP S RV+DLVSR+TLDEK+ QL A +P
Sbjct: 24 STEPPFSCDPSNPS-----TSSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIP 78
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGV+NVG G HF+ I ATSFP VILT ASF+ W +IGQ +
Sbjct: 79 RLGIPAYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGR 138
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ----- 203
EARA+YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV+YVRG+Q
Sbjct: 139 EARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQ 198
Query: 204 --DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
++GH L+ S+CCKH+ AYD+DNWKGV+R+ FDARVT QD+ +T+ PF
Sbjct: 199 GGKLKGH----------LQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPPF 248
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
+ CV++G AS +MC+YNRVNGIPSCAD LL++T RG+WD HGYI +DCD++ ++ DN
Sbjct: 249 QSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQG 308
Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
+ A S EDAV LKAG+D++CG Y T AV+Q K+ E ID++L L++V MRLG
Sbjct: 309 Y-AKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGL 367
Query: 382 FDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
F+G+P + ++G +CS E+ LA EAAR GIVLLKN LPL +K ++AV+GP
Sbjct: 368 FNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGP 427
Query: 439 HANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTA 497
+AN+ ++GNYAG PC+ ++P+ Y N Y +GCD V C S+ SI A + AK
Sbjct: 428 NANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKC-SSASIDKAVDIAKGV 486
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D +++ GLD + E E LDR DL LPG Q +LI VA+ AK P++LV++S G VDI+FA+
Sbjct: 487 DRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAK 546
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
+ NI +ILWAGYPGE GG A+A+++FG NPGG+LP+TWY ++V+ +P+T M +RP
Sbjct: 547 YDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFVK-VPMTDMRMRPDP 605
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
S GYPGRTY+FY G ++ FGYGLSY+++ Y L ++T ++ LN+ R ++ SD
Sbjct: 606 SSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQT-KLYLNQSSTMRIID-NSDPV 663
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
+ L + + F KV +N G G V+++++ +Q+IGF+ V
Sbjct: 664 RATLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLIGFKSV 723
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+ AG I+F + C+ + + ++ G H + V GG +PI +
Sbjct: 724 ILNAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMV--GGDKYPISV 772
>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
Length = 767
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/755 (49%), Positives = 502/755 (66%), Gaps = 30/755 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + ++S+ FC+ S S R DLVSR+TL EKV L D +P
Sbjct: 29 AQTPVFACDASNAT-----LASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALP 83
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++P ATSFP ILT ASFN +L++ IG+ VS
Sbjct: 84 RLGIPLYEWWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSN 143
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD
Sbjct: 144 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGS-- 201
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
S LKV++CCKHY AYDVDNWKGV+RY F+A V++QD+++TF PF+ CV +G+
Sbjct: 202 -----GSGSLKVAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGN 256
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL+ +RG+W L+GYI +DCDS+ V+ +N + + ED
Sbjct: 257 VASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPED 315
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A ++KAGLDL+CG + T AVQ GK+ E+D+D+++ + LMRLGFFDG P+
Sbjct: 316 AAAISIKAGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKL 375
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ +LG D+C+ N ELA EAAR+GIVLLKN LPL+++ +K++AV+GP+ANA+ M
Sbjct: 376 PFGNLGPSDVCTSSNQELAREAARQGIVLLKN-SGALPLSASSIKSLAVIGPNANASFTM 434
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G Y+ GC +V C N+ + AA++AA +AD T+++ G
Sbjct: 435 IGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVG 494
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D S+E ESLDR L LPG Q QL++ VA ++GP ILVIMS G DI+FA+++ I AI
Sbjct: 495 ADQSIERESLDRTSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAI 554
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG AIADV+FG NP GRLP+TWY + + +P+ M +RP S GYPGRT
Sbjct: 555 LWVGYPGEAGGAAIADVLFGHHNPSGRLPVTWYPESFTK-VPMIDMRMRPDASTGYPGRT 613
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y+FY G T+Y FG GLSYT F ++L+S K + + L + C +CP V
Sbjct: 614 YRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQVALQLAEGHTCLT---------EQCPSVE 664
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
C+ F+ + +N G G+ V ++S PPA + K ++GF++V + G+
Sbjct: 665 AEGAHCEGLAFDVHLRVRNAGDMSGAHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQA 723
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + CK L++VD N + G HT+ VG+
Sbjct: 724 GVVAFKVDVCKDLSVVDELGNRKVALGNHTLHVGD 758
>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 770
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/730 (50%), Positives = 491/730 (67%), Gaps = 25/730 (3%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
FC SL RVKDL+ R+TL EK++ L + A VPRLG+ YEWWSEALHGVSNVGPGT
Sbjct: 46 FCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGVSNVGPGT 105
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
F PGATSFP VI T ASFN+SLW IG+ VS EARAMYN G AGLTYWSPN+N+ R
Sbjct: 106 KFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 165
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR ETPGEDP + +YA NYV+GLQ +G + LKV++CCKHY AYD+
Sbjct: 166 DPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKR--------LKVAACCKHYTAYDL 217
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
DNW GVDRYHF+A+V++QD+E+T+ PF+ CV EG +SVMCSYN+VNG P+CADP LL
Sbjct: 218 DNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLK 277
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
T+RG W L GYIV+DCDS+ V+ D+ F + E+A A T+KAGLDLDCG + T
Sbjct: 278 NTIRGAWGLDGYIVSDCDSVGVLYDSQHF-TPTPEEAAASTIKAGLDLDCGPFLAVHTAT 336
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAR 410
AV +G +KE D++ +L L +V MRLG FDG P Y +LG +D+C+ + LA EAAR
Sbjct: 337 AVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAAR 396
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANV 470
+GIVLL+N LPL+ + +TVAV+GP+++ATV MIGNYAG+ C Y +P+ G S Y
Sbjct: 397 QGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKT 456
Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
+ GC +VAC + I A AA+ ADA +++ GLD S+EAES DR + LPG Q +L+
Sbjct: 457 IHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELV 516
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
++ KGP ++V+MS G +D++FA+ + I ILW GYPG+ GG AIADV+FG NPG
Sbjct: 517 RRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPG 576
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
G+LP+TWY Y+ +P+T+M LRP S GYPGRTY+FY GP ++PFG+GLSY++F
Sbjct: 577 GKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQ-- 634
Query: 651 LSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
SF + +++L N + T S T C V+DL +D +N G+ DG
Sbjct: 635 -SFAEAPTKISLPLSSLSPNSSATVKVSHTDCAS--VSDL------PIMIDVKNTGTVDG 685
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
S ++V+S P + + K +IGF++V + AG KR++ + C L+ VD +P
Sbjct: 686 SHTILVFSTVPNQTWSPE-KHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIP 744
Query: 770 AGEHTIFVGN 779
GEH + +G+
Sbjct: 745 MGEHKLHIGD 754
>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 733 bits (1891), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/756 (48%), Positives = 498/756 (65%), Gaps = 27/756 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + ++++ FC+ S R +DLVSR+TL EKV L + +
Sbjct: 32 AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VST
Sbjct: 87 RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 146
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD
Sbjct: 147 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA---- 202
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
A + LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G+
Sbjct: 203 GAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 262
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL +RG+W L+GYIV+DCDS+ V+ + + E+
Sbjct: 263 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 321
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+K+GLDL+CG + T AVQ G++ E D+D+++ + +LMRLGFFDG P+
Sbjct: 322 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 381
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ SLG +D+C+ N ELA E AR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 382 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 440
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G N Y+ GC +V C N+ + A AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 500
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D S+E ESLDR L LPG QTQL++ VA + GPVILV+MS G DI+FA+ + I AI
Sbjct: 501 ADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAI 560
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG A+AD++FG NP GRLP+TWY Y + +T M +RP S GYPGRT
Sbjct: 561 LWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRT 620
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGV 684
Y+FY G T++ FG GLSYT+ ++L+S + + + L + CR C V
Sbjct: 621 YRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASV 671
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
CDD F+ K+ +N G G+ V+++S PP A K ++GF++V + G
Sbjct: 672 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGE 730
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + C+ L++VD + G HT+ VG+
Sbjct: 731 AGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 766
>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 773
Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/755 (48%), Positives = 498/755 (65%), Gaps = 30/755 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +P F CD + ++S+ FC+ S + R DLVSR+TL EKV L D +P
Sbjct: 35 AQTPAFACDASNAT-----LASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALP 89
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN +L++ IG+ VS
Sbjct: 90 RLGVPLYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSN 149
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQ
Sbjct: 150 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGA 209
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
A LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF PF+ CV +G+
Sbjct: 210 GA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGN 262
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL+ +RG+W L+GYI +DCDS+ V+ +N + + ED
Sbjct: 263 VASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPED 321
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A ++KAGLDL+CG + T AVQ GK+ E+D+D+++ LMRLGFFDG P+
Sbjct: 322 AAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPREL 381
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ +LG D+C+ N ELA EAAR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 382 PFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTM 440
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G Y+ GC +V C N+ + AA++AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVG 500
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D S+E ESLDR L LPG Q QL++ VA + GP ILV+MS G DI+FA+++ I AI
Sbjct: 501 ADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAI 560
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG AIADV+FG NP GRLP+TWY + + +P+T M +RP S GYPGRT
Sbjct: 561 LWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMRMRPDPSTGYPGRT 619
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y+FY G T+Y FG GLSYT F ++L+S K + + L + C +CP V
Sbjct: 620 YRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT---------EQCPSVE 670
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
C+ F+ + +N G G V ++S PPA + K ++GF++V + G+
Sbjct: 671 AEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQA 729
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + CK L++VD N + G HT+ VG+
Sbjct: 730 GVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 764
>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 775
Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/756 (48%), Positives = 498/756 (65%), Gaps = 27/756 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + ++++ FC+ S R +DLVSR+TL EKV L + +
Sbjct: 32 AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VST
Sbjct: 87 RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 146
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD
Sbjct: 147 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA---- 202
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
A + LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G+
Sbjct: 203 GAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 262
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL +RG+W L+GYIV+DCDS+ V+ + + E+
Sbjct: 263 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 321
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+K+GLDL+CG + T AVQ G++ E D+D+++ + +LMRLGFFDG P+
Sbjct: 322 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 381
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ SLG +D+C+ N ELA E AR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 382 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 440
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G N Y+ GC +V C N+ + A AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 500
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D S+E ESLDR L LPG QTQL++ VA + GPVILV+MS G DI+FA+ + I AI
Sbjct: 501 ADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAI 560
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG A+AD++FG NP G+LP+TWY Y + +T M +RP S GYPGRT
Sbjct: 561 LWVGYPGEAGGAALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRT 620
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGV 684
Y+FY G T++ FG GLSYT+ ++L+S + + + L + CR C V
Sbjct: 621 YRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASV 671
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
CDD F+ K+ +N G G+ V+++S PP A K ++GF++V + G
Sbjct: 672 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGE 730
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + C+ L++VD + G HT+ VG+
Sbjct: 731 AGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 766
>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
Length = 784
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/754 (48%), Positives = 507/754 (67%), Gaps = 24/754 (3%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
+ CD + LG SF FCD+ L +RV+DLVSR+TLDEKV ++ + A G+PRLG+P
Sbjct: 36 YACDVSSNASLG----SFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVP 91
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y+WW EALHGV++ PG F + P ATSFP I T ASFN +L+ IG+AVS+EARA+
Sbjct: 92 SYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARAL 150
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
+NLGRAGLT+WSPN+N+ RDPRWGR ETPGEDP + ++A YVRGLQ +A+D
Sbjct: 151 HNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYEGSASD- 209
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKVS+CCKH AYDVDNWKG+DRYHF+A V+EQD+ +T+ PF+ C+++G SSVM
Sbjct: 210 --GFLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVM 267
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYNRVNG+P+CAD LL +TVR W +GYIV+DCD++QV+ ++ + A S EDAVA +
Sbjct: 268 CSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTY-APSAEDAVADS 326
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+ AGLDL+CG + +A+Q GK+ E D+D ++ L MRLG FDG P Y SL
Sbjct: 327 ILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSSL 386
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G DICS+++ +LA +AA +G+VLLKND +LPL++A +KTVA++GP+ANAT M+GNY
Sbjct: 387 GATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTMLGNYE 444
Query: 452 GIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
GIPC+Y+SP+ G Y +N+ Y GC +VAC + + +A E A ADA +++ GLD S
Sbjct: 445 GIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQSQ 504
Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
E E+ DR L LPG Q+QL++ +A P++LVIMSAG VDI+ + N+ I +++W GY
Sbjct: 505 ERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLGY 564
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PG+ GG A+A VVFG +NPGGRLP TWY+ ++ + + M +RP GYPGR+Y+FY
Sbjct: 565 PGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTN-VSMLDMQMRPNPLSGYPGRSYRFYT 623
Query: 631 GPTLYPFGYGLSYTQFKYN-LLSFTKT--IQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
G LY FG GLSY+ + Y LL+ TK + N + C +N + +K+ C + +
Sbjct: 624 GTPLYNFGDGLSYSTYFYKFLLAPTKLSFFKSNTGNSRGCPAVNRSK--AKSGCFHLPAD 681
Query: 688 DLR-CDD-YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
DL C+ F+ V+ N+G GS V+++S PP + +KQ+I FQ+V + + +
Sbjct: 682 DLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPP-PVEGAPLKQLIAFQKVHLESDTTQ 740
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ F + CK L+ V L +G H + +GN
Sbjct: 741 RLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGN 774
>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
Length = 738
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/748 (48%), Positives = 498/748 (66%), Gaps = 56/748 (7%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + G+ + FC SLP R +DLV R+TL EK++ L + A VPRLG+
Sbjct: 27 FACDP----RNGV-TRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIK 81
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFN SLW++IG+ VS EARAM
Sbjct: 82 GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAM 141
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN G AGLTYWSPN+N+ RDPRWGR ETPGEDP V +YA YVRGLQ NA D
Sbjct: 142 YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG-----NARDR 196
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYD+D+W G+DR+HF+ARV++QD+E+T+ PF+ CV EG+ +SVM
Sbjct: 197 ----LKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVM 252
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL T+RGEW L+GYIV+DCDS+ V D + A + E+A A
Sbjct: 253 CSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTA-TPEEAAAVA 311
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+KAGLDLDCG + T A++ GK+ E D++ +L +V MRLG FDG P Y +L
Sbjct: 312 IKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNL 371
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ + +LA EAAR+GIVL++N LPL++++ +T+AV+GP+++ T MIGNYA
Sbjct: 372 GPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYA 431
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+ C Y +P+ G YA ++ GC VAC+ + AA AA+ ADAT+++ GLD S+E
Sbjct: 432 GVACGYTTPLQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIE 491
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AE DR D+ LPG Q +L+++VA ++GP +LV+MS G +D++FA+ + I AI+W GYP
Sbjct: 492 AEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYP 551
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG AIADV+FG+ NPGG+LP+TWY Y++ P+T+M +R + S GYPGRTY+FYNG
Sbjct: 552 GQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNG 611
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
P ++PFG+GLSY+ F ++L T
Sbjct: 612 PVVFPFGHGLSYSTFAHSLAQAPTTP---------------------------------- 637
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
F +D +N G+ DGS ++++S PP + K+++ F++V V AG +R++F
Sbjct: 638 ---LGFHIDVKNTGTMDGSHTLLLFSTPPPGTWSPN-KRLLAFEKVHVGAGSQERVRFDV 693
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD+ +P GEH +G+
Sbjct: 694 HVCKHLSVVDHFGIHRIPMGEHHFHIGD 721
>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
[Hordeum vulgare]
Length = 777
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/753 (47%), Positives = 491/753 (65%), Gaps = 27/753 (3%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
PVF CD + ++++ FC+ S R +DLVSR+TL EKV L + + RLG
Sbjct: 37 PVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLG 91
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VSTEAR
Sbjct: 92 IPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEAR 151
Query: 153 AMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD A
Sbjct: 152 AMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA----GAG 207
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
+ LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G+ +S
Sbjct: 208 GVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVAS 267
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMCSYN+VNG P+CAD LL +RG+W L+GYIV+DCDS+ V+ + + E+A A
Sbjct: 268 VMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEEAAA 326
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YV 389
T+K+G+DL+CG + T AVQ G++ E D+D+++ + +LMRLGFFDG P+ +
Sbjct: 327 ITIKSGVDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFG 386
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
SLG +D+C+ N ELA E AR+GIVLLKN LPL++ +K++AV+GP+ANA+ MIGN
Sbjct: 387 SLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGN 445
Query: 450 YAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDL 508
Y G PC+Y +P+ G N Y+ GC +V C N+ + A AA +AD T+++ G D
Sbjct: 446 YEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQ 505
Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
S+E ESLDR L LPG QTQL++ VA + GPVILV+MS G DI+FA+ + I A LW
Sbjct: 506 SIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWV 565
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
GYPGE GG A+ D +FG NP GRLP+TWY Y + +T M +RP S GYPGRTY+F
Sbjct: 566 GYPGEAGGAALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRF 625
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
Y G T++ FG GLSYT+ ++L+S + + + L + CR C V
Sbjct: 626 YTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHLCR---------AEECASVEAA 676
Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
CDD + K+ +N G G+ V+++S PP A K ++GF++V + G
Sbjct: 677 GDHCDDLALDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLVGFEKVSLAPGEAGT 735
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + C+ L++VD + G HT+ G+
Sbjct: 736 VAFRVDVCRDLSVVDELGGRKVALGGHTLHDGD 768
>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
Length = 785
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/762 (47%), Positives = 506/762 (66%), Gaps = 30/762 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
++ P + CD + LG SF FCD+ L +RV+DLVSR+TLDEKV ++ + A G+P
Sbjct: 32 TAQPRYACDVSSNASLG----SFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIP 87
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WW EALHGV++ PG F + P ATSFP I ASFN +L+ IG+AVS+
Sbjct: 88 RLGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSS 146
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARA++NLGRAGLT+WSPN+N+ RDPRWGR ETPGEDP + ++A YVRGLQ
Sbjct: 147 EARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGG 206
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+A+D LKVS+CCKH AYD+DNWKG+DRYHF+A V+EQD+ +T+ PF+ C+++G
Sbjct: 207 SASD---GFLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGR 263
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
SSVMCSYNRVNG+P+CAD LL +TVR W +GYIV+DCD++QV+ ++ + A S ED
Sbjct: 264 VSSVMCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTY-APSAED 322
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SP 386
AVA ++ AGLDL+CG + +A+Q GKV E D+D ++ L MRLG FDG +
Sbjct: 323 AVADSILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTR 382
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
Y SLG DICS+++ +LA +AA +G+VLLKND +LPL++A +KTVA++GP+ANAT M
Sbjct: 383 PYSSLGATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTM 440
Query: 447 IGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
+GNY GIPC+Y+SP+ G Y N+ Y GC DVAC + + +A E A ADA +++ G
Sbjct: 441 LGNYEGIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVG 500
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
LD S E E+ DR L LPG Q+QL++ +A P++LVIMSAG VDI+ + N+ I ++
Sbjct: 501 LDQSQERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSV 560
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
+W GYPG+ GG A+A VVFG +NPGGRLP TWY+ ++ + + M +RP GYPGR+
Sbjct: 561 IWIGYPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTN-VSMLDMRMRPNPPSGYPGRS 619
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNL------LSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
Y+FY G LY FG GLSY+ + Y LSF K+ N + C +N + ++
Sbjct: 620 YRFYTGTPLYNFGDGLSYSTYLYKFLLAPTRLSFFKS---NTRNSRDCPTVNRSE--AEF 674
Query: 680 RCPGVLVNDLR-CDD-YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
C + +DL C+ F+ V+ N+G GS V+++S PP + +KQ+I FQ+V
Sbjct: 675 GCFHLPADDLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPP-PVEGAPLKQLIAFQKV 733
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ + +R+ F + CK L+ V L +G H + +GN
Sbjct: 734 HLESDTTQRLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGN 775
>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
Length = 773
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/772 (46%), Positives = 505/772 (65%), Gaps = 36/772 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
S+ P F CD S +F FC+++LP S R +DLVSR+TLDEK+ QL + A +P
Sbjct: 23 STQPPFSCDSSNPS-----TKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIP 77
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVSN GPG HF+D I GATSFP VILT ASF+ W +IGQA+
Sbjct: 78 RLGIPGYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGK 137
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ----- 203
EARA+YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP V G YA +YV+G+Q
Sbjct: 138 EARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFE 197
Query: 204 --DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
++GH L+ S+CCKH+ AYD+DNWKG++R+ FDARVT QD+ +T+ PF
Sbjct: 198 GGKIKGH----------LQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPF 247
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
+ CV++G AS +MC+YN+VNG+PSCAD LL++T R +W GYI +DCD++ ++ D+
Sbjct: 248 KSCVEQGRASGIMCAYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQG 307
Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
+ A S EDAV LKAG+D++CG Y AV+Q K+ E+DIDK+L L++V MRLG
Sbjct: 308 Y-AKSPEDAVVDVLKAGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGL 366
Query: 382 FDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
F+G P+ + ++G +CS E+ LA EAAR GIVLLKN LPL+ +K K++AV+GP
Sbjct: 367 FNGRPEGQLFGNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGP 426
Query: 439 HANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTA 497
+AN+ ++GNYAG PCR+++P+ Y T Y CD V C S+ S+ A + AK A
Sbjct: 427 NANSGQMLLGNYAGPPCRFVTPLQALQSYIKQTVYHPACDTVQC-SSASVDRAVDVAKGA 485
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D +++ GLD + E E LDR DL LPG Q +LI VA+ AK PV+LV+ S G VDI+FA+
Sbjct: 486 DNVVLMMGLDQTQEREELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAK 545
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
+ NI +ILWAGYPGE G A+A++VFG NPGGRLP+TWY ++V+ +P+T M +RP
Sbjct: 546 NDKNIGSILWAGYPGEGGAIALAEIVFGDHNPGGRLPMTWYPQEFVK-VPMTDMGMRPEA 604
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK-TIQVNLNKLQHCRNLNYTSDA 676
S GYPGRTY+FY G +++ FGYG+SY+++ Y L + ++ T+ +N + H N D+
Sbjct: 605 SSGYPGRTYRFYRGRSVFEFGYGISYSKYSYELTAVSQNTLYLNQSSTMHIIN---DFDS 661
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
++ L + + ++ +N G G V+++++ KQ+IGFQ
Sbjct: 662 VRSTLISELGTEFCEQNKCRARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQS 721
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
V + AG I+F + C+ L+ + ++ G H + V G +PI +
Sbjct: 722 VVLGAGERAEIEFEVSPCEHLSRANEDGLMVMEEGRHFLVV--DGDEYPISV 771
>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
Length = 775
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/790 (46%), Positives = 504/790 (63%), Gaps = 29/790 (3%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
L S I ++ S + V S+ P F CD Q S FC + LP S+RV DL
Sbjct: 3 LHISTLITTILISLSLVSIVQSTQPPFSCDSSN-----PQTKSLKFCQTGLPISVRVLDL 57
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
VSR+TLDEK+ QL + A +PRLG+P YEWWSE+LHGV + G G F+ I GATSFP V
Sbjct: 58 VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQV 117
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGED 187
ILT A+F+E+LW +IGQ + EAR +YN G+A G+T+W+PNIN+ RDPRWGR ETPGED
Sbjct: 118 ILTAATFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 177
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P + G+YA+ YVRG+Q N L L+ S+CCKH+ AYD+D WK +DR+ F+A
Sbjct: 178 PIMTGKYAIRYVRGVQG--DSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAI 235
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
VT QDM +TF PF+ C+++ AS +MCSYN VNGIPSCA+ LL +T R +W HGYI
Sbjct: 236 VTPQDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYIT 295
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
+DCD++QVM DNH++ ++ ED+ A LKAG+D+DCG Y +T +AV + KV + ID+
Sbjct: 296 SDCDAVQVMHDNHRY-GNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDR 354
Query: 368 SLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
+L L+++ MRLG F+G P+ Y ++ +C+ ++ +LA EAAR GIVLLKN LP
Sbjct: 355 ALHNLFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLP 414
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKS 483
L+ AK ++AV+G +AN + GNY G PC+Y+ + GYA +V Y+ GC+ C S
Sbjct: 415 LSKAKTNSLAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTS 474
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
N I A A+ AD +++ GLD + E E DR+DL LPG Q LIN VA+ AK PVIL
Sbjct: 475 AN-IDQAVNIARNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVIL 533
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
VI+S G VDI+FA+ N I +ILWAGYPGE GG A+A+++FG+ NPGG+LP+TWY +V
Sbjct: 534 VILSGGPVDISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFV 593
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLN 662
+ +P+T M +RP GYPGRTY+FY GP +Y FGYGLSYT + Y S T TIQ LN
Sbjct: 594 K-IPMTDMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQ--LN 650
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDD----YFEFKVDFQNVGSTDGSDVVIVYSK 718
+L + + + T V+++ D+ F V +N G DG V+++ K
Sbjct: 651 QLLSVKTVENSDSIRYT-----FVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVK 705
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ IKQ++GFQ V ++AG N ++ F + C+ L+ + ++ G + VG
Sbjct: 706 QDKARNGSPIKQLVGFQSVSLKAGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVG 765
Query: 779 NGGVSFPIHL 788
+ PI++
Sbjct: 766 DA--EHPINI 773
>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
Length = 772
Score = 712 bits (1837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/739 (49%), Positives = 499/739 (67%), Gaps = 24/739 (3%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+++F FC++SLP + RV+D V+R+TL+EK+ QL + A G+PRLG+P+Y+WW EALHGV++
Sbjct: 39 LAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVAS 98
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG F +P ATSFP I T ASFN SL+ IGQAVSTEARAM+NLG++GLT+WSPN
Sbjct: 99 -SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPN 157
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN+ RDPRWGR ETPGEDP + +A YVRGLQ+ + S LKVS+CCKH
Sbjct: 158 INIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQA-------GSDKLKVSACCKHM 210
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYDVDNW G DRYHF+A VTEQD+E+T+ PF+ CV++G SSVMCSYNR+NG+P+CAD
Sbjct: 211 TAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNGVPTCAD 270
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
+LL TVR W L+GYIV+DCDS+QV DN + A +++ A A L AGL+L+CG +
Sbjct: 271 HELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAED-AAADALLAGLNLNCGTFLA 329
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
T +A+QQ KV E I+++L YL TV MRLG +DG P+ Y SLG D+C+ E+ LA
Sbjct: 330 KHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTSEHQTLA 389
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
EAAR+G+VLLKN LPL+++K+K++AVVGPHANAT AMIGNYAGIPC+Y SP+ F
Sbjct: 390 LEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTSPLQAFQ 448
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
YA V+Y GC +VAC S++ I A AA ADA ++ GLDL++EAESLDR L LPG
Sbjct: 449 KYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRTSLLLPGK 508
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L++QV + AKGPV++VI+SAG +DI FA +++ I ILWAGYPG+ GG AIA+V+FG
Sbjct: 509 QQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAAIAEVIFG 568
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
NP G+LP TWY ++ + + M +RP S GYPGRTY+FY GPT++ FG GLSYT
Sbjct: 569 DHNPSGKLPATWYPQNFTS-ISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGDGLSYTS 627
Query: 646 FKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV--DFQ 702
+ + + + +Q C L +S C + D + + + +V +
Sbjct: 628 LSAKFIKAPSFLSIPSTAPMQPCTGLKKSSS-----CFHLDATDEKSCESLKSQVAISVR 682
Query: 703 NVGSTDGSDVVIVYSKPPAEIA-ATYIKQVIGFQRVFVRAGR-NKRIKFVFNACKSLNIV 760
N G+ S ++++S PP+ + +Q++GF ++ + + + F + C+
Sbjct: 683 NKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFVHA 742
Query: 761 DYAANTLLPAGEHTIFVGN 779
D LL +G H + GN
Sbjct: 743 DRDGKKLLRSGTHVLTAGN 761
>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
Length = 822
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/780 (49%), Positives = 497/780 (63%), Gaps = 60/780 (7%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
++ FC SLP R +DLV+R+T EKV+ L + A GVPRLG+ YEWWSEALHGVS+
Sbjct: 39 ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ------------------------ 145
GPG F PGAT+FP VI T ASFN +LW+ IGQ
Sbjct: 99 GPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVF 158
Query: 146 --------AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
AVS E RAMYN G+AGLT+WSPN+N+ RDPRWGR ETPGEDP V RYA
Sbjct: 159 MYVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAA 218
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
YVRGLQ + +S LK+++CCKH+ AYD+DNW G DR+HF+A VT QD+E+TF
Sbjct: 219 YVRGLQQQQ-------PSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTF 271
Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
PF CV +G A+SVMCSYN+VNG+P+CAD L T+R W L GYIV+DCDS+ V
Sbjct: 272 NVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFY 331
Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLM 377
+ + ++EDAVA TL+AGLDLDCG + +T AV QGKV + DID ++ TV M
Sbjct: 332 SDQHY-TRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQM 390
Query: 378 RLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK-TV 433
RLG FDG P + LG Q +C+ + ELA EAAR+GIVLLKND LPL+ A + V
Sbjct: 391 RLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAV 450
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACK-SNNSIFAAS 491
AVVGPHA ATVAMIGNYAG PCRY +P+ G + YA ++ GC DVAC S I AA
Sbjct: 451 AVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAV 510
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+AA+ ADATI++AGLD +EAE LDR L LPG Q +LI+ VA+ +KGPVILV+MS G +
Sbjct: 511 DAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPI 570
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
DI FA+ + I ILWAGYPG+ GG+AIADV+FG NPGG+LP+TWY DY+Q +P+T+M
Sbjct: 571 DIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNM 630
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL----NKLQHC 667
+R + GYPGRTY+FY GPT++PFG+GLSYT F +++ + V L
Sbjct: 631 AMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASAS 690
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY-------SKP 719
+LN T+ S+ V V RC++ VD +NVG DG+ V+VY +
Sbjct: 691 ASLNATARLSRAAA--VRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAE 748
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A ++Q++ F++V V AG R++ + C L++ D +P GEH + +G
Sbjct: 749 AAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGE 808
>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 757
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/793 (48%), Positives = 501/793 (63%), Gaps = 75/793 (9%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
VS L FSL LL S++ V A SSPVF CD L +SF FC++SL S R
Sbjct: 13 VSVFLFFSLVCFLLFSSSHVVLAQ--SSPVFACDVVSNPSL----ASFGFCNTSLGVSDR 66
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
V DLV R+TL EK+ L + A V RLG+P+YEWWSEALHGVS VGPGTHF V+PGATS
Sbjct: 67 VVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATS 126
Query: 125 FPTVILTTASFNESLWKKIG----QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
FP VILT ASFN SL+ IG Q VSTEARAMYN+G AGLT+WSPNIN+ RDPRWGR
Sbjct: 127 FPQVILTAASFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 186
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ETPGEDP + +Y YV+GLQ + D N LKV++CCKHY AYD+DNWKGVD
Sbjct: 187 QETPGEDPLLSSKYGSGYVKGLQQRD------DGNPDGLKVAACCKHYTAYDLDNWKGVD 240
Query: 241 RYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
RYHF+A VT+QDM++TF PF+ CV +G+ +SVMCSYN+VNGIP+CADP LL+ +RGE
Sbjct: 241 RYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGE 300
Query: 300 WDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA--GLDLDCGQYYTNFTGNAV 355
W L+G YIV DCDSI V ++ + + E+A A+ + A GLDL+CG + T AV
Sbjct: 301 WKLNGYVYIVTDCDSIDVFYNSQHY-TKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAV 359
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREG 412
G V E+ ID+++ + LMRLGFFDG P Y LG +D+C+ EN ELA EAAR+G
Sbjct: 360 TAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQG 419
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTY 472
IVLLKN G PC+Y +P+ G + TY
Sbjct: 420 IVLLKN--------------------------------TGTPCKYTTPLQGLAALVATTY 447
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQ 532
GC +VAC S + A + A ADAT+++ G DLS+EAES DR D+ LPG Q LI
Sbjct: 448 LPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITA 506
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN---- 588
VA + GPVILVIMS GG+D++FA+TN I +ILW GYPGE GG AIAD++FG +N
Sbjct: 507 VANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSTH 566
Query: 589 --PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
PGGRLP+TWY YV +P+T+M +RP S GYPGRTY+FY G T+Y FG GLSY++F
Sbjct: 567 QPPGGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEF 626
Query: 647 KYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGS 706
+ L + V L + C Y+S+ C V + C + F+ + +N G+
Sbjct: 627 SHELTQAPGLVSVPLEENHVC----YSSE-----CKSVAAAEQTCQN-FDVHLRIKNTGT 676
Query: 707 TDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
T GS V ++S PP+ + + K ++GF++VF+ A + + F + CK L++VD +
Sbjct: 677 TSGSHTVFLFSTPPS-VHNSPQKHLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSK 735
Query: 767 LLPAGEHTIFVGN 779
+ GEH + +G+
Sbjct: 736 KVALGEHVLHIGS 748
>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
Length = 779
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/765 (48%), Positives = 485/765 (63%), Gaps = 59/765 (7%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F FC++ LP S RV+DL+SRMTL EK+ QL + A G+PRLGLP+YEWW EALHGV+ V P
Sbjct: 44 FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVA-VSP 102
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
G F PGATSFP ILT ASF+ AVSTEARAM+N RAGLTYWSPN+N+
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
RDPRWGR ETPGEDP + +YA YVRGLQD T+L LKVS+CCKH AY
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQD-------TNLGGDKLKVSACCKHMTAY 206
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
DVDNWKG R+ F+A VT+QD+ +T+ PF+ CV++ SSVMCSYNRVNG+P+CAD L
Sbjct: 207 DVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNGVPTCADYNL 266
Query: 292 LNQTVRGEWDLHG----------------YIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
L+ TVR W+L+G YIV+DCDS+Q DN + A + ED VA L
Sbjct: 267 LSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNY-AKTAEDVVADAL 325
Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
AGL+LDCG + T +A+ GK+ E +++++L+YLY V MRLG +DG+P+ Y +LG
Sbjct: 326 LAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNPRSQPYGNLG 385
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
Q +C+ EN +LA +AA+EGIVLLKN+ N LP + + ++TVA +GPHA AT AMIGNY G
Sbjct: 386 PQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKATRAMIGNYQG 445
Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
IPC+Y +P G S YA V Y GC DVAC SN+ I +A+ A ADA ++ GLDL+ EA
Sbjct: 446 IPCKYTTPHDGLSAYARVVYSAGCSDVACYSNSLIGSAASTASQADAVVLFVGLDLNQEA 505
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E DR L LPG Q +L+ +V + AKGPV+LVI S G VD++FA+ + ++ +LWAGYPG
Sbjct: 506 EGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKVQGMLWAGYPG 565
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
E GG AIA V+FG NPGGRLP+TWY + + L M +RP S GYPGRTY+FY G
Sbjct: 566 EAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITML-DMNMRPDASRGYPGRTYRFYTGQ 624
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV------ 686
++Y FGYG +Y++ + K ++L + + A K C G L
Sbjct: 625 SVYNFGYGKTYSKLSHKF----KEAPLSLGFPE--------AAAVKRSCDGNLTCFHLNA 672
Query: 687 -NDLRCDDYF-EFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR 743
+++ C + ++ N G + V++YS PP A I+Q+ GF +V V G
Sbjct: 673 HDEITCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAPGA 732
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+ ++ + CK L+ +L G HT+ VGN PI L
Sbjct: 733 VENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPILL 777
>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
Length = 779
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/765 (48%), Positives = 483/765 (63%), Gaps = 59/765 (7%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F FC++ LP S RV+DL+SRMTL EK+ QL + A G+PRLGLP+YEWW EALHGV+ V P
Sbjct: 44 FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVA-VSP 102
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
G F PGATSFP ILT ASF+ AVSTEARAM+N RAGLTYWSPN+N+
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
RDPRWGR ETPGEDP + +YA YVRGLQD T+L LKVS+CCKH AY
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQD-------TNLGGDKLKVSACCKHMTAY 206
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
DVDNWKG R+ F+A VT+QD+ +T+ PF+ CV++ SSVMCSYNRVNG+P+CAD L
Sbjct: 207 DVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNGVPTCADYNL 266
Query: 292 LNQTVRGEWDLHG----------------YIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
L+ TVR W+L+G YIV+DCDS+Q DN + A + ED VA L
Sbjct: 267 LSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNY-AKTAEDVVADAL 325
Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
AGL+LDCG + T +A+ GK+ E +++++L+YLY V MRLG +DG+P+ Y +LG
Sbjct: 326 LAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNPRSQPYGNLG 385
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
Q +C+ EN +LA +AA+EGIVLLKN+ N LP + + ++TVA +GPHA AT AMIGNY G
Sbjct: 386 PQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKATRAMIGNYQG 445
Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
IPC+Y +P G S YA V Y GC DVAC S++ I +A A ADA ++ GLDL+ EA
Sbjct: 446 IPCKYTTPHDGLSAYARVVYSAGCSDVACYSDSLIGSAVSTASQADAVVLFVGLDLNQEA 505
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E DR L LPG Q +L+ +V + AKGP +LVI S G VD++FA+ N ++ ILWAGYPG
Sbjct: 506 EGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKVQGILWAGYPG 565
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
E GG AIA V+FG NPGGRLP+TWY + + L M +RP S GYPGRTY+FY G
Sbjct: 566 EAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITML-DMNMRPDASRGYPGRTYRFYTGQ 624
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV------ 686
++Y FGYG +Y++ + K ++L + + A K C G L
Sbjct: 625 SVYNFGYGKTYSKLSHKF----KEAPLSLGFPE--------AAAVKRSCDGNLTCFHLNA 672
Query: 687 -NDLRCDDYF-EFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR 743
+++ C + ++ N G + V++YS PP A I+Q+ GF +V V G
Sbjct: 673 HDEITCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAPGA 732
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+ ++ + CK L+ +L G HT+ VGN PI L
Sbjct: 733 VENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPILL 777
>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
Length = 772
Score = 707 bits (1826), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/739 (49%), Positives = 497/739 (67%), Gaps = 24/739 (3%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+++F FC++SL + RV+D V+R+TL+EK+ QL + A G+PRLG+P+Y+WW EALHGV++
Sbjct: 39 LAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVAS 98
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG F +P ATSFP I T ASFN SL+ IGQAVSTEARAM+NLG++GLT+WSPN
Sbjct: 99 -SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPN 157
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN+ RDPRWGR ETPGEDP + +A YVRGLQ+ + S LKVS+CCKH
Sbjct: 158 INIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQA-------GSDKLKVSACCKHM 210
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYDVDNW G DRYHF+A VTEQD+E+T+ PF+ CV++G SSVMCSYNR+NG+P+CAD
Sbjct: 211 TAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNGVPTCAD 270
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
+LL TVR W L+GYIV+DCDS+QV DN + A +++ A A L AGL+L+CG +
Sbjct: 271 HELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAED-AAADALLAGLNLNCGTFLA 329
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
T +A+QQ KV E I+++L YL TV MRLG +DG P+ Y SLG D+C+ E+ LA
Sbjct: 330 KHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTSEHQTLA 389
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
EAAR+G+VLLKN LPL+++K+K++AVVGPHANAT AMIGNYAGIPC+Y SP+ F
Sbjct: 390 LEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTSPLQAFQ 448
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
YA V+Y GC +VAC S++ I A AA ADA ++ GLDL++EAESLDR L LPG
Sbjct: 449 KYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRTSLLLPGK 508
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L++QV + AKGPV++VI+SAG +DI FA +++ I ILWAGYPG+ GG AIA+V+FG
Sbjct: 509 QQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAAIAEVIFG 568
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
NP G+LP TWY ++ + + M +RP S GYPGRTY+FY GPT++ FG GLSYT
Sbjct: 569 DHNPSGKLPATWYPQNFTS-ISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGDGLSYTS 627
Query: 646 FKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV--DFQ 702
+ + + + +Q C L +S C + D + + + +V +
Sbjct: 628 LSAKFIKAPSFLSIPSTAPMQPCTGLKKSSS-----CFHLDATDEKSCESLKSQVAISVR 682
Query: 703 NVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR-NKRIKFVFNACKSLNIV 760
N G+ S ++++S PP A +Q++GF ++ + + + F + C+
Sbjct: 683 NKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFVHA 742
Query: 761 DYAANTLLPAGEHTIFVGN 779
D LL +G H + GN
Sbjct: 743 DPDGKKLLRSGTHVLTAGN 761
>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
Length = 790
Score = 705 bits (1819), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/762 (47%), Positives = 480/762 (62%), Gaps = 52/762 (6%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G ++ FC SLP R +DLVSR+T EKV+ L + A GV RLG+ YEWWSEALHG
Sbjct: 39 GGPATTLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHG 98
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
VS+ GPG F PGAT+FP VI A+ N +LW+ IG+AVS EARAMYN GRAGLT+W
Sbjct: 99 VSDTGPGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFW 158
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPN+N+ RDPRWGR ETPGEDP + RYA YVRGLQ H LK+++CC
Sbjct: 159 SPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQQPYDHNR--------LKLAACC 210
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ AYD+D+W G DR+HF+A V+ QD+E+TF PF CV G A+SVMCSYN+VNG+P+
Sbjct: 211 KHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPT 270
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CAD L T+R W L GYIV+DCDS+ V + + + EDAVA TL+AGLDLDCG
Sbjct: 271 CADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHY-TRTAEDAVAATLRAGLDLDCGP 329
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENI 402
+ +T NAV + KV + D+D +L TV MRLG FDG P + LG D+C+ +
Sbjct: 330 FLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQ 389
Query: 403 ELAAEAAREGIVLLKN-------DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+LA +AAR+ +VLLKN D++ LPL A + VAVVGPHA+ATVAMIGNYAG PC
Sbjct: 390 DLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPC 449
Query: 456 RYMSPIAGFSGY-ANVTYKTGCDDVACKSNNS-IFAASEAAKTADATIILAGLDLSVEAE 513
RY +P+ G + Y A V ++ GC DVAC+ N I AA +AA+ GL S
Sbjct: 450 RYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS---- 505
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
L LPG Q +LI+ VA+ AKGPVILV+MS G +DIAFA+ + I ILW GYPG+
Sbjct: 506 ------LLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQ 559
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+AIADV+FG+ NPGG+LP+TWY DY++ +P+T+M +R + GYPGRTY+FY GPT
Sbjct: 560 AGGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTGPT 619
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN--------LNYTSDASKTRCPGVL 685
++ FG+GLSYTQF + L + V L+ + LN T + R
Sbjct: 620 IHAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVR----- 674
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-------IKQVIGFQRV 737
V RC+ VD +NVG DG+ V+VY P+ +++ +Q++ F++V
Sbjct: 675 VAHARCEGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKV 734
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
V AG R++ + C L++ D +P GEH + +G
Sbjct: 735 HVPAGGVARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGE 776
>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 709
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/696 (49%), Positives = 466/696 (66%), Gaps = 22/696 (3%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VST
Sbjct: 21 RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 80
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD
Sbjct: 81 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA---- 136
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
A + LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G+
Sbjct: 137 GAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 196
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL +RG+W L+GYIV+DCDS+ V+ + + E+
Sbjct: 197 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 255
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+K+GLDL+CG + T AVQ G++ E D+D+++ + +LMRLGFFDG P+
Sbjct: 256 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 315
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ SLG +D+C+ N ELA E AR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 316 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 374
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G N Y+ GC +V C N+ + A AA +AD T+++ G
Sbjct: 375 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 434
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
D S+E ESLDR L LPG QTQL++ VA + GPVILV+MS G DI+FA+ + I AI
Sbjct: 435 ADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAI 494
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG A+AD++FG NP GRLP+TWY Y + +T M +RP S GYPGRT
Sbjct: 495 LWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRT 554
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGV 684
Y+FY G T++ FG GLSYT+ ++L+S + + + L + CR C V
Sbjct: 555 YRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASV 605
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
CDD F+ K+ +N G G+ V+++S PP A K ++GF++V + G
Sbjct: 606 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGE 664
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + C+ L++VD + G HT+ VG+
Sbjct: 665 AGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 700
>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
Length = 772
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/753 (46%), Positives = 471/753 (62%), Gaps = 45/753 (5%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
SSF FCD SLP RV DLV RM L EK+ Q+ A G+PRLG+P Y+WW EALHGV+
Sbjct: 31 SSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVAE- 89
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
PG F +P ATSFP VILT ASFN SLW KI QA+S EA AMYN GR+GLT+WSPNI
Sbjct: 90 SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLTFWSPNI 149
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA--TDLNSRP--LKVSSCC 225
N+ RDPRWGR ETPGEDP + +YA +VRGLQ+ + E + + RP LKVSSCC
Sbjct: 150 NIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPTRLKVSSCC 209
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ AYD++ +G D +HF+A+VT QD+++TF PF C+ +G AS +MCSYNRVNG+PS
Sbjct: 210 KHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNRVNGVPS 269
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CAD L +TVR W GYIV+DCD++ ++ + + + EDAVA L AG+DL+CG
Sbjct: 270 CADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLSAGMDLNCGT 328
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIE 403
+ T A++QGKV E +D++L + TV MRLG FDG+ Y S+G +C+ E+ +
Sbjct: 329 FLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPDAVCTREHRQ 388
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
L+ EAA +GIVLLKN N LP + T+AV+GP NAT M+GNYAG+PC+Y++P G
Sbjct: 389 LSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQYITPFQG 448
Query: 464 FSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
Y V ++ GC D+ C AA AA+ +DA +I+ GLD E E LDR L L
Sbjct: 449 LQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLDRTSLLL 508
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PGYQ L+ +V++VAKGPVILV+MS G +D+ FA+ N I ++LW GYPGE GG+AIA V
Sbjct: 509 PGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEAGGKAIARV 568
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG NP GRLP+TWY + + + + +M LRP S G+PGRTY+FY G +Y FG+GLS
Sbjct: 569 IFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENVYEFGHGLS 628
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF- 701
YT F Y S I + R P LR D F +D+
Sbjct: 629 YTNFTYTNFSAPSNITAR--------------NTVAIRTP------LREDGARHFPIDYT 668
Query: 702 -------------QNVGSTDGSDVVIVYSKPPAEIAATYI--KQVIGFQRVFVRAGRNKR 746
N G+ D + ++Y+ PPA ++ KQ+I F+R + AGR +
Sbjct: 669 GCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAK 728
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++F + CK L + + A +L G++ + +G+
Sbjct: 729 VEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGD 761
>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
[Cucumis sativus]
Length = 783
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/776 (45%), Positives = 493/776 (63%), Gaps = 38/776 (4%)
Query: 27 ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH 86
A SS P + CD + FC + LP +R +DLVSR+TLDEKV QL +
Sbjct: 30 AGSSSQPPYACDSSN-----PLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVP 84
Query: 87 GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
+PRLG+P YEWWSEALHGV+NVG G + I ATSFP VILT ASF+E+LW +IGQA
Sbjct: 85 PIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQA 144
Query: 147 VSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD- 204
+ TEARA+YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP + G+Y+V YVRG+Q
Sbjct: 145 IGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD 204
Query: 205 -VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
+EG + LK S+CCKH+ AYD+D W G+ RY FDA+VT QDM +T+ PFE
Sbjct: 205 AIEGGKLGNQ-----LKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFES 259
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
CV+EG AS +MC+YNRVNG+PSCAD LL T R +W +GYI +DCD++ ++ D +
Sbjct: 260 CVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGY- 318
Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
A EDAVA L+AG+D++CG Y T +AV+ KV ID++L+ L++V MRLG FD
Sbjct: 319 AKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFD 378
Query: 384 GSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
G+P + +G+ +CS ++ LA +AAREGIVLLKN LPL+ + ++AV+G +
Sbjct: 379 GNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNG 438
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
N + GNYAGIPC+ +P G + Y N Y GC+ C + +I+ A + AK+ D
Sbjct: 439 NDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDY 497
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
+++ GLD + E E DR +L LPG Q +LI +VA+ AK PVILVI+S G VDI+ A+ N
Sbjct: 498 VVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYN 557
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
I +ILWAGYPG+ GG AIA+++FG NPGGRLP+TWY D+++ P+T M +R S
Sbjct: 558 EKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIK-FPMTDMRMRADSST 616
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQ--FKYNLLSFTKTIQVNLNKLQHCRNLN-----Y 672
GYPGRTY+FYNGP +Y FGYGLSY+ +++ +S +K + + Q +N +
Sbjct: 617 GYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRL 676
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
S+ K C VN V +N G G V+++ KP I + +KQ++
Sbjct: 677 VSELDKKFCESKTVN---------VTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLV 727
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
GF++V + AG + I+F+ + C ++ ++ G +++ VG+ V P+ +
Sbjct: 728 GFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGD--VEHPLDI 781
>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
Length = 783
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/776 (45%), Positives = 493/776 (63%), Gaps = 38/776 (4%)
Query: 27 ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH 86
A SS P + CD + FC + LP +R +DLVSR+TLDEKV QL +
Sbjct: 30 AGSSSQPPYACDSSN-----PLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVP 84
Query: 87 GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
+PRLG+P YEWWSEALHGV+NVG G + I ATSFP VILT ASF+E+LW +IGQA
Sbjct: 85 PIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQA 144
Query: 147 VSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD- 204
+ TEARA+YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP + G+Y+V YVRG+Q
Sbjct: 145 IGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD 204
Query: 205 -VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
+EG + LK S+CCKH+ AYD+D W G+ RY FDA+VT QDM +T+ PFE
Sbjct: 205 AIEGGKLGNQ-----LKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFES 259
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
CV+EG AS +MC+YNRVNG+PSCAD LL T R +W +GYI +DCD++ ++ D +
Sbjct: 260 CVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGY- 318
Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
A EDAVA L+AG+D++CG Y T +AV+ KV ID++L+ L++V MRLG FD
Sbjct: 319 AKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFD 378
Query: 384 GSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
G+P + +G+ +CS ++ LA +AAREGIVLLKN LPL+ + ++AV+G +
Sbjct: 379 GNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNG 438
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
N + GNYAGIPC+ +P G + Y N Y GC+ C + +I+ A + AK+ D
Sbjct: 439 NDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDY 497
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
+++ GLD + E E DR +L LPG Q +LI +VA+ AK PVILVI+S G VDI+ A+ N
Sbjct: 498 VVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYN 557
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
I +ILWAGYPG+ GG AIA+++FG NPGGRLP+TWY D+++ P+T M +R S
Sbjct: 558 EKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIK-FPMTDMRMRADSST 616
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQ--FKYNLLSFTKTIQVNLNKLQHCRNLN-----Y 672
GYPGRTY+FYNGP +Y FGYGLSY+ +++ +S +K + + Q +N +
Sbjct: 617 GYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRL 676
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
S+ K C VN V +N G G V+++ KP I + +KQ++
Sbjct: 677 VSELDKKFCESKTVN---------VTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLV 727
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
GF++V + AG + I+F+ + C ++ ++ G +++ VG+ V P+ +
Sbjct: 728 GFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGD--VEHPLDI 781
>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
Length = 784
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/764 (46%), Positives = 495/764 (64%), Gaps = 37/764 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+S P + C G + FCD++LP RV DLVSR+T+ EK+ QLGD + +P
Sbjct: 35 ASEPPYTCGAG-------APPNIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIP 87
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WWSEALHGV+N G G H D + ATSFP VILT ASFN LW +IGQ +
Sbjct: 88 RLGVPAYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGV 147
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA+YN G+A GLT+W+PNINV RDPRWGR ETPGEDP + G+YA +VRG+Q G+
Sbjct: 148 EARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GY 204
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
A +NS L+ S+CCKH+ AYD++NWKG+ RY +DA+VT QD+E+T+ PF+ CV++G
Sbjct: 205 GVAGPVNSTDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDG 264
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS +MCSYNRVNG+P+CAD LL++T R W +GYI +DCD++ ++ D + A + E
Sbjct: 265 HASGIMCSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGY-AKTSE 323
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
DAVA LKAG+D++CG Y + +A+QQGK+ E DI+++L L+TV MRLG F+G P+
Sbjct: 324 DAVADVLKAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRR 383
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y ++G +C+ E+ +LA EAA++GIVLLKND LPL+ + V ++AV+G +AN +
Sbjct: 384 NRYGNIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATS 443
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNY G PC ++P+ GY + ++ GC+ AC +I A +AA +AD+ ++
Sbjct: 444 LLGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVLFM 502
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLD + E E +DR DL LPG Q LI VA AK PVILV++ G VD++FA+TN I A
Sbjct: 503 GLDQNQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGA 562
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPGE GG AIA V+FG+ NPGGRLP+TWY D+ + +P+T M +R + GYPGR
Sbjct: 563 ILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTK-VPMTDMRMRADPATGYPGR 621
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+FY GPT++ FGYGLSY+++ + ++ N+ L+ A T GV
Sbjct: 622 TYRFYRGPTVFNFGYGLSYSKYSHRFVTKPPPSMSNVAGLK----------ALATTAGGV 671
Query: 685 LVNDLR------CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQ 735
D+ CD F V QN G DG V+V+ + P + + +Q+IGFQ
Sbjct: 672 ATYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQ 731
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ +RA + ++F + CK + ++ G H + VG+
Sbjct: 732 SLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGD 775
>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
Length = 1411
Score = 696 bits (1796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/764 (46%), Positives = 489/764 (64%), Gaps = 52/764 (6%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
SSSP F CD S+ FC+++L S R DL+SR+TLDEK+ QL A +P
Sbjct: 693 SSSPPFACDSS-----DPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIP 747
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHG+ + G F+ I ATSFP VILT ASF+ LW +IGQA+
Sbjct: 748 RLGIPAYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGI 806
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
E RAMYN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP V G+YAV+YVRGLQ
Sbjct: 807 ETRAMYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFE 866
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
D+ L+ S+CCKH+ AYD+DNW +DRY FDARVT QD+ +T+ PF C++EG
Sbjct: 867 GGKVDV----LQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEG 922
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS +MC+YN VNG+P+CAD LL++T RG+W GYIV+DCD++ ++ D + A S E
Sbjct: 923 RASGLMCAYNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGY-AKSPE 981
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
DAVA L AG+D+ CG Y +AV Q K+ E++ID++L L+TV MRLG F+G+P+
Sbjct: 982 DAVAIVLTAGMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRK 1041
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
+ ++G +CS E+ LA EAAR GIVLLKN LPL+ + ++AV+GP+ANAT
Sbjct: 1042 LPFGNIGPDQVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDT 1101
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNYAG PC+++SP+ G Y N T Y GC+DVAC S+ SI A + AK AD +++
Sbjct: 1102 LLGNYAGPPCKFISPLQGLQSYVNNTMYHAGCNDVAC-SSASIENAVDVAKQADYVVLVM 1160
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLD + E E DR DL LPG Q QLI VA+ AK PV+LV++ G VDI+FA+ ++NI +
Sbjct: 1161 GLDQTQEREKYDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGS 1220
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPGE GG AIA+ +FG NPGGRLP+TWY D+++ +P+T M +RP GYPGR
Sbjct: 1221 ILWAGYPGEAGGAAIAETIFGDHNPGGRLPVTWYPKDFIK-IPMTDMRMRPEPQSGYPGR 1279
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
T++FY G T++ FG GLSY+ + Y LS T NKL Y + S T
Sbjct: 1280 THRFYTGKTVFEFGNGLSYSPYSYEFLSVTP------NKL-------YLNQPSTTHV--- 1323
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+N G G V+++ K + +KQ++GFQ VF+ AG +
Sbjct: 1324 ----------------VENSGKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGES 1367
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
++F+ + C+ L+ + ++ G H + VG+ +PI +
Sbjct: 1368 SNVEFILSPCEHLSRANKDGLMVMEQGIHLLVVGDK--EYPIAI 1409
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/684 (49%), Positives = 459/684 (67%), Gaps = 37/684 (5%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
L I L+ + V + SP F CD S S+ FC ++LP RV+DLVSR+
Sbjct: 7 LLINLIYVTVILVGVESTQSPPFSCDSSNPS-----TKSYHFCKTTLPIPDRVRDLVSRL 61
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
TLDEK+ QL + A +PRLG+P YEWWSEALHGV++ GPG F+ I ATSFP VILT
Sbjct: 62 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTA 121
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVV 191
ASF+ LW +IG+A+ EARA+YN G+ G+T+W+PNIN+ RDPRWGR ETPGEDP V
Sbjct: 122 ASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 181
Query: 192 GRYAVNYVRGLQD--VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
G YAV+YVRG+Q + G + +L + S+CCKH+ AYD+D+WKG+DR+ FDARVT
Sbjct: 182 GSYAVSYVRGVQGDCLRGLKRCGEL-----QASACCKHFTAYDLDDWKGIDRFKFDARVT 236
Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
QD+ +T+ PF C++EG AS +MC+YNRVNG+PSCAD LL T R W+ GYI +D
Sbjct: 237 MQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSD 296
Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
CD++ ++ D++ F A + EDAV LKAG+D++CG Y N T +AV Q K+ E+++D++L
Sbjct: 297 CDAVSLIHDSYGF-AKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRAL 355
Query: 370 KYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
+ L+ V MRLG F+G+P+ Y +G +CS E+ LA +AAR+GIVLLKN Q LPL
Sbjct: 356 ENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLP 415
Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNN 485
K ++AV+GP+AN+ +IGNYAG PC++++P+ Y T Y GCD VAC S+
Sbjct: 416 KGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVAC-SSP 474
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
SI A E A+ AD +++ GLD + E E+ DR DL LPG Q QLI VA AK PV+LV+
Sbjct: 475 SIEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVL 534
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
+S G VDI+FA+ + NI +ILWAGYPG GG AIA+ +FG NPGGRLP+TWY D+ +
Sbjct: 535 LSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDFTK- 593
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL- 664
+P+T M +RP + GYPGRTY+FY G ++ FGYGLSY+ + +TI V NKL
Sbjct: 594 IPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYS------CETIPVTRNKLY 647
Query: 665 ----------QHCRNLNYTSDASK 678
++ ++ YTS A K
Sbjct: 648 FNQSSTAHVYENTDSIRYTSMAGK 671
>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
Length = 772
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/754 (46%), Positives = 471/754 (62%), Gaps = 47/754 (6%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
SSF FCD SLP RV DLV RM L EK+ Q+ A G+PRLG+P Y+WW EALHGV+
Sbjct: 31 SSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVAE- 89
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
PG F +P ATSFP VILT ASFN SLW KI QA+S EA AMYN GR+GLT+WSPNI
Sbjct: 90 SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLTFWSPNI 149
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA--TDLNSRP--LKVSSCC 225
N+ RDPRWGR ETPGEDP + +YA +VRGLQ+ + E + + P LKVSSCC
Sbjct: 150 NIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPTRLKVSSCC 209
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ AYD++ +G D +HF+A+VT QD+++TF PF C+ +G AS +MCSYNRVNG+PS
Sbjct: 210 KHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNRVNGVPS 269
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CAD L +TVR W GYIV+DCD++ ++ + + + EDAVA L AG+DL+CG
Sbjct: 270 CADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLSAGMDLNCGT 328
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIE 403
+ T A++QGKV E +D++L + TV MRLG FDG+ Y S+G +C+ E+ +
Sbjct: 329 FLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPDAVCTPEHRQ 388
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
L+ EAA +GIVLLKN N LP + T+AV+GP NAT M+GNYAG+PC+Y++P G
Sbjct: 389 LSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQYITPFQG 448
Query: 464 FSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
Y V ++ GC D+ C AA AA+ +DA +I+ GLD E E LDR L L
Sbjct: 449 LQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLDRTSLLL 508
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PG Q L+ +V++VAKGPVILV+MS G +D+ FA+ N I +LW GYPGE GG+AIA V
Sbjct: 509 PGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGEAGGKAIARV 568
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG NP GRLP+TWY + + + + +M LRP S G+PGRTY+FY G +Y FG+GLS
Sbjct: 569 IFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENVYEFGHGLS 628
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT F Y C N T+ + R P LR D +F +D+
Sbjct: 629 YTNFTYT---------------NFCAPSNITARNTVAIRTP------LREDGARQFPIDY 667
Query: 702 --------------QNVGSTDGSDVVIVYSKPPAEIAATYI--KQVIGFQRVFVRAGRNK 745
N G+ D + ++Y+ PPA ++ KQ+I F+R + AGR
Sbjct: 668 TGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCA 727
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+++F + CK L + + A +L G++ + +G+
Sbjct: 728 KVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGD 761
>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
sativus]
Length = 767
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/743 (45%), Positives = 484/743 (65%), Gaps = 18/743 (2%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
S+ FC+ SL ++ R + LVS +TLDEK+QQL + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 19 SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 78
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
PG F+ I ATSFP V++T ASFN +LW IG A++ EARAM+N+G+ GLT W+PNIN
Sbjct: 79 PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 138
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD---VEGHENATDLNSR-----PLKVS 222
+ RDPRWGR ETPGEDP V Y++ +VRGLQ ++ HE ++ L VS
Sbjct: 139 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 198
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+CCKH+ AYD++ W RY FD+ VTEQD+ +T+ PF C+++G AS +MCSYN VNG
Sbjct: 199 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 258
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P+CA+P LL + R +W L GYI +DCD++ + + K+ D+ EDA+A LKAG+D++
Sbjct: 259 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKAGMDIN 316
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSD 399
CG + T +A+ QGKV+E ++D +L L++V RLGFFDG+P ++ LG QD+C+
Sbjct: 317 CGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTA 376
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ LA EAAR+GIVLLKN+ LPL+ + ++ V+G AN + ++G YAG+PC MS
Sbjct: 377 QHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMS 436
Query: 460 PIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
+ GF YA + + +GC DV C S+N A AK AD I +AGLD S E E LDR
Sbjct: 437 LVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRV 496
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG Q L++ VA V+K P+ILV++ G +DI+FA+ ++ + +ILW G PGE GG+A
Sbjct: 497 SLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKA 556
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+A+V+FG +NPGGRLP+TWY + +P+ M +RP S GYPGRTY+FY G +Y FG
Sbjct: 557 LAEVIFGDYNPGGRLPVTWYPQSFTN-VPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFG 615
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY--FE 696
GLSYT FKY LLS K + + L K + R + V ++ D FE
Sbjct: 616 EGLSYTSFKYRLLSAPKKVNL-LGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFE 674
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
K+ N+G DGS VV+++S+ P + T +Q+IGF R++V+ ++ + + C
Sbjct: 675 VKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNH 734
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
+++ D ++P G+HTI +G+
Sbjct: 735 VSLADEYGKRVIPLGDHTISLGD 757
>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
Precursor
gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
Length = 792
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/795 (45%), Positives = 487/795 (61%), Gaps = 40/795 (5%)
Query: 11 FSLSIALLVFSTNAVD---ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
L++ L+F T+A+ N S P F C P FS S+ FC+ SL R
Sbjct: 3 LQLTLISLLFFTSAIAETFKNLDSHPQFPCKPPHFS-------SYPFCNVSLSIKQRAIS 55
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LVS + L EK+ QL + A VPRLG+P YEWWSE+LHG+++ GPG F+ I ATSFP
Sbjct: 56 LVSLLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQ 115
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
VI++ ASFN +LW +IG AV+ E RAMYN G+AGLT+W+PNINV RDPRWGR ETPGED
Sbjct: 116 VIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGED 175
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSR-------------PLKVSSCCKHYAAYDVD 234
P VV Y V +VRG Q+ + + S L +S+CCKH+ AYD++
Sbjct: 176 PKVVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLE 235
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
W RY F+A VTEQDME+T+ PFE C+++G AS +MCSYN VNG+P+CA LL Q
Sbjct: 236 KWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLL-Q 294
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
R EW GYI +DCD++ + + S E+AVA +KAG+D++CG Y T +A
Sbjct: 295 KARVEWGFEGYITSDCDAVATIFAYQGY-TKSPEEAVADAIKAGVDINCGTYMLRHTQSA 353
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAARE 411
++QGKV E +D++L L+ V +RLG FDG P QY LG DICS ++ +LA EA R+
Sbjct: 354 IEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQ 413
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
GIVLLKND LPLN V ++A+VGP AN M G Y G PC+ + Y T
Sbjct: 414 GIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKT 473
Query: 472 -YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
Y +GC DV+C S+ A AK AD I++AGLDLS E E DR L LPG Q L+
Sbjct: 474 SYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLV 533
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+ VA V+K PVILV+ G VD+ FA+ + I +I+W GYPGE GG+A+A+++FG FNPG
Sbjct: 534 SHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 593
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
GRLP TWY + + ++ M +R S GYPGRTY+FY GP +Y FG GLSYT+F+Y +
Sbjct: 594 GRLPTTWYPESFTD-VAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKI 652
Query: 651 LSFTKTIQVNLNKL-----QHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNV 704
LS I+++L++L H + L + + + V+VN C+ F +V N
Sbjct: 653 LS--APIRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNS--CESLRFNVRVHVSNT 708
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
G DGS VV+++SK P ++ KQ+IG+ RV VR+ FV + CK L++ +
Sbjct: 709 GEIDGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVG 768
Query: 765 NTLLPAGEHTIFVGN 779
++P G H +F+G+
Sbjct: 769 KRVIPLGSHVLFLGD 783
>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
Length = 777
Score = 691 bits (1784), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/743 (45%), Positives = 484/743 (65%), Gaps = 18/743 (2%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
S+ FC+ SL ++ R + LVS +TLDEK+QQL + A +PRLG+P Y+WWSE LHG++ G
Sbjct: 29 SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 88
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
PG F+ I ATSFP V++T ASFN +LW IG A++ EARAM+N+G+ GLT W+PNIN
Sbjct: 89 PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 148
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD---VEGHENATDLNSR-----PLKVS 222
+ RDPRWGR ETPGEDP V Y++ +VRGLQ ++ HE ++ L VS
Sbjct: 149 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 208
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+CCKH+ AYD++ W RY FD+ VTEQD+ +T+ PF C+++G AS +MCSYN VNG
Sbjct: 209 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 268
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P+CA+P LL + R +W L GYI +DCD++ + + K+ D+ EDA+A LKAG+D++
Sbjct: 269 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKAGMDIN 326
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSD 399
CG + T +A+ QGKV+E ++D +L L++V RLGFFDG+P ++ LG QD+C+
Sbjct: 327 CGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTA 386
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ LA EAAR+GIVLLKN+ LPL+ + ++ V+G AN + ++G YAG+PC MS
Sbjct: 387 QHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMS 446
Query: 460 PIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
+ GF YA + + +GC DV C S+N A AK AD I +AGLD S E E LDR
Sbjct: 447 LVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRV 506
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG Q L++ VA V+K P+ILV++ G +DI+FA+ ++ + +ILW G PGE GG+A
Sbjct: 507 SLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKA 566
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+A+V+FG +NPGGRLP+TWY + +P+ M +RP S GYPGRTY+FY G +Y FG
Sbjct: 567 LAEVIFGDYNPGGRLPVTWYPQSFTN-VPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFG 625
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY--FE 696
GLSYT FKY LLS K + + L K + R + V ++ D FE
Sbjct: 626 EGLSYTSFKYRLLSAPKKVNL-LGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFE 684
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
K+ N+G DGS VV+++S+ P + T +Q+IGF R++V+ ++ + + C
Sbjct: 685 VKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNH 744
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
+++ D ++P G+HTI +G+
Sbjct: 745 VSLADEYGKRVIPLGDHTISLGD 767
>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
Length = 777
Score = 691 bits (1782), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/739 (47%), Positives = 482/739 (65%), Gaps = 19/739 (2%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
SS FCD LP R DLVS++TL+EK+ QLGD + V RLG+P Y+WWSEALHGV+N
Sbjct: 40 SSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVPAYKWWSEALHGVANA 99
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPN 168
G G H D + ATSFP VILT ASFN LW +IGQ + TEAR +YN G+A GLT+W+PN
Sbjct: 100 GRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGVYNNGQAEGLTFWAPN 159
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
INV RDPRWGR ETPGEDP + G+YA +VRG+Q G+ + +NS L+ S+CCKH+
Sbjct: 160 INVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGMSGAINSSDLEASACCKHF 216
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++NWKGV R+ FDA+VTEQD+ +T+ PF+ CV++G AS +MCSYNRVNG+P+CAD
Sbjct: 217 TAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYNRVNGVPTCAD 276
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL++T RG+W +GYI +DCD++ ++ D + A + EDAVA LKAG+D++CG Y
Sbjct: 277 HNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGY-AKAPEDAVADVLKAGMDVNCGGYIQ 335
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
+A QQGK+ DID++L+ L+ + MRLG FDG+P+ Y ++G +CS E+ +LA
Sbjct: 336 THGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGADQVCSKEHQDLA 395
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
+AAR+GIVLLKND LPL+ +KV ++AV+GP+ N ++GNY G PC ++P+
Sbjct: 396 LQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPPCISVTPLQALQ 455
Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
GY + + GC+ C +N I A AA +AD ++ GLD + E E +DR +L LPG
Sbjct: 456 GYVKDARFVQGCNAAVCNVSN-IGEAVHAAGSADYVVLFMGLDQNQEREEVDRLELGLPG 514
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q L+N VA+ AK PVILV++ G VD+ FA+ N I AI+WAGYPG+ GG AIA V+F
Sbjct: 515 MQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAGGIAIAQVLF 574
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NPGGRLP+TWY ++ +P+T M +R S GYPGRTY+FY G T+Y FGYGLSY+
Sbjct: 575 GDHNPGGRLPVTWYPKEFT-AVPMTDMRMRADPSTGYPGRTYRFYKGKTVYNFGYGLSYS 633
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVD 700
++ + S T +++ ++ L T+ AS V ++ CD F V
Sbjct: 634 KYSHRFAS-KGTKPPSMSGIE---GLKATARASAAGTVSYDVEEMGAEACDRLRFPAVVR 689
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
QN G DG +V+++ + P Q+IGFQ V +RA ++F + CK L+
Sbjct: 690 VQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVSPCKHLSRA 749
Query: 761 DYAANTLLPAGEHTIFVGN 779
++ G H + VG+
Sbjct: 750 AEDGRKVIDQGSHFVRVGD 768
>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 796
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/797 (45%), Positives = 494/797 (61%), Gaps = 44/797 (5%)
Query: 13 LSIALLVFSTNAVD---ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
L++ LVF T+A+ N S P F C P FS S+ FC+ SL R LV
Sbjct: 5 LTLISLVFFTSAIAETFKNLDSHPQFPCKPPHFS-------SYPFCNVSLSIKQRAISLV 57
Query: 70 SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
S +TL EK+ QL A VPRLG+P YEWWSE+LHG+++ GPG F+ I ATSFP VI
Sbjct: 58 SLLTLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVI 117
Query: 130 LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF 189
++ ASFN +LW +IG AV+ EARAMYN G+AGLT+W+PNIN+ RDPRWGR ETPGEDP
Sbjct: 118 VSAASFNRTLWYEIGSAVAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPK 177
Query: 190 VVGRYAVNYVRGLQDVE---------GHENATDLNSR------PLKVSSCCKHYAAYDVD 234
VV Y V +VRG Q+ + G +N D L +S+CCKH+ AYD++
Sbjct: 178 VVSEYGVEFVRGFQEKKKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLE 237
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
W RY F+A VTEQDME+T+ PFE C+K+G AS +MCSYN VNG+P+CA LL Q
Sbjct: 238 KWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLL-Q 296
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
R EW GYI +DCD++ + + + S E+AVA +KAG+D++CG Y T +A
Sbjct: 297 KARVEWGFDGYITSDCDAVATIFEYQGY-TKSPEEAVADAIKAGVDINCGTYMLRNTQSA 355
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAARE 411
++QGKV E +D++L L+ V +RLG FDG P+ Y LG DICS ++ +LA EAAR+
Sbjct: 356 IEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQ 415
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
GIVLLKND LPLN V ++A+VGP AN M G Y G PC+ + Y T
Sbjct: 416 GIVLLKNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKT 475
Query: 472 -YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
Y +GC DV+C S+ A AK AD I++AGLDLS E E DR L LPG Q L+
Sbjct: 476 SYASGCSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLV 535
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+ VA V+K PVILV+ G VD+ FA+T+ I +I+W GYPGE GG+A+A+++FG FNPG
Sbjct: 536 SSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 595
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
GRLPITWY + +P++ M +R S GYPGRTY+FY GP +Y FG GLSYT+F Y +
Sbjct: 596 GRLPITWYPESFAD-VPMSDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDYKI 654
Query: 651 LSFTKTIQVNLNKL-----QHCRNL--NYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQ 702
+S I+++L++L H + L + + V+VN C+ F +V+ +
Sbjct: 655 IS--APIRLSLSELLPQQSSHKKQLLQHGEEQLQYIQLDDVMVNS--CESLRFNVRVNVR 710
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
N G DGS V++++SK ++ KQ+IGF RV +R+ FV + CK L++ +
Sbjct: 711 NTGEIDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVAND 770
Query: 763 AANTLLPAGEHTIFVGN 779
++P G H +F+G+
Sbjct: 771 VGKRVIPLGIHALFLGD 787
>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
Length = 767
Score = 688 bits (1775), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/760 (46%), Positives = 479/760 (63%), Gaps = 46/760 (6%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
S P F CD + FC SLP RV+DL+ R+TL EK++ L + A VPR
Sbjct: 27 SRPAFACDGA--------TRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPR 78
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+ YEWWSEALHGVSN PG F PGATSFP VI T ASFN SLW+ IG+ VS E
Sbjct: 79 LGIKGYEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDE 138
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
ARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + G+YA +YVRGLQ G++
Sbjct: 139 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQ- 197
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
LKV++CCKHY AYD+DNW VDRY F+ARV++QD+ +T+ PF+ CV EG
Sbjct: 198 --------LKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGK- 248
Query: 271 SSVMCSYNRVNGIPSCADPKLLN----QTVRGEWD--LHGYIVADCDSIQVMVDNHKFLA 324
V C++ I A+P +L Q W LH + + C H L
Sbjct: 249 YQVYCAHT----IKLMANPLVLTLISPQHHPWSWHSWLHCFRLYRCWGFIC----HSTLH 300
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
+ EDA A T+KAGLDL+CG + T AV+QGK+ E D++ +L +V MRLG FDG
Sbjct: 301 STPEDAAAATIKAGLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDG 360
Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
P Y +LG +D+C+ + +LA EAAR+GIVLL+N +LPL++ +TVAV+GP+++
Sbjct: 361 EPSSQPYGNLGPRDVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSD 420
Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASE-AAKTADAT 500
TV M+GNYAG+ C + +P+ G Y +++GCD VAC SNN +F +E AA+ ADAT
Sbjct: 421 VTVTMLGNYAGVACGFTTPLQGIERYVRTIHQSGCDSVAC-SNNQLFGVAETAARQADAT 479
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
+++ GLD S+E E DR L LPG Q +L+++VA ++GPV+LV+MS G +D++FA+ +
Sbjct: 480 VLVMGLDQSIETEFKDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDP 539
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
I AILW GYPG+ GG AIADV+FG+ NPGGRLP+TWY DY+ P+T+M +R S G
Sbjct: 540 RIGAILWVGYPGQAGGTAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSG 599
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
YPGRTY+FY GP ++PFG+G+SYT F + L T+ V L L +N S T
Sbjct: 600 YPGRTYRFYKGPVVFPFGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQN-------STTF 652
Query: 681 CPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
G+ V CD +D +N G DG+ V+V+S PP KQ+IGF++V V
Sbjct: 653 NNGIRVTHTNCDTLILGIHIDVKNTGDMDGTHTVLVFSTPPVGKWGAN-KQLIGFKKVHV 711
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A +R+K + C L++VD +P GEH++ +G+
Sbjct: 712 VARGRQRVKIHVHVCNQLSVVDQFGIRRIPIGEHSLHIGD 751
>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
Length = 745
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/755 (45%), Positives = 480/755 (63%), Gaps = 46/755 (6%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
S+ P F CD S +F FC ++LP S R DLVSR+TL+EK+ QL + A +P
Sbjct: 23 STQPPFSCDSSNPS-----TKTFPFCKTTLPISQRANDLVSRLTLEEKISQLVNSAQPIP 77
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WWSEALHGV+ GPG F+ I ATSFP VIL+ ASF+ + W +I QA+
Sbjct: 78 RLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDANQWYRISQAIGK 137
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA+YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP + G+YAV+YVRGLQ G
Sbjct: 138 EARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQ---GD 194
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
PL+ S+CCKH+ AYD++NW G RY FDA VT QD+ +T+ PF+ CV+EG
Sbjct: 195 SFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEEG 254
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS +MC+YNRVNGIP+CAD L++T R +W GYI +DCD++ ++ D + A + E
Sbjct: 255 RASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGY-AKTPE 313
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
DAV LKAG+D++CG Y T AV Q K+ ++ID++L L++V MRLG F+G+P
Sbjct: 314 DAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTG 373
Query: 387 -QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Q+ ++G +CS EN LA +AAR GIVLLKN LPL+ +K ++AV+GP+AN+
Sbjct: 374 QQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQT 433
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYK-TGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNYAG PC+ ++P+ Y T GCD V C S+ SI A AK AD +++
Sbjct: 434 LLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQC-SSASIVGAVNVAKGADHVVLIM 492
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLD + E E LDR DL LPG Q +LI VA+ AK PV+LV++S G VDI+FA+ + NI +
Sbjct: 493 GLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGS 552
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPGE G A+A+++FG NPGG+LP+TWY ++V+ +P+T M +RP S GYPGR
Sbjct: 553 ILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFVK-VPMTDMRMRPETSSGYPGR 611
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+FY GPT++ FGYGLSY+++ Y L + I + + C N+
Sbjct: 612 TYRFYKGPTVFEFGYGLSYSKYTYEL----RAIYIG---EEQCENIK------------- 651
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
F+ V +N G G V+++++ IK+++GFQ V + AG
Sbjct: 652 ----------FKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQTVKLGAGEK 701
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
I++ + C+ L+ + ++ G + VG+
Sbjct: 702 TEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGD 736
>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 776
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/791 (45%), Positives = 498/791 (62%), Gaps = 44/791 (5%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
+SS F I+L + T +V A P F CD S S+ FC+ LP + R
Sbjct: 3 LSSTFTFVTIISLFLTLTYSVLAQ---LPPFACDYSNPST-----RSYPFCNPKLPITQR 54
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
KDLVSR+TLDEK+ QL + A +PRLG+P YEWWSEALHG+ NVG G F+ I ATS
Sbjct: 55 TKDLVSRLTLDEKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATS 114
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITET 183
FP VILT ASF+ LW +IGQA+ EARA+YN G+A G+T+W+PNIN+ RDPRWGR ET
Sbjct: 115 FPQVILTAASFDSHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQET 174
Query: 184 PGEDPFVVGRYAVNYVRGLQ-------DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
GEDP + YAV+YVRGLQ + GH L+ S+CCKH+ AYD+DNW
Sbjct: 175 AGEDPMMTSNYAVSYVRGLQGDSFQGGKLRGH----------LQASACCKHFTAYDLDNW 224
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
KGV+R+HFDARV+ QD+ +T+ PF C+++G AS +MC+YNRVNGIPSCAD LL TV
Sbjct: 225 KGVNRFHFDARVSLQDLADTYQPPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTV 284
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
R +W+ HGYIV+DC ++ ++ D + A S EDAVA L AG+DL+CG Y T+ +AVQ
Sbjct: 285 RKQWEFHGYIVSDCGAVGIIHDEQGY-AKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQ 343
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS---LGKQDICSDENIELAAEAAREGI 413
Q K+ ID++L L+++ +RLG FDG+P + +G +CS+ ++ LA EAAR GI
Sbjct: 344 QKKLPIVRIDRALHNLFSIRIRLGQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGI 403
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANAT-VAMIGNYAGIPCRYMSPIAGFSGYA-NVT 471
VLLKN + LPL + ++AV+GP+ANA+ + ++GNYAG PC+ ++ + GF Y N
Sbjct: 404 VLLKNTASLLPLPKTSI-SLAVIGPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAV 462
Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
+ GCD ++ I A + AK AD +++ GLD SVE E DR L LPG Q +LIN
Sbjct: 463 FHPGCDGGPKCASAPIDKAVKVAKNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELIN 522
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
VA+ +K PVILV++ G +DI+ A+ N I I+WAGYPGE GG A+A ++FG NPGG
Sbjct: 523 SVAKASKRPVILVLLCGGPIDISSAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGG 582
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
RLPITWY DY++ +P+T M +R + GYPGRTY+FY GPT+Y FG+GLSYT++ Y +
Sbjct: 583 RLPITWYPKDYIK-VPMTDMRMRADPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSYEFV 641
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGST 707
S T +KL ++ + + LV++L C V +N G+
Sbjct: 642 SVTH------DKLHFNQSSTHLMTENSETIRYKLVSELDEETCKSMSVSVTVGVKNHGNI 695
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
G ++++ +P + +KQ++GF + + AG + F + C+ L+ + A +
Sbjct: 696 VGRHPILLFMRPQKHRTRSPMKQLVGFHSLLLDAGEMSHVGFELSPCEHLSRANEAGLKI 755
Query: 768 LPAGEHTIFVG 778
+ G H + VG
Sbjct: 756 IEEGSHLLHVG 766
>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
Length = 768
Score = 684 bits (1766), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/783 (44%), Positives = 483/783 (61%), Gaps = 43/783 (5%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
+C L + L +FS + S+ P F C P S + FC++SLP S R +
Sbjct: 9 FICLFLQV-LPLFSISE-----STHPQFPCMPP-------TNSDYPFCNTSLPISTRAQS 55
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LVS +TL EK+QQL D A +PRL +P YEWWSE+LHG++ GPG F+ + ATSFP
Sbjct: 56 LVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQ 115
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
V+LT ASFN SLW IG A++ EARAMYN+G+AGLT+W+PNIN+ RDPRWGR ETPGED
Sbjct: 116 VLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGED 175
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P V YAV +VRG Q D + L +S+CCKH AYD++ W RY FDA
Sbjct: 176 PMVASAYAVEFVRGFQG--------DSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAV 227
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V+ QD+E+T+ PF CV++G AS +MCSYNRVNG+P+CA L Q + EW GYI
Sbjct: 228 VSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLF-QKAKTEWGFKGYIT 286
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
+DCD++ + + ++ A+S EDAVA LKAG D++CG Y T +A+ QGKVKE DID+
Sbjct: 287 SDCDAVATVYE-YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDR 345
Query: 368 SLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
+L L++V MRLG FDG P Y +LG +D+C+ E+ LA EAAR+GIVLLKND+ LP
Sbjct: 346 ALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLP 405
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKS 483
L+ +++ ++A++GP A+ + G Y GIPC+ S + G Y T + GC DV C S
Sbjct: 406 LDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLS 464
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
+ A A+ AD +++AGLDLS E E DR L LPG Q LI+ VA + P++L
Sbjct: 465 DTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVL 524
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
V+ G +D++FAE + I +ILW GYPGE G +A+A+++FG FNPGGRLP+TWY +
Sbjct: 525 VLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 584
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
+ +P+ M +R GYPGRTY+FY G +Y FG GLSYT+F Y +S NK
Sbjct: 585 R-VPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSAP-------NK 636
Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEFKVDFQNVGSTDGSDVVIVY 716
L R+ + S + R VN CD F ++ NVG DGS VV+++
Sbjct: 637 LNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVVMLF 696
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
S+ P + T KQ+IGF RV + R+ + + C+ +I + ++P G+HTI
Sbjct: 697 SRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIM 756
Query: 777 VGN 779
+G+
Sbjct: 757 LGD 759
>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
Length = 774
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/744 (47%), Positives = 481/744 (64%), Gaps = 28/744 (3%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
SS FC+ LP R DLVSR+TL+EK+ QLGD + V RLG+P Y+WWSEALHGVSN
Sbjct: 36 SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 95
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPN 168
G G H D + ATSFP VILT ASFN LW +IGQ + TEARA+YN G+A GLT+W+PN
Sbjct: 96 GRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGLTFWAPN 155
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
INV RDPRWGR ETPGEDP V G+YA +VRG+Q G+ A +NS L+ S+CCKH+
Sbjct: 156 INVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEASACCKHF 212
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++NWKGV RY FDA+VT QD+ +T+ PF CV++G AS +MCSYNRVNG+P+CAD
Sbjct: 213 TAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCAD 272
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL++T RG+W +GYI +DCD++ ++ D + A + EDAVA LKAG+D++CG Y
Sbjct: 273 YNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGY-AKTAEDAVADVLKAGMDVNCGSYVQ 331
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELA 405
+A+QQGK+ E DI+++L L+ V MRLG F+G+P+Y ++G +C+ E+ LA
Sbjct: 332 EHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLA 391
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
EAA+ G+VLLKND N LPL+ ++V ++AV+G +AN ++GNY G PC ++P+
Sbjct: 392 LEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQ 451
Query: 466 GYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
GY T + GC+ AC + SI A++ A + D ++ GLD E E +DR +L LPG
Sbjct: 452 GYVKDTRFLAGCNSAACNVS-SIGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPG 510
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LIN VA AK PVILV++ G VD+ FA+ N I AILWAGYPGE GG AIA V+F
Sbjct: 511 MQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLF 570
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G+ NPGGRLP+TWY ++ +P+T M +R S GYPGRTY+FY G T+Y FGYGLSY+
Sbjct: 571 GEHNPGGRLPVTWYPKEFTS-VPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYS 629
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEF 697
++ ++ ++ N KL +++ A T G + D+ CD F
Sbjct: 630 KYSHHFVA-------NGTKLPSLSSIDGLK-AMATAAAGTVSYDVEEIGTETCDKLKFPA 681
Query: 698 KVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
V QN G DG V+++ + P A Q+IGFQ + +++ + ++F + CK
Sbjct: 682 LVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCK 741
Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
+ ++ G H + VG+
Sbjct: 742 HFSRATEDGKKVIDHGSHFMMVGD 765
>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
Length = 780
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/744 (47%), Positives = 481/744 (64%), Gaps = 28/744 (3%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
SS FC+ LP R DLVSR+TL+EK+ QLGD + V RLG+P Y+WWSEALHGVSN
Sbjct: 42 SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 101
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPN 168
G G H D + ATSFP VILT ASFN LW +IGQ + TEARA+YN G+A GLT+W+PN
Sbjct: 102 GRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGLTFWAPN 161
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
INV RDPRWGR ETPGEDP V G+YA +VRG+Q G+ A +NS L+ S+CCKH+
Sbjct: 162 INVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEASACCKHF 218
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++NWKGV RY FDA+VT QD+ +T+ PF CV++G AS +MCSYNRVNG+P+CAD
Sbjct: 219 TAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCAD 278
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL++T RG+W +GYI +DCD++ ++ D + A + EDAVA LKAG+D++CG Y
Sbjct: 279 YNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGY-AKTAEDAVADVLKAGMDVNCGSYVQ 337
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY---VSLGKQDICSDENIELA 405
+A+QQGK+ E DI+++L L+ V MRLG F+G+P+Y ++G +C+ E+ LA
Sbjct: 338 EHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLA 397
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
EAA+ G+VLLKND N LPL+ ++V ++AV+G +AN ++GNY G PC ++P+
Sbjct: 398 LEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQ 457
Query: 466 GYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
GY T + GC+ AC + SI A++ A + D ++ GLD E E +DR +L LPG
Sbjct: 458 GYVKDTRFLAGCNSAACNVS-SIGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPG 516
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LIN VA AK PVILV++ G VD+ FA+ N I AILWAGYPGE GG AIA V+F
Sbjct: 517 MQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLF 576
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G+ NPGGRLP+TWY ++ +P+T M +R S GYPGRTY+FY G T+Y FGYGLSY+
Sbjct: 577 GEHNPGGRLPVTWYPKEFTS-VPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYS 635
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEF 697
++ ++ ++ N KL +++ A T G + D+ CD F
Sbjct: 636 KYSHHFVA-------NGTKLPSLSSIDGLK-AMATAAAGTVSYDVEEIGPETCDKLKFPA 687
Query: 698 KVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
V QN G DG V+++ + P A Q+IGFQ + +++ + ++F + CK
Sbjct: 688 LVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCK 747
Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
+ ++ G H + VG+
Sbjct: 748 HFSRATEDGKKVIDHGSHFMMVGD 771
>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 726
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/743 (47%), Positives = 491/743 (66%), Gaps = 39/743 (5%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
FCD+SL IRV DLVSR+TL+EKV QL + A +PRL +P YEWW E LHGV++V
Sbjct: 3 FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS--- 59
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
F +P ATSFP ILTTASFN+ LW +IGQA STEARA YN G AGLTYWSP IN+AR
Sbjct: 60 -FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINIAR 118
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGRI ET GEDP+ YA ++V+G+Q EG D NS+ LK+S+CCKH+ AYDV
Sbjct: 119 DPRWGRIQETSGEDPYTTSAYATHFVQGMQ--EG-----DANSKRLKLSACCKHFTAYDV 171
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
DNW+G+DRYHFDA+ ++ +T+ PF+ CV+EG ++S+MCSYN+VNG+P+CA+ L
Sbjct: 172 DNWEGIDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLE 228
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
TVR W L+GYIV+DCDS+ VM ++ + A + EDA A L AGLDL+CG Y ++T
Sbjct: 229 NTVRRAWGLNGYIVSDCDSVLVMHESTNY-APTTEDAAADALNAGLDLNCGDYLASYTEG 287
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAR 410
AV GKV + +D ++ ++ V MRLG FDG+P ++ ++G D+C+ + ELA EAAR
Sbjct: 288 AVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAAR 347
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SG 466
+GIVLLKND N LPL +K AV+GP+ANAT M+GNY GIPC+Y++P+ G SG
Sbjct: 348 QGIVLLKNDGNILPL--SKNINTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSG 405
Query: 467 -YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
Y V + GC + AC+ ++ I +A A ADA +++ GL E+E+LDR L LPGY
Sbjct: 406 DYHKVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGY 465
Query: 526 QTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI++VA A G PV+LV+M AG VDI FA+ + I++ILW GYPG+ GG+AIA+V+F
Sbjct: 466 QQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIF 525
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NPGG+LP++WY DY + + +T+M +RP YPGRTY+FY G +Y FGYGLSYT
Sbjct: 526 GAHNPGGKLPMSWYPEDYTK-ISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFGYGLSYT 584
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
++K++ T+ Q C + + TS SKT C F+ ++ +N+
Sbjct: 585 EYKHSFALAPTTVMTPSIHSQLC-DPHQTSAGSKT-C---------SSSNFDVHINVENI 633
Query: 705 GSTDGSDVVIV-YSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G+ G+ +++ ++ P A T +KQ+ F V++R+G +++ N C+ L V
Sbjct: 634 GAMAGNHTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQHLGTVAED 693
Query: 764 ANTLLPAGEHTIFVGNGGVSFPI 786
+L AG H + VG+ S +
Sbjct: 694 GTRMLEAGNHILSVGDAKHSLSV 716
>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
Length = 774
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/785 (45%), Positives = 501/785 (63%), Gaps = 44/785 (5%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
L I L+ + V + SP F CD S S+ FC ++LP RV+DLVSR+
Sbjct: 7 LLINLIYVTVILVGVESTQSPPFSCDSSNPS-----TKSYHFCKTTLPIPDRVRDLVSRL 61
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
TLDEK+ QL + A +PRLG+P YEWWSEALHGV++ GPG F+ I ATSFP VILT
Sbjct: 62 TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTA 121
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVV 191
ASF+ LW +IG+A+ EARA+YN G+ G+T+W+PNIN+ RDPRWGR ETPGEDP V
Sbjct: 122 ASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 181
Query: 192 GRYAVNYVRGLQD--VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
G YAV+YVRG+Q + G + +L + S+CCKH+ AYD+D+WKG+DR+ FDARVT
Sbjct: 182 GSYAVSYVRGVQGDCLRGLKRCGEL-----QASACCKHFTAYDLDDWKGIDRFKFDARVT 236
Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
QD+ +T+ PF C++EG AS +MC+YNRVNG+PSCAD LL T R W+ GYI +D
Sbjct: 237 MQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSD 296
Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
CD++ ++ D++ F A + EDAV LKAG+D++CG Y N T +AV Q K+ E+++D++L
Sbjct: 297 CDAVSLIHDSYGF-AKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRAL 355
Query: 370 KYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
+ L+ V MRLG F+G+P+ Y +G +CS E+ LA +AAR+GIVLLKN Q LPL
Sbjct: 356 ENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLP 415
Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNN 485
K ++AV+GP+AN+ +IGNYAG PC++++P+ Y T Y GCD VAC S+
Sbjct: 416 KGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVAC-SSP 474
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
SI A E A+ AD +++ GLD + E E+ DR DL LPG Q QLI VA AK PV+LV+
Sbjct: 475 SIEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVL 534
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
+S G VDI+FA+ + NI +ILWAGYPG GG AIA+ +FG NPGGRLP+TWY D+ +
Sbjct: 535 LSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDFTK- 593
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL- 664
+P+T M +RP + GYPGRTY+FY G ++ FGYGLSY+ + +TI V NKL
Sbjct: 594 IPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYS------CETIPVTRNKLY 647
Query: 665 ----------QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
++ ++ YTS A L +L + + +N G G V+
Sbjct: 648 FNQSSTAHVYENTDSIRYTSVAE-------LGKELCDSNNISISIRVRNDGEMAGKHSVL 700
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
++ + A + IKQ++ FQ V + G + + F+ N C+ + + ++ G H
Sbjct: 701 LFVRRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFSGPNKDGLMVIEEGTHF 760
Query: 775 IFVGN 779
+ VG+
Sbjct: 761 LVVGD 765
>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 772
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/769 (46%), Positives = 490/769 (63%), Gaps = 34/769 (4%)
Query: 22 TNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL 81
T V ++ +P F CD FS + S+ FC+ LP R KDL+SR+TLDEK+ QL
Sbjct: 14 TVTVQSSKPEAP-FACD---FSNPSSR--SYPFCNPKLPIPQRTKDLLSRLTLDEKLSQL 67
Query: 82 GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDD--VIPGATSFPTVILTTASFNESL 139
+ A +PRLG+P Y+WWSEALHGVS VGPG FD+ I ATSFP VILT ASF+ L
Sbjct: 68 VNTAPPIPRLGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRL 127
Query: 140 WKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
W +IG A+ EARA++N G+A GLT+W+PNIN+ RDPRWGR ET GEDP + RYAV++
Sbjct: 128 WYRIGHAIGIEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSF 187
Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
VRGLQ L S+CCKH+ AYD+DNWKGVDR+ FDARV+ QD+ +T+
Sbjct: 188 VRGLQ-------GDSFKGAHLLASACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQ 240
Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
PF+ CV++G AS +MC+YNRVNG+P+CAD LL QT R +WD +GYI +DC ++ + D
Sbjct: 241 PPFQSCVQQGRASGIMCAYNRVNGVPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHD 300
Query: 319 NHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
++ A S ED VA L+AG+DL+CG Y T +AV Q K+ ++ID++L+ L+++ MR
Sbjct: 301 RQRY-AKSPEDVVADVLRAGMDLECGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMR 359
Query: 379 LGFFDGSPQYVS---LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL-NSAKVKTVA 434
LG FDG+P +S +G +CS E+ LA EAAR GIVLLKN LPL ++ ++A
Sbjct: 360 LGLFDGNPTRLSFGLIGSNHVCSKEHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLA 419
Query: 435 VVGPHANAT-VAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASE 492
V+GP+AN++ + ++GNYAG PC+Y++ + GF Y N Y GCD S+ I A E
Sbjct: 420 VIGPNANSSPLTLLGNYAGPPCKYVTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVE 479
Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
AK D +++ GLD S E E DR L LPG Q +LIN VAE +K PVILV++S G +D
Sbjct: 480 VAKKVDYVVLVMGLDQSEEREERDRVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLD 539
Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
I A+ N I ILWAGYPGE GG A+A ++FG NPGGRLP TWY DY++ +P+T M
Sbjct: 540 ITSAKYNHKIGGILWAGYPGELGGIALAQIIFGDHNPGGRLPTTWYPKDYIK-VPMTDMR 598
Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
+R S GYPGRTY+FY GP +Y FGYGLSY+++ Y +S T +KL ++ +
Sbjct: 599 MRADPSTGYPGRTYRFYKGPKVYEFGYGLSYSKYSYEFVSVTH------DKLHFNQSSTH 652
Query: 673 TSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
+ LV++L C V QN GS G V+++ +P + + + +
Sbjct: 653 LMVENSETISYKLVSELDEQTCQSMSLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPV 712
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
KQ++GF+ V + AG ++F + C+ L+ + A ++ G H + V
Sbjct: 713 KQLVGFESVMLDAGEMAHVEFEVSPCEHLSRANEAGAMIIEEGSHMLLV 761
>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
Length = 789
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/796 (44%), Positives = 486/796 (61%), Gaps = 48/796 (6%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
+C L + L +FS + S+ P F C P S + FC++SLP S R +
Sbjct: 9 FICLFLQV-LPLFSISE-----STHPQFPCMPP-------TNSDYPFCNTSLPISTRAQS 55
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LVS +TL EK+QQL D A +PRL +P YEWWSE+LHG++ GPG F+ + ATSFP
Sbjct: 56 LVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQ 115
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
V+LT ASFN SLW IG A++ EARAMYN+G+AGLT+W+PNIN+ RDPRWGR ETPGED
Sbjct: 116 VLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGED 175
Query: 188 PFVVGRYAVNYVRGLQ--------DVEGHENAT-----DLNSRPLKVSSCCKHYAAYDVD 234
P V YAV +VRG Q ++ G D + L +S+CCKH AYD++
Sbjct: 176 PMVASAYAVEFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLE 235
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
W RY FDA V+ QD+E+T+ PF CV++G AS +MCSYNRVNG+P+CA L Q
Sbjct: 236 KWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLF-Q 294
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
+ EW GYI +DCD++ + + ++ A+S EDAVA LKAG D++CG Y T +A
Sbjct: 295 KAKTEWGFKGYITSDCDAVATVYE-YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSA 353
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAARE 411
+ QGKVKE DID++L L++V MRLG FDG P Y +LG +D+C+ E+ LA EAAR+
Sbjct: 354 IDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQ 413
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
GIVLLKND+ LPL+ +++ ++A++GP A+ + G Y GIPC+ S + G Y T
Sbjct: 414 GIVLLKNDKKFLPLDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKT 472
Query: 472 -YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
+ GC DV C S+ A A+ AD +++AGLDLS E E DR L LPG Q LI
Sbjct: 473 SFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALI 532
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+ VA + P++LV+ G +D++FAE + I +ILW GYPGE G +A+A+++FG FNPG
Sbjct: 533 SSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPG 592
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
GRLP+TWY + + +P+ M +R GYPGRTY+FY G +Y FG GLSYT+F Y
Sbjct: 593 GRLPMTWYPESFTR-VPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQF 651
Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEFKVDFQN 703
+S NKL R+ + S + R VN CD F ++ N
Sbjct: 652 VSAP-------NKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTN 704
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
VG DGS VV+++S+ P + T KQ+IGF RV + R+ + + C+ +I +
Sbjct: 705 VGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQ 764
Query: 764 ANTLLPAGEHTIFVGN 779
++P G+HTI +G+
Sbjct: 765 GKRIMPLGDHTIMLGD 780
>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
gi|194688848|gb|ACF78508.1| unknown [Zea mays]
gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 780
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/762 (45%), Positives = 481/762 (63%), Gaps = 32/762 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+S P + C G + FCD+ LP RV DLVSRMT+ EK+ QLGD + +P
Sbjct: 30 ASEPPYTCGAG-------APPNIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGDQSPAIP 82
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WWSEALHG+SN G G H D + ATSFP VILT ASFN LW +IGQ +
Sbjct: 83 RLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGV 142
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA+YN G+A GLT+W+PNINV RDPRWGR ETPGEDP + G+YA +VRG+Q G+
Sbjct: 143 EARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GY 199
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
A +NS L+ S+CCKH+ AYD++NWKGV RY FDA+VT QD+ +T+ PF+ CV++G
Sbjct: 200 GLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCVEDG 259
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS +MCSYNRVNG+P+CAD LL+ T R +W +GYI +DCD++ ++ D + A + E
Sbjct: 260 HASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGY-AKTAE 318
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
DAVA LKAG+D++CG Y + +A+QQGK+ E DI+++L L+ V MRLG F+G P+
Sbjct: 319 DAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRR 378
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANAT 443
Y +G +C+ E+ +LA EAA++GIVLLKND LPL+ V ++AV+G +AN
Sbjct: 379 NLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDA 438
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
+ + GNY G PC ++P+ GY + ++ GC+ AC +I A +AA +AD+ ++
Sbjct: 439 IRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVL 497
Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
GLD E E +DR DL LPG Q LI VA AK PVILV++ G VD++FA+TN I
Sbjct: 498 FMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKI 557
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AILWAGYPGE GG AIA V+FG+ NPGGRLP+TWY D+ + +P+T M +R + GYP
Sbjct: 558 GAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTR-VPMTDMRMRADPATGYP 616
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ--VNLNKLQHCRNLNYTSDASKTR 680
GRTY+FY GPT++ FGYGLSY+++ + + L ++ + D
Sbjct: 617 GRTYRFYRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEA-- 674
Query: 681 CPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRV 737
+ CD F V QN G DG V+V+ + P + + Q+IGFQ +
Sbjct: 675 -----IGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSL 729
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+RA + ++F + CK + ++ G H + VG
Sbjct: 730 HLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGE 771
>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
Length = 776
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/799 (46%), Positives = 483/799 (60%), Gaps = 114/799 (14%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
VCDP RF+ GL M+ F +CD+SLPY+ RV+DLV RMTL+EKV LGD A G PR+GLP+
Sbjct: 46 VCDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPR 105
Query: 96 YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR--- 152
Y G G T+ PT ++ + +W++ + AR
Sbjct: 106 Y------------CGGGRR-------CTACPT-----SARRDVVWRRRARRHQLPARHQQ 141
Query: 153 ---------------------AMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVV 191
MYNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVV
Sbjct: 142 RRVVQRDAVARHRRRGVDGDQGMYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVV 201
Query: 192 GRYAVNYVRGLQDVEGHENATDLN------SRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
GRYAVN+VRG+QD++G A SRP+KVSSCCKHYAA
Sbjct: 202 GRYAVNFVRGMQDIDGATTAASAAAATDAFSRPIKVSSCCKHYAA--------------- 246
Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
VMCSYNR+NG+P+CAD +LL +TVR +W LHGY
Sbjct: 247 --------------------------CVMCSYNRINGVPACADARLLTETVRRDWQLHGY 280
Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-------YTNFTGNAVQQG 358
IV+DCDS++VMV + K+L + +A A +KAGLDLDCG + +T + +AV+QG
Sbjct: 281 IVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQG 340
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
K+KE+ +D +L LY LMRLGFFDG P+ SLG D+C++E+ ELAA+AAR+G+VLLKN
Sbjct: 341 KLKESAVDNALTNLYLTLMRLGFFDGIPELESLGAADVCTEEHKELAADAARQGMVLLKN 400
Query: 419 DQNTLPLNSAKVKTVAVVGP--HANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGC 476
D LPL+ KV +VA+ G H NAT M+G+Y G PCR ++P G + T C
Sbjct: 401 DAALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHAC 460
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
D +C + A+ AAKT DATI++AGL++SVE ES DREDL LP Q IN VAE
Sbjct: 461 DKGSCDT------AAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEA 514
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
+ P++LVIMSAGGVD++FA+ N I A++WAGYPGEEGG AIADV+FGK+NPGGRLP+T
Sbjct: 515 SPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLT 574
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTK 655
WY +YV +P+TSM LRP GYPGRTYKFY G LYPFG+GLSYT F Y +
Sbjct: 575 WYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAA 634
Query: 656 TIQVNLNKLQHCRNLNYTSD-ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
+ V + ++C+ L Y + +S CP V V C + F V N G DG+ VV
Sbjct: 635 PVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVP 694
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y+ PPAE+ KQ++ F+RV V AG + F N CK+ IV+ A T++P+G
Sbjct: 695 MYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSR 754
Query: 775 IFVGNGG--VSFPIHLNFN 791
+ VG+ +SFP+ ++
Sbjct: 755 VLVGDDALSLSFPVQIDLQ 773
>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/717 (50%), Positives = 465/717 (64%), Gaps = 53/717 (7%)
Query: 3 KVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
KVV L C LVF + V A SSPVF CD L +S FC++S+ +
Sbjct: 14 KVVFLLFCM-----FLVFLSTHVSAQ--SSPVFACDVVSNPSL----ASLGFCNTSIGIN 62
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
RV DLV R+TL EK+ L + A V RLG+P+YEWWSEALHGVS VGPGTHF D + GA
Sbjct: 63 DRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGA 122
Query: 123 TSFPTVILTTASFNESLWKKIG-----QAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
TSFP VILT ASFN SL++ IG Q VSTEARAMYN+G AGLT+WSPNIN+ RDPRW
Sbjct: 123 TSFPQVILTAASFNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRW 182
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ETPGEDP + +Y YV+GLQ + D + LKV++CCKHY AYD+DNWK
Sbjct: 183 GRGQETPGEDPLLSSKYGSCYVKGLQQRD------DGDPDKLKVAACCKHYTAYDLDNWK 236
Query: 238 GVDRYHFDARV-TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
G DRYHF+A V T+QDM++TF PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+ +
Sbjct: 237 GSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVI 296
Query: 297 RGEWDLHGY-------IVADCDSIQVMVDNHKFLADSKEDA-----VAQTLKAGLDLDCG 344
RGEW+L+GY IV DCDS+ V + + +E A +L G+DL+CG
Sbjct: 297 RGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCG 356
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDEN 401
+ T AV+ G V E ID ++ + LMRLGFFDG P Y LG +D+C+ EN
Sbjct: 357 SFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAEN 416
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-AGIPCRYMSP 460
ELA EAAR+GIVLLKN +LPL+ +K +AV+GP+AN T MIGNY G PC+Y +P
Sbjct: 417 QELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTP 476
Query: 461 IAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
+ G + TY GC +VAC S + A + A ADAT+++ G DLS+EAES DR D+
Sbjct: 477 LQGLAASVATTYLPGCSNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDV 535
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
LPG Q LI VA V+ GPVILVIMS GG+D++FA TN I +ILW GYPGE GG AIA
Sbjct: 536 LLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIA 595
Query: 581 DVVFGKFNPG----GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
D++FG +NP GRLP+TWY YV +P+T+M +RP S GYPGRTY+FY G T+Y
Sbjct: 596 DIIFGYYNPSTHQPGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYS 655
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
FG GLSY+QF + L+ + + V L + C + + C V+ ++ C +
Sbjct: 656 FGDGLSYSQFTHELIQAPQLVYVPLEESHVCHS---------SECQSVVASEQTCQN 703
>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
Length = 791
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/747 (44%), Positives = 468/747 (62%), Gaps = 28/747 (3%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ FC+ +LP S RV+ L+S +T+DEK+ L D +PRLGLP YEWWSE+LHG++ GP
Sbjct: 41 YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGP 100
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
+F+ I G TSFP VILT A+FN +LW I A++ EARAMYNLG+AGLT+W+PNIN+
Sbjct: 101 AVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINI 160
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV-----EGHENATDLNSRPLK------ 220
RDPRWGR ETPGEDP VV YA+ YV G Q + +G+ N R LK
Sbjct: 161 LRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLKEDDNDG 220
Query: 221 ----VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
+S+CCKH+ AYD++ W RY F+A VT+QDME+TF PF C+++G AS +MCS
Sbjct: 221 ERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCLMCS 280
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
YN VNG+P+CAD +LL++ VR +W GYI +DCD++ + +N K+ + EDAVA LK
Sbjct: 281 YNSVNGVPACADKELLDK-VRTDWGFDGYITSDCDAVATIYENQKY-TKTPEDAVAVALK 338
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGK 393
AG +++CG Y +A QQG V E D+D++L+YL++V RLG FDG+P Q+ + G
Sbjct: 339 AGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANFGA 398
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
QD+C+ ++ LA +AAR+GIVLLKNDQ LPL+ V T+A+VGP AN + + G Y+G+
Sbjct: 399 QDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVS-SPGGTYSGV 457
Query: 454 PCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
PC+ S GF + N T Y GC DV C S A K AD I++AG DLS E
Sbjct: 458 PCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSEET 517
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E DR L LPG QT L+ +A +K P+ILV+ G VD++FAE + I +ILW YPG
Sbjct: 518 EDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAYPG 577
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
E GG+A+++++FG NPGG+LP+TWY + + +P+T M +R S GYPGRTY+FY G
Sbjct: 578 ETGGKALSEIIFGYQNPGGKLPMTWYLESFTK-VPMTDMNMRADPSNGYPGRTYRFYTGD 636
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC- 691
LY FG+GLSYT F LLS + ++L K R++ ++R + V+++
Sbjct: 637 VLYGFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSI---LAKGRSRLGYIHVDEVESC 693
Query: 692 -DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
F + N G DGS V++++S+ KQ++GF RV V A + +
Sbjct: 694 HSSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLL 753
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFV 777
+ C+ + + N +L GEHT +
Sbjct: 754 VDPCELFSFANDQGNRILALGEHTFIL 780
>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 782
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/739 (46%), Positives = 472/739 (63%), Gaps = 34/739 (4%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
FCD++LP R DLV+R+T EKV QLGD A GVPRLG+P Y+WWSEALHG++ G G
Sbjct: 52 FCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYKWWSEALHGLATSGRGL 111
Query: 114 HFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSP 167
HFD PG ATSFP V+LT A+F++ LW +IGQA+ TEARA+YN+G+A GLT WSP
Sbjct: 112 HFD--APGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEARALYNIGQAEGLTMWSP 169
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
N+N+ RDPRWGR ETPGEDP + +YAV +V+G+Q G+ +A L+ S+CCKH
Sbjct: 170 NVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQ---GNSSAI------LQTSACCKH 220
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
AYD+++W GV RY+F+A+VT QD+E+T+ PF CV + A+ +MC+Y +NG+P+CA
Sbjct: 221 VTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACA 280
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
+ LL +TVRG+W L GYI +DCD++ +M D ++ + EDAVA LKAGLD++CG Y
Sbjct: 281 NADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNCGTYM 339
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIE 403
A+QQGK+ E DIDK+LK L+ + MRLG FDG P+ Y LG DIC+ E+
Sbjct: 340 QQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRS 399
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LA EAA +GIVLLKND LPL+ V + AV+GP+AN +A+IGNY G PC +P+ G
Sbjct: 400 LALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNG 459
Query: 464 FSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
GY NV + GC+ AC + AA+ A+ ++D + GL E+E DR L L
Sbjct: 460 ILGYIKNVRFLAGCNSAACDVAATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDRTSLLL 518
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PG Q LI VA+ AK PVILV+++ G VD+ FA+TN I AILWAGYPG+ GG AIA V
Sbjct: 519 PGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARV 578
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG NPGGRLP+TWY ++ + +P+T M +R + GYPGR+Y+FY G T+Y FGYGLS
Sbjct: 579 LFGDHNPGGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLS 637
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK---- 698
Y+ + L+S K + N L R TS+ ++ + ++ D + K
Sbjct: 638 YSSYSRQLVSGGKPAESYTNLLASLRTTT-TSEGDES----YHIEEIGTDGCEQLKFPAV 692
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
V+ QN G DG V++Y + P Q+IGF+ ++ G I+F + C+ +
Sbjct: 693 VEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFS 752
Query: 759 IVDYAANTLLPAGEHTIFV 777
V ++ G H + V
Sbjct: 753 RVRKDGKKVIDRGSHYLMV 771
>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 780
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/798 (44%), Positives = 502/798 (62%), Gaps = 41/798 (5%)
Query: 11 FSLSIALL-VFSTN-----AVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
FS++I + +F T D+ ++ P + CD SF FC+ +L + R
Sbjct: 4 FSITITFIFLFLTRYHRLVHADSLATNVPPYSCDTSN-----PLTKSFPFCNLNLTITQR 58
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
KD+VSR+TLDEK+ QL + A +PRLG+P Y+WW+EALHGVS VG G + I ATS
Sbjct: 59 AKDIVSRLTLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATS 118
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITET 183
FP +IL ASF+ LW +I + + TEAR +YN G+A G+T+W+PNIN+ RDPRWGR ET
Sbjct: 119 FPQIILIAASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQET 178
Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
GEDP V +Y V+YVRGLQ + E + R LK S+CCKH+ AYD++NWKGV+RY
Sbjct: 179 AGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASACCKHFTAYDLENWKGVNRYV 236
Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
FDA+VT QD+ +T+ F CV +G +S +MC+YNRVNG+P+CAD LL T R +W+ +
Sbjct: 237 FDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFN 296
Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
GYI +DCD+++ + + + A + ED VA L+AG+D++CG Y T +AV Q K+ +
Sbjct: 297 GYIASDCDAVRFIYEKQGY-AKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPIS 355
Query: 364 DIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID++L L+T+ +RLG FDG+P QY +G +CS EN++LA EAAR GIVLLKN
Sbjct: 356 QIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTA 415
Query: 421 NTLPLNSAKVKTVAVVGPHAN-ATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDD 478
+ LPL +V T+ V+GP+AN +++ ++GNY G PC+ +S + GF YA+ T Y++GC D
Sbjct: 416 SILPL--PRVNTLGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTD 473
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
++ I A E AK +D I++ GLD S E E+LDR+ L LPG Q +LIN VA+ +K
Sbjct: 474 GVKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASK 533
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PVILVI+ G VDI FA+ N I I+WAGYPGE GGRA+A VVFG +NPGGRLP+TWY
Sbjct: 534 KPVILVILCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWY 593
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
D+++ +P+T M +R S GYPGRTY+FY GP +Y FGYGLSY+ + YN +S K
Sbjct: 594 PKDFIK-IPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFIS-VKNNN 651
Query: 659 VNLNK------LQHCRNLNY--TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
+++N+ L++ + Y S+ K C + ++ + N GS G
Sbjct: 652 IHINQSTTHSILENSETIRYKLVSELGKKACKTMSIS---------VTLGITNTGSMAGK 702
Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
V+++ KP +KQ++GF+ V V G + F + C+ L+ + + ++
Sbjct: 703 HPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEE 762
Query: 771 GEHTIFVGNGGVSFPIHL 788
G + VG S I L
Sbjct: 763 GGYLFLVGELEYSINITL 780
>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
Length = 791
Score = 671 bits (1731), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/761 (46%), Positives = 465/761 (61%), Gaps = 35/761 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +P F C P SK FC+ LP S R DLVSRMT EK QLGD A+GVP
Sbjct: 44 AGAPPFSCGPSSPSK------GLPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVP 97
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WW+EALHGV+ G G H + + ATSFP V+ T ASFN++LW +IGQA
Sbjct: 98 RLGVPSYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGK 157
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA YN+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RY +VRGLQ G
Sbjct: 158 EARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQ---GS 214
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+ T L+ S+CCKH AYD+++WKGV RY F A VT QD+ +TF PF CV +G
Sbjct: 215 SSNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFRSCVVDG 274
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS VMC+Y VNG+PSCA+ LL +T RG W L GY+ ADCD++ +M N +F + E
Sbjct: 275 KASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAIM-RNSQFYRPTAE 333
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
D VA TLKAGLD+DCG Y + A+Q+GK+ + D+DK++K L T MRLG FDG P+
Sbjct: 334 DTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFDGDPKT 393
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y +LG IC+ E+ LA EAA +GIVLLKN LPL V + AV+G +AN +A
Sbjct: 394 NVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNANDVLA 453
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNY G PC +P+ G GY NV + GC+ AC + A+ A ++DA I+
Sbjct: 454 LLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNV-AATPQATALASSSDAVILFM 512
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E+E DR L LPG Q LIN VA AK PVILV+++ G VDI FA+ N I A
Sbjct: 513 GLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANPKIGA 572
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG AIA V+FG+ NP G+LP TWY ++ + +P+T M +R S YPGR
Sbjct: 573 ILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEFTR-IPMTDMRMRAAGS--YPGR 629
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC------RNLNYTSDASK 678
TY+FYNG T+Y FGYGLSY++F + +++ K N + L NL+Y +
Sbjct: 630 TYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAGLAAMTEDNLSYHVEH-- 687
Query: 679 TRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
+ D+ CD F V QN G DG +++ + P+ +Q+IGFQ
Sbjct: 688 -------IGDVVCDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIGFQSQ 740
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++AG ++F + C+ + V ++ G H + VG
Sbjct: 741 HIKAGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVG 781
>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 767
Score = 669 bits (1726), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/758 (47%), Positives = 476/758 (62%), Gaps = 33/758 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ P F C G SS+ FCD++LP + R DLVSR+T EKV QLGD A GVP
Sbjct: 22 AGDPPFSC--------GQASSSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVP 73
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WW+EALHG++ G G HFD + ATSFP V LT A+F++ LW +IGQA+
Sbjct: 74 RLGVPGYKWWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGR 133
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA+YNLG+A GLT WSPN+N+ RDPRWGR ETPGEDP RYAV +VRG+Q
Sbjct: 134 EARALYNLGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQG---- 189
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
N+T L L+ S+CCKH AYD+++W GV RY+FDA+VT QD+E+TF PF CV +G
Sbjct: 190 -NSTSL----LQASACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDG 244
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS VMC+Y +NG+P+CA+ LL +TVRG+W L GY +DCD++ +M D ++ A S E
Sbjct: 245 KASCVMCAYTGINGVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRY-AQSPE 303
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
DAVA LKAGLD+DCG Y A+QQGK+ E DIDK+LK L+ + MRLG FDG P+
Sbjct: 304 DAVALALKAGLDIDCGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRT 363
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y LG DIC+ E+ LA +AA++GIVLLKND LPL+ A V + AV+GP+AN A
Sbjct: 364 NMYGGLGAADICTAEHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGA 423
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
+I NY G PC +P+ G GY + + GC AC + AA+ A T+D +
Sbjct: 424 LIANYFGPPCESTTPLKGIQGYVKDARFLAGCSSTACDVATTDQAAA-LASTSDYVFLFM 482
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E+E DR L LPG Q LI VA+ A+ PVILV++S G VD+ FA+TN I A
Sbjct: 483 GLGQRQESEGRDRTSLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGA 542
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG AIA V+FG NP GRLP+TWY ++ +P+T M +R + GYPGR
Sbjct: 543 ILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTN-VPMTDMRMRADPANGYPGR 601
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSF-TKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
+Y+FY G T+Y FGYGLSY+ + LLS T T N + L +L T +++
Sbjct: 602 SYRFYQGKTVYKFGYGLSYSSYSRRLLSSGTSTPAPNADLLA---SLTTTMPSAENILGS 658
Query: 684 VLVNDLRCDD----YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
V + F V+ QN G DG V++Y + P A +Q+IGF++ +
Sbjct: 659 YHVEQIGAQGCEMLKFPAVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHL 718
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
+AG IKF C+ L+ V N ++ G H + V
Sbjct: 719 KAGEKAHIKFEIRPCEHLSRVREDGNKVIDRGSHFLRV 756
>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
Length = 777
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/789 (43%), Positives = 482/789 (61%), Gaps = 35/789 (4%)
Query: 11 FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
F I +L+F S+ P F CD SS+ FC+++LP RV DLVS
Sbjct: 13 FIFVILVLLFRRTE-----STKPPFSCDSSN-----PNTSSYPFCNAALPIPQRVNDLVS 62
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
R+T+DEK+ QL + A +PRLG+ YEWWSE LHG+S G GT F+ I AT FP +IL
Sbjct: 63 RLTVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIIL 122
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR-AGLTYWSPNINVARDPRWGRITETPGEDPF 189
T +SF+E+LW +I QA+ EARA+YN G+ G+T W+PNIN+ RDPRWGR ETPGEDP
Sbjct: 123 TASSFDENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPM 182
Query: 190 VVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
+VG+Y V YVRGLQ EG + L L+ S+CCKH+ A D+DNW RY FDA+
Sbjct: 183 MVGKYGVAYVRGLQGDSFEGGK----LKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQ 238
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V +QD+ +++ PF+ CV++G ASSVMC+YN VNGIP+CA+ LL T RG+W L GYIV
Sbjct: 239 VLKQDLADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIV 298
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
+DCD++ M + A EDAVA TLKAG+D++CG + +T +A+++ KVKE+DID+
Sbjct: 299 SDCDAVDKMYSEQHY-AKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDR 357
Query: 368 SLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
+L L++V MRLG F+G P +Y + ++CS+E+ LA EAAR G VLLKN LP
Sbjct: 358 ALHNLFSVRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLP 417
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKS 483
L+ K ++AV+GP AN + ++GNY G C+ ++ G GY AN Y GCD + C S
Sbjct: 418 LSKMKTASLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTS 477
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
+I A AK AD +++ GLD ++E E DR +L LPG Q +LI +AE A PVIL
Sbjct: 478 P-AIDEAVNIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVIL 536
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
V+M G VD+ FA+ N I ILW GYPGE G A+A ++FG+ NPGGR P+TWY ++
Sbjct: 537 VLMCGGPVDVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRSPVTWYPKEF- 595
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
+ + M +RP S GYPGRTY+FYNGP ++ FGYGLSYT + Y S +K N+
Sbjct: 596 NKVAMNDMRMRPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFASVSK------NQ 649
Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLR---CDD-YFEFKVDFQNVGSTDGSDVVIVYSKP 719
L +N K + V+D+ C+ KV +N G G V+++ K
Sbjct: 650 LLF-KNPKINQSTEKGSVLNIAVSDVGPEVCNSAMITVKVAVKNQGEMAGKHPVLLFLKH 708
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ + K +IGF+ V + AG N ++ F C+ + ++ G+H + +G+
Sbjct: 709 SSTVDEVPKKTLIGFKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGKHFLLLGD 768
Query: 780 GGVSFPIHL 788
P+ L
Sbjct: 769 QEYPIPVSL 777
>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
Length = 780
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/739 (45%), Positives = 469/739 (63%), Gaps = 20/739 (2%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
++P F C P + ++ FC+ SLP + R + L+S +TL EK+QQL D A G+PR
Sbjct: 26 ANPQFPCKPPTHN-------TYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPR 78
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIP--GATSFPTVILTTASFNESLWKKIGQAVS 148
LG+P YEWWSE+LHG+S GPG F + P AT FP VI++ ASFN +LW IG A++
Sbjct: 79 LGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIA 138
Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARAMYN+G+AGLT+W+PNIN+ RDPRWGR ETPGEDP V YA+ +V+G Q
Sbjct: 139 IEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWK 198
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
++N L +S+CCKH AYD++ W RY F+A VTEQDME+T+ PF C+++G
Sbjct: 199 NEDGEINDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKG 258
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS +MCSYN VNG+P+CA LL Q R EW GYI +DCD++ + + + + S E
Sbjct: 259 KASCLMCSYNEVNGVPACAREDLL-QKPRTEWGFKGYITSDCDAVATIFEYQNY-SKSPE 316
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
DAVA LKAG+D++CG Y +AV++GK++E DID++L L++V +RLG FDG P
Sbjct: 317 DAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRK 376
Query: 387 -QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Q+ LG +++C+ E+ LA EAAR+GIVLLKND+ LPLN V ++A++GP AN +
Sbjct: 377 GQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANS 436
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
+ G+Y G PC S G Y T Y GC DVAC S+ A AK AD II+A
Sbjct: 437 LGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVA 496
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLDLS E E DR L LPG Q L++ VA +K PVILV+ G +D++FA+ + I +
Sbjct: 497 GLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIAS 556
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILW GYPGE G +A+A+++FG++NPGGRLP+TWY + + + +T M +RP S GYPGR
Sbjct: 557 ILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESFTE-VSMTDMNMRPNPSRGYPGR 615
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+FY G +Y FG GLSYT F Y +LS + ++ + + R R +
Sbjct: 616 TYRFYTGNRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGE--RLSYI 673
Query: 685 LVNDL-RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+N++ CD F ++ +NVG+ DG VV+++S+ P KQ++GF RV +
Sbjct: 674 NINEITSCDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISH 733
Query: 743 RNKRIKFVFNACKSLNIVD 761
R+ + + + C+ L++ +
Sbjct: 734 RSTEMSILVDPCEHLSVAN 752
>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 774
Score = 668 bits (1724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/755 (45%), Positives = 465/755 (61%), Gaps = 20/755 (2%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ P F C P FCD +L + R DLVSR+T EK+ QLGD A GVP
Sbjct: 26 AGDPPFSCGPSSAEA----SEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVP 81
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WW+EALHG++ G G HFD + ATSFP V+LT A+F++ LW +IGQA+
Sbjct: 82 RLGVPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGR 141
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA++N+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RYAV +VRG+Q
Sbjct: 142 EARALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG---- 197
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+ +S L+ S+CCKH AYD+++W GV RY F ARVTEQD+E+TF PF CV E
Sbjct: 198 ----NSSSSLLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEA 253
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS VMC+Y +NG+P+CA+ LL TVRG+W L GY+ +DCD++ +M D ++ A + E
Sbjct: 254 KASCVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRY-APTPE 312
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
DAVA +LKAGLD+DCG Y A+QQGK+ E DIDK+L LY V MRLG FDG P+
Sbjct: 313 DAVAVSLKAGLDIDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRK 372
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y LG DIC+ E+ LA EAA++GIVLLKND LPL+ + V + AV+GP+AN +A
Sbjct: 373 NMYGVLGAADICTPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMA 432
Query: 446 MIGNYAGIPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
+I NY G PC +P+ G Y N V + GC+ AC + A + A + D +
Sbjct: 433 LIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAG-SEDYVFLFM 491
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E+E DR L LPG Q LI VA+ +K PVILV++S G VDI FA++N I A
Sbjct: 492 GLSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGA 551
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG AIA V+FG NP GRLP+TWY ++ + +P+T M +R + GYPGR
Sbjct: 552 ILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMRMRADPTSGYPGR 610
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+Y+FY G T+Y FGYGLSY+ F L+ T ++ L R D ++
Sbjct: 611 SYRFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDA 670
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ + F V+ QN G DG V+++ + P Q+IGF+ ++AG
Sbjct: 671 IGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGET 730
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+++F + CK + V ++ G H + V N
Sbjct: 731 AKLRFDISPCKHFSRVRADGRKVIDIGSHFLMVDN 765
>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 798
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/768 (45%), Positives = 488/768 (63%), Gaps = 46/768 (5%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
S FC+ +L + R KD+VSR+TLDEK+ QL + A +PRLG+P Y+WW EALHGV+N G
Sbjct: 47 SLPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAG 106
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNI 169
G + + GATSFP VILT ASF+ LW +I + + TEAR +YN G+A G+T+W+PNI
Sbjct: 107 KGIRLNGSVAGATSFPQVILTAASFDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNI 166
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
N+ RDPRWGR ET GEDP V +Y V+YVRGLQ + E + R LK S+CCKH+
Sbjct: 167 NIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGDR-LKASACCKHFT 224
Query: 230 AYDVDNWKGVDRYHFDARV----------------TEQDMEETFLRPFEMCVKEGDASSV 273
AYD+DNWKG+DR+ FDA+V T QD+ +T+ PF C+ +G +S +
Sbjct: 225 AYDLDNWKGLDRFDFDAKVSFLFSMAYSPWMINYVTLQDLADTYQPPFHSCIVQGRSSGI 284
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MC+YNRVNG+P+CAD LL +T R +W+ +GYI +DC++++++ DN + A + EDAVA
Sbjct: 285 MCAYNRVNGVPNCADYNLLTKTARQKWNFNGYITSDCEAVRIIYDNQGY-AKTPEDAVAD 343
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVS 390
L+AG+D++CG Y T AV Q KV + ID++L L+T+ +RLG FDG+P QY
Sbjct: 344 VLQAGMDVECGDYLTKHAKAAVLQKKVPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGR 403
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN-ATVAMIGN 449
+G +CS EN++LA EAAR GIVLLKN + LPL +V T+ V+GP+AN ++ ++GN
Sbjct: 404 IGPNQVCSKENLDLALEAARSGIVLLKNTASILPL--PRVNTLGVIGPNANKSSKVVLGN 461
Query: 450 YAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
Y G PCR + + GF YA+ T Y++GC D ++ I A E AK +D I++ GLD
Sbjct: 462 YFGRPCRLVPILKGFYTYASQTHYRSGCLDGTKCASAEIDRAVEVAKISDYVILVMGLDQ 521
Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
S E ES DR+DL LPG Q +LIN VA+ +K PVILV++ G VDI FA+ N I I+WA
Sbjct: 522 SQERESRDRDDLELPGKQQELINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWA 581
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
GYPGE GGRA+A VVFG +NPGGRLP+TWY D+++ +P+T M +R S GYPGRTY+F
Sbjct: 582 GYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIK-IPMTDMRMRADPSSGYPGRTYRF 640
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK------LQHCRNLNY--TSDASKTR 680
Y GP +Y FGYGLSY+ + YN +S K +++N+ L++ + Y S+ +
Sbjct: 641 YTGPKVYEFGYGLSYSNYSYNFIS-VKNNNLHINQSTTHSILENSETIYYKLVSELGEET 699
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
C + ++ + N GS G V+++ KP +KQ++GF+ V V
Sbjct: 700 CKTMSIS---------VTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVE 750
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
G + F + C+ L+ + + ++ G H + VG S I L
Sbjct: 751 GGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVVGEEEYSINITL 798
>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 779
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/787 (44%), Positives = 475/787 (60%), Gaps = 34/787 (4%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
LS+ ++ + +++P F C PG ++ + FCD +LP R DLVSR+
Sbjct: 14 LSLIAMIMPAALLRTAAAATPPFSCGPGSATQ------GYAFCDKALPVERRAADLVSRL 67
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
TL EKV QLGD A VPRLG+P Y+WWSE LHG+S G G HFD + TSFP V+LT
Sbjct: 68 TLAEKVSQLGDEADAVPRLGVPAYKWWSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTA 127
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVV 191
ASF++ +W +IGQA+ TEARA+YNLG+A GLT WSPN+N+ RDPRWGR ETPGEDP
Sbjct: 128 ASFDQDIWYRIGQAIGTEARALYNLGQAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTA 187
Query: 192 GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQ 251
+YAV +V+GLQ ++ L+ S+CCKH AYD+++W GV RY+F+A+VT Q
Sbjct: 188 SKYAVAFVKGLQGT---------SATTLQTSACCKHATAYDLEDWNGVVRYNFNAKVTLQ 238
Query: 252 DMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD 311
D+ +TF PF+ CV+EG A+ VMC+Y +NG+P+CA L+ +T +G+W L+GY+ +DCD
Sbjct: 239 DLADTFNPPFKSCVEEGKATCVMCAYTNINGVPACASSDLITKTFKGDWGLNGYVSSDCD 298
Query: 312 SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKY 371
++ ++ D ++ A + ED VA LKAGLDL+CG Y +A+QQGK+ E D+D +LK
Sbjct: 299 AVALLRDAQRYRA-TPEDTVAVALKAGLDLNCGNYTQVHGMSALQQGKMTEQDVDNALKN 357
Query: 372 LYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
L+ V MRLG FDG P+ Y SLG D+CS + LA EAA+ GIVLLKND LPL+
Sbjct: 358 LFAVRMRLGHFDGDPRTSALYGSLGAADVCSPAHKNLALEAAQSGIVLLKNDAGILPLDP 417
Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNS 486
+ V + A +G +AN A+ GNY G PC +P+ G GY NV + GCD AC
Sbjct: 418 SAVASAAAIGHNANDPAALNGNYFGPPCETTTPLQGLQGYVKNVKFLAGCDSAACG---- 473
Query: 487 IFAASEAAKT----ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
FAA+ A T +D I+ GL E E +DR L LPG Q LI VA +K PVI
Sbjct: 474 -FAATGQAVTLASSSDYVILFMGLSQKEEQEGIDRTSLLLPGKQQNLITAVASASKRPVI 532
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
LV+++ G VDI FA++N I AILWAGYPG+ GG AIA V+FG NP GRLP+TWY ++
Sbjct: 533 LVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEF 592
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
+ +P+T M +R + GYPGR+Y+FY G T+Y FG GLSY++F L+S T T QV
Sbjct: 593 TK-VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGDGLSYSKFSRQLVSSTNTHQVPNT 651
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPA 721
L +D + + CD F V+ QN G DG V+++ + P
Sbjct: 652 NLLTGLTARTATDGGMSYYHVEEIGVEGCDKLKFPAVVEVQNHGPMDGKHSVMMFLRWPN 711
Query: 722 EIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+ Q++GF+ ++AG + F + C+ ++ G H + VG
Sbjct: 712 STGTGRPVSQLVGFRSQHLKAGEKASLTFDVSPCEHFARAREDGKKVIDRGSHFLVVGKD 771
Query: 781 GVSFPIH 787
H
Sbjct: 772 EREISFH 778
>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
Length = 732
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/729 (46%), Positives = 457/729 (62%), Gaps = 30/729 (4%)
Query: 74 LDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTA 133
L EK+ QL + A VPRLG+P YEWWSE+LHG+++ GPG F+ I ATSFP VI++ A
Sbjct: 2 LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61
Query: 134 SFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGR 193
SFN +LW +IG AV+ E RAMYN G+AGLT+W+PNINV RDPRWGR ETPGEDP VV
Sbjct: 62 SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121
Query: 194 YAVNYVRGLQDVEGHENATDLNSR-------------PLKVSSCCKHYAAYDVDNWKGVD 240
Y V +VRG Q+ + + S L +S+CCKH+ AYD++ W
Sbjct: 122 YGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFT 181
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
RY F+A VTEQDME+T+ PFE C+++G AS +MCSYN VNG+P+CA LL Q R EW
Sbjct: 182 RYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLL-QKARVEW 240
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
GYI +DCD++ + + S E+AVA +KAG+D++CG Y T +A++QGKV
Sbjct: 241 GFEGYITSDCDAVATIFAYQGY-TKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKV 299
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLK 417
E +D++L L+ V +RLG FDG P QY LG DICS ++ +LA EA R+GIVLLK
Sbjct: 300 SEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLK 359
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGC 476
ND LPLN V ++A+VGP AN M G Y G PC+ + Y T Y +GC
Sbjct: 360 NDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGC 419
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
DV+C S+ A AK AD I++AGLDLS E E DR L LPG Q L++ VA V
Sbjct: 420 SDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAV 479
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
+K PVILV+ G VD+ FA+ + I +I+W GYPGE GG+A+A+++FG FNPGGRLP T
Sbjct: 480 SKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTT 539
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
WY + + ++ M +R S GYPGRTY+FY GP +Y FG GLSYT+F+Y +LS
Sbjct: 540 WYPESFTD-VAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILS--AP 596
Query: 657 IQVNLNKL-----QHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGS 710
I+++L++L H + L + + + V+VN C+ F +V N G DGS
Sbjct: 597 IRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNS--CESLRFNVRVHVSNTGEIDGS 654
Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
VV+++SK P ++ KQ+IG+ RV VR+ FV + CK L++ + ++P
Sbjct: 655 HVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPL 714
Query: 771 GEHTIFVGN 779
G H +F+G+
Sbjct: 715 GSHVLFLGD 723
>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 775
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/757 (45%), Positives = 471/757 (62%), Gaps = 22/757 (2%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+S P+F C P S+ ++ FCD SLP + R DLVSR+T+ EKV QLGD A GVP
Sbjct: 25 ASDPMFSCGPSSASR------AYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVP 78
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WWSE LHG++ G G F+ + TSFP V+LTTASF+ESLW +IGQA+
Sbjct: 79 RLGVPPYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGR 138
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA+YNLG+A GLT WSPN+N+ RDPRWGR ETPGEDP V +YAV +VRG+Q
Sbjct: 139 EARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQG---- 194
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
N + PL+ S+CCKH AYD+++W GV RY+FDARVT QD+ +TF PF+ CV +G
Sbjct: 195 SNPAGAAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDG 254
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS VMC+Y +NG+P+CA LL +T RG W L GY+ +DCD++ +M D ++ + E
Sbjct: 255 KASCVMCAYTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRY-EPTPE 313
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
D VA LKAGLDL+CG Y A+QQGK+ E D+DK+L L+ V MRLG FDG P+
Sbjct: 314 DTVAVALKAGLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRG 373
Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
Y LG D+C+ ++ LA EAA++GIVLLKND LPL+ + V + AV+G +AN +
Sbjct: 374 NALYGRLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPL 433
Query: 445 AMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
+ GNY G C +P+ G Y NV + GC AC + A+ A +A+ +
Sbjct: 434 VLSGNYFGPACETTTPLEGLQSYVRNVRFLAGCSSAAC-GYAATGQAAALASSAEYVFLF 492
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
GL E E LDR L LPG Q L+ VA AK PV+LV+++ G VDI FA++N I
Sbjct: 493 MGLSQDQEKEGLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIG 552
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AILWAGYPG+ GG AIA V+FG NP GRLP+TWY D+ + +P+T M +R + GYPG
Sbjct: 553 AILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYTEDFTK-VPMTDMRMRADPATGYPG 611
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY+FY G T+Y FGYGLSY++F L++ K + N + L H T A+ +
Sbjct: 612 RTYRFYRGKTIYKFGYGLSYSKFSRQLVTGDKNLAPNTSLLAHLS--AKTQHAATSYYHV 669
Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ + C+ F +V+ N G DG V+++ + P ++Q+IGF+ ++AG
Sbjct: 670 DDIGTVGCEQLKFPAEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAG 729
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++F + C+ + ++ G H + VG
Sbjct: 730 EKANVRFHVSPCEHFSRTRADGKKVIDRGSHFLMVGK 766
>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
Length = 783
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/761 (44%), Positives = 483/761 (63%), Gaps = 26/761 (3%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
++P + C P S + FC+ SLP S R L+S +TL +K+ QL + A +
Sbjct: 27 TTPDYPCKPPH--------SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISH 78
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+P Y+WWSEALHG++ GPG +F+ + AT+FP VI++ A+FN SLW IG AV E
Sbjct: 79 LGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVE 138
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE- 209
RAM+N+G+AGL++W+PN+NV RDPRWGR ETPGEDP V YAV +VRG+Q V+G +
Sbjct: 139 GRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKK 198
Query: 210 --NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
N D + L VS+CCKH+ AYD++ W RY+F+A VT+QD+E+T+ PF CV++
Sbjct: 199 VLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQ 258
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G AS +MCSYN VNG+P+CA LL VR +W GYI +DCD++ + + K+ A S
Sbjct: 259 GKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGYIASDCDAVATVFEYQKY-AKSA 316
Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
EDAVA LKAG+D++CG + T +A++QG VKE D+D++L L++V MRLG F+G P+
Sbjct: 317 EDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPE 376
Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+ LG QD+C+ E+ +LA EAAR+GIVLLKND LPL+ ++A++GP A T
Sbjct: 377 KGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMA-TTS 435
Query: 445 AMIGNYAGIPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
+ G Y+GIPC S G Y ++Y GC DV C S++ A + AK AD +I+
Sbjct: 436 ELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIV 495
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
AGLD ++E E LDR L LPG Q L+++VA +K PVILV+ G +D++FAE+N I
Sbjct: 496 AGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLIT 555
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
+ILW GYPGE GG+A+A+++FG+FNP GRLP+TWY + +P+ M +R S GYPG
Sbjct: 556 SILWIGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPESFTN-VPMNDMGMRADPSRGYPG 614
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC---RNLNYTSDASKTR 680
RTY+FY G +Y FG+GLSY+ F Y +LS +++L+K + R+L +
Sbjct: 615 RTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPS--KLSLSKTTNGGLRRSLLNKVEKDVFE 672
Query: 681 CPGVLVNDLR-CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
V V++L+ C+ F + NVG DGS VV+++SK P I + Q++G R+
Sbjct: 673 VDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLH 732
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ ++ + + C+ + D +LP G H + VG+
Sbjct: 733 TVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGD 773
>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
Length = 742
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/725 (46%), Positives = 469/725 (64%), Gaps = 32/725 (4%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
LC + I + + +T+ S+ P + CD S + FC + LP S RV+DL
Sbjct: 6 LCLRILILIAIHTTSLHLYVESTQPPYSCDSSDPS-----TKLYPFCQTKLPISQRVEDL 60
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV---SNVGPGTHFDDVIPGATSF 125
VSR+TLDEKV QL D A +PRLG+P YEWWSEALHGV + V G F+ I ATSF
Sbjct: 61 VSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATSF 120
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETP 184
P VILT ASF+ LW +IGQ + EAR +YN G+A G+T+W+PNIN+ RDPRWGR ETP
Sbjct: 121 PQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQETP 180
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP V G+YAV+YVRG+Q G L+ S+CCKH+ AYD+D WKG++R+ F
Sbjct: 181 GEDPLVAGKYAVSYVRGVQ---GDSFGGGTLGEQLQASACCKHFTAYDLDKWKGMNRFVF 237
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
DA QD+ +T+ PF+ C++EG AS +MC+YNRVNG+P+CAD LL++ RG+W +G
Sbjct: 238 DA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARGQWGFYG 293
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YI +DCD++ ++ D+ + A S EDAVA LKAG+D++CG Y N+T +AV++ K+ E++
Sbjct: 294 YITSDCDAVAIIHDDQGY-AKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKKLPESE 352
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
ID++L L+++ MRLG F+G+P Y ++ +CS E+ LA +AA++GIVLLKN
Sbjct: 353 IDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLLKNPDK 412
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVA 480
LPL+ + K++AV+GP+AN + ++GNY G PC+ ++P+ G Y N Y GC VA
Sbjct: 413 LLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPGCSRVA 472
Query: 481 CKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGP 540
C S+ SI A + AK AD I++ GLD + E E DR DL LPG Q +LI VA+ AK P
Sbjct: 473 C-SSASINQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAKAAKKP 531
Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
V+LV+ G VD++FA+ + NI +I+WAGYPGE GG A+A ++FG NPGGRLP+TWY
Sbjct: 532 VVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPGGRLPMTWYPQ 591
Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
D+ + +P+T M +RP S GYPGRTY+FYNG ++ FGYGLSY+ + Y L S T+
Sbjct: 592 DFTK-VPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGLSYSNYSYELASDTQ----- 645
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVN---DLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
NKL + N + S T ++ N +L F V +N G G + I Y
Sbjct: 646 -NKLYLRASSNQITKNSNTIRHKLISNIGKELCEKTKFTVTVRVKNHGEMAGENAEIQYE 704
Query: 718 KPPAE 722
P E
Sbjct: 705 LSPCE 709
>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
Precursor
gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
Length = 767
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/774 (44%), Positives = 486/774 (62%), Gaps = 40/774 (5%)
Query: 30 SSSPVFVCDPGR-FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
S+ P CDP +KL + FC + LP R +DLVSR+T+DEK+ QL + A G+
Sbjct: 19 SAPPPHSCDPSNPTTKL------YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGI 72
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
PRLG+P YEWWSEALHGV+ GPG F+ + ATSFP VILT ASF+ W +I Q +
Sbjct: 73 PRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIG 132
Query: 149 TEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DV 205
EAR +YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP + G YAV YVRGLQ
Sbjct: 133 KEARGVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSF 192
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
+G + S L+ S+CCKH+ AYD+D WKG+ RY F+A+V+ D+ ET+ PF+ C+
Sbjct: 193 DGRKTL----SNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCI 248
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
+EG AS +MC+YNRVNGIPSCADP LL +T RG+W GYI +DCD++ ++ D + A
Sbjct: 249 EEGRASGIMCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGY-AK 307
Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
S EDAVA LKAG+D++CG Y T +A+QQ KV ETDID++L L++V +RLG F+G
Sbjct: 308 SPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGD 367
Query: 386 PQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
P Y ++ ++CS + LA +AAR GIVLLKN+ LP + V ++AV+GP+A+
Sbjct: 368 PTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHV 427
Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
++GNYAG PC+ ++P+ Y N Y GCD VAC SN +I A AK AD +
Sbjct: 428 VKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SNAAIDQAVAIAKNADHVV 486
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
++ GLD + E E DR DL LPG Q +LI VA AK PV+LV++ G VDI+FA N
Sbjct: 487 LIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNK 546
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I +I+WAGYPGE GG AI++++FG NPGGRLP+TWY +V + +T M +R + GY
Sbjct: 547 IGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVN-IQMTDMRMR--SATGY 603
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQ-HCRNLNYT--SDAS 677
PGRTYKFY GP +Y FG+GLSY+ + Y + +T + +N +K Q + ++ YT S+
Sbjct: 604 PGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSEMG 663
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQ 735
K C + V+ +N G G V+++++ E KQ++GF+
Sbjct: 664 KEGCDVAKT---------KVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFK 714
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ + G ++F C+ L+ + +L G++ + VG+ P+ +N
Sbjct: 715 SIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDS--ELPLIVN 766
>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 755
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/759 (45%), Positives = 472/759 (62%), Gaps = 40/759 (5%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P F C P Q + + FC+ +LP R DLV+++TL+EKV QLGD A GVPR G
Sbjct: 12 PAFSCGPP-------QQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFG 64
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+P Y WWSE LHGVS G G HF+ + G T+FP V+LTTASF++S+W +IGQA+ TEAR
Sbjct: 65 VPGYNWWSEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEAR 124
Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
AM+NLG+A GLT WSPN+N+ RDPRWGR ETPGEDP +YAV +VRGLQ
Sbjct: 125 AMFNLGQADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGT------ 178
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
++ L+ S+CCKH AYD+D+W + RY+F+A+VT QD+EETF PF+ CV EG A+
Sbjct: 179 ---STTTLQTSACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKAT 235
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
VMC+Y VNGIP+CAD LL +T++GEW ++GYI +DCD++ ++ + + EDAV
Sbjct: 236 CVMCAYTSVNGIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR--YSGTPEDAV 293
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG----SPQ 387
A +KAGLD++CG + A+QQ K+ E D+DK+L+ L+ + MRLG FDG SP
Sbjct: 294 AAAIKAGLDMNCGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPL 353
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN--SAKVKTVAVVGPHANATVA 445
Y LG QD+CS + +LA EAA+ GIVLLKND TLPL+ +A + AV+GP+AN A
Sbjct: 354 YGRLGAQDVCSPAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGA 413
Query: 446 MIGNYAGIPCRYMSPIAGFSGY--ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
++GNY G PC +P+ + NV + GCD AC ++ + AS A T+D TI+
Sbjct: 414 LLGNYFGPPCETTTPLQALQKFYSKNVRFVPGCDSAACNVADT-YQASGLAATSDYTILF 472
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
GL E E LDR L LPG Q LI VA AK P+ILV+++ G VDI FA+ N I
Sbjct: 473 MGLSQKQEQEGLDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIG 532
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AILWAGYPG+ GG AIA V+FG+ NP GRLP+TWY +Y + +P+ M +R + GYPG
Sbjct: 533 AILWAGYPGQAGGLAIAKVLFGEHNPSGRLPVTWYPEEYTK-VPMDDMRMRADPATGYPG 591
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCP 682
R+Y+FY G +Y FGYGLSY++F L+ + + N+ + L + D +R
Sbjct: 592 RSYRFYKGNAVYKFGYGLSYSKFSRQLVRNSSSN----NRAPNTELLAAAAVDCGASRY- 646
Query: 683 GVLVNDLR---CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
LV ++ C+ F V+ +N G DG V+++ + P Q++GF+
Sbjct: 647 -YLVEEIGGEVCERLKFPAVVEVENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQD 705
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
+RAG + F + C+ + ++ G H + V
Sbjct: 706 LRAGEKASVSFDISPCEHFSRTTVDGTKVIDRGSHFLMV 744
>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 778
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/776 (43%), Positives = 491/776 (63%), Gaps = 42/776 (5%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
S+ P + CD S + FC++ LP + R +DLVSR+TLDEK+ QL + A +P
Sbjct: 26 STRPPYSCDSSSNSPY------YSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIP 79
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WWSEALHGV++ G G F+ I ATSFP VILT ASF+ +LW +I + +
Sbjct: 80 RLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGR 139
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVE 206
EARA+YN G+A G+T+W+PNINV RDPRWGR ET GEDP + +Y V YVRGLQ E
Sbjct: 140 EARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFE 199
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
G + A L + S+CCKH+ AYD+D WKG+DR+ FDARVT QD+ +T+ PF+ C++
Sbjct: 200 GGKLAERL-----QASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIE 254
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
+G AS +MC+YNRVNG+P+CAD LL +T R +W GYI +DC ++ ++ + + A +
Sbjct: 255 QGRASGIMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGY-AKT 313
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
EDA+A +AG+D++CG Y T +AV Q K+ + ID++L+ L+++ +RLG FDG+P
Sbjct: 314 AEDAIADVFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNP 373
Query: 387 Q---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
+ ++G ++CS ++++LA EAAR+GIVLLKN + LPL T+A++GP+ANA+
Sbjct: 374 TKLPFGTIGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANAS 432
Query: 444 VAM-IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
+ +GNY G PC ++ + GF GYA Y GCDD + I A E AK D ++
Sbjct: 433 SKVFLGNYYGRPCNLVTLLQGFEGYAKTVYHPGCDDGPQCAYAQIEEAVEVAKKVDYVVL 492
Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
+ GLD S E ES DRE L LPG Q +LI VA AK PV++V++ G VDI A+ + +
Sbjct: 493 VMGLDQSQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKV 552
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
ILWAGYPGE GG A+A VVFG NPGG+LPITWY D+++ +P+T M +R + GYP
Sbjct: 553 GGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIK-VPMTDMRMRADPASGYP 611
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNK----LQHCRNLNY--TSD 675
GRTY+FY GP +Y FGYGLSYT++ Y LLS + T+ +N + Q+ + Y S+
Sbjct: 612 GRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSETIRYKLVSE 671
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY---SKPPAEIAATYIKQVI 732
++ C +L++ + N G+ G V+++ K +KQ++
Sbjct: 672 LAEETCQTMLLS---------IALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLV 722
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
GFQ V V AG ++ F + C+ L++ + A + ++ G + VG+ +PI +
Sbjct: 723 GFQSVKVNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLFIVGDQ--EYPIEV 776
>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
Length = 780
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/734 (45%), Positives = 472/734 (64%), Gaps = 12/734 (1%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
FCD+SLP R + LVS +TL EK+ L + A +PRLG+P Y+WWSE+LHG++ GPG
Sbjct: 41 FCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLALNGPGV 100
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
F +P ATSFP VIL+ ASFN SLW + A++ EARAM+N+G+AGLT+W+PNIN+ R
Sbjct: 101 SFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWAPNINLFR 160
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG-HENATDLNSRPLKVSSCCKHYAAYD 232
DPRWGR ETPGEDP + YAV YVRGLQ + G + + L VS+CCKH+ AYD
Sbjct: 161 DPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCKHFTAYD 220
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+D W RY+F+A V++QD+E+T+ PF C+++G AS +MCSYN VNG+P+CA +LL
Sbjct: 221 LDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPACASEELL 280
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
R +W GYI +DCD++ + + K+ A S+EDAVA LKAG+D++CG + T
Sbjct: 281 G-LARDKWGFKGYITSDCDAVATVYEYQKY-AKSQEDAVADVLKAGMDINCGTFMLRHTE 338
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAA 409
+A++QGKVKE D+D++L L++V +RLG FDG P ++ LG +D+C+ E+ LA +AA
Sbjct: 339 SAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTLALDAA 398
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN 469
R+GIVLLKND+ LPL+ ++AV+GP A T + G Y+GIPC S G +A
Sbjct: 399 RQGIVLLKNDKKFLPLDRDIGASLAVIGPLAT-TTKLGGGYSGIPCSSSSLYEGLGEFAE 457
Query: 470 -VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
++Y GC DV C S++ A + AK AD +I+AGLD + E E DR L LPG Q
Sbjct: 458 RISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLPGKQMN 517
Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
L++ VA+ +K PVILV++ G +D++FAE N I +I+W GYPGE GG+A+A+++FG+FN
Sbjct: 518 LVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEIIFGEFN 577
Query: 589 PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
P GRLP+TWY + +P+ M +R S GYPGRTY+FY G +Y FG+GLS++ F Y
Sbjct: 578 PAGRLPMTWYPEAFTN-VPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLSFSDFSY 636
Query: 649 NLLSFTKTIQVNLNKLQHCRN-LNYTSDASKTRCPGVLVNDLR-CDDY-FEFKVDFQNVG 705
N LS I ++ R L Y + V VN L+ C+ F + N+G
Sbjct: 637 NFLSAPSKISLSRTIKDGSRKRLLYQVENEVYGVDYVPVNQLQNCNKLSFSVHISVMNLG 696
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
DGS VV+++SK P + + Q++GF R+ + + + + C+ L+ D
Sbjct: 697 GLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSFADKQGK 756
Query: 766 TLLPAGEHTIFVGN 779
+LP G HT+ VG+
Sbjct: 757 RILPLGPHTLSVGD 770
>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
Length = 772
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/767 (45%), Positives = 466/767 (60%), Gaps = 39/767 (5%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P F C P FCD +L + R DLVSR+T EK+ QLGD A GVPRLG
Sbjct: 26 PPFSCGPTSAEA----SEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLG 81
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
+P Y+WW+EALHG++ G G HFD V + ATSFP V+LT A+F++ LW +IGQA+ E
Sbjct: 82 VPGYKWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGRE 141
Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
ARA++N+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RYAV +VRG+Q
Sbjct: 142 ARALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG----- 196
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ +S L+ S+CCKH AYD+++W GV RY F ARVT QD+E+TF PF CV EG
Sbjct: 197 ---NSSSSLLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGK 253
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
AS +MC+Y +NG+P+CA+ LL TVRG+W L GY+ +DCD++ +M D ++ A + ED
Sbjct: 254 ASCIMCAYTAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRY-APTPED 312
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
AVA +LKAGLD+DCG Y A+QQGK+ E DIDK+L L+ V MRLG FDG P+
Sbjct: 313 AVAVSLKAGLDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKN 372
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
Y +L DIC+ E+ LA EAA++GIVLLKND LPL+ + V + AV+GP++N +A+
Sbjct: 373 MYGALSAADICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMAL 432
Query: 447 IGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCD----DVACKSNNSIFAASEAAKTADATI 501
I NY G PC +P+ G Y NV + GC DVA + + SE D
Sbjct: 433 IANYFGPPCESTTPLQGLQSYVNNVRFLAGCSSAACDVAVTDQAVVLSGSE-----DYVF 487
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
+ GL E+E DR L LPG Q LI VA+ +K PVILV++S G VDI FA++N
Sbjct: 488 LFMGLSQQQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPK 547
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I AILWAGYPG+ GG AIA V+FG NP GRLP+TWY D+ + +P+T M +R + GY
Sbjct: 548 IGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPMTWYPEDFTK-VPMTDMRMRADPTSGY 606
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
PGR+Y+FY G +Y FGYGLSY+ F LL T ++ L R T + ++
Sbjct: 607 PGRSYRFYQGNAVYKFGYGLSYSTFSSRLLYGTSMPALSSTVLAGLRE-TVTEEGDRS-- 663
Query: 682 PGVLVNDLRCDDYFEFK----VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
++D+ D + K V+ QN G DG +++ + P Q+IGF
Sbjct: 664 --YHIDDIGTDGCEQLKFPAMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQ 721
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
++AG ++F + C+ + V ++ G H + V N +
Sbjct: 722 HLKAGETANLRFDISPCEHFSRVRADGMKVIDIGSHFLTVDNHAIEI 768
>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 829
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/756 (46%), Positives = 462/756 (61%), Gaps = 28/756 (3%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P F C G LGL FC++ LP + R DLVSRMT EK QLGD A+GVPRLG
Sbjct: 84 PPFSCGGG--PSLGLP-----FCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLG 136
Query: 93 LPQYEWWSEALHGVSNVGPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
+P Y+WW+EALHGV+ G G H D + ATSFP V+LT ASFN++LW +IGQA EA
Sbjct: 137 VPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEA 196
Query: 152 RAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
RA YN+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RYA +VRGLQ +
Sbjct: 197 RAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQG-----S 251
Query: 211 ATDLNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+++ S P L S+CCKH AYD+++WKGV RY F A VT QD+ +TF PF CV +G
Sbjct: 252 SSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDG 311
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS VMC+Y VNG+PSCA+ LL +T RG W L GY+ ADCD++ +M N +F + E
Sbjct: 312 KASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIM-RNSQFYRPTAE 370
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
D VA TLKAGLD+DCG Y A+Q+GK+ + D+DK++K L+T MRLG FDG P+
Sbjct: 371 DTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKA 430
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y +LG IC+ E+ LA EAA +GIVLLKN LPL V + AV+G +AN +A
Sbjct: 431 HVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLA 490
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNY G PC +P+ G GY NV + GC AC + AA+ A+ T+D+ I+
Sbjct: 491 LLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVILFM 549
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E+E DR L LPG Q LI VA AK PVILV+++ G VDI FA+ N I A
Sbjct: 550 GLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGA 609
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG AIA V+FG+ NP GRLP+TWY ++ + +P+T M +R S YPGR
Sbjct: 610 ILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEFTK-VPMTDMRMRSAGS--YPGR 666
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+Y+FY G T+Y FGYGLSY++F + +++ N L + T D
Sbjct: 667 SYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH- 725
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+ D C F V QN G DG +++ + P +Q++GFQ ++AG
Sbjct: 726 -IGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGE 784
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++F + C+ + V ++ G H + VG
Sbjct: 785 KAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGK 820
>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 766
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/773 (44%), Positives = 485/773 (62%), Gaps = 38/773 (4%)
Query: 30 SSSPVFVCDPGR-FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
S+ P CDP +KL + FC + LP S R +DLVSR+ +DEK+ QLG+ A G+
Sbjct: 18 SAPPPHSCDPSNPTTKL------YQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGI 71
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
PRLG+P YEWWSEALHGV+ GPG F+ + ATSFP VILT ASF+ W +I Q +
Sbjct: 72 PRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIG 131
Query: 149 TEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DV 205
EAR +YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP + G YAV YVRGLQ
Sbjct: 132 KEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSF 191
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
+G + S L+ S+CCKH+ AYD+D WKG+ RY F+A+V+ D+ ET+ PF+ C+
Sbjct: 192 DGRKTL----SIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCI 247
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
+EG AS +MC+YNRVNGIPSCADP LL +T RG W GYI +DCD++ ++ D + A
Sbjct: 248 EEGRASGIMCAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGY-AK 306
Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
+ EDAVA LKAG+D++CG Y T +A+QQ KV ETDID++L L++V +RLG F+G
Sbjct: 307 TPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGD 366
Query: 386 PQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
P Y ++ D+CS + LA EAAR GIVLLKN+ LP + V ++AV+GP+A+
Sbjct: 367 PTKLPYGNISPNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHV 426
Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
++GNYAG PC+ ++P+ Y N Y GCD VAC SN +I A A+ AD +
Sbjct: 427 AKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHNGCDSVAC-SNAAIDQAVAIARNADHVV 485
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
++ GLD + E E +DR DL LPG Q +LI VA AK PV+LV++ G VDI+FA N
Sbjct: 486 LIMGLDQTQEKEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDK 545
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I +I+WAGYPGE GG A+A+++FG NPGGRLP+TWY +V + +T M +R + GY
Sbjct: 546 IGSIMWAGYPGEAGGIALAEIIFGDHNPGGRLPVTWYPQSFVN-VQMTDMRMR--SATGY 602
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQ-HCRNLNYT--SDAS 677
PGRTYKFY GP ++ FG+GLSY+ + Y + T + +N +K Q + ++ YT S+
Sbjct: 603 PGRTYKFYKGPKVFEFGHGLSYSTYSYRFKTLGATNLYLNQSKAQLNSDSVRYTLVSEMG 662
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQ 735
+ C + V +N G G V+++++ E KQ++GF+
Sbjct: 663 EEGCNIAKTKVI---------VTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFK 713
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+ + G ++F C+ L+ + ++ G++ + VG+ + I++
Sbjct: 714 SIVLSNGEKAEMEFEIGLCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTINV 766
>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
Length = 770
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/740 (44%), Positives = 476/740 (64%), Gaps = 24/740 (3%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S++ FC+++LP+ R + LVS +TLDEK+ QL + A G PRLG+P +EWWSE+LHGV +
Sbjct: 36 SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDN 95
Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GPG +F + AT FP VIL+ A+FN SLW+ +A++ EARAM+N G+AGLT+W+PN
Sbjct: 96 GPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPN 155
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
INV RDPRWGR ETPGEDP VV Y+V YV+G Q G E + +S+CCKHY
Sbjct: 156 INVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLSACCKHY 208
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++ W+G RY F+A+V QDME+T+ PF+ C++EG AS +MCSYN+VNG+P+CA
Sbjct: 209 IAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACAR 268
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
+L Q R EW GYI +DCD++ ++ +N + A S ED++A LKAG+D++CG +
Sbjct: 269 KDIL-QRARDEWGFQGYITSDCDAVAIIHENQTYTA-SDEDSIAVVLKAGMDINCGSFLI 326
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
T +A+++GKV+E DI+ +L L++V +RLGFFD + + + LG ++C+ E+ ELA
Sbjct: 327 RHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELA 386
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
AEA R+G VLLKND LPL ++V +A++GP AN + G+Y G+PC + + G
Sbjct: 387 AEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQ 446
Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
Y T+ GC DV C S + A EAAK AD +++AGL+L+ E E DR L LPG
Sbjct: 447 AYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPG 506
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI+ VA V K PV+LV+M G VD++FA+ + I +ILW GYPGE GG + +++F
Sbjct: 507 RQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILF 566
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK+NPGG+LPITWY + +P+ M +R S GYPGRTY+FY G +Y FGYGLSY+
Sbjct: 567 GKYNPGGKLPITWYPESFT-AVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYS 625
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDLRCDDYFEFKVDF 701
++ Y++L K I ++ + + + + TR G V V D+ + +F V
Sbjct: 626 KYSYSILQAPKKISLSRSSVPDL----ISRKPAYTRRDGVDYVQVEDIASCEALQFPVHI 681
Query: 702 --QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI 759
N G+ DGS V++++ + IKQ++GF+RV AGR+ ++ + CK ++
Sbjct: 682 SVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSF 741
Query: 760 VDYAANTLLPAGEHTIFVGN 779
+ +L G H + VG+
Sbjct: 742 ANTEGTRVLFLGTHVLMVGD 761
>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
Length = 771
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/739 (45%), Positives = 466/739 (63%), Gaps = 68/739 (9%)
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA- 146
+PRLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG++
Sbjct: 45 LPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGESA 104
Query: 147 -----------------------------------------VSTEARAMYNLGRAGLTYW 165
VSTEARAM+N+G AGLT+W
Sbjct: 105 CNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAGLTFW 164
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD G +A LKV++CC
Sbjct: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDA-------LKVAACC 217
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHY AYDVDNWKGV+RY FDA V++QD+++TF PF+ CV +G+ +SVMCSYN+VNG P+
Sbjct: 218 KHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPT 277
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CAD LL+ +RG+W L+GYIV+DCDS+ V+ +N + + EDA A T+K+GLDL+CG
Sbjct: 278 CADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PEDAAAITIKSGLDLNCGN 336
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENI 402
+ T AVQ GK+ E+D+D+++ + VLMRLGFFDG P+ + SLG +D+C+ N
Sbjct: 337 FLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQ 396
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
ELA EAAR+GIVLLKN LPL++ +K++AV+GP+ANA+ MIGNY G PC+Y +P+
Sbjct: 397 ELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQ 455
Query: 463 GFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
G Y+ GC +V C N+ + AA++AA +AD T+++ G D SVE ESLDR L
Sbjct: 456 GLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLL 515
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
LPG Q QL++ VA ++GPVILV+MS G DI+FA+++ I AILW GYP R
Sbjct: 516 LPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRSRWRRPRR 575
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
LP+TWY + + +T M +RP S GYPGRTY+FY G T+Y FG GL
Sbjct: 576 HPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGL 633
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
SYT+F ++L+S + + V L + C + ++ +A+ C + F+ +
Sbjct: 634 SYTKFAHSLVSAPEQVAVQLAEGHACHTEHCFSVEAAGEHCGSL---------SFDVHLR 684
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
+N G G V ++S PP+ + + K ++GF++V + G+ + F + CK L++V
Sbjct: 685 VRNAGGMAGGHTVFLFSSPPS-VHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVV 743
Query: 761 DYAANTLLPAGEHTIFVGN 779
D N + G HT+ VG+
Sbjct: 744 DELGNRKVALGSHTLHVGD 762
>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
Length = 776
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/775 (43%), Positives = 493/775 (63%), Gaps = 41/775 (5%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
S+ P + CD S + FC++ LP S R +DLVSR+TLDEK+ QL + A +P
Sbjct: 25 STQPPYSCDSSSNSPY------YPFCNTRLPISKRAQDLVSRLTLDEKLAQLVNTAPAIP 78
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WWSEALHGV++ G G F+ I ATSFP VILT ASF+ +LW +I + +
Sbjct: 79 RLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGK 138
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVE 206
EARA+YN G+A G+T+W+PNINV RDPRWGR ET GEDP + +Y V YVRGLQ E
Sbjct: 139 EARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFE 198
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
G + L R L+ S+CCKH+ AYD+D+WKG+DR+ +DARVT QD+ +T+ PF+ C++
Sbjct: 199 GGK----LGER-LQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIE 253
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
+G AS +MC+YNRVNG+P+CA+ LL +T R +W GYI +DC ++ ++ D + A +
Sbjct: 254 QGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGY-AKT 312
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
EDA+A +AG+D++CG Y T +AV Q K+ + ID++L+ L+++ +RLG DG+P
Sbjct: 313 AEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNP 372
Query: 387 Q---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
+ ++G +CS ++++LA EAAR+GIVLLKN + LPL T+A++GP+ANA+
Sbjct: 373 TKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANAS 431
Query: 444 VAM-IGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATI 501
+ +GNY G PC ++ + GF GYA T Y GCDD + I A E AK D +
Sbjct: 432 SKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVDYVV 491
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
++ GLD S E ES DRE L LPG Q +LI VA +K PV+LV++ G VDI A+ +
Sbjct: 492 LVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDK 551
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
+ ILWAGYPGE GG A+A VVFG NPGG+LPITWY D+++ +P+T M +R + GY
Sbjct: 552 VGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIK-VPMTDMRMRADPASGY 610
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK-TIQVNLNK----LQHCRNLNY--TS 674
PGRTY+FY GP +Y FGYGLSYT++ Y LLS + T+ +N + Q+ + Y S
Sbjct: 611 PGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSETIRYKLVS 670
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP-PAEIAATYIKQVIG 733
+ ++ C +L++ + N G+ G V+++ + +KQ++G
Sbjct: 671 ELAEETCQTMLLS---------IALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVG 721
Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
FQ V + AG ++ F + C+ L++ + A + ++ G + + VG+ +PI +
Sbjct: 722 FQSVKLNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGD--QEYPIEI 774
>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
expressed [Oryza sativa Japonica Group]
gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
Length = 771
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/756 (45%), Positives = 464/756 (61%), Gaps = 29/756 (3%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P + C P S S + FCD+ LP + R DLVSR+T EKV QLGD A GVPRLG
Sbjct: 25 PPYSCGPRSPS------SGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLG 78
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+P Y+WWSE LHG+S G G HF+ + TSFP V+LT A+F++ LW +IGQA+ TEAR
Sbjct: 79 VPPYKWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEAR 138
Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
A+YNLG+A GLT WSPN+N+ RDPRWGR ETPGEDP +YAV +V+GLQ G
Sbjct: 139 ALYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GSTPG 195
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
T L+ S+CCKH AYD++ W GV RY+F+A+VT QD+ +TF PF+ CV + AS
Sbjct: 196 T------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKAS 249
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
VMC+Y +NG+P+CA LL++T RG+W L GY+ +DCD++ ++ D ++ A + ED V
Sbjct: 250 CVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRY-APTPEDTV 308
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---- 387
A +KAGLDL+CG Y A+QQGK++E+D+D++L L+ V MRLG FDG P+
Sbjct: 309 AVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAA 368
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y LG D+C+ + +LA EAA++GIVLLKND LPL+ A V++ AV+GP+AN A+
Sbjct: 369 YGHLGAADVCTQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALN 428
Query: 448 GNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
GNY G PC +P+ G Y ++V + GCD AC + A+ A ++D I+ GL
Sbjct: 429 GNYFGPPCETTTPLQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGL 487
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
E E LDR L LPG Q LI VA A+ PVILV+++ G VD+ FA+ N I AIL
Sbjct: 488 SQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAIL 547
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
WAGYPG+ GG AIA V+FG NP GRLP+TWY ++ + +P+T M +R + GYPGR+Y
Sbjct: 548 WAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTR-IPMTDMRMRADPATGYPGRSY 606
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
+FY G +Y FGYGLSY++F L++ K + N N L +
Sbjct: 607 RFYQGNPVYKFGYGLSYSKFSRRLVAAAKPRRPNRNLLAGVIPKPAGDGGESYHVE--EI 664
Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--IKQVIGFQRVFVRAGR 743
+ C+ F V+ N G DG V+V+ + P A +Q++GF VRAG
Sbjct: 665 GEEGCERLKFPATVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGE 724
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ N C+ L+ ++ G H + VG
Sbjct: 725 KARLTMEINPCEHLSRAREDGTKVIDRGSHFLKVGE 760
>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 830
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/757 (46%), Positives = 462/757 (61%), Gaps = 29/757 (3%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P F C G LGL FC++ LP + R DLVSRMT EK QLGD A+GVPRLG
Sbjct: 84 PPFSCGGG--PSLGLP-----FCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLG 136
Query: 93 LPQYEWWSEALHGVSNVGPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
+P Y+WW+EALHGV+ G G H D + ATSFP V+LT ASFN++LW +IGQA EA
Sbjct: 137 VPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEA 196
Query: 152 RAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
RA YN+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RYA +VRGLQ +
Sbjct: 197 RAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQG-----S 251
Query: 211 ATDLNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+++ S P L S+CCKH AYD+++WKGV RY F A VT QD+ +TF PF CV +G
Sbjct: 252 SSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDG 311
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG-YIVADCDSIQVMVDNHKFLADSK 327
AS VMC+Y VNG+PSCA+ LL +T RG W L G Y+ ADCD++ +M N +F +
Sbjct: 312 KASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIM-RNSQFYRPTA 370
Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
ED VA TLKAGLD+DCG Y A+Q+GK+ + D+DK++K L+T MRLG FDG P+
Sbjct: 371 EDTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPK 430
Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
Y +LG IC+ E+ LA EAA +GIVLLKN LPL V + AV+G +AN +
Sbjct: 431 AHVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVL 490
Query: 445 AMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
A++GNY G PC +P+ G GY NV + GC AC + AA+ A+ T+D+ I+
Sbjct: 491 ALLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVILF 549
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
GL E+E DR L LPG Q LI VA AK PVILV+++ G VDI FA+ N I
Sbjct: 550 MGLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIG 609
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AILWAGYPG+ GG AIA V+FG+ NP GRLP+TWY ++ + +P+T M +R S YPG
Sbjct: 610 AILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEFTK-VPMTDMRMRSAGS--YPG 666
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
R+Y+FY G T+Y FGYGLSY++F + +++ N L + T D
Sbjct: 667 RSYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH 726
Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ D C F V QN G DG +++ + P +Q++GFQ ++AG
Sbjct: 727 --IGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAG 784
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++F + C+ + V ++ G H + VG
Sbjct: 785 EKAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGK 821
>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 784
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/766 (44%), Positives = 486/766 (63%), Gaps = 41/766 (5%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
+ CDP + L F FC++S+ RV+DL+SR+T+ EK++QL + A V RLG+P
Sbjct: 20 YACDPDGPADL-----LFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIP 74
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y+WW E LHGV+ + P +F P ATSFP L+ S+N +LW KIGQ VSTE RAM
Sbjct: 75 PYQWWGEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAM 133
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN---A 211
YN GR+GLTYWSPNIN+ARDPRWGR ETPGEDP + YAV++V+GLQ+ + +N A
Sbjct: 134 YNQGRSGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQA 193
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
R LK+S+CCKH+ A+D+D WK DR HFD++VT+QD+E+T+ F+ CVKEG +S
Sbjct: 194 VSRGPRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSS 253
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
SVMCSYNR+NGIP C +LL TVR +W GYIV+DCD++ ++ D + A + EDAV
Sbjct: 254 SVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINY-APTSEDAV 312
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---Y 388
+ + AG+DL+CG A+ + + E ID L+ L+ V MRLG FDG+P Y
Sbjct: 313 SYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPY 372
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
SLG +D+C+++N LA EAAR+ +VLLKN++N LP +AV+G HA+AT M+G
Sbjct: 373 GSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLG 432
Query: 449 NYAGIPCRYMSPIAGFSGYAN-----VTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
NY G PC+++SP+ GF+ + ++++ GC D AC+ I+AA EAA ADA +++
Sbjct: 433 NYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLV 492
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNI 562
G+ + E E DR+ L LPG Q +L++ V E + G PV+LV++S +D++FA + I
Sbjct: 493 LGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRI 552
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
++I+WAGYPG+ GG AIA+ +FG NPGGRL +WY +Y + +++M +RP S GYP
Sbjct: 553 QSIIWAGYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYENYTN-IDMSNMNMRPNASTGYP 611
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
GRTY+F+ L+ FG+GLSY+ FKY ++S ++I + Q C +SD +
Sbjct: 612 GRTYRFFTDTPLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLC-----SSDRA----- 661
Query: 683 GVLVNDLRCDDY---------FEFKVDFQNVGSTDGSDVVIVYSKPPAE-IAATYIKQVI 732
V+ +DL C Y F +V N G G V+++SKPP+ I +KQ++
Sbjct: 662 -VMTSDLNCLHYEKEACKESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLV 720
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F+RV + AG + I F N C+ L V + GEHT+ VG
Sbjct: 721 SFERVHLEAGAGQEILFKVNPCEDLGTVGDDGIRTVELGEHTLMVG 766
>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
Length = 766
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/755 (43%), Positives = 483/755 (63%), Gaps = 36/755 (4%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S++ FCD+SL R + LVS +TLDEK+ QL + A GVPRLG+P Y+WWSE+LHG+++
Sbjct: 32 SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLADN 91
Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GPG +F + AT+FP VIL+TA+FN SLW+ + +AV+TEA M+N G+AGLTYW+PN
Sbjct: 92 GPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWAPN 151
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN+ RDPRWGR ET GEDP V Y++ YV+G Q +G E +++S+CCKHY
Sbjct: 152 INIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGR-------IRLSACCKHY 204
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++ W+G RY F+A+V QD+E+T+ PF+ C++E AS +MC+YN+VNG+P CA+
Sbjct: 205 TAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAN 264
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL +T R EW GYI +DCD++ ++ +N + S ED++A LKAG+D++CG +
Sbjct: 265 KDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSDEDSIAIVLKAGMDINCGSFLV 322
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDENIELA 405
T +AV++GKV+E DID++L L++V +RLG FD + LG ++C+ E+ ELA
Sbjct: 323 RHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKEHRELA 382
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
AEA R+G VLLKND + LPL ++V+ VA++GP AN AM G+Y G+ C + + G
Sbjct: 383 AEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTFLKGIQ 442
Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
YA T+ GC DV+C S A AAK AD +++AGL+L+ E E DR L LPG
Sbjct: 443 AYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVSLLLPG 502
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI+ VA VAK P++LV++ G VD++FA+ + I +ILW GYPGE GG+ + +++F
Sbjct: 503 KQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILF 562
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G++NPGG+L +TWY + +P+T M +R S GYPGRTY+FY G +Y FGYGLSY+
Sbjct: 563 GEYNPGGKLAMTWYPESFT-AIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYS 621
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND----LRCDDY------ 694
++ Y++LS K I ++ + + L+ S R P + D ++ +D
Sbjct: 622 KYSYSILSAPKKITMSRSSV-----LDIIS-----RKPSYIRRDGLDFVKTEDIASCEAL 671
Query: 695 -FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
F V N GS DGS V+++++ + + IKQ++GF+RV AG ++ +
Sbjct: 672 AFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDP 731
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
CK ++ + +L G+H + VG+ I L
Sbjct: 732 CKHMSAANPEGKRVLLLGDHVLTVGDEEFELFIEL 766
>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
distachyon]
Length = 771
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/743 (43%), Positives = 473/743 (63%), Gaps = 32/743 (4%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ FCD+SLP+ +R + LVS +TLDEK+ QL + A GVPRLG+P YEWWSE+LHG+++ GP
Sbjct: 37 YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGP 96
Query: 112 GTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
G +F + AT FP VIL+ ASFN SLW+ + +AV+ EARAM+N G+AGLTYW+PNIN
Sbjct: 97 GVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNIN 156
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
V RDPRWGR ETPGEDP V+ Y+V YV+G Q G D + +S+CCKHY A
Sbjct: 157 VFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYG-----DGKEGRMMLSACCKHYVA 211
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD++ W RY F+A+V EQD E+T+ PF+ C++EG AS +MCSYN+VNG+P+CA
Sbjct: 212 YDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKD 271
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL Q VR EW GY+V+DCD++ ++ + +S ED++A LKAG+D++CG +
Sbjct: 272 LL-QKVRDEWGFQGYVVSDCDAVGIIYGYQNY-TNSDEDSIAIVLKAGMDINCGSFLIRH 329
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDENIELAAE 407
T +A+Q+GK+ E DI+ +L L++V +RLG FD G+ + LG +IC+ E+ ELAAE
Sbjct: 330 TKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAE 389
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
AAR+G VLLKND + LPL ++V +A++GP AN M G+Y G+PC + + G
Sbjct: 390 AARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAV 449
Query: 468 A-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
T GC D++C S + A E AK AD +++AGL+L+ E E LDR L LPG Q
Sbjct: 450 VPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQ 509
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
LIN +A V K P++LVI G VD++FA+ + I ++LW GYPGE GG+ + +++FG+
Sbjct: 510 MDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGE 569
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
+NPGG+LPITWY + +P+ M +R S YPGRTY+FY G +Y FGYGLSY+++
Sbjct: 570 YNPGGKLPITWYPESFT-AVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGLSYSKY 628
Query: 647 KYNL------LSFTKTIQVNL--NKLQHCRN--LNYTSDASKTRCPGVLVNDLRCDDYFE 696
YN+ +S +++ V+ K H R L+Y C + F
Sbjct: 629 SYNIIQAPTKISLSRSSAVDFISTKRAHTRRDGLDYVQVEDIASCESI---------KFS 679
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+ N G+ DGS V+++++ + + +KQ++GF+R++ AG+ ++ + CK
Sbjct: 680 VHISVANDGAMDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKL 739
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
++ + +L G H + VG+
Sbjct: 740 MSSANTEGRRVLLLGSHLLMVGD 762
>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 781
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/755 (45%), Positives = 470/755 (62%), Gaps = 24/755 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
++ P F C P + + FCD++LP + R DLV+R+T EKV QLGD A GVP
Sbjct: 33 AADPPFSCGPSSTAA----TQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVP 88
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P Y+WW+EALHG++ G G HF+ + ATSFP V LT A+F++ LW +IGQA+
Sbjct: 89 RLGVPAYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGR 148
Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EARA+YN+G+A GLT WSPN+N+ RDPRWGR ETPGEDP RY V +V+GLQ
Sbjct: 149 EARALYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQ----- 203
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+S L+ S+CCKH AYD+++W GV RY+FDARVT QD+E+T+ PF CV +G
Sbjct: 204 --GNSTSSSLLQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDG 261
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
AS VMC+Y +NG+P+CA+ LL TVR +W L GY+ +DCD++ +M D ++ A + E
Sbjct: 262 KASCVMCAYTAINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRY-APTPE 320
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
DAVA LKAGLD+DCG Y A+QQGK+ E D+DK+LK L+ + MRLG FDG P+
Sbjct: 321 DAVALALKAGLDIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRA 380
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y L IC+ E+ LA EAA++GIVLLKND LPL+ A + + AV+GP+AN
Sbjct: 381 NIYGGLNAAHICTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGL 440
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
+IGNY G PC ++P+ G GY +V + GC AC ++ AA+ A ++D ++
Sbjct: 441 LIGNYFGPPCESVTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAG-SSDYVLLFM 499
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E+E DR L LPG Q LI VA+ AK PVILV+++ G VD+ FA+ N I A
Sbjct: 500 GLSQQQESEGRDRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGA 559
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG AIA V+FG NPGGRLP+TWY ++ + +P+T M +R + GYPGR
Sbjct: 560 ILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGR 618
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+Y+FY G T+Y FGYGLSY+ + LLS N + L + ++ V
Sbjct: 619 SYRFYQGETVYKFGYGLSYSSYSRRLLSSGTP---NTDLLAGLSTMPTPAEEGGVASYHV 675
Query: 685 LVNDLRCDDYFEFK--VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
R + +F V+ +N G DG V++Y + A KQ+IGF+R ++AG
Sbjct: 676 EHIGARGCEQLKFPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLIGFRRQHLKAG 735
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
+ F + C+ + V N ++ G H + V
Sbjct: 736 EKASLTFDISPCEHFSRVRKDGNKVVDRGSHFLMV 770
>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 772
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/742 (43%), Positives = 482/742 (64%), Gaps = 22/742 (2%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ +S+ FCD SLP+ +R + LVS +TLDEK+ QL + A GVPRLG+P YEWWSE+LHG++
Sbjct: 34 EANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGLA 93
Query: 108 NVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
+ GPG +F + AT FP VIL+ A+FN SLW+ + +AV+ EARAM+N G+AGLTYW+
Sbjct: 94 DNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWA 153
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PNINV RDPRWGR ETPGEDP ++ Y+V YV+G Q G D + +S+CCK
Sbjct: 154 PNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYG-----DGREGRMMLSACCK 208
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
HY AYD++ W RY F+A V QD E+T+ PF+ C++EG AS +MCSYN+VNG+P+C
Sbjct: 209 HYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPAC 268
Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
A LL Q +R EW GYIV+DCD++ ++ +N + + S ED+VA LKAG+D++CG +
Sbjct: 269 ARKDLL-QKIRDEWGFKGYIVSDCDAVAIIHENQTYTS-SDEDSVAIVLKAGMDVNCGSF 326
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIE 403
T +A+++GK++E DI+ +L L++V +RLG F+ + + + LG ++C+ E+ E
Sbjct: 327 LIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTKEHRE 386
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LAAEA R+G VLLKND + LPL +KV +A++G AN M G+Y G+PC ++ + G
Sbjct: 387 LAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPITFLKG 446
Query: 464 FSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
+ T GC DV+C S + A EAAK AD +++AGL+L+ E+E LDR L L
Sbjct: 447 MQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRVTLLL 506
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PG Q L+N +A V K P++LVI G VD+AFA+ + I ++LW GYPGE GG+ + ++
Sbjct: 507 PGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQVLPEI 566
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG++NPGG+LP+TWY + +P+ M +R S GYPGRTY+FY G +Y FGYGLS
Sbjct: 567 LFGEYNPGGKLPMTWYPESFT-AVPMNDMNMRADPSRGYPGRTYRFYTGEVVYGFGYGLS 625
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDL-RCDDY-FEF 697
Y+++ YN++ + I ++ + + + + TR G V V D+ C+ F
Sbjct: 626 YSKYSYNIVQAPQRISLSHSPVPGL----ISRKPAYTRRDGLDYVQVEDIASCESLVFSV 681
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
+ N G+ DGS V+++++ + + +KQ++GF+RV+ AG +K + + CK +
Sbjct: 682 HISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVDPCKYM 741
Query: 758 NIVDYAANTLLPAGEHTIFVGN 779
+ + +L G H + VG+
Sbjct: 742 SAANTEGRRVLLLGSHHLMVGD 763
>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
Length = 779
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/755 (43%), Positives = 461/755 (61%), Gaps = 29/755 (3%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
+P F C P K F FC+++LP R DLV+R+T EKV QLGD A GVPRL
Sbjct: 34 NPGFTCGPASAQK------GFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRL 87
Query: 92 GLPQYEWWSEALHGVSNVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
G+P Y+WWSEALHG++ G G HF + ATSFP VI T A+F++ LW +IGQA+ E
Sbjct: 88 GIPVYKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKE 147
Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
RA YNLG+A GL WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ
Sbjct: 148 GRAFYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ------ 201
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ L + L+ S+CCKH AYD++ WKGV RY+F+A+VT QD+ +T+ PF CV +G
Sbjct: 202 -GSSLTN--LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGK 258
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
AS +MC+Y +NG+P+CA LL +TVRGEW L GY +DCD++ ++ + F + E+
Sbjct: 259 ASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEE 317
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
AVA LKAGLD++CG Y +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+
Sbjct: 318 AVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGN 377
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y LG D+C+ + LA EAAR G+VLLKND LPL + V + AV+G +AN +A
Sbjct: 378 KLYGRLGAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILA 437
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNY G+PC +P G Y + + GC AC + A+ AK++D ++
Sbjct: 438 LLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVM 496
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E E LDR L LPG Q LI VA +K PVIL++++ G VDI FA+TN I A
Sbjct: 497 GLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGA 556
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG+AIADV+FG+FNP G+LP+TWY ++ + +T M +RP + GYPGR
Sbjct: 557 ILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFT-MTDMRMRPDPATGYPGR 615
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
+Y+FY G T+Y FGYGLSY++F ++S + L R + R
Sbjct: 616 SYRFYKGKTVYKFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVD- 674
Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ D RC+ F V+ QN G DG V+++ + + ++Q+IGF+ ++ G
Sbjct: 675 -EIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVG 733
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
K++K + C+ L+ ++ G H + V
Sbjct: 734 EKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMV 768
>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
Length = 785
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/756 (44%), Positives = 480/756 (63%), Gaps = 35/756 (4%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
S+ FC+ +L R KD+VSR+TLDEK+ QL + A +PRLG+ Y+WWSEALHGV++ G
Sbjct: 47 SYTFCNLNLTTIQRAKDIVSRLTLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYG 106
Query: 111 PGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSP 167
G + I AT FP VILT ASF+ LW +I + + TEARA+YN G+A G+T+W+P
Sbjct: 107 KGIRLNGNVTIKAATIFPQVILTAASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAP 166
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCC 225
NIN+ RDPRWGR ET GEDP V +YAV++VRGLQ EG + LN LK S+CC
Sbjct: 167 NINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFEGGK----LNEDRLKASACC 222
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ AYD+DNWKGVDR+ FDA VT QD+ +T+ PF C+ +G +S +MC+YNRVNGIP+
Sbjct: 223 KHFTAYDLDNWKGVDRFDFDANVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPN 282
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CAD LL T R +W+ +GYI +DC ++ ++ D + A + EDAVA L+AG+D++CG
Sbjct: 283 CADYNLLTNTARKKWNFNGYITSDCSAVDIIHDRQGY-AKAPEDAVADVLQAGMDVECGD 341
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENI 402
Y+T+ + +AV Q KV + ID++L L+++ +RLG FDG P +Y +G +CS +N+
Sbjct: 342 YFTSHSKSAVLQKKVPISQIDRALHNLFSIRIRLGLFDGHPTKLKYGKIGPNRVCSKQNL 401
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI-GNYAGIPCRYMSPI 461
+A EAAR GIVLLKN + LPL + ++ V+GP+AN++ ++ GNY G PC ++ +
Sbjct: 402 NIALEAARSGIVLLKNAASILPLPKS-TDSIVVIGPNANSSSQVVLGNYFGRPCNLVTIL 460
Query: 462 AGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GF Y+ N+ Y GC D + I A E AK D +++ GLD S E+E DR+DL
Sbjct: 461 QGFENYSDNLLYHPGCSDGTKCVSAEIDRAVEVAKVVDYVVLVMGLDQSQESEGHDRDDL 520
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
LPG Q +LIN VA+ +K PVILV+ G VDI+FA+ + I ILWAGYPGE GG A+A
Sbjct: 521 ELPGKQQELINSVAKASKRPVILVLFCGGPVDISFAKVDDKIGGILWAGYPGELGGMALA 580
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
VVFG +NPGGRLP+TWY D+++ +P+T M +R S GYPGRTY+FY GP +Y FGYG
Sbjct: 581 QVVFGDYNPGGRLPMTWYPKDFIK-IPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYG 639
Query: 641 LSYTQFKYNLLSFTKTIQVNLNK------LQHCRNLNY--TSDASKTRCPGVLVNDLRCD 692
LSY+ + YN +S K +++N+ L+ + ++Y S+ K C + ++
Sbjct: 640 LSYSNYSYNFIS-VKNNNLHINQSTTYSILEKSQTIHYKLVSELGKKACKTMSIS----- 693
Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
+ N GS G V+++ KP +KQ++GF+ V V G + F +
Sbjct: 694 ----VTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVS 749
Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
C+ L+ + + ++ G + VG S I L
Sbjct: 750 VCEHLSRANESGVKVIEEGGYLFLVGELEYSINITL 785
>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
sativa Japonica Group]
gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 853
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/755 (43%), Positives = 460/755 (60%), Gaps = 29/755 (3%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
+P F C P K F FC+++LP R DLV+R+T EKV QLGD A GVPRL
Sbjct: 108 NPGFTCGPASAQK------GFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRL 161
Query: 92 GLPQYEWWSEALHGVSNVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
G+P Y+WWSEALHG++ G G HF + ATSFP VI T A+F++ LW +IGQA+ E
Sbjct: 162 GIPVYKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKE 221
Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
RA YNLG+A GL WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ
Sbjct: 222 GRAFYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ------ 275
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ L + L+ S+CCKH AYD++ WKGV RY+F+A+VT QD+ +T+ PF CV +G
Sbjct: 276 -GSSLTN--LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGK 332
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
AS +MC+Y +NG+P+CA LL +TVRGEW L GY +DCD++ ++ + F + E+
Sbjct: 333 ASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEE 391
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
AVA LKAGLD++CG Y +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+
Sbjct: 392 AVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGN 451
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y L D+C+ + LA EAAR G+VLLKND LPL + V + AV+G +AN +A
Sbjct: 452 KLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILA 511
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNY G+PC +P G Y + + GC AC + A+ AK++D ++
Sbjct: 512 LLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVM 570
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E E LDR L LPG Q LI VA +K PVIL++++ G VDI FA+TN I A
Sbjct: 571 GLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGA 630
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG+AIADV+FG+FNP G+LP+TWY ++ + +T M +RP + GYPGR
Sbjct: 631 ILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTK-FTMTDMRMRPDPATGYPGR 689
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
+Y+FY G T+Y FGYGLSY++F ++S + L R + R
Sbjct: 690 SYRFYKGKTVYKFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVD- 748
Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ D RC+ F V+ QN G DG V+++ + + ++Q+IGF+ ++ G
Sbjct: 749 -EIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVG 807
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
K++K + C+ L+ ++ G H + V
Sbjct: 808 EKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMV 842
>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
Length = 771
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/756 (45%), Positives = 465/756 (61%), Gaps = 29/756 (3%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P + C P R LG + FCD+ LP + R DLVSR+T EKV QLGD A GV RLG
Sbjct: 25 PPYSCGP-RSPSLG-----YAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLG 78
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+P Y+WWSE LHG+S G G HF+ + TSFP V+LT A+F++ LW +IGQA+ TEAR
Sbjct: 79 VPPYKWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEAR 138
Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
A+YNLG+A GLT WSPN+N+ RDPRWGR ETPGEDP +YAV +V+GLQ G
Sbjct: 139 ALYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GSTPG 195
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
T L+ S+CCKH AYD++ W GV RY+F+A+VT QD+ +TF PF+ CV + AS
Sbjct: 196 T------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKAS 249
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
VMC+Y +NG+P+CA LL++T RG+W L GY+ +DCD++ ++ D ++ A + ED V
Sbjct: 250 CVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRY-APTPEDTV 308
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---- 387
A +KAGLDL+CG Y A+QQGK++E+D+D++L L+ V MRLG FDG P+
Sbjct: 309 AVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAA 368
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y LG D+C+ + +LA EAA+ GIVLLKND LPL+ A V++ AV+GP+AN A+
Sbjct: 369 YGHLGAADVCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALN 428
Query: 448 GNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
GNY G PC +P+ G Y ++V + GCD AC + AA+ A+ ++D I+ GL
Sbjct: 429 GNYFGPPCETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALAS-SSDQVIMFMGL 487
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
E E LDR L LPG Q LI VA A+ PVILV+++ G VD+ FA+ N I AIL
Sbjct: 488 SQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAIL 547
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
WAGYPG+ GG AIA V+FG NP GRLP+TWY ++ + +P+T M +R + GYPGR+Y
Sbjct: 548 WAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTR-IPMTDMRMRADPATGYPGRSY 606
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
+FY G +Y FGYGLSY++F L++ K + N N L +
Sbjct: 607 RFYQGNPVYKFGYGLSYSKFTRRLVAAAKPRRPNRNLLAGVIPKPAGDGGESYHVE--EI 664
Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--IKQVIGFQRVFVRAGR 743
+ C+ F V+ N G DG V+V+ + P A +Q++GF VRAG
Sbjct: 665 GEEGCERLKFPATVEVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGE 724
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ N C+ L+ ++ G H + VG
Sbjct: 725 KARLTMEINPCEHLSRARDDGTKVIDRGSHFLKVGE 760
>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
Length = 779
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/755 (43%), Positives = 460/755 (60%), Gaps = 29/755 (3%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
+P F C P K F FC+++LP R DLV+R+T EKV QLGD A GVPRL
Sbjct: 34 NPGFTCGPASAQK------GFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRL 87
Query: 92 GLPQYEWWSEALHGVSNVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
G+P Y+WWSEALHG++ G G HF + ATSFP VI T A+F++ LW +IGQA+ E
Sbjct: 88 GIPVYKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKE 147
Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
RA YNLG+A GL WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ
Sbjct: 148 GRAFYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ------ 201
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ L + L+ S+CCKH AYD++ WKGV RY+F+A+VT QD+ +T+ PF CV +G
Sbjct: 202 -GSSLTN--LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGK 258
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
AS +MC+Y +NG+P+CA LL +TVRGEW L GY +DCD++ ++ + F + E+
Sbjct: 259 ASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEE 317
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
AVA LKAGLD++CG Y +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+
Sbjct: 318 AVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGN 377
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y L D+C+ + LA EAAR G+VLLKND LPL + V + AV+G +AN +A
Sbjct: 378 KLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILA 437
Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
++GNY G+PC +P G Y + + GC AC + A+ AK++D ++
Sbjct: 438 LLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVM 496
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GL E E LDR L LPG Q LI VA +K PVIL++++ G VDI FA+TN I A
Sbjct: 497 GLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGA 556
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILWAGYPG+ GG+AIADV+FG+FNP G+LP+TWY ++ + +T M +RP + GYPGR
Sbjct: 557 ILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFT-MTDMRMRPDPATGYPGR 615
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
+Y+FY G T+Y FGYGLSY++F ++S + L R + R
Sbjct: 616 SYRFYKGKTVYKFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVD- 674
Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ D RC+ F V+ QN G DG V+++ + + ++Q+IGF+ ++ G
Sbjct: 675 -EIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVG 733
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
K++K + C+ L+ ++ G H + V
Sbjct: 734 EKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMV 768
>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 655
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 323/644 (50%), Positives = 420/644 (65%), Gaps = 21/644 (3%)
Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
MYN GRAGLT+WSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ N
Sbjct: 1 MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
N LK+++CCKH+ AYD+D W G DR+HF+A V QD+E+TF PF CV++G A+SV
Sbjct: 61 RNR--LKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASV 118
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYN+VNG+P+CAD L T+RG W L GYIV+DCDS+ V + + + EDA A
Sbjct: 119 MCSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHY-TRTPEDAAAA 177
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
TL+AGLDLDCG + + G+AV GKV + D+D +L TV MRLG FDG P +
Sbjct: 178 TLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGR 237
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKN------DQNTLPLNSAKVKTVAVVGPHANATV 444
LG D+C+ E+ +LA +AAR+G+VLLKN +++ LPL A + VAVVGPHA+ATV
Sbjct: 238 LGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATV 297
Query: 445 AMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
AMIGNYAG PCRY +P+ G + YA V ++ GC DVAC+ N I AA EAA+ ADAT+++
Sbjct: 298 AMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVV 357
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
AGLD VEAE LDR L LPG Q +LI+ VA+ +KGPVILV+MS G +DIAFA+ + I
Sbjct: 358 AGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRID 417
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
ILW GYPG+ GG+AIADV+FG NPG +LP+TWY+ DY+Q +P+T+M +R + GYPG
Sbjct: 418 GILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPG 477
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP- 682
RTY+FY GPT+YPFG+GLSYTQF + L + V L+ H + + P
Sbjct: 478 RTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPV 537
Query: 683 -GVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP-----AEIAATYIKQVIGFQ 735
V V RC+ VD NVG DG+ V+VY P A A +Q++ F+
Sbjct: 538 RAVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFE 597
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+V V AG R++ C L++ D +P GEH + +G
Sbjct: 598 KVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGE 641
>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
Length = 839
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 312/646 (48%), Positives = 436/646 (67%), Gaps = 24/646 (3%)
Query: 139 LWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
++ I VSTEARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV Y
Sbjct: 204 MYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGY 263
Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
V GLQD G +A LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF
Sbjct: 264 VTGLQDAGGGSDA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQ 316
Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
PF+ CV +G+ +SVMCSYN+VNG P+CAD LL+ +RG+W L+GYIV+DCDS+ V+ +
Sbjct: 317 PPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYN 376
Query: 319 NHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
N + + EDA A T+K+GLDL+CG + T AVQ GK+ E+D+D+++ + VLMR
Sbjct: 377 NQHY-TKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMR 435
Query: 379 LGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
LGFFDG P+ + SLG +D+C+ N ELA EAAR+GIVLLKN LPL++ +K++AV
Sbjct: 436 LGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAV 494
Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAA 494
+GP+ANA+ MIGNY G PC+Y +P+ G Y+ GC +V C N+ + AA++AA
Sbjct: 495 IGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAA 554
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+AD T+++ G D SVE ESLDR L LPG Q QL++ VA ++GPVILV+MS G DI+
Sbjct: 555 ASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDIS 614
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
FA+++ I AILW GYPGE GG A+AD++FG NPGGRLP+TWY + + +T M +R
Sbjct: 615 FAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMR 674
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YT 673
P S GYPGRTY+FY G T+Y FG GLSYT+F ++L+S + + V L + C + ++
Sbjct: 675 PDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHACHTEHCFS 734
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
+A+ C + F+ + +N G G V ++S PP+ + + K ++G
Sbjct: 735 VEAAGEHCGSL---------SFDVHLRVRNAGGMAGGHTVFLFSSPPS-VHSAPAKHLLG 784
Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F++V + G+ + F + CK L++VD N + G HT+ VG+
Sbjct: 785 FEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 830
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/116 (51%), Positives = 77/116 (66%), Gaps = 5/116 (4%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + +S + FCD + + R DL+ R+TL EKV L + +P
Sbjct: 26 AQTPVFACDASNAT-----VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALP 80
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+
Sbjct: 81 RLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136
>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
Length = 764
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 331/746 (44%), Positives = 451/746 (60%), Gaps = 29/746 (3%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G Q FCD+ L R DLV+ +TL EKV QLGD A GV RLG+P YEWWSE LHG
Sbjct: 23 GQQQQPHRFCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHG 82
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTY 164
+S G G F+ + TSFP VILT A+F+ LW+++G+AV EARA+YNLG+A GLT
Sbjct: 83 LSIWGRGIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTI 142
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPN+N+ RDPRWGR ETPGEDP RYAV +V GLQ + G + S+C
Sbjct: 143 WSPNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASAC 190
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH AYD+D W V RY++D++VT QD+E+T+ PF+ CV EG A+ +MC YN +NG+P
Sbjct: 191 CKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVP 250
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+CA LL + VR EW ++GY+ +DCD++ + D H + S ED VA ++K G+D++CG
Sbjct: 251 ACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTL-SPEDTVAVSIKVGMDVNCG 309
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDE 400
Y AVQ+G + E DID++L L+ V MRLG FDG P+ Y LG D+CS
Sbjct: 310 NYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPA 369
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA EAA++GIVLLKND LPL + V ++AV+GP+A+ A+ GNY G PC +P
Sbjct: 370 HKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTP 429
Query: 461 IAGFSGYA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
+ G GY + GCD AC + AA+ A+ ++D ++ GL E + LDR
Sbjct: 430 LQGIKGYLGDRARFLAGCDSPACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQDGLDRT 488
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG Q LI VA A+ PVILV+++ G VD+ FA+ N I AILWAGYPG+ GG A
Sbjct: 489 SLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLA 548
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
IA V+FG NP GRLP+TWY ++ + +P+T M +R + GYPGR+Y+FY G T+Y FG
Sbjct: 549 IAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGNTVYNFG 607
Query: 639 YGLSYTQFKYNLL-SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY 694
YGLSY++F + SF+ + NL+ L D LV ++ RC
Sbjct: 608 YGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSS--YLVKEIGVERCSRL 665
Query: 695 -FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
F V+ QN G DG V++Y + P +Q+IGF+ V+ G + F +
Sbjct: 666 VFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSP 725
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGN 779
C+ + V ++ G H + VG+
Sbjct: 726 CEHFSWVGEDGERVIDGGAHFLMVGD 751
>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
distachyon]
Length = 1026
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 318/649 (48%), Positives = 434/649 (66%), Gaps = 29/649 (4%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
L + L++ +T A D P F C SS+ FCD LP R DL SR+
Sbjct: 12 LPLCLVLQATMATD------PPFSCG---------SPSSYPFCDRKLPIGQRAADLASRL 56
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV---GPGTHFDD-VIPGATSFPTV 128
T++EKV LGD + GVPRLG+P Y+WWSEALHGV+N G FDD + ATSFP V
Sbjct: 57 TVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSFPQV 116
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGED 187
++T ASFN LW +IGQ + EAR +YN G+A GLT+W+PNINV RDPRWGR ETPGED
Sbjct: 117 LVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETPGED 176
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P + G+YA +VRG+Q G+ + +NS L+ S+CCKH+ AYD++NW GV R+ F+A+
Sbjct: 177 PTMTGKYAAVFVRGVQ---GYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAK 233
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V+EQD+ +T+ PF CV++G AS +MCSYNRVNG+P+CAD LL++T RG+W +GYI
Sbjct: 234 VSEQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYIT 293
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
+DCD++ ++ D + A EDAVA LKAG+D++CG Y +A QGK+ E DID+
Sbjct: 294 SDCDAVAIIHDVQGY-AKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDR 352
Query: 368 SLKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
+L+ L+ + MRLG FDG+P+Y ++G +C E+ +LA EAA++GIVLLKND TLP
Sbjct: 353 ALQNLFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLP 412
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKS 483
L K+ ++AV+G +AN + GNY G PC +SP+ GY T + GC+ C
Sbjct: 413 LPKQKISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNV 472
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
++ I A++AA A+ ++ GLD E E LDR +L LPG Q L+N VA+ AK PV+L
Sbjct: 473 SD-IAGAAKAASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVL 531
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
V++ G VD+ FA+ N I AI+WAGYPG+ GG AIA V+FG+ NPGGRLP+TWY +Y
Sbjct: 532 VLLCGGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEYA 591
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
+ +T M +R S GYPGRTY+FY G T+Y FGYGLSY+++ ++ +S
Sbjct: 592 TAVAMTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHSFVS 640
>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 769
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 328/750 (43%), Positives = 478/750 (63%), Gaps = 26/750 (3%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S++ FCD+SL R + LVS +TLDEK+ QL + A GVPRLG+P Y+WWSE+LHG+++
Sbjct: 35 SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLADN 94
Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GPG +F + AT FP VIL+TA+FN SLW+ + +AV+TEA M+N G+AGLTYW+PN
Sbjct: 95 GPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWAPN 154
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN+ RDPRWGR ET GEDP V Y++ YV+G Q + +++S+CCKHY
Sbjct: 155 INIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ-------GEEGEEGRIRLSACCKHY 207
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++ W+G RY F+A+V QD+E+T+ PF+ C++E AS +MC+YN+VNG+P CA
Sbjct: 208 TAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAH 267
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL +T R EW GYI +DCD++ ++ +N + S ED++A LKAG+D++CG +
Sbjct: 268 KDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAIVLKAGMDINCGSFLV 325
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
T +A+++GK++E DID++L L++V +RLG FD + LG +C+ E+ ELA
Sbjct: 326 RHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELA 385
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
AEA R+G VLLKND N LPL ++V+ VA++GP AN AM G+Y G+PC + + G
Sbjct: 386 AEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQ 445
Query: 466 GYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
YA T + GC D +C S + A EAAK AD +++AGL+L+ E E DR L LPG
Sbjct: 446 AYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPG 505
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI+ +A VAK P++LV++ G VD++FA+ + I +ILW GYPGE GG+ + +++F
Sbjct: 506 KQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILF 565
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G++NPGG+LPITWY + +P+T M +R S GYPGRTY+FY G +Y FGYGLSY+
Sbjct: 566 GEYNPGGKLPITWYPESFT-AIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYS 624
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCPG---VLVNDL-RCDDY-FEFK 698
++ Y++ S K I V+ + +L S + TR G V D+ C+ F
Sbjct: 625 KYSYSISSAPKKITVSRSS-----DLGIISRKPAYTRRDGLGSVKTEDIASCEALVFSVH 679
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
V N GS DGS V+++++ + + IKQ++GF+ V AG ++ + CK ++
Sbjct: 680 VAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQMS 739
Query: 759 IVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+ +L G H + VG+ I L
Sbjct: 740 AANPEGKRVLLLGAHVLTVGDEEFELSIEL 769
>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
Length = 635
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 310/639 (48%), Positives = 426/639 (66%), Gaps = 26/639 (4%)
Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
Q VS EARAM+N G AGLTYWSPN+N+ RDPRWGR ETPGEDP VVG+YA +YVRGLQ
Sbjct: 2 QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61
Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
+G+ LKV++CCKH+ AYD+DNW GVDR+HF+A V++QDME+TF PF MC
Sbjct: 62 SDGNR---------LKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMC 112
Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
VKEG +SVMCSYN+VNGIP+CADP LL +TVRG + ++ ++ ++ L
Sbjct: 113 VKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGT------LFQTVTLLEFIMGSNTILQ 166
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
++ +A LDLDCG + T +AV++G + E +I+ +L TV MRLG FDG
Sbjct: 167 PRRKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDG 226
Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
P Y +LG D+C+ + ELA EAAR+GIVLLKN +LPL++ + +VA+VGP++N
Sbjct: 227 EPSSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSN 286
Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
T MIGNYAG+ C Y +P+ G YA ++ GC DVAC S+ AA +AA+ ADAT+
Sbjct: 287 VTATMIGNYAGLACGYTTPLQGIQRYAQTIHRQGCADVACVSDQQFSAAIDAARQADATV 346
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
++ GLD S+EAE DR L LPG Q +L+++VA +KGP ILV+MS G +D++FAE +
Sbjct: 347 LVMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPK 406
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I +I+WAGYPG+ GG AI+DV+FG NPGG+LP+TWY DY+ LP+T+M +R S GY
Sbjct: 407 IGSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKGY 466
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
PGRTY+FY G +YPFG+G+SYT F + + S + V L+ +H N T R
Sbjct: 467 PGRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRHGSG-NATISGKAIR- 524
Query: 682 PGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
V RC+ +VD +N GS DG+ ++VYS+PPA A + KQ++ F++V V
Sbjct: 525 ----VTHARCNRLSLGMQVDVKNTGSMDGTHTLLVYSRPPARHWAPH-KQLVAFEKVHVA 579
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
AG +R+ + CKSL++VD + +P GEH++ +G+
Sbjct: 580 AGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGD 618
>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
Length = 750
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 337/771 (43%), Positives = 466/771 (60%), Gaps = 46/771 (5%)
Query: 19 VFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKV 78
VF ++AV +S P+F C P S+ ++ FCD SLP + R DLVSR+T+ EKV
Sbjct: 7 VFFSSAV----ASDPLFSCGPSSPSR------AYPFCDRSLPAARRAADLVSRLTVAEKV 56
Query: 79 QQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNES 138
QLGD A GVPRLG+P Y+WWSE LHG++ G G F+ + G TSFP V+LTTASF++
Sbjct: 57 SQLGDEAAGVPRLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDG 116
Query: 139 LWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
LW +IGQA+ EARA+YNLG+A GLT WSPN+N+ RDPRWGR ETPGEDP V +YAV
Sbjct: 117 LWFRIGQAIGREARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVA 176
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
+VRG+Q ++ + PL+ S+CCKH AYD+++W GV RY+FDARVT QD+ +TF
Sbjct: 177 FVRGIQ-----GSSAAGAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTF 231
Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
PF+ CV +G A+ VMC+Y +NG+P+CA LL +T RG W GY+ +DCD++ +M
Sbjct: 232 NPPFQSCVVDGKATCVMCAYTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMH 291
Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLM 377
D +++ + ED VA LK A+QQGK+ E D+DK+L L+ V M
Sbjct: 292 DAQRYVP-TPEDTVAVALK------------EHGMAAIQQGKMTEKDVDKALTNLFAVRM 338
Query: 378 RLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
RLG FDG P+ Y LG D+C+ ++ LA EAA++GIVLLKND LPL+ + + +
Sbjct: 339 RLGHFDGDPRGNALYGHLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSA 398
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASE 492
AV+G +AN + + GNY G C +P+ G Y +NV + GC AC + A+
Sbjct: 399 AVIGHNANDALVLRGNYFGPACETTTPLQGVQSYVSNVRFLAGCSSAAC-GYAATGQAAA 457
Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
A +++ + GL E E LDR L LPG Q LI VA AK PVILV+++ G VD
Sbjct: 458 LASSSEYVFLFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVD 517
Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
I FA++N I AILWAGYPG+ GG AIA V+FG NP GRLP+TWY ++ + +P+T M
Sbjct: 518 ITFAQSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMR 576
Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
+R + GYPGR+Y+FY G T+Y FGYGLSY++F L++ K +L L
Sbjct: 577 MRADPANGYPGRSYRFYRGNTIYKFGYGLSYSKFSRQLVTGGKNQLASL--LAGLSATTK 634
Query: 673 TSDASKTRCPGVLVNDLRCDD----YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
DA+ V+D+ D F +V+ QN G DG V+++ + P +
Sbjct: 635 DDDATSY----YHVDDIGADGCEQLRFPAEVEVQNHGPMDGKHSVLMFLRWPNATDGRPV 690
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
Q+IGF ++AG ++F C+ + ++ G H + VG
Sbjct: 691 SQLIGFTSQHIKAGEKANVRFDVRPCEHFSRARADGKKVIDRGSHFLMVGK 741
>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
[Vitis vinifera]
Length = 464
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 293/458 (63%), Positives = 356/458 (77%), Gaps = 2/458 (0%)
Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
MYNLG AGLT+WSPNINV RD RWGR ET EDPF+VG +AVNYVRGLQDVEG EN TD
Sbjct: 1 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
LNSRPLKVSSCCKHYAAYD+D+W +DR+ FDARV+EQDM+ETF+ PFE CV+EGD SSV
Sbjct: 61 LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCS+N++NGIP C+DP+LL +R EWDLHGYIV+DC ++V+VDN +L DSK DAVA+
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
TL+AGLDL+CG YYT+ V GKV + ++D++LK +Y +LMR+G+FDG P Y SLG
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGL 240
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+DIC+ ++IELA EAAR+GIVLLKND PL K +A+VGPHANAT MIGNYAG+
Sbjct: 241 KDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--KKLALVGPHANATEVMIGNYAGL 298
Query: 454 PCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
P +Y+SP+ FS NVTY TGC D +C ++ A EAAK+A+ TII G DLS+EAE
Sbjct: 299 PRKYVSPLEAFSAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAE 358
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
+DR D LPG QT+LI QVAEV+ GPVILV++S +DI FA+ N I AILW G+PGE
Sbjct: 359 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 418
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
+GG AIADVVFGK+NPGGRLP+TWY DYV L M
Sbjct: 419 QGGHAIADVVFGKYNPGGRLPVTWYEADYVACLETHIM 456
>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
var. deliciosa]
Length = 634
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 310/636 (48%), Positives = 426/636 (66%), Gaps = 25/636 (3%)
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAMYN G AGLT+WSPN+N+ RDPRWGR ETPGEDP + G YA +YVRGLQ +G
Sbjct: 2 EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGNDGER 61
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
LKV++CCKHY AYD+DNW+GVDR+HF+ARV++QD+++TF PF CV G
Sbjct: 62 ---------LKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGK 112
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNGIP+CA+PKLL T+RG W L+GYIV+DCDS+ V +N + + E+
Sbjct: 113 VASVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTS-KPEE 171
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--- 386
AVA +KAGLDLDCG + T AV++G V + +I+ +L T MRLG FDG P
Sbjct: 172 AVAAAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAH 231
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
QY +LG +D+C+ + +LA EAAR+GIVLL+N +LPL+ + +TVAV+GP+++ TV M
Sbjct: 232 QYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTM 291
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
IGNYAG+ C Y +P+ G Y ++ GC DV C N AA AA+ ADAT+++ GL
Sbjct: 292 IGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGL 351
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
D S+EAE +DR LPG+Q +L+++VA ++GP ILV+MS G +D+ FA+ + I AI+
Sbjct: 352 DQSIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAII 411
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
W GYPG+ GG AIADV+FG NPGG+LP+TWY +YV LP+T M +R + GYPGRTY
Sbjct: 412 WVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTY 471
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
+FY GP ++PFG GLSYT F +NL + V L L+ N S A V V
Sbjct: 472 RFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKA-------VRV 524
Query: 687 NDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR 743
+ C+ + VD +N GS DG+ ++V++ PP + AA+ KQ++GF ++ + AG
Sbjct: 525 SHADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWAAS--KQLVGFHKIHIAAGS 582
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R++ + CK L++VD +P GEH + +G+
Sbjct: 583 ETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 618
>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Glycine max]
Length = 613
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 305/558 (54%), Positives = 401/558 (71%), Gaps = 16/558 (2%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CD G+ + + + FCD SL RVKDLV R+TL EK+ L + A V RLG+P
Sbjct: 30 FACDVGKSPAV----AGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIP 85
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
+YEWWSEALHGVSNVG GT F +V+PGATSFP ILT ASFN SL++ IG+ VSTEA AM
Sbjct: 86 RYEWWSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAM 145
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
YN+G AGLTYWSPNIN+ RDPRWGR ETPGEDP + +YA YV+GLQ +G +
Sbjct: 146 YNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGD----- 200
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKHY AYDVD WKG+ RY F+A +T+QD+E+TF PF+ CV +G+ +SVM
Sbjct: 201 -PNKLKVAACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVM 259
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNG P+CADP LL VRGEW L+GY+V+DCDS++V+ ++ + E+A A +
Sbjct: 260 CSYNKVNGKPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLY-KYQHYTKTPEEAAAIS 318
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
+ AGLDL+CG++ +T AV+QG + E+ I+ ++ + LMRLGFFDG P+ Y +L
Sbjct: 319 ILAGLDLNCGRFLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNL 377
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +D+C+ N ELA EAAR+GIV LKN +LPLN+ +K++AV+GP+ANAT MIGNY
Sbjct: 378 GPKDVCTPANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYE 437
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
GIPC+Y+SP+ G + + +Y GC DV C N + A + + + DAT+I+ G L++E
Sbjct: 438 GIPCKYISPLQGLTAFVPTSYAAGCLDVRC-PNPVLDDAKKISASGDATVIVVGASLAIE 496
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
AESLDR ++ LPG Q L+ +VA +KGPVILVIMS GG+D++FA+ N I +ILW GYP
Sbjct: 497 AESLDRVNILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYP 556
Query: 572 GEEGGRAIADVVFGKFNP 589
GE GG AIADV+FG NP
Sbjct: 557 GEAGGAAIADVIFGFHNP 574
>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
[Oryza sativa Japonica Group]
Length = 793
Score = 619 bits (1597), Expect = e-174, Method: Compositional matrix adjust.
Identities = 331/774 (42%), Positives = 451/774 (58%), Gaps = 57/774 (7%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G Q FCD+ L R DLV+ +TL EKV QLGD A GV RLG+P YEWWSE LHG
Sbjct: 24 GQQQQPHRFCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHG 83
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTY 164
+S G G F+ + TSFP VILT A+F+ LW+++G+AV EARA+YNLG+A GLT
Sbjct: 84 LSIWGRGIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTI 143
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPN+N+ RDPRWGR ETPGEDP RYAV +V GLQ + G + S+C
Sbjct: 144 WSPNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASAC 191
Query: 225 CKHYAAYDVDNWKGVDRYHFDAR----------------------------VTEQDMEET 256
CKH AYD+D W V RY++D++ VT QD+E+T
Sbjct: 192 CKHATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDT 251
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
+ PF+ CV EG A+ +MC YN +NG+P+CA LL + VR EW ++GY+ +DCD++ +
Sbjct: 252 YNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATI 311
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
D H + S ED VA ++K G+D++CG Y AVQ+G + E DID++L L+ V
Sbjct: 312 RDAHHYTL-SPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVR 370
Query: 377 MRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
MRLG FDG P+ Y LG D+CS + LA EAA++GIVLLKND LPL + V +
Sbjct: 371 MRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTS 430
Query: 433 VAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NVTYKTGCDDVACKSNNSIFAA 490
+AV+GP+A+ A+ GNY G PC +P+ G GY + GCD AC + AA
Sbjct: 431 LAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNEAA 490
Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+ A+ ++D ++ GL E + LDR L LPG Q LI VA A+ PVILV+++ G
Sbjct: 491 ALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGP 549
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
VD+ FA+ N I AILWAGYPG+ GG AIA V+FG NP GRLP+TWY ++ + +P+T
Sbjct: 550 VDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTD 608
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL-SFTKTIQVNLNKLQHCRN 669
M +R + GYPGR+Y+FY G T+Y FGYGLSY++F + SF+ + NL+ L
Sbjct: 609 MRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMA 668
Query: 670 LNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA 725
D LV ++ RC F V+ QN G DG V++Y + P
Sbjct: 669 RRAGDDGGGMSS--YLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGG 726
Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+Q+IGF+ V+ G + F + C+ + V ++ G H + VG+
Sbjct: 727 RPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 780
>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
Group]
Length = 883
Score = 616 bits (1588), Expect = e-173, Method: Compositional matrix adjust.
Identities = 330/653 (50%), Positives = 430/653 (65%), Gaps = 28/653 (4%)
Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
QAVS E RAMYN G+AGLT+WSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286
Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
+ +S LK+++CCKH+ AYD+DNW G DR+HF+A VT QD+E+TF PF C
Sbjct: 287 QQ-------PSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339
Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
V +G A+SVMCSYN+VNG+P+CAD L T+R W L GYIV+DCDS+ V + +
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY-T 398
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
++EDAVA TL+AGLDLDCG + +T AV QGKV + DID ++ TV MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458
Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK-TVAVVGPHA 440
P + LG Q +C+ + ELA EAAR+GIVLLKND LPL+ A + VAVVGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACK-SNNSIFAASEAAKTAD 498
ATVAMIGNYAG PCRY +P+ G + YA ++ GC DVAC S I AA +AA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578
Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
ATI++AGLD +EAE LDR L LPG Q +LI+ VA+ +KGPVILV+MS G +DI FA+
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
+ I ILWAGYPG+ GG+AIADV+FG NPGG+LP+TWY DY+Q +P+T+M +R +
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL----NKLQHCRNLNYTS 674
GYPGRTY+FY GPT++PFG+GLSYT F +++ + V L +LN T+
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATA 758
Query: 675 DASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY-------SKPPAEIAAT 726
S+ V V RC++ VD +NVG DG+ V+VY + A
Sbjct: 759 RLSRAAA--VRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGA 816
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++Q++ F++V V AG R++ + C L++ D +P GEH + +G
Sbjct: 817 PVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGE 869
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/113 (53%), Positives = 74/113 (65%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G ++ FC SLP R +DLV+R+T EKV+ L + A GVPRLG+ YEWWSEALHG
Sbjct: 35 GGPAATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHG 94
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
VS+ GPG F PGAT+FP VI T ASFN +LW+ IGQ S+ + LG
Sbjct: 95 VSDTGPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147
>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
Length = 885
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 330/655 (50%), Positives = 430/655 (65%), Gaps = 30/655 (4%)
Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
QAVS E RAMYN G+AGLT+WSPN+N+ RDPRWGR ETPGEDP V RYA YVRGLQ
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286
Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
+ +S LK+++CCKH+ AYD+DNW G DR+HF+A VT QD+E+TF PF C
Sbjct: 287 QQ-------PSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339
Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
V +G A+SVMCSYN+VNG+P+CAD L T+R W L GYIV+DCDS+ V + +
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY-T 398
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
++EDAVA TL+AGLDLDCG + +T AV QGKV + DID ++ TV MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458
Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK-TVAVVGPHA 440
P + LG Q +C+ + ELA EAAR+GIVLLKND LPL+ A + VAVVGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACK-SNNSIFAASEAAKTAD 498
ATVAMIGNYAG PCRY +P+ G + YA ++ GC DVAC S I AA +AA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578
Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
ATI++AGLD +EAE LDR L LPG Q +LI+ VA+ +KGPVILV+MS G +DI FA+
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
+ I ILWAGYPG+ GG+AIADV+FG NPGG+LP+TWY DY+Q +P+T+M +R +
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL------NKLQHCRNLNY 672
GYPGRTY+FY GPT++PFG+GLSYT F +++ + V L +LN
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLNA 758
Query: 673 TSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY-------SKPPAEIA 724
T+ S+ V V RC++ VD +NVG DG+ V+VY + A
Sbjct: 759 TARLSRAAA--VRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGH 816
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++Q++ F++V V AG R++ + C L++ D +P GEH + +G
Sbjct: 817 GAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGE 871
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 61/113 (53%), Positives = 74/113 (65%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G ++ FC SLP R +DLV+RMT EKV+ L + A GVPRLG+ YEWWSEALHG
Sbjct: 35 GGPAATLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHG 94
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
VS+ GPG F PGAT+FP VI T ASFN +LW+ IGQ S+ + LG
Sbjct: 95 VSDTGPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147
>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
Length = 771
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 316/748 (42%), Positives = 460/748 (61%), Gaps = 39/748 (5%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR---------LGLPQYEWWS 100
S++ FC+++LP+ R + LVS +TLDEK+ QL G P +G+P +
Sbjct: 36 SAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGVPSTASAT 95
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
S GP + AT FP VIL+ A+FN SLW+ +A++ EARAM+N G+A
Sbjct: 96 TGPGSTSPRGP-------VRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA 148
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GLT+W+PNINV RDPRWGR ETPGEDP VV Y+V YV+G Q G E +
Sbjct: 149 GLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MM 201
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+S+CCKHY AYD++ W+G RY F+A+V QDME+T+ PF+ C++EG AS +MCSYN+V
Sbjct: 202 LSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQV 261
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
NG+P+CA +L Q R EW GYI +DCD++ ++ +N + A S ED++A LKAG+D
Sbjct: 262 NGVPACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTYTA-SDEDSIAVVLKAGMD 319
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDIC 397
++CG + T +A+++GKV+E DI+ +L L++V +RLGFFD + + + LG ++C
Sbjct: 320 INCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVC 379
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
+ E+ ELAAEA R+G VLLKND LPL ++V +A++GP AN + G+Y G+PC
Sbjct: 380 TTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHS 439
Query: 458 MSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLD 516
+ + G Y T+ GC DV C S + A EAAK AD +++AGL+L+ E E D
Sbjct: 440 TTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHD 499
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
R L LPG Q LI+ VA V K PV+LV+M G VD++FA+ + I +ILW GYPGE GG
Sbjct: 500 RVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGG 559
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+ +++FGK+NPGG+LPITWY + +P+ M +R S GYPGRTY+FY G +Y
Sbjct: 560 NVLPEILFGKYNPGGKLPITWYPESFT-AVPMDDMNMRADASRGYPGRTYRFYTGDVVYG 618
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDLRCDD 693
FGYGLSY+++ Y++L K I ++ + + + + TR G V V D+ +
Sbjct: 619 FGYGLSYSKYSYSILQAPKKISLSRSSVPDL----ISRKPAYTRRDGVDYVQVEDIASCE 674
Query: 694 YFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
+F V N G+ DGS V++++ + IKQ++GF+RV AGR+ ++
Sbjct: 675 ALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITV 734
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK ++ + +L G H + VG+
Sbjct: 735 DPCKLMSFANTEGTRVLFLGTHVLMVGD 762
>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
Length = 900
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 336/771 (43%), Positives = 471/771 (61%), Gaps = 47/771 (6%)
Query: 29 GSSSPVFVCD--PGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH 86
GS +P CD PG+ S FC+++L Y R++DL+SR+ + L + A
Sbjct: 167 GSPTPR-TCDVEPGK---------SLPFCNTALSYDDRIRDLISRINDSDLPGLLVNSAT 216
Query: 87 GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
GV L LP Y+WWSEALHGV + PG HF +P ATSFP VI T A+FN++L++KIG
Sbjct: 217 GVEHLNLPAYQWWSEALHGVGH-SPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTV 275
Query: 147 VSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+STEARAM N+ RAG T+W+PNIN+ RDPRWGR ETPGEDPF G YA N+V G QD E
Sbjct: 276 ISTEARAMNNVQRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVSGFQDGE 335
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
D+N +K SSCCKH+ Y+++NW GVDR+H++A T+QD+ +T+L FE CV+
Sbjct: 336 ------DMNY--IKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVR 387
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G AS +MCSYN VNG+PSCA+ ++ R W GYI +DC ++ ++++HKF ++
Sbjct: 388 YGRASGLMCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNT 447
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--G 384
E + L+AG+D DCG + + A+Q+G V ++ +L L+ V RLG FD
Sbjct: 448 SE-TIRAVLEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFDPVS 506
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
Y + + + N +LA EAA++GIVLLKN LPL + VA++GP+A+AT
Sbjct: 507 KQPYTNYSVARVNTPANQQLALEAAQQGIVLLKNTNARLPLKTGL--HVALIGPNADATT 564
Query: 445 AMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
M GNY G +SP+ GF Y A VTY GCD VACK + AA AAK ADA +++
Sbjct: 565 VMQGNYQGTAPFLISPVRGFKNYSAAVTYAKGCD-VACKDTSGFDAAVAAAKEADAVVVV 623
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
GLD E+E DR + LPG+Q L+ QVA AK P+++ +M+ G VD++ + N N+
Sbjct: 624 VGLDQGQESEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVA 683
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
ILW GYPG+ GG+A+ADVVFG +PGGRLP T Y G YV + +RP + G PG
Sbjct: 684 GILWCGYPGQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPG 743
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY+FY G +Y +G GLSYT F Y+ + + T+ +L +Q Y DA +
Sbjct: 744 RTYRFYTGKPVYEYGTGLSYTSFSYH-IHYLNTMDTSLATVQ-----TYVQDAKQNH--- 794
Query: 684 VLVNDLRCD--DYFEFKVDFQNVGSTDGSDVVIVYSKP--PAEIAATYIKQVIGFQRVFV 739
+R D ++ +V+ NVG G+DVV V+ +P PAE+ A IK +IGF+RVF+
Sbjct: 795 ---KFIRYDAPEFTRVEVNVTNVGRVAGADVVQVFVEPKTPAELGAP-IKTLIGFERVFL 850
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG-VSFPIHLN 789
G+ ++F NA L VD + + AGE + +G+ ++FP+H+N
Sbjct: 851 NPGQWTIVQFSVNA-HDLTFVDASGKRVARAGEWLVHIGHDSRLTFPVHVN 900
>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
Length = 738
Score = 598 bits (1543), Expect = e-168, Method: Compositional matrix adjust.
Identities = 311/740 (42%), Positives = 451/740 (60%), Gaps = 56/740 (7%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S++ FC+++LP+ R + LVS +TLDEK+ QL + A G PRLG+P +EWWSE+LHGV +
Sbjct: 36 SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDN 95
Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GPG +F + AT FP VIL+ A+FN SLW+ +A++ EARAM+N G+AGLT+W+PN
Sbjct: 96 GPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPN 155
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
INV RDPRWGR ETPGEDP VV Y+V YV+G Q G E + +S+CCKHY
Sbjct: 156 INVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLSACCKHY 208
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AYD++ W+G RY F+A+V NG+P+CA
Sbjct: 209 IAYDLEKWRGFTRYTFNAKV--------------------------------NGVPACAR 236
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
+L Q R EW GYI +DCD++ ++ +N + A S ED++A LKAG+D++CG +
Sbjct: 237 KDIL-QRARDEWGFQGYITSDCDAVAIIHENQTYTA-SDEDSIAVVLKAGMDINCGSFLI 294
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
T +A+++GKV+E DI+ +L L++V +RLGFFD + + + LG ++C+ E+ ELA
Sbjct: 295 RHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELA 354
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
AEA R+G VLLKND LPL ++V +A++GP AN + G+Y G+PC + + G
Sbjct: 355 AEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQ 414
Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
Y T+ GC DV C S + A EAAK AD +++AGL+L+ E E DR L LPG
Sbjct: 415 AYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPG 474
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI+ VA V K PV+LV+M G VD++FA+ + I +ILW GYPGE GG + +++F
Sbjct: 475 RQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILF 534
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK+NPGG+LPITWY + +P+ M +R S GYPGRTY+FY G +Y FGYGLSY+
Sbjct: 535 GKYNPGGKLPITWYPESFTA-VPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYS 593
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDLRCDDYFEFKVDF 701
++ Y++L K I ++ + + + + TR G V V D+ + +F V
Sbjct: 594 KYSYSILQAPKKISLSRSSVPDL----ISRKPAYTRRDGVDYVQVEDIASCEALQFPVHI 649
Query: 702 --QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI 759
N G+ DGS V++++ + IKQ++GF+RV AGR+ ++ + CK ++
Sbjct: 650 SVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSF 709
Query: 760 VDYAANTLLPAGEHTIFVGN 779
+ +L G H + VG+
Sbjct: 710 ANTEGTRVLFLGTHVLMVGD 729
>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
Length = 685
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 310/692 (44%), Positives = 440/692 (63%), Gaps = 30/692 (4%)
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNIN 170
G + IP ATSFP VILT ASF+ LW +I + + TEAR +YN G+A G+ +W+PNIN
Sbjct: 2 GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
+ RDPRWGR ET GEDP V +Y V+YVRGLQ + E + R LK S+CCKH+ A
Sbjct: 62 IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASACCKHFTA 119
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD++NWKGV+RY FDA+VT QD+ +T+ F CV +G +S +MC+YNRVNG+P+CAD
Sbjct: 120 YDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYN 179
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL T R +W+ +GYI +DCD+++ + + + A + ED VA L+AG+DL+CG Y T
Sbjct: 180 LLTNTARKKWNFNGYIASDCDAVRFIYEKQGY-AKTPEDVVADVLRAGMDLECGNYMTKH 238
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAE 407
+AV Q K+ + ID++L L+T+ +RLG FDG+P QY +G +CS EN++LA E
Sbjct: 239 AKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALE 298
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN-ATVAMIGNYAGIPCRYMSPIAGFSG 466
AAR GIVLLKN + LPL +V T+ V+GP+AN +++ ++GNY G PC+ +S + GF
Sbjct: 299 AARSGIVLLKNTASILPL--PRVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYT 356
Query: 467 YANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
YA+ T Y +GC D ++ I A E AK +D I++ GLD S E E+LDR+ L LPG
Sbjct: 357 YASQTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGK 416
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +LIN VA+ +K PVILV++ G VDI FA+ N I I+WAGYPGE GGRA+A VVFG
Sbjct: 417 QQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFG 476
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NPGGRLP+TWY D+++ +P+T M +R S GYPGRTY+FY GP +Y FGYGLSY+
Sbjct: 477 DYNPGGRLPMTWYPKDFIK-IPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSN 535
Query: 646 FKYNLLSFTKTIQVNLNK------LQHCRNLNY--TSDASKTRCPGVLVNDLRCDDYFEF 697
+ YN +S K +++N+ L++ +NY S+ + C + ++
Sbjct: 536 YSYNFIS-VKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSIS---------V 585
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
+ N GS G V+++ KP +KQ++GF+ V V G + F + C+ L
Sbjct: 586 TLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHL 645
Query: 758 NIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ + + ++ G + VG S I L+
Sbjct: 646 SRANESGVKVIEEGGYLFLVGQEEYSINIMLD 677
>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
Length = 753
Score = 592 bits (1527), Expect = e-166, Method: Compositional matrix adjust.
Identities = 318/746 (42%), Positives = 439/746 (58%), Gaps = 40/746 (5%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G Q FCD+ L R DLV+ +TL EKV QLGD A GV RLG+P YEWWSE LHG
Sbjct: 23 GQQQQPHRFCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHG 82
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTY 164
+S G G F+ + TSFP VILT A+F+ LW+++G+AV EARA+YNLG+A GLT
Sbjct: 83 LSIWGRGIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTI 142
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPN+N+ RDP R PG+ R + G Q + G + S+C
Sbjct: 143 WSPNVNIFRDPSGTR----PGD-----ARRGPRH--GEQGIGG------------EASAC 179
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH AYD+D W V RY++D++VT QD+E+T+ PF+ CV EG A+ +MC YN +NG+P
Sbjct: 180 CKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVP 239
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+CA LL + VR EW ++GY+ +DCD++ + D H + S ED VA ++K G+D++CG
Sbjct: 240 ACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTL-SPEDTVAVSIKVGMDVNCG 298
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDE 400
Y AVQ+G + E DID++L L+ V MRLG FDG P+ Y LG D+CS
Sbjct: 299 NYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPA 358
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA EAA++GIVLLKND LPL + V ++AV+GP+A+ A+ GNY G PC +P
Sbjct: 359 HKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTP 418
Query: 461 IAGFSGYA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
+ G GY + GCD AC + + AA+ A ++D ++ GL E + LDR
Sbjct: 419 LQGIKGYLGDRARFLAGCDSPACAVDATNEAAA-LASSSDHVVLFMGLSQKQEQDGLDRT 477
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG Q LI VA A+ PVILV+++ G VD+ FA+ N I AILWAGYPG+ GG A
Sbjct: 478 SLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLA 537
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
IA V+FG NP GRLP+TWY ++ + +P+T M +R + GYPGR+Y+FY G T+Y FG
Sbjct: 538 IAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGNTVYNFG 596
Query: 639 YGLSYTQFKYNLL-SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY 694
YGLSY++F + SF+ + NL+ L D LV ++ RC
Sbjct: 597 YGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSS--YLVKEIGVERCSRL 654
Query: 695 -FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
F V+ QN G DG V++Y + P +Q+IGF+ V+ G + F +
Sbjct: 655 VFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSP 714
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGN 779
C+ + V ++ G H + VG+
Sbjct: 715 CEHFSWVGEDGERVIDGGAHFLMVGD 740
>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
Length = 633
Score = 587 bits (1513), Expect = e-165, Method: Compositional matrix adjust.
Identities = 296/642 (46%), Positives = 424/642 (66%), Gaps = 24/642 (3%)
Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
IG+ VSTE RAMYN+G+AGLTYWSPN+N+ RDPRWGR ET GEDP + RY V YV+GL
Sbjct: 2 IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61
Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
Q + D LKV+SCCKHY AYDVD+WKG+ RY+F+A+VT+QD+++TF PF+
Sbjct: 62 QQRD------DGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFK 115
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
CV +G+ +SVMCSYN+V+G P+C D LL +RG+W L+GYIV DCDS+ M +
Sbjct: 116 SCVLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHY 175
Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
+ E+ A +L AGL L+CG + +T AV QG V E+ ID+++ + LMRLGFF
Sbjct: 176 -TKTPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFF 234
Query: 383 DGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
DG+P+ Y +LG +DIC++++ ELA EAAR+GIVLLKN +LPL+ +K++AV+GP+
Sbjct: 235 DGNPKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPN 294
Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
AN M+G+Y G PC+Y +P+ G + Y+ GCD +AC + + A + A ADA
Sbjct: 295 ANLAYTMVGSYEGSPCKYTTPLDGLGASVSTVYQQGCD-IAC-ATAQVDNAKKVAAAADA 352
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
+++ G D ++E ES DR ++ LPG Q+ L+ +VA V+KGPVILVIMS GG+D+ FA N
Sbjct: 353 VVLVMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDN 412
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
+ +ILW G+PGE GG A+ADVVFG NPGGRLP+TWY YV + +T+M +R
Sbjct: 413 PKVTSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPKT 472
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
G+PGR+Y+FY GPT++ FG GLSYTQ+K++L+ K + + L + CR+ T
Sbjct: 473 GFPGRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGHACRS---------T 523
Query: 680 RCPGV-LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
+C + VN+ C++ + + QNVG GS V++++ PP+ A K ++ FQ++
Sbjct: 524 KCKSIDAVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNAPQ-KHLLDFQKI 582
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ +KF + CK L++VD N + G H + +G+
Sbjct: 583 HLTPQSEGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGD 624
>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
Length = 751
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 306/739 (41%), Positives = 444/739 (60%), Gaps = 59/739 (7%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
++SS FCD SLP RV DLV+R+ L++ V L + A P + +P YEWW+EALHGV+
Sbjct: 28 KVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNEALHGVA 87
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
+ PG F + ATSFP V+ T ASFN +L+ +I +A+STEARA YN AGLT+W+P
Sbjct: 88 -LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAGLTFWTP 146
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSSCC 225
N+N+ RDPRWGR ETPGEDP++ G YAV +VRGLQ +EGHEN D ++ LK+SSCC
Sbjct: 147 NVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDD--NKFLKISSCC 204
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH++AY + V R+ DA VT+QD +T+ FE CVK G SS+MCSYN VNGIPS
Sbjct: 205 KHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPS 260
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CAD LL VR +W GYI +DC+++ ++ H F S E A TL AG+DL+CG+
Sbjct: 261 CADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHF-TQSPEQTCATTLDAGMDLNCGE 319
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD-GSPQYVSLGKQDICSDENIEL 404
+ +A++QG V + +LK + V+MRLG F+ G+ + ++ K + + + +L
Sbjct: 320 FLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFEKGTQPFSNITKDAVDTAAHRQL 379
Query: 405 AAEAAREGIVLLKNDQNTLPLNS---AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
A EAAR+ +VLLKN+ NTLPL + +K ++A++GPH NA+ A++GNY GIP ++P+
Sbjct: 380 ALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPSHIVTPL 439
Query: 462 AGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
G S Y NV Y GC V+ + A E K AD ++ GLD S E E +DR L
Sbjct: 440 KGVSSYVPNVAYSLGCK-VSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEIDRYHL 498
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
LPG+Q L+N++ A P++LV++S G VD++ + + + AI++ GY G+ GG+A+A
Sbjct: 499 KLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAGGQALA 558
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
D++FGK++P GRL T+Y+ DYV +P+ M +RP G PGRTY+F++G +Y FG+G
Sbjct: 559 DMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGAPVYEFGFG 618
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
LSYT F + CR+ C FE V
Sbjct: 619 LSYTTFH-----------------KACRS---------------------CVASFEITV- 639
Query: 701 FQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLN 758
N+G +G D +++Y++PP A ++ ++ F+R V G+ F A K+
Sbjct: 640 -TNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADFCLEA-KAFA 697
Query: 759 IVDYAANTLLPAGEHTIFV 777
+ + + ++ G TI V
Sbjct: 698 LANAEGSWVVEQGNWTIHV 716
>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
Length = 776
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 309/773 (39%), Positives = 452/773 (58%), Gaps = 56/773 (7%)
Query: 31 SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
++P + C P S + FC+ SLP S R L+S +TL +K+ QL + A +
Sbjct: 26 TTPDYPCKPPH--------SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISH 77
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
LG+P Y+WWSEALHG++ GPG +F+ + AT+FP VI++ A+FN SLW IG AV E
Sbjct: 78 LGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVE 137
Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE- 209
RAM+N+G+AGL++W+PN+NV RDPRWGR ETPGEDP V YAV +VRG+Q V+G +
Sbjct: 138 GRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKK 197
Query: 210 --NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
N D + L VS+CCKH+ AYD++ W RY+F+A V T+ PF CV++
Sbjct: 198 VLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVN------TYQPPFRGCVQQ 251
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY-IVADCDSIQVMVDNHKFLADS 326
G AS +MCSYN VNG+P+CA LL VR +W G I+ + ++ + K + +
Sbjct: 252 GKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNL 310
Query: 327 KEDAVAQTLKA-----------GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
+ + LK +D++CG + T +A++QG VKE D+D++L L++V
Sbjct: 311 PKMLLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSV 370
Query: 376 LMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
MRLG F+G P+ + LG QD+C+ E+ +LA EAAR+GIVLLKND LPL+ +
Sbjct: 371 QMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVS 430
Query: 433 VAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAAS 491
+A++GP A T + G Y+GIPC S G Y ++Y GC DV C S++ A
Sbjct: 431 LAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAI 489
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ AK AD +I+AGLD ++E E LDR L LPG Q L+++VA +K PVILV+ G +
Sbjct: 490 DIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPL 549
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
D++FAE+N I +ILW GYP + F+ GRLP+TWY + +P+ M
Sbjct: 550 DVSFAESNQLITSILWIGYPVD-------------FDAAGRLPMTWYPESFTN-VPMNDM 595
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC---R 668
+R S GYPGRTY+FY G +Y FG+GLSY+ F Y +LS +++L+K + R
Sbjct: 596 GMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPS--KLSLSKTTNGGLRR 653
Query: 669 NLNYTSDASKTRCPGVLVNDLR-CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
+L + V V++L+ C+ F + NVG DGS VV+++SK P I +
Sbjct: 654 SLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGS 713
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
Q++G R+ + ++ + + C+ + D +LP G H + VG+
Sbjct: 714 PESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGD 766
>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 694
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 283/623 (45%), Positives = 396/623 (63%), Gaps = 30/623 (4%)
Query: 171 VARDPRWGRI--------TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
V + P GR+ +ETPGEDP + +YAV YV GLQD A + LKV+
Sbjct: 79 VNKQPALGRLGIPAYEWWSETPGEDPLLASKYAVGYVTGLQDA----GAGGVTDGALKVA 134
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G+ +SVMCSYN+VNG
Sbjct: 135 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 194
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
P+CAD LL +RG+W L+GYIV+DCDS+ V+ + + E+A A T+K+GLDL+
Sbjct: 195 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEEAAAITIKSGLDLN 253
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSD 399
CG + T AVQ G++ E D+D+++ + +LMRLGFFDG P+ + SLG +D+C+
Sbjct: 254 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 313
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
N ELA E AR+GIVLLKN LPL++ +K++AV+GP+ANA+ MIGNY G PC+Y +
Sbjct: 314 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 372
Query: 460 PIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
P+ G N Y+ GC +V C N+ + A AA +AD T+++ G D S+E ESLDR
Sbjct: 373 PLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRT 432
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG QTQL++ VA + GPVILV+MS G DI+FA+ + I AILW GYPGE GG A
Sbjct: 433 SLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAA 492
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+AD++FG NP GRLP+TWY Y + +T M +RP S GYPGRTY+FY G T++ FG
Sbjct: 493 LADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFG 552
Query: 639 YGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FE 696
GLSYT+ ++L+S + + + L + CR C V CDD F+
Sbjct: 553 DGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASVEAAGDHCDDLAFD 603
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
K+ +N G G+ V+++S PP A K ++GF++V + G + F + C+
Sbjct: 604 VKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGEAGTVAFRVDVCRD 662
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
L++VD + G HT+ VG+
Sbjct: 663 LSVVDELGGRKVALGGHTLHVGD 685
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 43/72 (59%), Gaps = 5/72 (6%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + ++++ FC+ S R +DLVSR+TL EKV L + +
Sbjct: 32 AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86
Query: 90 RLGLPQYEWWSE 101
RLG+P YEWWSE
Sbjct: 87 RLGIPAYEWWSE 98
>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
Length = 709
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 294/720 (40%), Positives = 421/720 (58%), Gaps = 55/720 (7%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R ++R+ LD+ V L + A P + +P YEWW+EALHGV+ + PG F I AT
Sbjct: 7 RSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGVTFKGSITAAT 65
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
SFP V+ T ASFN SL+ +I +STEARA +N AGLT+W+PN+N+ RDPRWGR ET
Sbjct: 66 SFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGRGQET 125
Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
PGEDP++ G YAV +VRGLQ EG E NS+ LK+SSCCKH++AY + V R+
Sbjct: 126 PGEDPYLTGEYAVAFVRGLQG-EGMEGREVENSKFLKISSCCKHFSAYSQE----VPRHR 180
Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
+A VT+QD +T+ FE CVK G SS+MCSYN VNGIPSCAD LL VRG+W
Sbjct: 181 NNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRGQWKFD 240
Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
GYI +DC+++ ++D+H + S E A TL AG+DL+CG++ A++QG V
Sbjct: 241 GYIASDCEAVADVIDHHHY-TQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQGIVTTE 299
Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTL 423
I +LK + VLMRLG F+ + ++ K + + + +LA EAAR+ IVLLKND NTL
Sbjct: 300 MIHNALKNQFRVLMRLGMFEKVEPFANITKDSVDTTMHRQLALEAARQSIVLLKNDGNTL 359
Query: 424 PLNS---AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDV 479
PL + + +++A++GPH NA+ A++GNY GIP ++P+ G S + NV + GC V
Sbjct: 360 PLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHSLGCK-V 418
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
+ + A AK AD I+ GLD S E E +DR + LP +Q+ L+ +V EVA
Sbjct: 419 SGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLLKRVLEVASH 478
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P++ V++S G VD++ + + + AI++ GY G+ GG+A+ADV+FGK+NP G+LP T+Y+
Sbjct: 479 PIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPSGKLPQTFYD 538
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+YV + + M +RP G GRTY+F+ G +Y FG+GLSYT F N + T
Sbjct: 539 SEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTFHKNCHACVAT--- 595
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
F + N G+ G DV++ Y +P
Sbjct: 596 -------------------------------------FNITVTNAGAISGEDVILTYVEP 618
Query: 720 P-AEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
P A +K ++ F+R + AG+ K A K+ + + A N ++ G TI V
Sbjct: 619 PLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLEA-KAFALANEAGNWVVEPGNWTIHV 677
>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 525
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 268/505 (53%), Positives = 355/505 (70%), Gaps = 15/505 (2%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +PVF CD + ++++ FC+ S R +DLVSR+TL EKV L + +
Sbjct: 32 AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN SL++ IG+ VST
Sbjct: 87 RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 146
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQD G
Sbjct: 147 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDA-GAG 205
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
TD LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF PF+ CV +G+
Sbjct: 206 GVTD---GALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 262
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL +RG+W L+GYIV+DCDS+ V+ + + E+
Sbjct: 263 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 321
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+K+GLDL+CG + T AVQ G++ E D+D+++ + +LMRLGFFDG P+
Sbjct: 322 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 381
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ SLG +D+C+ N ELA E AR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 382 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 440
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
IGNY G PC+Y +P+ G N Y+ GC +V C N+ + A AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 500
Query: 506 LDLSVEAESLDREDLWLPGYQTQLI 530
D S+E ESLDR L LPG QTQL+
Sbjct: 501 ADQSIERESLDRTSLLLPGQQTQLV 525
>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
Length = 690
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 304/725 (41%), Positives = 418/725 (57%), Gaps = 56/725 (7%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R + LV+ +TL EK+ +G A V RL +P+Y+WWSEALHGV+ PG F + P AT
Sbjct: 4 RARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-ASPGVVFQEPTPFAT 62
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
+FP V LT SF++ L+ I +STEAR M N RA LTYWSPN+NV RDPRWGR ET
Sbjct: 63 AFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQET 122
Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
PGEDPF+V YAV +VRGLQ+ E + R LKVS+CCKHY+AYD++NW GV+R+
Sbjct: 123 PGEDPFLVATYAVEFVRGLQEGE--------DPRYLKVSACCKHYSAYDLENWHGVERFE 174
Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
FDA V+++DM +TF PFE CVK+G SS+MCSYN +NGIP+CAD +LL T RG W
Sbjct: 175 FDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFE 234
Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
GYI +DC +I ++ NH + D+ A+ ++A DLDCG +Y ++V+ G++KE
Sbjct: 235 GYITSDCGAIDTIIYNHHYTNDTDTTAML-GVRATCDLDCGGFYQQHILHSVESGRLKEA 293
Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
++D +L L+ V MRLG FD Q Y G + + E+ +A AAREGI LLKN +
Sbjct: 294 EVDDALANLFKVQMRLGLFDPVEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQND 353
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVAC 481
LPL S K K V V+GP+A M+GNY GIP ++ +A G NV CD V
Sbjct: 354 FLPL-SLKDKHVVVMGPYAEDAGVMLGNYNGIP-EFIVTVA--QGLRNV-----CDHVDV 404
Query: 482 KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
+ + E D ++ GL+ +E E LDREDL LP Q L++ + PV
Sbjct: 405 VKSLEALSKLEG---VDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPV 461
Query: 542 ILVIMSAGG-VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
+L ++S GG VDI+ E N ++ +L GY G GG+AIA+V+ G NP GRL T Y
Sbjct: 462 VLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPSGRLVNTMYYN 521
Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
DYV L M +RP + G+PGRTY+F+ GP ++PFG+GLSYT F + V
Sbjct: 522 DYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAH---------AVE 572
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
+ ++++ R L + L D Y V N GS G + V+++ K P
Sbjct: 573 IGQMRNHR----------------LRSALAIDVY----VKVTNTGSRQGDESVLLFVKSP 612
Query: 721 AEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
Y +K + F RV + G + + FV + L++ + A +L GE + V
Sbjct: 613 LAGKQGYPLKSLADFSRVSLAPGETQTVHFVLGE-EQLHLANEQAKYVLLRGEWKVEVEE 671
Query: 780 GGVSF 784
F
Sbjct: 672 ASARF 676
>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 311/801 (38%), Positives = 449/801 (56%), Gaps = 64/801 (7%)
Query: 10 CFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
C ++++A LV + A C+ L FC+ +L + R DLV
Sbjct: 8 CITIAVAALVVAPTA--------RALTCEDAALRNLP-------FCNPNLAWEQRADDLV 52
Query: 70 SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
R+TL EK+ Q G A GV RLG+ YEWWSEALHGV+ PG +F P +T FP +I
Sbjct: 53 GRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALHGVAE-SPGVNFTGNTPVSTCFPQII 111
Query: 130 --------LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
A+FN + Q +STEARA N G AGLTY++PNIN+ RDPRWGR
Sbjct: 112 GNNCSSLSRVGATFNLDSVAAMAQVISTEARAFANAGHAGLTYFTPNINIFRDPRWGRGQ 171
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ETPGEDP++ RY V+ LQ+ E ++R LKV + CKHY AYD+++W G+DR
Sbjct: 172 ETPGEDPYLTSRYVETLVQNLQNGE--------DARYLKVVATCKHYTAYDMEDWGGIDR 223
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+HF+A V++QD+ ETF+ PFE CV+ G +S+MCSYN VNGIPSCAD + N+ R +W
Sbjct: 224 FHFNAVVSDQDLVETFMPPFEACVRVGKGASLMCSYNAVNGIPSCADDFINNEIAREQWG 283
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
GYIV+DC +I + H + ++ + A ++ G DLDCG +Y + +A+ +
Sbjct: 284 FDGYIVSDCGAIDCIQYTHNY-TNTTQATCAAGIQGGCDLDCGDFYQSHLMDAIGNATLH 342
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
E D+D SL+ L+ +RLG FD + Y + I S E+ ELA + ARE IVLL ND
Sbjct: 343 EADLDFSLRRLFGHRIRLGEFDAASIQPYRQIPVSAINSQEHQELALQIARESIVLLGND 402
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGC 476
NTLP + A V+ +A++GP+A+ ++GNY G ++P+ GF ++T+ GC
Sbjct: 403 NNTLPFSLATVRKLAIIGPNADDAETLLGNYYGDAPYLITPLKGFQQLDPTLSITFVKGC 462
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
DV + AA+ AAK ADATI++ GL+ +VE+E+LDR L LPG Q +LI +
Sbjct: 463 -DVNSTDTSGFVAAAAAAKAADATIVVVGLNQTVESENLDRTTLVLPGVQAELILALTAA 521
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
A+GPVILV+MS +D+ + ++A LW GYPG+ GGRA+A+ VFG F+P GRLP T
Sbjct: 522 ARGPVILVVMSGSPIDL--SNVIHPVRAALWIGYPGQAGGRALAEAVFGVFSPAGRLPFT 579
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
Y DYV LP+T+M +R PGRTY+FY G L+ FG+GLSY+ F+Y + + +
Sbjct: 580 VYPADYVNQLPMTNMDMR-----AGPGRTYRFYTGTPLFEFGHGLSYSTFQYTWSNSSSS 634
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
+ + R P V+ F+V QN G DVV+ +
Sbjct: 635 SSSSATSQHSLSTAALAAQHLAARAPVEAVS---------FRVLVQNTGKMASDDVVLAF 685
Query: 717 SKPPA---------EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+ A + A+ I+ ++GF+R+ + G ++ I F + + + A TL
Sbjct: 686 ASFNASSIIDQSSSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQLAQVDSTGAQTL 745
Query: 768 LPAGEHTIFVGNGGVSFPIHL 788
+P+ F + + I L
Sbjct: 746 VPSRLQVAFGSDARLVAEIQL 766
>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
queenslandica]
Length = 728
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 294/741 (39%), Positives = 433/741 (58%), Gaps = 60/741 (8%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+++ +CD + RV DL+SRMT+ +K+ QL A +P L +P Y+WWSE LHGV+
Sbjct: 26 FNTYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGLHGVAG 85
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG HF P ATSFP VI A+FN SL + Q +STEARA N G+AGLTY++PN
Sbjct: 86 -SPGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQAGLTYFAPN 144
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN+ RDPRWGR ETPGEDP++ +YA N+V+G+Q E A D +R LK + CKHY
Sbjct: 145 INIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ-----EGADD--TRYLKTIATCKHY 197
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AAYD++N+ + R+ F+A V++QD EET+ F CV+EG S+MCSYN VNG+PSCA+
Sbjct: 198 AAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNAVNGVPSCAN 257
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
+ N+ RG+W GY+V+DC +I ++++HK+ +++ +D VA L+ G DL+CG +Y+
Sbjct: 258 DFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSNT-DDTVAAGLRGGCDLNCGHFYS 316
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAA 406
+ A G + + DID+++ L+T MRLG FD + + + ++ LA
Sbjct: 317 DHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFDPPSMQPFRDYTNDKVDTKQHEALAL 376
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
+A+RE IVLL+N+++ LPL+ + +A+VGPH A AM GNY G +SP+ G
Sbjct: 377 DASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYLISPMQGLQD 436
Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAK-----TADATIILAGLDLSVEAESLDREDL 520
+VT+ GC VAC +I SE K + +A I + GLD S E+E DR L
Sbjct: 437 LGLSVTFAAGCTQVACP---TIAGFSEVTKLVEEHSIEAIIAVIGLDESQESEGHDRTSL 493
Query: 521 WLPGYQTQLINQVAE--VAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
LPG Q QL+ + + V P I+V+MS G VD++ + + AILWAGYPG+ GG+A
Sbjct: 494 TLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKDIAD--AILWAGYPGQSGGQA 551
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
IA+V++GK NP GRLP+T+Y Y+ +P T+M +R PGR+YKFY G ++PFG
Sbjct: 552 IAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-----PGRSYKFYTGTPVFPFG 606
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+ + + V K H ++NY +
Sbjct: 607 FGLSYTTFE---MKWKNPPNVTHLKTTHDVDVNY-------------------------E 638
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
V N G GS V+ Y + + +K++ GFQ+++++ ++ + FV K
Sbjct: 639 VVVTNAGKRSGSVSVLAYIT--STVPGAPMKELFGFQKIYLKPEQSMTLSFVAEP-KVFT 695
Query: 759 IVDYAANTLLPAGEHTIFVGN 779
VD + G + I +G+
Sbjct: 696 TVDKHGERKIRPGTYKITIGD 716
>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 755
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 298/738 (40%), Positives = 434/738 (58%), Gaps = 55/738 (7%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+++L+C+ S + RVKDL+SR+T+ EK+ Q A + RL +P Y+WWSE LHG++
Sbjct: 53 FNAYLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAYDWWSECLHGLAQ 112
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG F++ + ATSFP VI A+FN SL +GQ +STEARA N G++GLT+++PN
Sbjct: 113 -SPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFANNGQSGLTFFAPN 171
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN+ RDPRWGR ETPGEDP++ +YA N+V+G+Q EG E + R LK + CKHY
Sbjct: 172 INIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQ--EGSE-----DRRYLKAIATCKHY 224
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AAY+++ + V R +F+A V++QD+EET+L F+ CV+EG S+MCSYN +NG+P+CA+
Sbjct: 225 AAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAINGVPNCAN 284
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
+ N+ R W GYIV+DC +I + H + +D+ VA LK G DL+CG +Y
Sbjct: 285 DFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTN-ITVADALKGGCDLNCGHFYE 343
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ----DICSDENIEL 404
+ +A + E DIDKSL L+T MRLG FD P + +Q D+ + E +L
Sbjct: 344 KYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFD--PPEIQPFRQYSVKDVNTPEAQDL 401
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
A AAREGIVLL+N + LPL+ K +A +GP+A+AT M GNY GI +SP+ GF
Sbjct: 402 ALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLISPLQGF 461
Query: 465 SGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
S N TY+ GC VAC A +A + DA I + GL+ + E ES DR + LP
Sbjct: 462 SNLGINATYQIGC-PVACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRTSIALP 520
Query: 524 GYQTQLINQVAE-VAKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
G+Q L+ ++ + AKG P+I+V+MS G VD+ + + AILWAGYPG+ GG+AIA+
Sbjct: 521 GHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKDIAD--AILWAGYPGQSGGQAIAE 578
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
V++GK NP GRLP+T+Y Y+ +P T+M +R PGR+YKFY G ++PFG+GL
Sbjct: 579 VIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-----PGRSYKFYTGTPVFPFGFGL 633
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT F+ T T + K H +NY + +
Sbjct: 634 SYTTFEIKWKD-TSTAKDYYLKTTHDEVVNYEATVT------------------------ 668
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N GS GS V+ + + + +K++ F+++++ + + FV K VD
Sbjct: 669 -NSGSRPGSVSVLAFIT--SSVPGAPMKELFAFKKIYLEPTESVDVSFVAEP-KVFTTVD 724
Query: 762 YAANTLLPAGEHTIFVGN 779
+ G + I +G+
Sbjct: 725 IYGIRKIRPGAYKIIIGD 742
>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
gi|223944757|gb|ACN26462.1| unknown [Zea mays]
Length = 630
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 272/645 (42%), Positives = 401/645 (62%), Gaps = 25/645 (3%)
Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
M+N G+AGLTYW+PNIN+ RDPRWGR ET GEDP V Y++ YV+G Q +
Sbjct: 1 MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ-------GEE 53
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
+++S+CCKHY AYD++ W+G RY F+A+V QD+E+T+ PF+ C++E AS +
Sbjct: 54 GEEGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCL 113
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MC+YN+VNG+P CA LL +T R EW GYI +DCD++ ++ +N + S ED++A
Sbjct: 114 MCAYNQVNGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAI 171
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
LKAG+D++CG + T +A+++GK++E DID++L L++V +RLG FD +
Sbjct: 172 VLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQ 231
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
LG +C+ E+ ELAAEA R+G VLLKND N LPL ++V+ VA++GP AN AM G+Y
Sbjct: 232 LGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDY 291
Query: 451 AGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
G+PC + + G YA T + GC D +C S + A EAAK AD +++AGL+L+
Sbjct: 292 TGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLT 351
Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
E E DR L LPG Q LI+ +A VAK P++LV++ G VD++FA+ + I +ILW G
Sbjct: 352 EEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLG 411
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPGE GG+ + +++FG++NPGG+LPITWY + +P+T M +R S GYPGRTY+FY
Sbjct: 412 YPGEVGGQVLPEILFGEYNPGGKLPITWYPESFT-AIPMTDMNMRADPSRGYPGRTYRFY 470
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCPG---VL 685
G +Y FGYGLSY+++ Y++ S K I V+ + +L S + TR G V
Sbjct: 471 TGDVVYGFGYGLSYSKYSYSISSAPKKITVSRSS-----DLGIISRKPAYTRRDGLGSVK 525
Query: 686 VNDL-RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
D+ C+ F V N GS DGS V+++++ + + IKQ++GF+ V AG
Sbjct: 526 TEDIASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGS 585
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
++ + CK ++ + +L G H + VG+ I L
Sbjct: 586 ASNVEITVDPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSIEL 630
>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
Length = 1620
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 292/747 (39%), Positives = 440/747 (58%), Gaps = 51/747 (6%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
L +F FC++SL R++D++SR+++ +KV + A GLP Y+WWSEALHGV
Sbjct: 919 LPAKNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEALHGV 978
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F + ATSFP VI T+ASFN++LW IG +STEARAM N+ +AGLT+W+
Sbjct: 979 G-FSPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGLTFWA 1037
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PNIN+ RDPRWGR ETPGEDP+ G YA N+V G+Q+ E ++R +K SSCCK
Sbjct: 1038 PNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGE--------DTRYIKASSCCK 1089
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H+ Y++++W VDR+HF+A T+QD+ +T+L FE CV+ G ASS+MCSYN VNG+PSC
Sbjct: 1090 HFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVPSC 1149
Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
A+ ++ R W GYI +DC +++ + NHK+ ++ V L AG+D+DCG +
Sbjct: 1150 ANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYY-NTTGATVNGVLSAGMDVDCGSF 1208
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIEL 404
+ +A+ G V +D++L L+ V RLG FD + Y++L + + E+ +L
Sbjct: 1209 LSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFDPAEDQPYLNLTTDAVNTPEHQQL 1268
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
A EAAR+G+ LL+N + LPL+++ +K +A++GP+ANAT M GNY G +SP G
Sbjct: 1269 ALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQGV 1328
Query: 465 SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
Y +NV + G A AAK AD +++ GLD + E+E DRE + LP
Sbjct: 1329 QQYVSNVALELG--------------AVTAAKAADTVVMVIGLDQTQESEGHDREIIALP 1374
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q +L+ QVA + P+++V+M+ G VD+ + N+ G+ GG+A+A+ +
Sbjct: 1375 GMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAETL 1425
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
FG NPGGRLP T Y D V + + +RP + G PGRTY+FY G +Y +G GLSY
Sbjct: 1426 FGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTGTPVYAYGTGLSY 1485
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F Y + T +++V+ +++ + + +T + +++ +DY V QN
Sbjct: 1486 TSFSYE--TSTPSLRVSAERVRA-----WVAARGQT---SFIRDEVDAEDYITVTV--QN 1533
Query: 704 VGSTDGSDVVIVYSKPPAEIA-ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
G+ G+DVV V+ K A IK + GF+RVF++ G I+F L++V+
Sbjct: 1534 NGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPGETTSIQFPVTP-HDLSVVNS 1592
Query: 763 AANTLLPAGEHTIFVGNGG-VSFPIHL 788
+ G T+ V + +S PI +
Sbjct: 1593 RGERVAVPGTWTVEVHHEARLSIPISV 1619
>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
Length = 732
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 309/755 (40%), Positives = 423/755 (56%), Gaps = 76/755 (10%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + L FC +LP RV DL+ RMTL EKV L + A VPRLG+
Sbjct: 27 FACDPKDGTNRDLP-----FCQVNLPIHTRVNDLIGRMTLQEKVGLLVNNAAAVPRLGIK 81
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F P ATSFP VI T ASFN +LW+ IG+ VS EARAM
Sbjct: 82 GYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASFNATLWEAIGRVVSDEARAM 141
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
+N G AGLTYWSPN+ + PRWGR ETPGEDP VVG+YA +YVRGLQ +G
Sbjct: 142 FNGGVAGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSDGIR----- 196
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QDM +TF PF MCVKEG +SVM
Sbjct: 197 ----LKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMCVKEGKVASVM 252
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN+VNGIP+CADP LL +TVRG+W L+GYIV+DCDS V F S +
Sbjct: 253 CSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHF--TSPRRSSLGC 310
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK 393
KAGLDLDCG + +AV++ +E +I+ + T + LG FDGSP Q V
Sbjct: 311 YKAGLDLDCGPFLVTHR-DAVKKA-AEEAEINNAWLKTLTFQISLGIFDGSPLQAVGDVV 368
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA--NATVAMIGNYA 451
+ N +LA A + + + KN L S + + GP A + M+GNY
Sbjct: 369 PTMGPPTNQDLAVNAPKR-LFIFKN--RAFLLYSPR----HIFGPVALFKSLPFMLGNYE 421
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
G+PC+Y+ P+ G +G+ ++ Y GC +V C + + +A + A +ADA +++ G D S+E
Sbjct: 422 GLPCKYLFPLQGLAGFVSLLYLPGCSNVICAVAD-VGSAVDLAASADAVVLVVGADQSIE 480
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR D +LPG Q +L+ +VA AKGPV+LVIM D+A + + +
Sbjct: 481 REGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIM-----DLAISGGGCSYNQV------ 529
Query: 572 GEEGGRAIADVVFGK-------FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
G I+DV G N G +P Y+ + L T + P S +
Sbjct: 530 ---NGIPISDVCEGSSYRWPSFSNCHGYMPWISYSRAIWETLRFTKVNWVPTWSW---NK 583
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+KF G +++ + + L K H + S+ V
Sbjct: 584 LHKF-----------GSHHSKCTDDGFGTPRRPPPWLRKCNH-----FQGRQSELHMLDV 627
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ D +VD +N GS DG+ ++VY +PPA A + KQ++ F++V V AG
Sbjct: 628 I------DSLLGMQVDVKNTGSMDGTHTLLVYFRPPARHWAPH-KQLVAFEKVHVAAGTQ 680
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+R+ + CKSL++VD + +P GEH++ +G+
Sbjct: 681 QRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGD 715
>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 732
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 296/751 (39%), Positives = 434/751 (57%), Gaps = 74/751 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ SF +C+ SLP S RVKDL+SRMTL EK+ QLG+ A + RL +P Y+WWSE LHGV+
Sbjct: 28 KFQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVA 87
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
+ PG HF+ + ATSFP VI T +SFN++L+ +I +STEARA N G+ Y+
Sbjct: 88 D-SPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFAN---QGIVYFKQ 143
Query: 168 NINV--------ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
+ + RDPRWGR ETPGEDP++ +YA+ +V G Q +S+ L
Sbjct: 144 HQQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQG----------DSKYL 193
Query: 220 KVSSCCKHYAAYDVDNW-KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
KV + CKH+A YD++++ G R+ F+A++T QD EET+ F+ CV+E + +S+MCSYN
Sbjct: 194 KVVTTCKHFAGYDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYN 253
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
VNG+PSCAD ++ N+ R W G+I +DC +I + + H + ++ +D VA LK G
Sbjct: 254 EVNGVPSCADGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHY-TNNTDDTVAAALKGG 312
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQD 395
DL+CG YY + +A G + +I+ +L L+T M+LG FD P+ Y ++
Sbjct: 313 CDLNCGSYYQSHAQSAFLNGTITIGEINLALTRLFTARMKLGMFD-PPELQPYNAISPDV 371
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ S E+ LA AARE IVLL+N+ + LPLN K T+AVVGPHA AT M GNY G+
Sbjct: 372 VNSLEHQALALNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAP 431
Query: 456 RYMSPIAGFS--GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
+SP+ GF G +V +GC DV C+ + A + A ADA I + GLD S E+E
Sbjct: 432 YLISPVEGFENLGIDSVLTASGC-DVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESE 490
Query: 514 SLDREDLWLPGYQTQLINQVAEVAK-----GPVILVIMSAGGVDIAFAETNTNIKAILWA 568
DREDL+LP Q + + + K P+I+V+MS VD+ T + AILWA
Sbjct: 491 GHDREDLFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTV--TKKHADAILWA 548
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
GYPG+ GG+AIA++++GK NP GRLP+T+Y G Y+ ++ M +R YPGRTYKF
Sbjct: 549 GYPGQSGGQAIAEIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMRE-----YPGRTYKF 603
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
YN + FG GLSYT F L ++K + + R+++Y P V+ N
Sbjct: 604 YNDTPDFSFGDGLSYTTF---YLEWSKPV-----NMSGVRSVSY---------PTVVYN- 645
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
V N G G+ V+ Y A K++ GF++VF+ ++ +
Sbjct: 646 ----------VTVTNTGKMPGAISVLAYISYNNSGAPK--KKLFGFEKVFLNPLQSVSVT 693
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F ++ K+ + VD + + G++ + +G+
Sbjct: 694 FPADS-KAFSTVDKSGKRSVNPGDYHVTIGD 723
>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
Length = 618
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 261/620 (42%), Positives = 373/620 (60%), Gaps = 21/620 (3%)
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPN+N+ RDPRWGR ETPGEDP +Y +V+GLQ + L + L+ S+C
Sbjct: 2 WSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ-------GSSLTN--LQTSAC 52
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH AYD++ WKGV RY+F+A+VT QD+ +T+ PF CV +G AS +MC+Y +NG+P
Sbjct: 53 CKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVP 112
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+CA LL +TVRGEW L GY +DCD++ ++ + F + E+AVA LKAGLD++CG
Sbjct: 113 ACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEEAVAVALKAGLDINCG 171
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDE 400
Y +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+ Y L D+C+
Sbjct: 172 VYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPV 231
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA EAAR G+VLLKND LPL + V + AV+G +AN +A++GNY G+PC +P
Sbjct: 232 HKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTP 291
Query: 461 IAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
G Y + + GC AC + A+ AK++D ++ GL E E LDR
Sbjct: 292 FGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVMGLSQKQEQEGLDRTS 350
Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
L LPG Q LI VA +K PVIL++++ G VDI FA+TN I AILWAGYPG+ GG+AI
Sbjct: 351 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 410
Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
ADV+FG+FNP G+LP+TWY ++ + +T M +RP + GYPGR+Y+FY G T+Y FGY
Sbjct: 411 ADVLFGEFNPSGKLPVTWYPEEFTK-FTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 469
Query: 640 GLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEF 697
GLSY++F ++S + L R + R + D RC+ F
Sbjct: 470 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVDE--IGDDRCERLRFPV 527
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
V+ QN G DG V+++ + + ++Q+IGF+ ++ G K++K + C+ L
Sbjct: 528 MVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCEHL 587
Query: 758 NIVDYAANTLLPAGEHTIFV 777
+ ++ G H + V
Sbjct: 588 SRARVDGEKVIDRGSHFLMV 607
>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
Length = 401
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 243/434 (55%), Positives = 313/434 (72%), Gaps = 36/434 (8%)
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
QGK +E D+D SL+ LY VL ++GFFDG P Y SL K+D+C+ E+IELAA+AAR+GIVLL
Sbjct: 2 QGKAREEDVDTSLRNLYIVLTQVGFFDGIPSYESLDKKDLCTKEHIELAADAARQGIVLL 61
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGC 476
KN TLPL+ AK+K +A++GPHANAT+ M+GNYAG+PC+Y SP+ GFS Y VTY+ GC
Sbjct: 62 KNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVTYEMGC 121
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
++V C + I A EA+K ADATI+L GLD +VE E LDR DL LPGYQT+LI QV
Sbjct: 122 NNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQVIVA 181
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
+KGP+ILVIMS VDI+F++T+ +KAILWAGYPGEEGGRAIADVV+GK+NPGGRLP+T
Sbjct: 182 SKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRLPLT 241
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
W+ DY+ MLP+TSM LRPV++ YPGRTYKF+NG +YPFG+GLSYT+F Y L S
Sbjct: 242 WHQNDYLSMLPMTSMSLRPVNN--YPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRS---- 295
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
+++ C D+FE ++ +N+G+ G++VV+VY
Sbjct: 296 ------------------------------SNMSCKDHFELDIEVKNIGAKHGNEVVLVY 325
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
SKPP I T+ KQVIGF+RVFV AG ++ +KF FN CKSL IV Y A LLP+GEH I
Sbjct: 326 SKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHKII 385
Query: 777 VGNGGVSFPIHLNF 790
+G+ S PI ++F
Sbjct: 386 IGDSPTSLPIDISF 399
>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 733
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 293/736 (39%), Positives = 431/736 (58%), Gaps = 59/736 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+C+ L + RVKDL+SR+TL+EK+ QLG+ A + RLG+P Y+WWSE LHGV+ V PG
Sbjct: 37 YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVA-VSPGL 95
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
H + TSFP +I T +SFN+SL+ +IG+AVSTEAR + G+ GLTY++PNIN+ R
Sbjct: 96 HLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTPNINIVR 155
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR ET GEDP++ +YAVN VRG Q G++ S K+ + CKH+AAYD+
Sbjct: 156 DPRWGRGQETAGEDPYLTSQYAVNLVRGAQ---GND------SEYKKIIATCKHFAAYDL 206
Query: 234 DNWKGVD-RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++ D R F+A VT+QD+EET+ F CV G S+MCSYN VNG+PSC D
Sbjct: 207 ESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFN 266
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
N+ R +W GY+V+DC +I +++ H + + + D VA LK G DL+CG +Y
Sbjct: 267 NKIARNKWKFDGYLVSDCGAIDDVMNKHHYTS-TPTDTVAAGLKGGTDLNCGSFYQTHAM 325
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY--VSLGKQDIC-SDENIELAAEAA 409
+A G + E DID+++ L+T MRLG FD P+Y S D+ + ++ +LA +AA
Sbjct: 326 DAFLNGSITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQDLALQAA 384
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
RE IVLL+N+ LPL+ +AVVGP+ A V M G I +SP+ GF S
Sbjct: 385 RESIVLLQNN-GKLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDGFKSKGL 443
Query: 469 NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
+VTY GC DV C + A + K A A + + GLD +E E++DRED++LPG Q +
Sbjct: 444 HVTYSLGC-DVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDK 502
Query: 529 LINQVAEVAKG-----PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
+ + + P+I+VIMS VD+ +E+ + AILW GYPG+ GG+AIA+V+
Sbjct: 503 FLLGLRDTLTNLQSPVPLIVVIMSGSSVDL--SESKSLADAILWVGYPGQSGGQAIAEVI 560
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
+G+ NP GRLP+T+Y G+Y+ ++ M +R PGRTY+FY ++PFG+GLSY
Sbjct: 561 YGEVNPSGRLPLTFYPGEYIDLVAYRHMSMREP-----PGRTYRFYTENPVFPFGHGLSY 615
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F+ LS+T NK+ + ++++D D +F + N
Sbjct: 616 TTFE---LSWT-------NKMNNVTE--------------IVISD-SVDINIDFDITVVN 650
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G+ V+ Y + I ++++ F +VF+ +K+I +F + VD
Sbjct: 651 TGYLSGAVSVLGYVS--SNIPDAPLRELFDFDKVFIDKYESKKIS-LFATNDAFTTVDEK 707
Query: 764 ANTLLPAGEHTIFVGN 779
+ GE+ I + N
Sbjct: 708 GRRNILPGEYDIAIEN 723
>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
Length = 800
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 289/735 (39%), Positives = 414/735 (56%), Gaps = 40/735 (5%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CDSS R LV +TL+EK+ G+ + GVPRLG+P+Y+WW+EALHGV+ PG
Sbjct: 39 LVCDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPG 98
Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
+ ATSFP IL A+F++ L ++ VSTEARA N GR+GL YW+PNIN
Sbjct: 99 QPMTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGLDYWTPNIN 158
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSCCKHYA 229
+DPRWGR ETPGEDPF + Y N +RGL EG++N P K+ + CKH+
Sbjct: 159 PYKDPRWGRGQETPGEDPFHLASYVQNLIRGL---EGNQN------DPYKKIVATCKHFT 209
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD++NW G RY FDA++ +DM E ++ PF+ C +E + MCSYN VNG+P+CADP
Sbjct: 210 GYDMENWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGVPTCADP 269
Query: 290 KLLNQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
LL +R W + ++V+DCD+IQ + H++ A+S+E AVA TL AG DL+CG Y
Sbjct: 270 WLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEW-AESREQAVADTLNAGTDLNCGTY 328
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIEL 404
Y + A +QG + +T +D++L Y+ L++LG+FD S Y +G QD+ S EL
Sbjct: 329 YQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADSQPYRQIGWQDVNSQHAQEL 388
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AG 463
A +AA+EGIVLLKND LPL+ V ++A++G ANAT M GNYAG+ SP+ A
Sbjct: 389 ALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLHSPLYAA 447
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
V Y G + + + A AA+ +D I++ G+D +E+E LDR +
Sbjct: 448 EQLGVKVNYAEGASQ-SNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESEELDRVAIAWS 506
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q +I ++A K PVI+V M AG +D +N NI A+LW GYPG++GG A+ D++
Sbjct: 507 GPQLDMITKLATYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTALFDII 565
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
G P GRLPIT Y Y + + +T M LRP + GRTYK+YNG ++PFG+GL Y
Sbjct: 566 TGAVAPAGRLPITQYPARYTKEVAMTDMSLRPSSTSA--GRTYKWYNGTAVFPFGFGLHY 623
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR-CPGVLVNDLRCDDYFEFKVDFQ 702
T F + S + + + C +D SK CP + VD
Sbjct: 624 TNFSAAIPSPPASSFAISDLVASCS----ANDTSKLDLCP-----------FTSLAVDIA 668
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
N G+ V + + + ++ +QR+ A + + SL VD
Sbjct: 669 NDGTRASDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQTARLNLTLGSLVRVDE 728
Query: 763 AANTLLPAGEHTIFV 777
+ LL G++++ +
Sbjct: 729 NGDKLLYPGDYSVLI 743
>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
Length = 699
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 266/646 (41%), Positives = 378/646 (58%), Gaps = 87/646 (13%)
Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
S + ++ + VSTEARAMYN+G AGLT+WSPN+N+ +DPRWGR ETPGEDP + +YA
Sbjct: 128 SKFMRLRKVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASG 187
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
YVRGLQ + D + LKV++CCKHY AYD+DNWKGVD +HF+A VT QDM++TF
Sbjct: 188 YVRGLQ------QSDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTF 241
Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
PF+ CV +G+ +SV+ YIV+DCDS+ V
Sbjct: 242 QPPFKSCVIDGNVASVI------------------------------YIVSDCDSVDVFY 271
Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLM 377
++ + + E+A A+ + AGLDL+CG + T AV+ G V E+ +DK++ + LM
Sbjct: 272 NSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLM 330
Query: 378 RLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
RLGFFDG+P Y LG +D+C+ E+ E A EA R+GIV
Sbjct: 331 RLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV-------------------- 370
Query: 435 VVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAA 494
+AG PC+Y +P+ G + TY GC +VAC + I A + A
Sbjct: 371 ---------------FAGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQ-IDEAKKIA 414
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
ADAT+++ G+D S+EAE DR ++ LPG Q LI +VA+ +KG VILV+MS GG DI+
Sbjct: 415 AAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDIS 474
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
FA+ + I +I W GYPGE GG AIADV+FG +NP G+LP+TWY YV +P+T+M +R
Sbjct: 475 FAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNMR 534
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P + GYPGRTY+FY G T+Y FG GLSYTQF ++L+ K++ + + + C +
Sbjct: 535 PDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSCHS----- 589
Query: 675 DASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
++C V C + F+ + N G+ GS V ++S PP+ + + K ++G
Sbjct: 590 ----SKCKSVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPS-VHNSPQKHLLG 644
Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F++VFV A ++F + CK L+IVD + G H + VGN
Sbjct: 645 FEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGN 690
>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
Length = 522
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 256/525 (48%), Positives = 345/525 (65%), Gaps = 19/525 (3%)
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NG+P+CAD +LL +TVR +W LHGYIV+DCDS++VMV + K+L + +A A +KAGL
Sbjct: 1 INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60
Query: 340 DLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
DLDCG + +T + +AV+QGK+KE+ +D +L LY LMRLGFFDG P+ SLG
Sbjct: 61 DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELESLG 120
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP--HANATVAMIGNY 450
D+C++E+ ELAA+AAR+G+VLLKND LPL+ KV +VA+ G H NAT M+G+Y
Sbjct: 121 AADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDY 180
Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
G PCR ++P G + T CD +C + + AAKT DATI++AGL++SV
Sbjct: 181 RGKPCRVVTPYDGVRKVVSSTSVHACDKGSCDTAAA------AAKTVDATIVVAGLNMSV 234
Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
E ES DREDL LP Q IN VAE + P++LVIMSAGGVD++FA+ N I A++WAGY
Sbjct: 235 ERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGY 294
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PGEEGG AIADV+FGK+NPGGRLP+TWY +YV +P+TSM LRP GYPGRTYKFY
Sbjct: 295 PGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYG 354
Query: 631 GP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD-ASKTRCPGVLVND 688
G LYPFG+GLSYT F Y + + V + ++C+ L Y + +S CP V V
Sbjct: 355 GADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVAS 414
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
C + F V N G DG+ VV +Y+ PPAE+ KQ++ F+RV V AG +
Sbjct: 415 HACQEEVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVA 474
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG--VSFPIHLNFN 791
F N CK+ IV+ A T++P+G + VG+ +SFP+ ++
Sbjct: 475 FALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 519
>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
Length = 573
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 249/571 (43%), Positives = 360/571 (63%), Gaps = 13/571 (2%)
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
NS L+ S+CCKH+ AYD++NWKGV R+ FDA+VTEQD+ +T+ PF+ CV++G AS +M
Sbjct: 1 NSSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIM 60
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYNRVNG+P+CAD LL++T RG+W +GYI +DCD++ ++ D + A + EDAVA
Sbjct: 61 CSYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGY-AKAPEDAVADV 119
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
LKAG+D++CG Y +A QQGK+ DID++L+ L+ + MRLG F+G+P+ Y ++
Sbjct: 120 LKAGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNI 179
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
G +C E+ +LA +AA++GIVLLKND LPL+ +KV +VAV+GP+ N ++GNY
Sbjct: 180 GADQVCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYF 239
Query: 452 GIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
G PC ++P GY + T+ GC+ C +N I A AA +AD ++ GLD +
Sbjct: 240 GPPCISVTPFQALQGYVKDATFVQGCNAAVCNVSN-IGEAVHAASSADYVVLFMGLDQNQ 298
Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
E E +DR +L LPG Q L+N+VA+ AK PVILV++ G VD+ FA+ N I AI+WAGY
Sbjct: 299 EREEVDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGY 358
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PG+ GG AIA V+FG+ NPGGRLP+TWY ++ +P+T M +R S GYPGRTY+FY
Sbjct: 359 PGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEFT-AVPMTDMRMRADPSTGYPGRTYRFYK 417
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G T+Y FGYGLSY+++ + S T +++ ++ L T+ A+ T V
Sbjct: 418 GKTVYNFGYGLSYSKYSHRFAS-EGTKPPSMSGIE---GLKATASAAGTVSYDVEEMGAE 473
Query: 691 CDDYFEFK--VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
D F V QN G DG V+++ + P Q+IGFQ V +RA ++
Sbjct: 474 ACDRLRFPAVVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVE 533
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F + CK + ++ G H + VG+
Sbjct: 534 FEVSPCKHFSRAAEDGRKVIDQGSHFVKVGD 564
>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
Length = 591
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 256/597 (42%), Positives = 364/597 (60%), Gaps = 26/597 (4%)
Query: 190 VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
+ +YAV +V+G+Q G+ +A L+ S+CCKH AYD+++W GV RY+F+A+VT
Sbjct: 1 MASKYAVAFVKGMQ---GNSSAI------LQTSACCKHVTAYDLEDWNGVQRYNFNAKVT 51
Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
QD+E+T+ PF CV + A+ +MC+Y +NG+P+CA+ LL +TVRG+W L GYI +D
Sbjct: 52 AQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASD 111
Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
CD++ +M D ++ + EDAVA LKAGLD++CG Y A+QQGK+ E DIDK+L
Sbjct: 112 CDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKAL 170
Query: 370 KYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
K L+ + MRLG FDG P+ Y LG DIC+ E+ LA EAA +GIVLLKND LPL
Sbjct: 171 KNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPL 230
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSN 484
+ V + AV+GP+AN +A+IGNY G PC +P+ G GY NV + GC+ AC
Sbjct: 231 DRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVA 290
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
+ AA+ A+ ++D + GL E+E DR L LPG Q LI VA+ AK PVILV
Sbjct: 291 ATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILV 349
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+++ G VD+ FA+TN I AILWAGYPG+ GG AIA V+FG NPGGRLP+TWY ++ +
Sbjct: 350 LLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTK 409
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
+P+T M +R + GYPGR+Y+FY G T+Y FGYGLSY+ + L+S K + N L
Sbjct: 410 -VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLL 468
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK----VDFQNVGSTDGSDVVIVYSKPP 720
R TS+ ++ + ++ D + K V+ QN G DG V++Y + P
Sbjct: 469 ASLRTTT-TSEGDES----YHIEEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWP 523
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
Q+IGF+ ++ G I+F + C+ + V ++ G H + V
Sbjct: 524 NAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMV 580
>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
Length = 785
Score = 487 bits (1253), Expect = e-134, Method: Compositional matrix adjust.
Identities = 286/731 (39%), Positives = 399/731 (54%), Gaps = 40/731 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD + R L++ T++EK+ G A GVPRLGLP Y WW EALHGV+ PG +
Sbjct: 39 CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVAQ-SPGVN 97
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F D ATSFP IL A+F++ L K + +STEARA N R+GL YW+PNIN
Sbjct: 98 FSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWTPNINPF 157
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
+D RWGR ETPGEDP+ + Y + + GLQ + + KV + CKH+ AYD
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVKSLIAGLQG----------DGKYKKVVATCKHFVAYD 207
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++ W G RY FD V Q++ E ++ PF+ C ++ + + MCSYN +NGIP+CADP LL
Sbjct: 208 LETWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLL 267
Query: 293 NQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R W+ ++ +DCDSIQ + H++ + ++E+AVA +LKAG D++CG YY
Sbjct: 268 QTILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTS-TREEAVAVSLKAGTDVNCGTYYQE 326
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGKQDICSDENIELAAEA 408
F A+ G V E DID +L Y+ L+RLG+FDG+ +Y SL +D+ + +LA +A
Sbjct: 327 FLPGALSLGLVTEKDIDMALIRQYSSLVRLGYFDGTAVEYRSLSWKDVSTPYAQQLALKA 386
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGY 467
A EGI LLKND LPL K +AV+G ANAT M+GNY GIP SP+ A
Sbjct: 387 AVEGITLLKND-GILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTG 445
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
ANVTY + N+ A AD + G+D VEAE +DR + G Q
Sbjct: 446 ANVTYSGNPGGQGDPTTNNWLHIWTAVDEADVILFAGGIDNGVEAEGMDRVSIAWTGAQL 505
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+I Q+A K PVI+ M GVD N NI A+LW GYPG++GG A+ D++ GK
Sbjct: 506 DVIGQLASRGK-PVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKS 564
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRLP T Y Y+ +P+T M LRP + G+PGRTY +YN ++ FGYGL YT F
Sbjct: 565 APAGRLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTNFS 624
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
+S T T ++ L +Y RCP + + K+ N G+
Sbjct: 625 AT-ISPTDTTSFSIADLTKDCTEHYMD-----RCP-----------FADMKIAVTNTGNV 667
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANT 766
V + + A K+++ +QR+ + AG ++ SL VD NT
Sbjct: 668 TSDYVTLGFLAGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNL-TLASLARVDDMGNT 726
Query: 767 LLPAGEHTIFV 777
+L G + + +
Sbjct: 727 VLYPGSYALLI 737
>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 790
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 284/732 (38%), Positives = 399/732 (54%), Gaps = 38/732 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD++ R K L++ TL EK+ G + GVPRLGL YEWW EALHGV++ PG +
Sbjct: 39 CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVAS-SPGVN 97
Query: 115 FDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F + G ATSFP IL A+F++ L + +STEARA N RAGL +W+PNIN
Sbjct: 98 FS--VSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWTPNIN 155
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
+DPRWGR ETPGEDP+ + Y + +RGLQ N KV + CKH+ A
Sbjct: 156 PFKDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGD---------NPSYKKVVATCKHFVA 206
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YDV+NW G RY DA + QD+ E ++ PF C ++ + + MCSYN +NG+P+CADP
Sbjct: 207 YDVENWNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTCADPY 266
Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL +R W+ ++ +DCDS+Q + H + A S+E+A A +LKAG D++CG YY
Sbjct: 267 LLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNY-ASSREEAAAISLKAGTDINCGTYY 325
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGKQDICSDENIELAA 406
A +QG + ETD+D SL Y L+RLG+FDG Y +L D+ + +LA
Sbjct: 326 QEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGDRVPYRNLTWNDVSTPYAQDLAL 385
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFS 465
+AA GI LLKND LPL +A++G ANAT M+GNY GIP + SP+ A
Sbjct: 386 KAATSGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLWAAQQ 444
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
A VTY G + + + AA +D I + G+D VEAE DR + G
Sbjct: 445 TGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIAWSGP 504
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +I Q+A+ P I+V M G +D + N NI+A+LW GYPG++GG+AI D++ G
Sbjct: 505 QLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFDILQG 564
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
P GRLPIT Y DY+ +P+T LRP + G PGRTY + N ++ FGYGL YT
Sbjct: 565 ISAPAGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGLHYT- 623
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
+FT TI + Y+ D+ + C ++ RC + F +D N G
Sbjct: 624 ------NFTATI-----PDAESSDTTYSIDSLASDCTESYLD--RC-PFKTFSIDVTNTG 669
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
S V + + K+++ +QR+ + + + SL+ VD N
Sbjct: 670 SVTSDYVTLGFLTGAHGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLGSLSRVDDKGN 729
Query: 766 TLLPAGEHTIFV 777
T+L G + + V
Sbjct: 730 TVLFPGSYALLV 741
>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
Length = 805
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 277/736 (37%), Positives = 412/736 (55%), Gaps = 37/736 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ CD S R K LV+ T+ EK+ G+ + GVPRLGLP Y+WW EALHGV++
Sbjct: 38 LSNNTVCDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVAS 97
Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F+ ATSFP IL A+F+++L + + + VSTEARA N GRAGL +W+
Sbjct: 98 -SPGVTFNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRAGLDFWT 156
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PNIN RDPRWGR ETPGEDP+ + Y + + GLQ E E K+++ CK
Sbjct: 157 PNINPYRDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGEDPEIR--------KITATCK 208
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H+A YD+++W G RY D ++ ++D+ E +L F C ++ + + MC+Y+ +NG+P+C
Sbjct: 209 HFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPTC 268
Query: 287 ADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
ADP LLN +R W + ++ +DCDSIQ + H F +D+++ A A L AG DLDC
Sbjct: 269 ADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNF-SDTRQGAAAAALNAGTDLDC 327
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENI 402
G YY + A QG + +T +D++L LYT L+R G+FDG + Y +L D+ +
Sbjct: 328 GTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDGPNAMYRNLTWSDVGTTHAQ 387
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
+LA +AA EG+VLLKND LPL+ + +A++G ANAT M GNY G+P SP+
Sbjct: 388 QLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSPLY 446
Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
A A V Y G + + AA+ AD I + G+D+SVEAE +DRED+
Sbjct: 447 AAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDREDIN 506
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
G Q +I ++A K P++L M +D N NI A++W GYPG++GG A+ +
Sbjct: 507 WTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVALFN 564
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
++ GK P GRLP+T Y Y+ +P+T M LRP + G PGRTYK+YNG ++ FGYG+
Sbjct: 565 IITGKTAPAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWYNGTAVFEFGYGM 624
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT+F ++ +K+ + L C ++ K RC + V+
Sbjct: 625 HYTKFSADISPMSKSSYDISSLLSGC------NETYKDRCA-----------FESISVNV 667
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G+ + + + K ++ +QR+ AG + + + SL+ VD
Sbjct: 668 HNTGNVTSDYAALGFIAGQFGPSPYPKKSLVNYQRLHNIAGGSSQTATLNLTLGSLSRVD 727
Query: 762 YAANTLLPAGEHTIFV 777
NT L G++ + +
Sbjct: 728 DHGNTYLYPGDYALMI 743
>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 765
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 278/655 (42%), Positives = 383/655 (58%), Gaps = 46/655 (7%)
Query: 2 AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
++ +S+L +S+ L F+ SP C+ G SK + CD++L
Sbjct: 6 SRRAASILACIVSLTQLGFA---------QSPFPDCENGPLSKNAV-------CDTTLDP 49
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--I 119
R + L++ MTL+EK+ + GVPRLGLP Y WWSEALHGV+ PG HF D
Sbjct: 50 VTRAQALLAAMTLEEKINNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVHFADSGNF 108
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
ATSFP+ I A+F++ L K+I + TE RA N G AGL YW+PNIN RDPRWGR
Sbjct: 109 SYATSFPSPITLGAAFDDDLVKQIATVIGTEGRAFGNAGHAGLDYWTPNINPYRDPRWGR 168
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
ETPGEDPF RY + + GLQD G E K+ + CKH+A YD+++W+G
Sbjct: 169 GQETPGEDPFHTSRYVYHLIDGLQDGIGPEKP--------KIVATCKHFAGYDIEDWEGN 220
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+RY FDA +++QDM E + PF+ C ++ +VMCSYN VNGIP+CADP LL +R
Sbjct: 221 ERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVLREH 280
Query: 300 WDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
W+ G ++ +DC +I + +HK++A A A + AG DLDCG Y F G+A+
Sbjct: 281 WEWEGVGHWVTSDCGAIDNIYKDHKYVA-DGAHAAAVAVNAGTDLDCGSVYPQFLGSAIS 339
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIV 414
QG + +D++L LY+ L++LG+FD + Y S+G D+ + + +LA AA EG V
Sbjct: 340 QGLLGNRTLDRALTRLYSSLVKLGYFDPAADQPYRSIGWSDVATPDAEQLAHTAAVEGTV 399
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY---MSPIAGFSGYANVT 471
LLKND TLPL K TVA+VGP+ANAT + GNY G +Y M A GY V
Sbjct: 400 LLKND-GTLPLK--KNGTVAIVGPYANATTQLQGNYEGT-AKYIHTMLSAAAQQGY-KVK 454
Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
Y G + S + A AAK +D I G+D VEAE+LDR + PG Q LI
Sbjct: 455 YAPGT-GINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDLIQ 513
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
Q++++ K P+++V G VD + +N + +LWAGYP + GG A+ D++ GK P G
Sbjct: 514 QLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAPAG 572
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
RLP+T Y +YV +P+T M LRP S PGRTY++Y+ + PFGYG+ YT F
Sbjct: 573 RLPVTQYPEEYVDQVPMTDMNLRPGPS--NPGRTYRWYDKAVI-PFGYGMHYTTF 624
>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 286/733 (39%), Positives = 408/733 (55%), Gaps = 37/733 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
C+ LP R K +V T+DE + +G+ + GV RLGLP Y+WWSEALHG+ SN
Sbjct: 61 CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIARSNFTAS 120
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
+ ATSFP IL +FN L+K++G + TEARA N+GRAGL ++SPNIN
Sbjct: 121 GEYSH----ATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPNINPF 176
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RD RWGR E E P +VG YA+NYV+GLQ G + ++ N L+V++ CKH+ YD
Sbjct: 177 RDARWGRGQEVASESPVLVGNYALNYVQGLQG--GLD--SNQNDDTLQVAATCKHFVGYD 232
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++W R ++A +++QD+ + +L F+ CV++ A+ MCSYN VNG+P+CA L
Sbjct: 233 MESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACASEFFL 292
Query: 293 NQTVRGEWDLH-GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
N +R +D G I +DCD+I + + H + D A A +KAG+D++CG Y N
Sbjct: 293 NTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLG-GAAADAIKAGVDVNCGDTYQNNL 351
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEA 408
G A+ + E I S+ Y+ L+RLG+FD SPQ Y D+ + + +LA +A
Sbjct: 352 GYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQLAYQA 410
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA 468
A EGI LLKND TLP N KV+ VAV+GP ANAT M+G+YAG P +SP+ G
Sbjct: 411 AVEGIALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGAQSEG 469
Query: 469 -NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
V Y G + + AA AAK ADA + G+D SVE E+LDRE L PG Q
Sbjct: 470 FQVEYALGT-QINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWPGNQL 528
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
L+++++ + K P++++ G +D + N N+ AI++AGYPG+ GG AI D++ GK+
Sbjct: 529 DLVSKLSGLKK-PLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDILSGKY 587
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRL T Y Y +P+T M LRP GYPGRT+ +YNG +Y FGYGL YT F
Sbjct: 588 APAGRLTTTQYPASYADQVPMTDMTLRPRQ--GYPGRTFMWYNGEPVYEFGYGLHYTTFS 645
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
+L + + + N Q + S+ G++ F V+ +N G T
Sbjct: 646 ASLANAPRGGHQSFNIEQVVA----AAKRSQYVDTGLITT---------FDVNIKNTGKT 692
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANT 766
++YSK A K ++ F ++ + AG+ + K SL D N
Sbjct: 693 TSDYAALLYSKTTAGPGPHPNKILVSFDKLHQIHAGQTQTAKLPV-TIGSLLQTDTNGNK 751
Query: 767 LLPAGEHTIFVGN 779
L G +T FV N
Sbjct: 752 WLYPGTYTFFVDN 764
>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
Length = 748
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 292/747 (39%), Positives = 405/747 (54%), Gaps = 98/747 (13%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ FC++SL R DLVSR+TLD+ + Q+G A VP LG+P Y WW+E LHGV
Sbjct: 10 LKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGVLT 69
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GT+ TSFP A+FN L K+ +A+S EARA+ N G GL +W+PN
Sbjct: 70 KC-GTNC------PTSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGLDFWAPN 122
Query: 169 I-----------------------NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
I ++ RDPRWGR E PGEDPF+ +Y +++RGLQ+
Sbjct: 123 IKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRGLQEG 182
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
E +SR +V CKH+AAY ++ WK DR+ FDA V++ D ET+L F+ C+
Sbjct: 183 E--------DSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCI 234
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
EG A S+MCSYN VNG+PSCA+ LL +R W GY+V+DCD++ + +NH F
Sbjct: 235 VEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHF-TK 293
Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
+ E A A L AG DL+CG +Y G A +G+V E ++ ++K L+ M LG +D
Sbjct: 294 TPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWDPP 353
Query: 386 PQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
+ Y + S E+ +LA +AARE +VLL+N + LPL + V+ VAV+GP+ANAT
Sbjct: 354 AEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKS-VRRVAVIGPNANAT 412
Query: 444 VAMIGNYAGIPCR------YMSPIAGFSG---YANVTYKTGCDDVACKSNNSIFAASEAA 494
M+GNY G C +SP A VTY GC DV + I A +AA
Sbjct: 413 ETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGC-DVDSTNTTGIPEAVKAA 471
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ AD I++ GL+ SVE+E DR + LPG Q LI + P ++V+M G V I
Sbjct: 472 QAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHGGAVAIE 530
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG----------GRLPITWYNGDYVQ 604
+ + + I+ A YPGE GG+AIADV+FG +NPG GRLP+T +YV
Sbjct: 531 WIK--DQVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVLPANYVD 588
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
M+PLT+M +R S PGRTY++Y GP L+ FG+GLSYT FK LS
Sbjct: 589 MVPLTNMSMRA--SGNNPGRTYRYYTGPAPLWEFGFGLSYTTFKTEWLS----------- 635
Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAE 722
T P L + R D+ F+V NVG G +VV+ + ++ A+
Sbjct: 636 ---------------TPQPSALKSYAR-DEAVSFRVRVTNVGPVAGDEVVLAFVTRDNAD 679
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKF 749
+KQ+ F+RV + G +K I F
Sbjct: 680 RGP--LKQLFAFERVHLNPGESKEIFF 704
>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 451
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 233/429 (54%), Positives = 304/429 (70%), Gaps = 17/429 (3%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
+ +P F CD + ++S+ FC+ S + R DLVSR+TL EKV L D +P
Sbjct: 35 AQTPAFACDASNAT-----LASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALP 89
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P YEWWSEALHGVS VGPGT F ++PGATSFP ILT ASFN +L++ IG+ VS
Sbjct: 90 RLGVPLYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSN 149
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAM+N+G AGLT+WSPNIN+ RDPRWGR ETPGEDP + +YAV YV GLQ
Sbjct: 150 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGA 209
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
A LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF PF+ CV +G+
Sbjct: 210 GA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGN 262
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CAD LL+ +RG+W L+GYI +DCDS+ V+ +N + + ED
Sbjct: 263 VASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPED 321
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A ++KAGLDL+CG + T AVQ GK+ E+D+D+++ LMRLGFFDG P+
Sbjct: 322 AAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPREL 381
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ +LG D+C+ N ELA EAAR+GIVLLKN LPL++ +K++AV+GP+ANA+ M
Sbjct: 382 PFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTM 440
Query: 447 IGNYAGIPC 455
IGNY G C
Sbjct: 441 IGNYEGTSC 449
>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
Length = 789
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 284/737 (38%), Positives = 399/737 (54%), Gaps = 42/737 (5%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
C++S R K LV+ TL+EK+ + GVPRLG+P Y+WWSE LHG++ GP T
Sbjct: 34 ICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEGLHGIA--GPYT 91
Query: 114 HFDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
+F +TSFP IL A+F++ L + + +STEARA N R GL +W+PNIN
Sbjct: 92 NFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRTGLDFWTPNIN 151
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
RDPRWGR ETPGED F + Y + GLQ TD R V + CKH+A
Sbjct: 152 PFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQG-----ETTDPYKR---VVATCKHFAG 203
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD+++W G RY FDA++++QD+ E +L+PF+ CV + + + MCSYN VNG+P+CADP
Sbjct: 204 YDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPTCADPY 262
Query: 291 LLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL +R W + ++ +DCD++Q + H++ A ++E AVA L AG DLDCG Y
Sbjct: 263 LLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSA-TREQAVADALIAGTDLDCGTYM 321
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELA 405
A QG V E +D++L Y+ L+RLG+FD + Y G + +D + LA
Sbjct: 322 QEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFDDAADQPYRQFGWDSVATDASQALA 381
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
AA EGIVLLKND LPL+ ++ V G ANAT ++GNYAG+P SP+
Sbjct: 382 RRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLWALQ 440
Query: 466 GYANVTYKTGCDDVACK---SNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
N+T + + + N + S A T+D I + G+D S+E E DR L
Sbjct: 441 -QENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEGHDRTSLAW 499
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
G Q +I Q+A K P I+V+M G +D A N NI AILWAGYPG++GG AI D+
Sbjct: 500 TGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIVDI 558
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+ GK P GRLP T Y Y ++P+T M LRP ++ PGRTYK+YNG Y FG+GL
Sbjct: 559 LTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRPSEN--NPGRTYKWYNGTATYEFGHGLH 616
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
YT F + S + + + C+N + + RC + +
Sbjct: 617 YTNFSATVTSPMQQSYRIADLMSTCKN---ATSITLERCA-----------FTSVDISVT 662
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
N G+ V + Y A K ++G+QR+F A + +SL VD
Sbjct: 663 NTGAVASDYVTLCYISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARIDLTLESLARVDE 722
Query: 763 AANTLLPAGEHTIFVGN 779
N +L GE+++ V N
Sbjct: 723 VGNKVLYPGEYSLMVDN 739
>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
SS1]
Length = 770
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 285/747 (38%), Positives = 416/747 (55%), Gaps = 45/747 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
++S L C++S + R K LV+ MTL+E V + + GVPRLGLP YEWWSEALHGV++
Sbjct: 30 LASNLVCNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVAS 89
Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F+ GATSFP IL +A+F++ L + +STEARA N +GL +++
Sbjct: 90 -SPGVTFETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFT 148
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PNIN +DPRWGR ETPGEDP RY + GLQ G S K+ + CK
Sbjct: 149 PNINPFKDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVGP-------SPYYKIIADCK 201
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H+AAYD++NW+G +R F+A V+ QD+ E + F+ CV++ SVMCSYN VNG+P+C
Sbjct: 202 HFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPAC 261
Query: 287 ADPKLLNQTVRGEWDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
P LL VR ++L +I +DCD++ + D H + + +A A L AG D+DCG
Sbjct: 262 GSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTT-TLTNASAVALLAGTDVDCG 320
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
Y+ G AV +G V ++D++++L LY L+RLG+FD S Y +LG D+ +
Sbjct: 321 TSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSVPYRALGASDVNTPAAQ 380
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA AA EGIVLLKND LPL S+ V +A++GP ANAT M GNY GI +SP+
Sbjct: 381 TLAYTAAVEGIVLLKND-GLLPL-SSNVSHIALIGPWANATTQMQGNYEGIAPLLISPLD 438
Query: 463 GFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
GF+ NV++ G ++ S + A A AD + + G+D +VEAE DR +
Sbjct: 439 GFTSAGFNVSFTNGTT-ISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSIT 497
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
PG Q +LI ++ K P +++ M G VD + N+++ A+LW GYPG+ GG+A+AD
Sbjct: 498 WPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALAD 556
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
++ G P GRL T Y YV + +T M +RP +S G PGRTYK+Y G ++ FG+GL
Sbjct: 557 IITGVQAPAGRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGL 616
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT F + ++ L N + ++ A V+ D F V
Sbjct: 617 HYTTFDVEWAEGSPAASYSIQDLVASANSSSSAVAH--------VDSAILD---TFTVQV 665
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI-- 759
N G+ V +++S A + +++++ + RV K I +A SLN+
Sbjct: 666 TNTGNVTSDYVALLFSNTTAGPSPAPLQELVSYARV-------KGITPGVSATASLNVTL 718
Query: 760 -----VDYAANTLLPAGEHTIFVGNGG 781
VD N+++ G + ++V G
Sbjct: 719 GTIARVDEDGNSIIYPGVYNLWVDTTG 745
>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
Length = 760
Score = 473 bits (1217), Expect = e-130, Method: Compositional matrix adjust.
Identities = 288/734 (39%), Positives = 405/734 (55%), Gaps = 45/734 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+S R L+ TL+EK+ G+ + GVPRLGLP Y+WW EALHGV+ PG
Sbjct: 34 CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVAE-SPGVI 92
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F + ATSFP IL A+F++ L ++ VSTEARA N R+GL +W+PNIN
Sbjct: 93 FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWTPNINPF 152
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
+DPRWGR ETPGEDPF + Y N + GLQ L+ ++ + CKHYA YD
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG--------GLDPEYKRIVATCKHYAGYD 204
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++NW+G RY FDA ++ QD+ E + R FE C ++ + + MCSYN VNG+PSCA+ LL
Sbjct: 205 LENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSCANSYLL 264
Query: 293 NQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+RG W+ +I +DCD+IQ + + H + A ++E VA L AG DLDCG YY
Sbjct: 265 QDILRGHWNWTSDDQWITSDCDAIQNIYEPH-YYAPTRELTVADALNAGADLDCGTYYPE 323
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
G A +G E+ +D++L Y L++LG+FD + Y +G ++ + E ELA
Sbjct: 324 NLGAAYDEGLFAESTLDRALIRQYASLVKLGYFDPAENQPYRQIGWANVSTPEAEELAYR 383
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
AA EGI L+KND TLPL S +K++A++GP ANAT M GNY G P +SP+
Sbjct: 384 AAVEGITLIKND-GTLPL-SPSIKSLALIGPWANATTQMQGNYYGQPPYLISPLMAAEAL 441
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
Y + V + +S AA AA+ ADA I + G+D +VEAE++DR L PG Q
Sbjct: 442 NYTVYYSPGPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTLDWPGVQP 501
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
I+Q+++ K P++++ M G VD + NTN+ A++W GYPG+ GG A+ D++ G
Sbjct: 502 DFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALMDIIVGNA 560
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRLP T Y DYV + +T M LRP S PGRTY +Y G + FG+GL YT F
Sbjct: 561 APAGRLPTTQYPLDYVYQVAMTDMSLRP--SATNPGRTYMWYTGTPIVEFGFGLHYTNFS 618
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
+L +Y + C GV DL + + V+ N+GS
Sbjct: 619 --------------AELSQPSAPSYDIASLVGACEGVAHLDLCA--FESYTVNVTNIGSK 662
Query: 708 DGSDVV----IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
SD V + PA I K + + R+ A + + + SL+ VD
Sbjct: 663 VTSDYVALLFVAGEHGPAPIPN---KVLAAYDRLHTIAPLSSQQATLNLTLGSLSRVDEY 719
Query: 764 ANTLLPAGEHTIFV 777
N +L GE+T+ +
Sbjct: 720 GNRVLYPGEYTLIL 733
>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
NRRL Y-27907]
Length = 788
Score = 473 bits (1217), Expect = e-130, Method: Compositional matrix adjust.
Identities = 286/737 (38%), Positives = 409/737 (55%), Gaps = 45/737 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
C+ LP + R K +V T+DE + +G+ + GV RLGLP Y+WWSE LHG+ SN
Sbjct: 61 CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIARSNFTAS 120
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
+ ATSFP IL +FN L+K++G + TEARA N+GRAGL Y+SPNIN
Sbjct: 121 GEYSH----ATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPF 176
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
+DPRWGR E E P +VG YA+NYV+GLQ G + ++ N L+V++ CKH+A YD
Sbjct: 177 KDPRWGRGQEVASESPVLVGNYALNYVQGLQG--GID--SNPNDDTLQVAATCKHFAGYD 232
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++WK R ++A +++QD+ + + F+ CV++ A+ MCSYN +NGIP CA L
Sbjct: 233 MESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAINGIPVCASEFFL 292
Query: 293 NQTVRGEWDLH-GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
+R +D G I +DCDS+ + + H ++ D A A +KAG+D++CG Y N
Sbjct: 293 GTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGA-AAADGIKAGVDVNCGDTYQNNL 351
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEA 408
G A+ + E I S+ Y+ L+RLG+FD SPQ Y + D+ + + +LA +A
Sbjct: 352 GYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDVSTSQANQLAYQA 410
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF--SG 466
A EGI LLKND TLP N KVK VAV+GP ANAT M+G+YAG P +SP+ G SG
Sbjct: 411 AVEGITLLKND-GTLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLISPLQGAQDSG 469
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
+ V Y G + N AA AAK ADA + G+D S+E E+LDRE L PG Q
Sbjct: 470 F-KVQYAYGTQINTTLTTNYT-AALNAAKGADAIVYFGGIDNSIENEALDRESLAWPGNQ 527
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
L+++++ + K P+++V AG VD + N N+ +I++AGYPG+ GG AI DV+ G
Sbjct: 528 LDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGTAIWDVLNGI 586
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
+ P GRL T Y Y +P+T M LRP D GYPGRT+ +YNG +Y FGYGL YT F
Sbjct: 587 YAPAGRLSTTQYPASYADQVPMTDMTLRPRD--GYPGRTFMWYNGEPVYEFGYGLHYTTF 644
Query: 647 KYNLLSFTKT---IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
+L + N+++ ++ Y + T F V+ +N
Sbjct: 645 SVSLANAPPKGAPQSFNIDQFIAAKSSQYVDTSLITT----------------FDVNIKN 688
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
G ++YS + K ++ F ++ + G+ + SL D
Sbjct: 689 TGKVTSDYAALLYSNTTSGPGPHPNKILVSFDKLHQIHPGQIQTASLPV-TIGSLLQTDT 747
Query: 763 AANTLLPAGEHTIFVGN 779
N L G +T FV N
Sbjct: 748 NGNKWLYPGAYTFFVDN 764
>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
Length = 771
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 303/761 (39%), Positives = 423/761 (55%), Gaps = 56/761 (7%)
Query: 37 CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
C G SKL + CD+SL + R + LV+ MT +EKV + GVPRLGLP Y
Sbjct: 32 CSSGPLSKLAV-------CDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAY 84
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
WWSEALHGV+ PG F D P ATSFP IL A+F++ L K++ VSTE RA
Sbjct: 85 NWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAF 143
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
N GR+GL +W+PNIN RD RWGR ETPGEDP V RY + V GLQ+ G N
Sbjct: 144 GNAGRSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANP--- 200
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
KV + CKH+AAY +++W GV R+ F+A V+ QD+ E +L PF+ C ++ +VM
Sbjct: 201 -----KVVATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVM 255
Query: 275 CSYNRVNGIPSCADPKLLNQTVRG--EWDLHG-YIVADCDSIQVMVDNHKFLADSKEDAV 331
CSYN +NG+P+CAD LL +R +WD G +I +DC +I + + H F + +A
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAA 314
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
A L AG DLDCG + + G A +G +D++L LY+ ++LG+FD + Y
Sbjct: 315 ATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFDPAEDQPYR 374
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
S+G D+ + LA +AA EGIVLLKND+ TLPL + T+A++GP+ANAT M GN
Sbjct: 375 SIGWTDVDTPAVEALAHKAAGEGIVLLKNDK-TLPLKAK--GTLALIGPYANATKQMQGN 431
Query: 450 YAGIPCRYMSPI---AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
Y G P +Y+ + A +GY +V Y G + S AA AAK AD + G+
Sbjct: 432 YEG-PAKYIRTLLWAATQAGY-DVKYAAGT-AINTNSTAGFDAALSAAKQADVVVYAGGI 488
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
D ++EAE DR + PG Q LI+Q++++ K P+++V G VD + +N + A+L
Sbjct: 489 DNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALL 547
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
WAGYP +EGG AI D++ GK P GRLP+T Y DYV +P+T M LRP + PGRTY
Sbjct: 548 WAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNT--PGRTY 605
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVL 685
++Y+ L PFG+GL YT FK +S+ + R L Y + A +R P +
Sbjct: 606 RWYDKAVL-PFGFGLHYTTFK---ISWPR------------RALGPYNTAALVSRSPKNV 649
Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRN 744
D D F +V N G T V +++ K Y +K ++G+ R
Sbjct: 650 PIDRAAFDTFHIQV--TNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGE 707
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
KR + + SL + +L G +T+ V G +P
Sbjct: 708 KRSVDIEVSLGSLARTAENGDLVLYPGRYTLEVDVGESQYP 748
>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 764
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 296/744 (39%), Positives = 416/744 (55%), Gaps = 52/744 (6%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L C+ S + R LV +TL+E V + + GVPRLGLP Y WWSEALHGV+ + PG
Sbjct: 36 LVCNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVA-LSPG 94
Query: 113 THFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
T+F +PG ATSFP I+ A+F++ L I +STEARA N GRAGL +++P
Sbjct: 95 TNFS--VPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGLDFFTP 152
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCK 226
NIN +DPRWGR ETPGEDPF + +Y V GLQ L+ P KV + CK
Sbjct: 153 NINPFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQG--------GLSPDPYYKVIADCK 204
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H+A YD++NW+G R F+A ++ QD+ E + F+ CV++ SVMCSYN VNGIPSC
Sbjct: 205 HFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNGIPSC 264
Query: 287 ADPKLLNQTVRGEWDL-HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A+ LL +RG + L G+I +DCD++ + H++ + +A A LKAG D+DCG
Sbjct: 265 ANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTT-TLVNASAVALKAGTDVDCGT 323
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
Y+ +AV Q V E DI S+ LY L+RLG+FD + + LG D+ + +
Sbjct: 324 TYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFDSPAEQPFRQLGWSDVNTPSSQA 383
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-- 461
LA AA EG+ LLKND TLPL+SA +K +A+VGP ANAT M GNY GI +SP+
Sbjct: 384 LALTAAEEGVTLLKND-GTLPLSSA-IKRIALVGPWANATTQMQGNYQGIAPFLVSPLQA 441
Query: 462 ---AGFS-GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
AGF +AN T DD + AA A + ADA I G+D ++E+E DR
Sbjct: 442 LQDAGFQVTFANGTAINSTDD------SGFAAAVSAVQVADAVIYAGGIDETIESEGNDR 495
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
E + PG Q L++Q+A V K P +++ M G VD + ++N + A++W GYPG+ GG
Sbjct: 496 EIITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGA 554
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
AI +++ GK P GRLPIT Y DYV +P+T M LRP + PGRTYK++ G ++ F
Sbjct: 555 AIVNILTGKIAPAGRLPITQYPADYVNEIPMTDMALRPNGT--SPGRTYKWFTGTPIFGF 612
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
G+GL YT F L + T + ++ S+ GV +L F F
Sbjct: 613 GFGLHYTTFS---LDWAPT---------PPSSFAISTLVSEANTAGVSFTNLA--PLFTF 658
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
+V+ +N G V +++S A +KQ++ + RV A + S+
Sbjct: 659 RVNVKNTGKVGSDYVALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVTLGSI 718
Query: 758 NIVDYAANTLLPAGEHTIFVGNGG 781
+D ++ L G + I+V G
Sbjct: 719 ARIDENGDSALYPGRYNIWVDTTG 742
>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
Length = 771
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 305/761 (40%), Positives = 425/761 (55%), Gaps = 56/761 (7%)
Query: 37 CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
C G SKL + CD+SL + R + LV+ MT +EKV + GVPRLGLP Y
Sbjct: 32 CSSGPLSKLAV-------CDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAY 84
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
WWSEALHGV+ PG F D P ATSFP IL A+F++ L K++ VSTE RA
Sbjct: 85 NWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAF 143
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
N GR+GL +W+PNIN RD RWGR ETPGEDP V RY + V GLQ+ G N
Sbjct: 144 GNAGRSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANP--- 200
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
KV + CKH+AAYD+++W GV R+ F+A V+ QD+ E +L PF+ C ++ +VM
Sbjct: 201 -----KVVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVM 255
Query: 275 CSYNRVNGIPSCADPKLLNQTVRG--EWDLHG-YIVADCDSIQVMVDNHKFLADSKEDAV 331
CSYN +NG+P+CAD LL +R +WD G +I +DC +I + + H F + +A
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAA 314
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
A L AG DLDCG + + G A +G +D++L LY+ L++LG+FD + Y
Sbjct: 315 ATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFDPAEDQPYR 374
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
S+G D+ + LA +AA EGIVLLKND+ TLPL + T+A++GP+ANAT M GN
Sbjct: 375 SIGWTDVDTPAAEALAHKAAGEGIVLLKNDK-TLPLKAK--GTLALIGPYANATKQMQGN 431
Query: 450 YAGIPCRYMSPI---AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
Y G P +Y+ + A +GY +V Y G + S AA AAK AD + G+
Sbjct: 432 YEG-PAKYIRTLLWAATQAGY-DVKYAAGT-AINTNSTAGFDAALSAAKQADVVVYAGGI 488
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
D ++EAE DR + PG Q LI+Q++++ K P+++V G VD + +N + A+L
Sbjct: 489 DNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALL 547
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
WAGYP +EGG AI D++ GK P GRLP+T Y DYV +P+T M LRP + PGRTY
Sbjct: 548 WAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNT--PGRTY 605
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVL 685
++Y+ L PFG+GL YT FK +S+ + R L Y + A +R P +
Sbjct: 606 RWYDKAVL-PFGFGLHYTTFK---ISWPR------------RALGPYNTAALVSRSPKNV 649
Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRN 744
D D F +V N G T V +++ K Y +K ++G+ R
Sbjct: 650 PIDRAAFDTFHIQV--TNTGKTTSDYVALLFLKTTDAGPKPYPLKTLVGYTRAKQIKPGE 707
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
KR + + SL + +L G +T+ V G +P
Sbjct: 708 KRSVDIEVSLGSLARTAENGDLVLYPGRYTLEVDVGESQYP 748
>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
FP-91666 SS1]
Length = 772
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 289/740 (39%), Positives = 414/740 (55%), Gaps = 46/740 (6%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L C+++ + R L+ L + V + + GV RLGLP Y+WW+EALHGV + PG
Sbjct: 37 LVCNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGS-SPG 95
Query: 113 THF----DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
++ D ATSFP IL A+FN+SL I +STEARA N AGLT+++PN
Sbjct: 96 VNWGSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAGLTFFTPN 155
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKH 227
IN RDPRWGR ETPGEDP+ + RY YV GLQ L+ P KV + CKH
Sbjct: 156 INPFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQG--------GLSPDPYYKVLANCKH 207
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
AYDV+NW+G DR F+A VT QD+ E + F+ C+++ +S MCSYN VNG+PSCA
Sbjct: 208 VLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNGVPSCA 267
Query: 288 DPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+L VR W L G+I DC ++Q + H + D+ +A A + AG DLDCG
Sbjct: 268 SSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGY-TDTLVNATAVAMDAGTDLDCG 326
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENI 402
Y+ AV +G + I +L LY L+RLG+FD + Q Y S ++ + +
Sbjct: 327 DVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYFDPAEQQPYRSFDWSNVNTPSSQ 386
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
+LA AA +GIVLL+ND LPL S VK +A++GP ANAT+++ GNYAGI +SP
Sbjct: 387 DLAYNAAVQGIVLLEND-GLLPL-STNVKNIALIGPMANATLSLQGNYAGIAPFVISPQQ 444
Query: 463 GF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
F +GY NVT+ G ++ N+ A EAA+ AD + + G+D S+EAE DR +
Sbjct: 445 AFETAGY-NVTFAFGT-GISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDRTSI 502
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
PG Q LI Q+ E+ K P+++V M G D + + N + A+LWAGYPG+ GG A+
Sbjct: 503 EWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGTALV 561
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
D++ GK +P GRLP+T Y YV + +T M +RP +S G PGRTYK+Y G +YPFGYG
Sbjct: 562 DIISGKQSPSGRLPVTQYPSSYVSEIDMTDMAIRP-NSSGSPGRTYKWYTGAPIYPFGYG 620
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
+ YT F+ L+++ + N + N + + D D F V
Sbjct: 621 IHYTTFR---LAWSDSSSTTYNIQDIVSSANKSGGFA----------DTEILDTFSLLV- 666
Query: 701 FQNVGSTDGSD-VVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLN 758
N GS SD V ++++ + + +++++G+ RV + G + S++
Sbjct: 667 -TNTGSNYTSDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGGTATAELNV-TLGSIS 724
Query: 759 IVDYAANTLLPAGEHTIFVG 778
VD N +L G + ++VG
Sbjct: 725 RVDENGNWILYPGTYNLWVG 744
>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
NZE10]
Length = 802
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 274/736 (37%), Positives = 401/736 (54%), Gaps = 36/736 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD++ R L++ TL EK+ G + GVPRLGLP Y WW EALHGV++ PG +
Sbjct: 39 CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVAS-SPGVN 97
Query: 115 FDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F D P ATSFP IL A+F++ L + + +STEARA N RAGL +W+PNIN
Sbjct: 98 FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRAGLDFWTPNINPF 157
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
+D RWGR ETPGEDP+ + Y + GLQ + + +V + CKH+ AYD
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSP--------DDKYKRVVATCKHFVAYD 209
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++W G RY FDA+V+ QD+ E ++ PF+ C ++ + + MCSYN +NG+P+CADP LL
Sbjct: 210 MESWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGVPTCADPWLL 269
Query: 293 NQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R +W+ ++ +DCD++Q + H + A ++E+A A +LKAG D++CG YY +
Sbjct: 270 QTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDY-ASTREEAAALSLKAGTDINCGTYYQD 328
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEA 408
A QG + TD+D SL Y+ L+RLG+FDG + Y +L D+ + +LA +A
Sbjct: 329 HLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDGLAVPYRNLTWNDVSTPHAQQLAYKA 388
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGY 467
A EGI LLKND LPL + ++A++G ANAT M+GNY GIP + SP+ A
Sbjct: 389 AAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFHSPLYAAQQTG 447
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
A V + TG + + AA +D I G+D SVE+E +DR L G Q
Sbjct: 448 ATVNFATGPGGQGDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGMDRVSLTWTGAQL 507
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+I Q+A K PVI++ M G +D + N N+ A++W GYPG++GG A+ D++ G
Sbjct: 508 DMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDGGVALFDIIRGIT 566
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRLP T Y Y+ +P+T M LRP + G PGRTY +YN ++P+G GL YT F
Sbjct: 567 APAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVFPYGLGLHYTNFT 626
Query: 648 YNLL-SFTKTIQVNLNKLQ----HCRNLNYTSDAS-KTRCPGVLVNDLRCDDYFEFKVDF 701
+ SF T + + L A+ K CP + F V
Sbjct: 627 AAIKPSFPSTYDSSSSNSGSASYDISTLTSNCTATYKDLCP-----------FTSFSVSI 675
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G V + + A K+++ +QR+ + + ++ SL VD
Sbjct: 676 TNTGEIMSDYVTLGFLAGIHGPAPHPNKRLVSYQRLHNITAGSSQTAWLNLTLGSLARVD 735
Query: 762 YAANTLLPAGEHTIFV 777
N +L G++ + V
Sbjct: 736 EMGNKVLYPGDYALLV 751
>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
Length = 523
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 239/528 (45%), Positives = 334/528 (63%), Gaps = 11/528 (2%)
Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
V +G+ +SVMCSYN+VNG P+CADP LL+ +RGEW L+GYIV+DCDS+ V+ N +
Sbjct: 3 VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
E A A ++ AGLDL+CG + T AV+ G V E IDK++ + LMRLGFFDG
Sbjct: 63 TPAE-AAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDG 121
Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
+P+ Y LG D+C+ N ELAA+AAR+GIVLLKN LPL+ +KT+AV+GP+AN
Sbjct: 122 NPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNAN 180
Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
T MIGNY G PC+Y +P+ G +G + TY GC +VAC + + A++ A TAD ++
Sbjct: 181 VTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVAD-VAGATKLAATADVSV 239
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
++ G D S+EAES DR DL LPG Q +L+ QVA+ AKGPV+LVIMS GG DI FA+ +
Sbjct: 240 LVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPK 299
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I ILW GYPGE GG AIAD++FG++NP G+LP+TWY YV+ +P+T M +RP + GY
Sbjct: 300 IAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASGY 359
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTR 680
PGRTY+FY G T+Y FG GLSYT+F + L+ + + L + CR+ S DA
Sbjct: 360 PGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQSLDAIGPH 419
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
C + FE + +N G +G V +++ PPA I + K ++GF+++ +
Sbjct: 420 CENAVSGG---GSAFEVHIKVRNGGDREGIHTVFLFTTPPA-IHGSPRKHLVGFEKIRLG 475
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
++F CK L++VD + G+H + VG+ S I +
Sbjct: 476 KREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIRI 523
>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
Length = 771
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 307/760 (40%), Positives = 423/760 (55%), Gaps = 54/760 (7%)
Query: 37 CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
C G SKL + CD+SL + R + LV+ MT +EKV + GVPRLGLP Y
Sbjct: 32 CSSGPLSKLAV-------CDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAY 84
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
WWSEALHGV+ PG F D P ATSFP IL A+F++ L K++ VSTE RA
Sbjct: 85 NWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAF 143
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
N GRAGL +W+PNIN RD RWGR ETPGEDP V RY + V GLQ+ G N
Sbjct: 144 GNAGRAGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANP--- 200
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
KV + CKH+AAYD+++W GV R+ F+A V+ QD+ E +L PF+ C ++ +VM
Sbjct: 201 -----KVVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVM 255
Query: 275 CSYNRVNGIPSCADPKLLNQTVRG--EWDLHG-YIVADCDSIQVMVDNHKFLADSKEDAV 331
CSYN +NG+P+CAD LL +R +WD G +I DC +I + + H + + +A
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNY-TKTPAEAA 314
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
A L AG DLDCG + + G A +G +DK+L LY+ L++LG+FD + Y
Sbjct: 315 ATALNAGTDLDCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYFDPAEDQPYR 374
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
S+G +D+ S LA +AA EGIVLLKND+ TLPL + T+A++GP+ANAT M GN
Sbjct: 375 SIGWKDVDSPAAEALAHKAAVEGIVLLKNDK-TLPLKAK--GTLALIGPYANATKQMQGN 431
Query: 450 YAGIP--CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
Y G P R + A +GY +V Y G + S AA AAK AD + G+D
Sbjct: 432 YEGPPKYIRTLLWAATQAGY-DVKYVAGT-AINANSTAGFDAALSAAKQADVVVYAGGID 489
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
++EAE DR + PG Q LI+Q++++ K P+++V G VD + +N ++ A+LW
Sbjct: 490 NTIEAEGHDRTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLW 548
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYP +EGG AI D++ GK P GRLP+T Y DYV +PLT M LRP + PGRTY+
Sbjct: 549 TGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPLTDMALRPGSNT--PGRTYR 606
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVLV 686
+Y+ L PFG+GL YT FK +S+ + R L Y + A +R P +
Sbjct: 607 WYDKAVL-PFGFGLHYTTFK---ISWPR------------RALGPYDTAALVSRSPKNVP 650
Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNK 745
D D F +V N G T V +++ K Y +K ++G+ R K
Sbjct: 651 IDRAAFDTFHIQV--TNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGEK 708
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
R + + SL + +L G +T+ V G +P
Sbjct: 709 RSVDIKVSLGSLARTAENGDLVLYPGRYTLEVDVGENQYP 748
>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
Length = 763
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 273/694 (39%), Positives = 394/694 (56%), Gaps = 42/694 (6%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L C+++ + R K L+ T +E V + + GVPRLGLP Y+WWSEALHGV+ PG
Sbjct: 35 LVCNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPG 94
Query: 113 THF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
HF + ATSFP IL A+F++ L ++ +STEARA N G +G+ +++PNI
Sbjct: 95 VHFAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFSGIDFFTPNI 154
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
N RDPRWGR ETPGEDP + RY V LQ G S K+ + CKH+A
Sbjct: 155 NPFRDPRWGRGQETPGEDPLHISRYVFQLVTALQGGLGP-------SPYYKIVADCKHFA 207
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD+++W+G+DR+HFDA +T QD+ E + F+ CV++ SVMCSYN VNG+P+CA
Sbjct: 208 GYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNSVNGVPACASS 267
Query: 290 KLLNQTVRGEWDL-HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL VR + L G+I +DCD++Q + H F ++ +A A +LKAG D+DCG Y
Sbjct: 268 YLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTT-TQANASAISLKAGTDVDCGNVYA 326
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
G+A+ QG V+E D+ ++L LY L+R G+FD SP+ + LG D+ + + LA
Sbjct: 327 QSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWADVDTPASRRLA 385
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
AA EGIVLLKND LPL+S V V +VGP NAT M GNY G +SP GF
Sbjct: 386 LLAAEEGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAPYLVSPRQGFV 444
Query: 465 -SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
+G+ NVT+ G + A AA D + + G D VE ES DR ++ P
Sbjct: 445 DAGF-NVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERESRDRINITWP 503
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q LI ++A V K P+I++ M AG VD + + + I A++W GYPG+ GG A+A++V
Sbjct: 504 GVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQSGGTALANIV 562
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P RLPIT Y DY+ LP+T M +RP +S PGRTYK++ G ++ FG+GL Y
Sbjct: 563 TGKTAPAARLPITQYPEDYIS-LPMTDMNVRPSNS--SPGRTYKWFTGEPIFEFGFGLHY 619
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
++F + ++ + + N + D + + F+V+ N
Sbjct: 620 SKFDF---AWAEEPPASFAIGDLVANASSPVDLAT---------------FHTFQVNVTN 661
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
+G V +++ A + +K+++G+ R+
Sbjct: 662 LGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRL 695
>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
Length = 741
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 289/730 (39%), Positives = 402/730 (55%), Gaps = 39/730 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+S R L+S TL+EK+ G+ A GVPRLGLP Y+WW EALHGV+ PG
Sbjct: 34 CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVAE-SPGVI 92
Query: 115 F--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F ATSFP IL A+F+++L + VSTEARA N R+G+ +W+PNIN
Sbjct: 93 FAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWTPNINPF 152
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
+DPRWGR ETPGEDPF + Y N + GLQ L+ ++ + CKH+AAYD
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG--------GLDPEYKRIVATCKHFAAYD 204
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++NW+G RY FDA V+ QD+ E + R F C ++ + S MCSYN VNG+PSCA+ LL
Sbjct: 205 LENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSCANSYLL 264
Query: 293 NQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R W + YI +DCD+IQ + + H + A ++ + VA L AG DLDCG+YY
Sbjct: 265 QDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTA-TRAETVADALNAGTDLDCGEYYPE 323
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQDICSDENIELAAE 407
G A QG E+ ++++L Y L++LG+FD + Y +G ++ + E ELA
Sbjct: 324 NLGAAYDQGLFTESTLNRALIRQYAALVKLGYFDPADIQPYRQIGWANVSTPEAEELAYT 383
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
AA EGI LLKND TLPL S +KT+A++GP ANAT M GNY G+ +SP+
Sbjct: 384 AAVEGITLLKND-GTLPL-SPSIKTIALIGPWANATTQMQGNYYGVAPYLISPLMAAEEL 441
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
Y + V + +S AA AA+ ADA I G+D++VEAE++DR L PG Q
Sbjct: 442 GFTVYYSAGPGVDDPTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRYTLDWPGVQP 501
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
I+Q++ + K P+I++ G +D + N + A++W GYPG+ GG+AI D++ G
Sbjct: 502 DFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKAIMDIIVGNA 560
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRLPIT Y DYV + +T M LRP S PGRTY +Y G + FG+GL YT F
Sbjct: 561 APAGRLPITQYPLDYVYQVAMTDMSLRP--SPTNPGRTYMWYTGTPIVEFGFGLHYTTFT 618
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
L +Y + C GV DL C + + + N GS+
Sbjct: 619 --------------ASLSQPSAPSYDIATLVSLCSGVAHPDL-C-PFASYTANVTNTGSS 662
Query: 708 DGSDVVIVYSKPPAEIAATYIKQV-IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
SD V + A Y +V + + R+ A + + SL+ VD NT
Sbjct: 663 VTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLGSLSRVDDYGNT 722
Query: 767 LLPAGEHTIF 776
+L GE+T+
Sbjct: 723 ILYPGEYTLI 732
>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
Length = 534
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 242/531 (45%), Positives = 342/531 (64%), Gaps = 17/531 (3%)
Query: 253 MEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDS 312
+E+T+ PF+ CV EG +SVMCSYN+VNG P+CADP LL T+RG W L GYIV+DCDS
Sbjct: 1 LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60
Query: 313 IQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
+ V+ D+ F + E+A A T+KAGLDLDCG + T AV +G +KE D++ +L L
Sbjct: 61 VGVLYDSQHF-TPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANL 119
Query: 373 YTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
+V MRLG FDG P Y +LG +D+C+ + LA EAAR+GIVLL+N LPL+ +
Sbjct: 120 LSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTR 179
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
+TVAV+GP+++ATV MIGNYAG+ C Y +P+ G S Y + GC +VAC + I
Sbjct: 180 HRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTIHAKGCANVACVGDQLIGE 239
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A AA+ ADA +++ GLD S+EAES DR + LPG Q +L+ ++ KGP ++V+MS G
Sbjct: 240 AEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGG 299
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+D++FA+ + I ILW GYPG+ GG AIADV+FG NPGG+LP+TWY Y+ +P+T
Sbjct: 300 PIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMT 359
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCR 668
+M LRP S GYPGRTY+FY GP ++PFG+GLSY++F SF + +++L
Sbjct: 360 NMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQ---SFAEAPTKISLPLSSLSP 416
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
N + T S T C V+DL +D +N G+ DGS ++V+S P + +
Sbjct: 417 NSSATVKVSHTDCAS--VSDL------PIMIDVKNTGTVDGSHTILVFSTVPNQTWSPE- 467
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K +IGF++V + AG KR++ + C L+ VD +P GEH + +G+
Sbjct: 468 KHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGD 518
>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
Length = 771
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 305/760 (40%), Positives = 420/760 (55%), Gaps = 54/760 (7%)
Query: 37 CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
C G SKL + CD+S + R + LV M+ EKV A GVPRLGLP Y
Sbjct: 32 CTSGPLSKLAV-------CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAY 84
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
WWSEALHGV+ PG HF D P ATSF IL ASF++ L K++ V TE RA
Sbjct: 85 NWWSEALHGVAGA-PGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAF 143
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
N GRAGL YW+PNIN RDPRWGR ETPGEDP V RY + V GLQ G
Sbjct: 144 GNAGRAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG------- 196
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
+RP ++++ CKH+AAYD+++W GV R+ FDARV+ QD+ E +L F+ CV++ +VM
Sbjct: 197 PARP-QIAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVM 255
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAV 331
CSYN +NG+P+CADP LL +R WD ++V+DC +I + H + + +A
Sbjct: 256 CSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAA 314
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
A L AG DLDCG + G A +QG +D++L LY+ L++LG+FD + + Y
Sbjct: 315 AVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYG 374
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
S+G +D+ + +LA +AA EGIVLLKNDQ TLPL + T+A++GP+ANAT M GN
Sbjct: 375 SIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAK--GTLALIGPYANATKQMQGN 431
Query: 450 YAGIP--CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
Y G P R + A GY V Y G + S AA AAK AD + G+D
Sbjct: 432 YQGPPKYIRTLEWAATQHGY-QVQYSPGT-AINNSSTAGFAAALAAAKDADVVLYAGGID 489
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
++E+E+LDR + PG Q LI++++ + K P+I++ G VD TN ++ A+LW
Sbjct: 490 NTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLW 548
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
AGYP +EGG AI D++ GK P GRLPIT Y Y +P+T M LR PGRTY+
Sbjct: 549 AGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGD--NPGRTYR 606
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+Y+ + PFG+GL YT F +V+ ++ R Y + A R PG
Sbjct: 607 WYD-KAVVPFGFGLHYTSF-----------EVSWDR---GRLGPYNTAALVNRAPGGSHV 651
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRV-FVRAGRNK 745
D D F+V QN G+ V +++ K Y +K ++G+ RV V+ G +
Sbjct: 652 DRALFD--TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERR 709
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
++ L P G++T+ V G +P
Sbjct: 710 SVEIEVTLGAMARTAANGDLVLYP-GKYTLQVDVGERGYP 748
>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
Length = 803
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 305/760 (40%), Positives = 420/760 (55%), Gaps = 54/760 (7%)
Query: 37 CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
C G SKL + CD+S + R + LV M+ EKV A GVPRLGLP Y
Sbjct: 64 CTSGPLSKLAV-------CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAY 116
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
WWSEALHGV+ PG HF D P ATSF IL ASF++ L K++ V TE RA
Sbjct: 117 NWWSEALHGVAGA-PGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAF 175
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
N GRAGL YW+PNIN RDPRWGR ETPGEDP V RY + V GLQ G
Sbjct: 176 GNAGRAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG------- 228
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
+RP ++++ CKH+AAYD+++W GV R+ FDARV+ QD+ E +L F+ CV++ +VM
Sbjct: 229 PARP-QIAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVM 287
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAV 331
CSYN +NG+P+CADP LL +R WD ++V+DC +I + H + + +A
Sbjct: 288 CSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAA 346
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
A L AG DLDCG + G A +QG +D++L LY+ L++LG+FD + + Y
Sbjct: 347 AVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYG 406
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
S+G +D+ + +LA +AA EGIVLLKNDQ TLPL + T+A++GP+ANAT M GN
Sbjct: 407 SIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAK--GTLALIGPYANATKQMQGN 463
Query: 450 YAGIP--CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
Y G P R + A GY V Y G + S AA AAK AD + G+D
Sbjct: 464 YQGPPKYIRTLEWAATQHGY-QVQYSPGT-AINNSSTAGFAAALAAAKDADVVLYAGGID 521
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
++E+E+LDR + PG Q LI++++ + K P+I++ G VD TN ++ A+LW
Sbjct: 522 NTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLW 580
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
AGYP +EGG AI D++ GK P GRLPIT Y Y +P+T M LR PGRTY+
Sbjct: 581 AGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGD--NPGRTYR 638
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+Y+ + PFG+GL YT F +V+ ++ R Y + A R PG
Sbjct: 639 WYD-KAVVPFGFGLHYTSF-----------EVSWDR---GRLGPYNTAALVNRAPGGSHV 683
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRV-FVRAGRNK 745
D D F+V QN G+ V +++ K Y +K ++G+ RV V+ G +
Sbjct: 684 DRALFD--TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERR 741
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
++ L P G++T+ V G +P
Sbjct: 742 SVEIEVTLGAMARTAANGDLVLYP-GKYTLQVDVGERGYP 780
>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
bisporus H97]
Length = 767
Score = 467 bits (1201), Expect = e-128, Method: Compositional matrix adjust.
Identities = 296/807 (36%), Positives = 436/807 (54%), Gaps = 77/807 (9%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVD-ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSL 59
M ++S++C ++ +AL F+ + D NG S VCDP +
Sbjct: 1 MNPFLASIVCAAIHVALGQFNYSFPDCVNGPLSSTAVCDPTKAP---------------- 44
Query: 60 PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF--DD 117
+ R K L+ T +E +Q + + GVPRLG+P Y+WWSEALHGV+ PG F
Sbjct: 45 --AARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSG 101
Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
ATSFP I+ ++F+ L K + +STEARA N RAGL Y++PNIN +DPRW
Sbjct: 102 EFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRW 161
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAYDVDNW 236
GR ETPGEDPF V +Y + + GLQ ++ RP KV++ CKHYAAYD+D+W
Sbjct: 162 GRGQETPGEDPFHVSQYVYSLIDGLQG--------GIDPRPYFKVAADCKHYAAYDLDSW 213
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
+G+DR+HFDA+V+ QD+ E +L F+ CV++ +SVMCSYN VNGIP+CA+P LL +
Sbjct: 214 EGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDIL 273
Query: 297 RGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
R W D ++ +DCD+I + H F D+ +AVA LKAG D+DCG Y+ +A
Sbjct: 274 RDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKAGTDVDCGTSYSTHLPDA 332
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREG 412
+ Q + D++++L YT LMRLG+FD S L D+ + LA AA EG
Sbjct: 333 LNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSDVNKPDAQALAHTAAVEG 392
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTY 472
+VLLKND LP+ SA KT+A++GP+ANAT M GNY G ++P G
Sbjct: 393 LVLLKND-GFLPV-SASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQG-------AV 443
Query: 473 KTGCDDVACKSNNSIFAASE--------AAKTADATIILAGLDLSVEAESLDREDLWLPG 524
G ++V + SI SE A ++D I G++ S+E+E+ DR + G
Sbjct: 444 DAGFNEVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTG 503
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q L+ Q+A + K PV++V G +D + N ++A++WAGYPG+ GG AI DV+
Sbjct: 504 NQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVIT 562
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G P GRL +T Y D+V + +T M LRP + PGRTYK+Y G + FG+GL +T
Sbjct: 563 GAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSA--NPGRTYKWYTGRPVLEFGHGLHFT 620
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F ++ + N+ L H + + P ++ D F V+ +N
Sbjct: 621 TFDFSWRG-RPGRKYNIQHLLHTADKKF---------PDLIPLD-------TFHVNIRNT 663
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYA 763
G+ V +++ + A A K ++ F R + AG + + N S+ VD
Sbjct: 664 GNITSDYVALLFLRSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN-LGSIARVDEH 722
Query: 764 ANTLLPAGEHTIF--VGNGGVSFPIHL 788
++ L AG++ + +G+G +S L
Sbjct: 723 GDSWLFAGDYQLVLDIGDGVLSHSFSL 749
>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 767
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 296/807 (36%), Positives = 439/807 (54%), Gaps = 77/807 (9%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVD-ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSL 59
M ++S++C ++ +AL F+ + D NG S VCDP +
Sbjct: 1 MNPFLASIVCAAIHVALGQFNYSFPDCVNGPLSSTAVCDPTKAP---------------- 44
Query: 60 PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF--DD 117
+ R L+ T +E +Q + + GVPRLG+P Y+WWSEALHGV+ PG F
Sbjct: 45 --AARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSG 101
Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
ATSFP I+ ++F+ L K + +STEARA N RAGL Y++PNIN +DPRW
Sbjct: 102 EFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRW 161
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAYDVDNW 236
GR ETPGEDPF V +Y + + GLQ ++ RP KV++ CKHYAAYD+D+W
Sbjct: 162 GRGQETPGEDPFHVSQYVYSLIDGLQG--------GIDPRPYFKVAADCKHYAAYDLDSW 213
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
+G+DR+HFDA+V+ QD+ E +L F+ CV++ +SVMCSYN VNGIP+CA+P LL +
Sbjct: 214 EGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDIL 273
Query: 297 RGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
R W D ++ +DCD+I + H F D+ +AVA LKAG D+DCG Y+ +A
Sbjct: 274 RDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKAGTDVDCGTSYSTHLPDA 332
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREG 412
+ Q + D++++L YT LMRLG+FD S L D+ + LA AA EG
Sbjct: 333 LNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSDVNKPDAQALAHTAAVEG 392
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTY 472
+VLLKND LP+ SA KT+A++GP+ANAT M GNY G ++P G
Sbjct: 393 LVLLKND-GFLPV-SASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQG-------AV 443
Query: 473 KTGCDDVACKSNNSIFAASE--------AAKTADATIILAGLDLSVEAESLDREDLWLPG 524
G ++V + SI SE A ++D I G++ S+E+E+ DR + G
Sbjct: 444 DAGFNEVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTG 503
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q L+ Q+A + K PV++V G +D + N ++A++WAGYPG+ GG AI DV+
Sbjct: 504 NQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVIT 562
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G P GRL +T Y D+V + +T M LRP + PGRTYK+Y G + FG+GL +T
Sbjct: 563 GAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSA--NPGRTYKWYTGRPVLEFGHGLHFT 620
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F ++ + + + ++L +T+D + P ++ D F V+ +N
Sbjct: 621 TFDFSW-------RGRPGRKYNIQHLLHTAD---KKFPDLIPLD-------TFHVNIRNT 663
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYA 763
G+ V +++ K A A K ++ F R + AG + + N S+ VD
Sbjct: 664 GNITSDYVALLFLKSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN-LGSIARVDEH 722
Query: 764 ANTLLPAGEHTIF--VGNGGVSFPIHL 788
++ L AG++ + +G+G +S L
Sbjct: 723 GDSWLFAGDYQLVLDIGDGVLSHSFSL 749
>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 786
Score = 463 bits (1191), Expect = e-127, Method: Compositional matrix adjust.
Identities = 260/611 (42%), Positives = 361/611 (59%), Gaps = 29/611 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CDS+ + R + L+ T DE +Q + + GVPRLGLP YEWWSEALHGV +
Sbjct: 32 LKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91
Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F ATSFP I+ A+F++ L K + VSTEARA N GRAGL Y++
Sbjct: 92 -SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGLNYFT 150
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
PNIN +DPRWGR ETPGEDPF + +Y + V GLQ ++ P +KV++ C
Sbjct: 151 PNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--------GIDPWPYIKVAADC 202
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+AAYD++NW+G+DR+HFDA+V++QD+ E +L PF+ CV++ A+SVMCSYN VNG+P+
Sbjct: 203 KHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPA 262
Query: 286 CADPKLLNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CA LL +R W D ++ +DC ++ + D+H F S +A A +LKAG D+DC
Sbjct: 263 CASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNF-TRSFAEAAAISLKAGTDIDC 321
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G + + A+ Q + D+ ++ YT L+RLG+FD S Y D+ + E
Sbjct: 322 GSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDSQTYRQFDWSDVNTPEA 381
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
L+ AA EG+VLLKND LPL + KT+A++GP+ NAT +M GNY G SP
Sbjct: 382 QALSRRAAVEGLVLLKND-GLLPL-APDGKTIAIIGPYTNATSSMQGNYFGNAPIITSP- 438
Query: 462 AGFSGYANVTYK---TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
F G +V +K V S+ A AK AD + + G+D ++E E LDR
Sbjct: 439 --FQGAQDVGFKVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRS 496
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+ PG Q L+ +A + K P+I+V G VD N ++AI+WAGYPG+ GG A
Sbjct: 497 SISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTA 555
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
I D++ G P GRLP+T Y DY + +T M LRP S PGRTYK+Y P L +G
Sbjct: 556 IFDIIVGSTAPAGRLPVTQYPADYTHQVRMTDMSLRP--SSHNPGRTYKWYKTPVL-EYG 612
Query: 639 YGLSYTQFKYN 649
+GL +T F ++
Sbjct: 613 HGLHFTTFDFS 623
>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 259/604 (42%), Positives = 368/604 (60%), Gaps = 25/604 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL + R LV T++E + + + GVPRLGLP Y+WWSE LHGV++ PG +
Sbjct: 37 CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVAD-SPGVN 95
Query: 115 FDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F P ATSFP I+ +A+F+++L K +G V E R+ N G AGL +W+PNIN
Sbjct: 96 FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
+DPRWGR ETPGEDP+ + +Y N ++GLQ +N P +V + CKH+A Y
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG--------GVNPEPYFQVVATCKHFAGY 207
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D+++W+ RY FDA +T QD+ E +L F+ C ++ A + MCSYN VNGIP+CAD L
Sbjct: 208 DLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYL 267
Query: 292 LNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R W D ++ +DCD+++ + + H + A ++ A A L+AG DLDCG +YT
Sbjct: 268 LQDILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQ-AAADALRAGTDLDCGTFYTE 326
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
+ A Q + ET++ +L Y L+RLG+FD + Q Y G ++ + +LA
Sbjct: 327 YLPLAYNQSLITETELRAALTRQYASLVRLGYFDPAAQQPYRQYGWSNVDTPYAQQLAYT 386
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FS 465
AA EGI LLKND TLPL S +K +A++GP ANAT M GNY G+ +SP+ G +
Sbjct: 387 AATEGITLLKND-GTLPLPS-TLKNIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAA 444
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY NVTY G ++ S AA AA+ ADA + G+D++VEAE++DR ++ PG
Sbjct: 445 GY-NVTYVFGT-NITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGN 502
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q QLI ++A + K P ++ G VD + N ++ +++WAGYPG+ GG+A+ D++ G
Sbjct: 503 QLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISG 561
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
K P GRL T Y DYV +P+T M LRP + PGRTYK+Y G +Y FGYGL YT
Sbjct: 562 KVAPAGRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYGLHYT 621
Query: 645 QFKY 648
F Y
Sbjct: 622 NFTY 625
>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 757
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 259/622 (41%), Positives = 373/622 (59%), Gaps = 33/622 (5%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IP 120
RVK L+ +TL+EK+ L D + G RLGLP YEWW+EA HGV + PG F +
Sbjct: 25 RVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVGSA-PGVQFTEKPVNFS 83
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
ATSFP ILT ASF+++L ++I + E RA N G +G +W+PNIN RDPRWGR
Sbjct: 84 YATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPFRDPRWGRG 143
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ETPGED FVV Y N++ GLQ + + +V + CKHYAAYD++
Sbjct: 144 QETPGEDSFVVQSYIRNFIPGLQGDDPEDK---------QVIATCKHYAAYDLE----TG 190
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
RY D T+QD+ + FL PF+ CV++ S+MC+YN V+GIP+CA LL+Q +R W
Sbjct: 191 RYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHW 250
Query: 301 DL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN-AVQ 356
+ + Y+V+DC ++ + H F D++E A + +L AG+DL+CG Y + A
Sbjct: 251 NFTADYNYVVSDCGAVTDIWQYHNF-TDTEEAAASVSLNAGVDLECGSSYLKLNESLAAN 309
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
Q V+ +D++L LY+ L +GFFDG +Y +LG D+ + E LA EAA EG+ LL
Sbjct: 310 QTTVQA--LDQALTRLYSALFTVGFFDGG-KYTALGFADVSTPEAQSLAYEAAVEGMTLL 366
Query: 417 KNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKT 474
KND+ LP+ S+ K K+VA++GP ANAT M G+Y+GIP +SP+ F G+ V Y
Sbjct: 367 KNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAM 426
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
G + ++ +A AA+ +D I L G+D S+EAE+LDR L PG Q L+ Q++
Sbjct: 427 GT-GINNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLS 485
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
++ K P+I+V G +D + N ++A++WAGYP + GG A+ DV+ GK + GRLP
Sbjct: 486 KLHK-PLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLP 544
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
+T Y Y + + + +RP DS YPGRTYK+Y G + PFGYGL YT+F++ T
Sbjct: 545 VTQYPASYADQVSIFDINIRPNDS--YPGRTYKWYTGMPVVPFGYGLHYTKFEFEWAQ-T 601
Query: 655 KTIQVNLNKL-QHCRNLNYTSD 675
+ N+ +L C++ SD
Sbjct: 602 LNHEYNIQQLVASCQSTGPISD 623
>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
Length = 374
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 219/382 (57%), Positives = 278/382 (72%), Gaps = 13/382 (3%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLGL YEWWSEALHGVSNVGPGT F P ATSFP VI T ASFN SLW+ IGQAVS
Sbjct: 1 RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGEDP + +YA +YV GLQ G+
Sbjct: 61 EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGNR 120
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
LKV++CCKHY AYD+DNW G+DR+HF+A+V++QD+E+T+ PF+ CV EG
Sbjct: 121 ---------LKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGK 171
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+SVMCSYN+VNG P+CADP +L T+RG+W L+GYIV+DCDS+ V+ D+ + + E+
Sbjct: 172 VASVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHY-TRTPEE 230
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
A A T+ AGLDLDCG + T A++QG V E ++++L TV MRLG FDG P
Sbjct: 231 AAADTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQ 290
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ +LG +D+C+ + +LA +AAREGIVLLKN +LPL++ + + +AV+GP+A AT M
Sbjct: 291 PFGNLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTM 350
Query: 447 IGNYAGIPCRYMSPIAGFSGYA 468
IGNYAGI C Y SP+ G S YA
Sbjct: 351 IGNYAGIACGYTSPLQGISRYA 372
>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 797
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 285/706 (40%), Positives = 396/706 (56%), Gaps = 47/706 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL R K LV+ MTL+EK+ + G PRLGLP Y WW+EALHGV+ G G
Sbjct: 62 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F D ATSFP IL A+F++ L K++ +STEARA N G AGL YW+PNIN
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ETPGEDP + RY + V GLQD G E RP KV + CKH+AAYD
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 232
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++NW+G++RY FDA V+ QD+ E +L F+ C ++ +VMCSYN +NGIP+CAD LL
Sbjct: 233 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 292
Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R W ++ DC +I + +H ++A A A L AG DLDCG +
Sbjct: 293 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 351
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
+ G+A+QQG ++ +L LY+ L++LG+FD + Y S+G ++ + ELA +
Sbjct: 352 YLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 411
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
A EGIV+LKND TLPL S TVA++GP ANAT + GNY G P R + A +
Sbjct: 412 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 468
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY V + G D+ S+ A AAK AD I G+D ++E ES DR + PG
Sbjct: 469 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 526
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q LI Q++++ K P+I+V G VD + N + A+LWAGYP + GG A+ D++ G
Sbjct: 527 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 585
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRLP+T Y YV +P+T M LRP + PGRTY++Y+ L PFG+GL YT
Sbjct: 586 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 642
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F V+ N H Y +D+ + V+ D F + N G
Sbjct: 643 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 685
Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
+ + +++ Y IK ++G+ R + G+++++K
Sbjct: 686 NVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 731
>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
bisporus H97]
Length = 763
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 259/611 (42%), Positives = 366/611 (59%), Gaps = 29/611 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CDS+ + R + L+ T DE +Q + + GVPRLGLP YEWWSEALHGV +
Sbjct: 32 LKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91
Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F ATSFP I+ A+F++ L K + VSTEARA N GRAGL Y++
Sbjct: 92 -SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGLNYFT 150
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
PNIN +DPRWGR ETPGEDPF + +Y + V GLQ ++ P +KV++ C
Sbjct: 151 PNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--------GIDPWPYIKVAADC 202
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+AAYD++NW+G+DR+HFDA+V++QD+ E +L PF+ CV++ A+SVMCSYN VNG+P+
Sbjct: 203 KHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPA 262
Query: 286 CADPKLLNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CA LL +R W D ++ +DC ++ + D+H F S +A A +LKAG D+DC
Sbjct: 263 CASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNF-TRSFAEAAAISLKAGTDIDC 321
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G + + A+ Q + D+ ++ YT L+RLG+FD S Y D+ + E
Sbjct: 322 GSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHSQTYRQFDWSDVNTPEA 381
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
L+ AA EG+VLLKND LPL + KT+A++GP+ NAT +M GNY G SP
Sbjct: 382 QALSRRAAVEGLVLLKND-GLLPL-APDGKTIAIIGPYTNATSSMQGNYFGNAPFITSP- 438
Query: 462 AGFSGYANVTYK--TGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDRE 518
F G +V +K + + ++++ FA A A+ AD + + G+D ++E E LDR
Sbjct: 439 --FQGAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDRS 496
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+ PG Q L+ +A + K P+I+V G VD N ++AI+WAGYPG+ GG A
Sbjct: 497 SISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGTA 555
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
I D++ G P GRLP+T Y DY + +T M LRP S PGRTYK+Y P L +G
Sbjct: 556 IFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLRP--SSHNPGRTYKWYKTPVL-EYG 612
Query: 639 YGLSYTQFKYN 649
+GL +T F ++
Sbjct: 613 HGLHFTTFDFS 623
>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 776
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 285/706 (40%), Positives = 396/706 (56%), Gaps = 47/706 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL R K LV+ MTL+EK+ + G PRLGLP Y WW+EALHGV+ G G
Sbjct: 41 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F D ATSFP IL A+F++ L K++ +STEARA N G AGL YW+PNIN
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ETPGEDP + RY + V GLQD G E RP KV + CKH+AAYD
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 211
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++NW+G++RY FDA V+ QD+ E +L F+ C ++ +VMCSYN +NGIP+CAD LL
Sbjct: 212 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 271
Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R W ++ DC +I + +H ++A A A L AG DLDCG +
Sbjct: 272 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 330
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
+ G+A+QQG ++ +L LY+ L++LG+FD + Y S+G ++ + ELA +
Sbjct: 331 YLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 390
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
A EGIV+LKND TLPL S TVA++GP ANAT + GNY G P R + A +
Sbjct: 391 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 447
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY V + G D+ S+ A AAK AD I G+D ++E ES DR + PG
Sbjct: 448 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 505
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q LI Q++++ K P+I+V G VD + N + A+LWAGYP + GG A+ D++ G
Sbjct: 506 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 564
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRLP+T Y YV +P+T M LRP + PGRTY++Y+ L PFG+GL YT
Sbjct: 565 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 621
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F V+ N H Y +D+ + V+ D F + N G
Sbjct: 622 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 664
Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
+ + +++ Y IK ++G+ R + G+++++K
Sbjct: 665 NVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 710
>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 797
Score = 460 bits (1184), Expect = e-126, Method: Compositional matrix adjust.
Identities = 285/706 (40%), Positives = 395/706 (55%), Gaps = 47/706 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL R K LV+ MTL+EK+ + G PRLGLP Y WW+EALHGV+ G G
Sbjct: 62 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F D ATSFP IL A+F++ L K++ +STEARA N G AGL YW+PNIN
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ETPGEDP + RY + V GLQD G E RP KV + CKH+AAYD
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 232
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++NW+G++RY FDA V+ QD+ E +L F+ C ++ +VMCSYN +NGIP+CAD LL
Sbjct: 233 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 292
Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R W ++ DC +I + +H ++A A A L AG DLDCG +
Sbjct: 293 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 351
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
+ G+A+QQG + +L LY+ L++LG+FD + Y S+G ++ + ELA +
Sbjct: 352 YLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 411
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
A EGIV+LKND TLPL S TVA++GP ANAT + GNY G P R + A +
Sbjct: 412 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 468
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY V + G D+ S+ A AAK AD I G+D ++E ES DR + PG
Sbjct: 469 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 526
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q LI Q++++ K P+I+V G VD + N + A+LWAGYP + GG A+ D++ G
Sbjct: 527 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 585
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRLP+T Y YV +P+T M LRP + PGRTY++Y+ L PFG+GL YT
Sbjct: 586 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 642
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F V+ N H Y +D+ + V+ D F + N G
Sbjct: 643 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 685
Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
+ + +++ Y IK ++G+ R + G+++++K
Sbjct: 686 NVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 731
>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 893
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 266/635 (41%), Positives = 377/635 (59%), Gaps = 35/635 (5%)
Query: 21 STNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQ 80
S+N + +GS P DP + S CD+SL R K LV MT +EKVQ
Sbjct: 140 SSNPIPLSGSVKPNCTLDP---------LCSNPICDTSLDPLTRAKGLVDAMTFEEKVQN 190
Query: 81 LGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNES 138
+ + G RLGLP Y+WW+EALHGV+ PG F ATSFP IL +A+F+++
Sbjct: 191 TQNGSPGAARLGLPAYQWWNEALHGVAG-SPGVTFQPSGNFSYATSFPQPILMSAAFDDA 249
Query: 139 LWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
L K++G VS E RA N G AGL +W+PNIN RDPRWGR ETPGEDP+ + RY N
Sbjct: 250 LIKEVGTVVSIEGRAFNNYGNAGLDFWTPNINPFRDPRWGRGQETPGEDPYHIARYVYNL 309
Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
V GLQ+ N +V + CKH+A YD+++W+G RY F+A ++ QD+ E +L
Sbjct: 310 VDGLQNGIAPANP--------RVVATCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYL 361
Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQV 315
PF+ C ++ ++MCSYN VNGIP+CAD LL+ +R W+ + ++ +DCD++
Sbjct: 362 PPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDN 421
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
+ +H++ + S A A L AG +LDCG +N A Q K ++ +L YLY+
Sbjct: 422 IYSDHRYTS-SLAAAAADALNAGTNLDCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSS 480
Query: 376 LMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKND-QNTLPLNSAKVKTV 433
L+RLG+FD QY SLG D+ + + +LA AA EGIVLLKND + LPL S +T+
Sbjct: 481 LVRLGWFDSEDSQYSSLGWSDVGTTASQQLANRAAVEGIVLLKNDHKKVLPL-SQHGQTI 539
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFS--GYANVTYKTGCDDVACKSNNSIFAAS 491
A++GP+ANAT + GNY G P + + G GY V Y+ G + + AA
Sbjct: 540 ALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQMGYT-VQYEAGT-GINSTDTSGFAAAV 597
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
AAKTAD I G+D S+EAE++DR + G Q QLI+Q+++V K P++++ G +
Sbjct: 598 AAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQL 656
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
D + N N+ A+LW GYP + GG+A+ D++ G+ P GRLP+T Y +Y +P+T M
Sbjct: 657 DDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQSAPAGRLPVTQYPANYTNAIPMTDM 716
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
LRP S PGRTY++Y+ + PFG+GL YT F
Sbjct: 717 SLRPNGST--PGRTYRWYDDAVI-PFGFGLHYTTF 748
>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
Length = 776
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 284/706 (40%), Positives = 395/706 (55%), Gaps = 47/706 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL R K LV+ MTL+EK+ + G PRLGLP Y WW+EALHGV+ G G
Sbjct: 41 CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F D ATSFP IL A+F++ L K++ +STEARA N G AGL YW+PNIN
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ETPGEDP + RY + V GLQD G E RP KV + CKH+AAYD
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 211
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
++NW+G++RY FDA V+ QD+ E +L F+ C ++ +VMCSYN +NGIP+CAD LL
Sbjct: 212 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 271
Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
+R W ++ DC +I + +H ++A A A L AG DLDCG +
Sbjct: 272 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 330
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
+ +A+QQG ++ +L LY+ L++LG+FD + Y S+G ++ + ELA +
Sbjct: 331 YLRSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 390
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
A EGIV+LKND TLPL S TVA++GP ANAT + GNY G P R + A +
Sbjct: 391 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 447
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY V + G D+ S+ A AAK AD I G+D ++E ES DR + PG
Sbjct: 448 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 505
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q LI Q++++ K P+I+V G VD + N + A+LWAGYP + GG A+ D++ G
Sbjct: 506 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 564
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRLP+T Y YV +P+T M LRP + PGRTY++Y+ L PFG+GL YT
Sbjct: 565 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 621
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F V+ N H Y +D+ + V+ D F + N G
Sbjct: 622 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 664
Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
+ + +++ Y IK ++G+ R + G+++++K
Sbjct: 665 NVASDYIALLFLTADRVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 710
>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
SS1]
Length = 778
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 260/606 (42%), Positives = 356/606 (58%), Gaps = 28/606 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CDS+L R + LV +T+ EK + + GVPRLGLP Y WWSE LHGV++ PG
Sbjct: 42 CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS-SPGVT 100
Query: 115 FDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
F ATSFP IL A+F+++L I +STEARA N +GL +W+PNIN
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGLDFWTPNINP 160
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
RDPRWGR ETPGEDPF + Y V GLQ G + + K+ + CKHYA Y
Sbjct: 161 VRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ-FGGDD------PKYQKLVATCKHYAGY 213
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D++NW G RY FDA ++ QD+ E FL PF+ C ++ + +SVMCSYN VNGIPSCA+ L
Sbjct: 214 DLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIPSCANDYL 273
Query: 292 LNQTVRGEWDLH--------GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
L +R W Y+ +DCD++ + H + + E AVA +LKAG DLDC
Sbjct: 274 LQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTI-TPEQAVAVSLKAGTDLDC 332
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDEN 401
G +Y + ++ +QG +TDID++L Y L LG+FD + Y +I +D
Sbjct: 333 GTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFDPAEGQIYRQYNWANINTDYA 392
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+LA AA EGI LLKN + LPL S + +A++GP ANAT M GNY GI SP+
Sbjct: 393 QQLAYTAAWEGITLLKNIDDMLPLPS-TMTNIALIGPWANATTQMQGNYQGIAPFLHSPL 451
Query: 462 AGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
NVTY G ++ S AA AA+TAD T+ + G+D++VEAE++DR ++
Sbjct: 452 YALQQRGINVTYVLGT-NITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAEAMDRVNI 510
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
PG Q LI Q+A V+ +I+ M G +D N + +LW GYPG++GG A+
Sbjct: 511 TWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQDGGTAMI 569
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
D+++G P GRLP++ Y +++ +P+T M L P +LG PGRTYK+Y+G + PFGYG
Sbjct: 570 DILYGSRAPAGRLPLSQYPANFINEVPMTDMRLHP--ALGTPGRTYKWYSGDLVLPFGYG 627
Query: 641 LSYTQF 646
L YT F
Sbjct: 628 LHYTTF 633
>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 822
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 273/652 (41%), Positives = 378/652 (57%), Gaps = 43/652 (6%)
Query: 34 VFVCDPGRFSKLGLQ---MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
V + + S + Q + S CD+SL + RV LV +TL+EK+ L D + G R
Sbjct: 56 VTILTAAKLSTIACQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTR 115
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAV 147
LGLP YEWWSEA HGV + PG F ATSFP ILT ASF+++L +KI + +
Sbjct: 116 LGLPSYEWWSEATHGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVI 174
Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
E RA N G +G +W+PNIN RDPRWGR ETPGEDP V Y N+V GLQ +
Sbjct: 175 GREGRAFGNNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDP 234
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
+V + CKHYA YD++ RY + T+QD+ + FL PF+ CV++
Sbjct: 235 KNK---------QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRD 281
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLA 324
D S+MCSYN V+GIP+CA+ LL++ +R W+ + Y+V+DC ++ + H F
Sbjct: 282 TDVGSIMCSYNSVSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-T 340
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
D++E A + L AG+DL+CG Y + A Q VK +D+SL LY+ L +GFFD
Sbjct: 341 DTEEAAASVALNAGVDLECGSSYLKLNESLAANQTSVKV--MDQSLARLYSALFTVGFFD 398
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANA 442
G +Y L D+ + + LA EAA EG+ LLKND + LPL+S K K+VAV+GP ANA
Sbjct: 399 GG-KYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANA 456
Query: 443 TVAMIGNYAGIPCRYMSPIAGF-SGYANVTYKTGCDDVACKSNNSIF-AASEAAKTADAT 500
T M G+Y+G +SP+ F V Y G N S F A AA +D
Sbjct: 457 TTQMQGDYSGDAPYLISPLEAFGDSRWKVNYALGT--AMNNQNTSGFEEALAAANKSDLI 514
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
I L G+D S+E+E+LDR L PG Q LI +++++K P+++V G VD + N
Sbjct: 515 IYLGGIDNSLESETLDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNK 573
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
+I+A++WAGYP + GG A+ DV+ GK +P GRLP+T Y Y + + + LRP DS
Sbjct: 574 DIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDS-- 631
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
YPGRTYK+Y G + PFGYGL YT+F ++ + KT+ + N+ L CRN
Sbjct: 632 YPGRTYKWYTGKPVLPFGYGLHYTKFMFD---WEKTLNREYNIQDLVASCRN 680
>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 732
Score = 457 bits (1176), Expect = e-125, Method: Compositional matrix adjust.
Identities = 266/625 (42%), Positives = 369/625 (59%), Gaps = 27/625 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL R +V T+DE + + GVPRLGLP Y+WWSE LHGV++ PG +
Sbjct: 22 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 80
Query: 115 FD--DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F ATSFP I+ A+F++ L K +G V E R+ N GRAGL +W+PNIN
Sbjct: 81 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 140
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
+DPRWGR ETPGEDP+ + +Y N V+GLQ L+ +P +V S CKH+AAY
Sbjct: 141 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTCKHFAAY 192
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D+++W G RY FDA VT QD+ E +L F+ C ++ + MCSYN VNGIPSCA+ L
Sbjct: 193 DLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYL 252
Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R W ++ +DCD++ + D H + + E+AVA LKAG D+DCG +Y+
Sbjct: 253 LQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDCGTFYSE 311
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQDICSDENIELAAE 407
+ A Q + ET++ ++L Y L+RLG+FD + Y ++ + + +LA +
Sbjct: 312 YLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQAQQLAYQ 371
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FS 465
AA EGIVLLKND TLPL S+ +K +A++GP NAT M GNY G+ +SP+ G +
Sbjct: 372 AAAEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVAT 429
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY NVTY G ++ + AA AA+ AD I G+D +VE+E DR + PG
Sbjct: 430 GY-NVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGN 487
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q L+ ++A V K P+++V G VD + N+ + A+LWAGYPG+ GG A+ D++ G
Sbjct: 488 QLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISG 546
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRLP+T Y DYV +P+T M LRP + PGRTYK+Y G +Y FGYGL YT
Sbjct: 547 KVAPAGRLPVTQYPADYVYEIPMTDMDLRP--NATSPGRTYKWYTGTPIYDFGYGLHYTT 604
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL 670
F Y + N+ L NL
Sbjct: 605 FSYKWAK-APSSTYNIQTLVQSGNL 628
>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 766
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 266/625 (42%), Positives = 369/625 (59%), Gaps = 27/625 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD+SL R +V T+DE + + GVPRLGLP Y+WWSE LHGV++ PG +
Sbjct: 37 CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 95
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F ATSFP I+ A+F++ L K +G V E R+ N GRAGL +W+PNIN
Sbjct: 96 FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 155
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
+DPRWGR ETPGEDP+ + +Y N V+GLQ L+ +P +V S CKH+AAY
Sbjct: 156 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTCKHFAAY 207
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D+++W G RY FDA VT QD+ E +L F+ C ++ + MCSYN VNGIPSCA+ L
Sbjct: 208 DLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYL 267
Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R W ++ +DCD++ + D H + + E+AVA LKAG D+DCG +Y+
Sbjct: 268 LQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDCGTFYSE 326
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQDICSDENIELAAE 407
+ A Q + ET++ ++L Y L+RLG+FD + Y ++ + + +LA +
Sbjct: 327 YLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQAQQLAYQ 386
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FS 465
AA EGIVLLKND TLPL S+ +K +A++GP NAT M GNY G+ +SP+ G +
Sbjct: 387 AAAEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVAT 444
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
GY NVTY G ++ + AA AA+ AD I G+D +VE+E DR + PG
Sbjct: 445 GY-NVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGN 502
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q L+ ++A V K P+++V G VD + N+ + A+LWAGYPG+ GG A+ D++ G
Sbjct: 503 QLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISG 561
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRLP+T Y DYV +P+T M LRP + PGRTYK+Y G +Y FGYGL YT
Sbjct: 562 KVAPAGRLPVTQYPADYVYEIPMTDMDLRP--NATSPGRTYKWYTGTPIYDFGYGLHYTT 619
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL 670
F Y + N+ L NL
Sbjct: 620 FSYKWAK-APSSTYNIQTLVQSGNL 643
>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 761
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 270/612 (44%), Positives = 366/612 (59%), Gaps = 31/612 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
++S CD+SL + R + LV +T+ E + A GVPRLGLP Y WW+EALHGV+
Sbjct: 29 LASNAVCDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVA- 87
Query: 109 VGPGTHFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
PG F PG ATSFP I ++F+++L +G STEARA N G AGL
Sbjct: 88 ASPGVVF--TSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLD 145
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
YW+PNIN +DPRWGR ETPGEDP RY V GLQ ++ LKV++
Sbjct: 146 YWTPNINPFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQ--------GGIDPPSLKVAA 197
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
CKH+AAYD+++W GV RY FDA VT QD+ E + PF+ CV++ A+SVMCSYN VNG+
Sbjct: 198 DCKHWAAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGV 257
Query: 284 PSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
P+CA P LL +R W L ++ +DCD++ + D H + D + A +LKAG DL
Sbjct: 258 PACASPYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFV-NGSAVSLKAGSDL 316
Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICS 398
DCG Y+ + A +G + E D+ +L LY L+ LG+FD +P+ Y + D+ +
Sbjct: 317 DCGTTYSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQISWADVNT 375
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT-VAMIGNYAGIPCRY 457
LA AA E VLLKND TLPL + + ++A++GP ANA+ V + GNY GIP
Sbjct: 376 PAAQALAYTAAIESFVLLKND-GTLPLTDSSL-SIALIGPMANASAVQLQGNYNGIPPFA 433
Query: 458 MSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
++P+ GF +G+ NVTY G + V + I A AA+ AD I + G+D +VE E+
Sbjct: 434 IAPLQGFLDAGF-NVTYVLGTN-VTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAK 491
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
DR ++ P Q L++ + E K P+++V M G +D + + + AILWAGYPG+ G
Sbjct: 492 DRTEISWPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSG 550
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
G AIAD V GK P GRL IT Y YV + +T M LRP +S G PGRTYK+Y G +Y
Sbjct: 551 GTAIADTVMGKVAPAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVY 610
Query: 636 PFGYGLSYTQFK 647
P+GYGL YT F
Sbjct: 611 PYGYGLHYTNFS 622
>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 822
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 269/651 (41%), Positives = 378/651 (58%), Gaps = 41/651 (6%)
Query: 34 VFVCDPGRFSKLGLQ---MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
V + + S + Q + S CD+SL + RV LV +TL+EK+ L D + G R
Sbjct: 56 VTILTAAKLSTIACQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTR 115
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAV 147
LGLP YEWWSEA HGV + PG F ATSFP ILT ASF+++L +KI + +
Sbjct: 116 LGLPSYEWWSEATHGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVI 174
Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
E RA N G +G +W+PNIN RDPRWGR ETPGEDP V Y N+V GLQ +
Sbjct: 175 GREGRAFGNNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDP 234
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
+V + CKHYA YD++ RY + T+QD+ + FL PF+ CV++
Sbjct: 235 KNK---------QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRD 281
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLA 324
D S+MCSYN V+GIP+CA+ LL++ +R W+ + Y+V+DC ++ + H F
Sbjct: 282 TDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNF-T 340
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
D++E A + L AG+DL+CG Y + A Q VK +D+SL LY+ L +GFFD
Sbjct: 341 DTEEAAASVALNAGVDLECGSSYLKLNESLAANQTSVKV--MDRSLARLYSALFTVGFFD 398
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN-SAKVKTVAVVGPHANA 442
G +Y L D+ + + LA EAA EG+ LLKND + LPL+ K K+VAV+GP ANA
Sbjct: 399 GG-KYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDFPHKYKSVAVIGPFANA 456
Query: 443 TVAMIGNYAGIPCRYMSPIAGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
T M G+Y+G +SP+ F V Y G + ++ + A AA +D I
Sbjct: 457 TTQMQGDYSGDAPYLISPLEAFGDSRWKVNYALGT-AINNQNTSGFEEALAAANKSDLII 515
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
L G+D S+E+E+LDR L PG Q LI +++++K P+++V G VD + N +
Sbjct: 516 YLGGIDNSLESETLDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKD 574
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I+A++WAGYP + GG A+ DV+ GK +P GRLP+T Y Y + + + LRP DS Y
Sbjct: 575 IQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDS--Y 632
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
PGRTYK+Y G + PFGYGL YT+F ++ + KT+ + N+ L CRN
Sbjct: 633 PGRTYKWYTGKPVLPFGYGLHYTKFMFD---WEKTLNREYNIQDLVASCRN 680
>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
Length = 882
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 270/634 (42%), Positives = 372/634 (58%), Gaps = 40/634 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD+SL + RV LV +TL+EK+ L D + G RLGLP YEWWSEA HGV +
Sbjct: 134 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS 193
Query: 109 VGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PG F ATSFP ILT ASF+++L +KI + + E RA N G +G +W
Sbjct: 194 -APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFW 252
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN RDPRWGR ETPGEDP V Y N+V GLQ + +V + C
Sbjct: 253 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNK---------QVIATC 303
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++ RY + T+QD+ + FL PF+ CV++ D S+MCSYN V+GIP+
Sbjct: 304 KHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 359
Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA+ LL++ +R W+ + Y+V+DC ++ + H F D++E A + L AG+DL+
Sbjct: 360 CANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALNAGVDLE 418
Query: 343 CGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
CG Y + A Q VK +D+SL LY+ L +GFFDG +Y L D+ + +
Sbjct: 419 CGSSYLKLNESLAANQTSVKV--MDQSLARLYSALFTVGFFDGG-KYDKLDFSDVSTPDA 475
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
LA EAA EG+ LLKND + LPL+S K K+VAV+GP ANAT M G+Y+G +SP
Sbjct: 476 QALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISP 534
Query: 461 IAGFS-GYANVTYKTGCDDVACKSNNSIF-AASEAAKTADATIILAGLDLSVEAESLDRE 518
+ F V Y G N S F A AA +D I L G+D S+E+E+LDR
Sbjct: 535 LEAFGDSRWKVNYALGT--AMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRT 592
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L PG Q LI +++++K P+++V G VD + N +I+A++WAGYP + GG A
Sbjct: 593 SLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTA 651
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ DV+ GK +P GRLP+T Y Y + + + LRP DS YPGRTYK+Y G + PFG
Sbjct: 652 LLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFG 709
Query: 639 YGLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
YGL YT+F ++ + KT+ + N+ L CRN
Sbjct: 710 YGLHYTKFMFD---WEKTLNREYNIQDLVASCRN 740
>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 781
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 287/742 (38%), Positives = 403/742 (54%), Gaps = 55/742 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
CD S R K LV+ TL+EK+ + A GV RLG+P Y+WW+E LHG++ GP T
Sbjct: 36 ICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHGIA--GPFT 93
Query: 114 HFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
F +TSFP IL A+F++ L ++ + +STEARA N R GL +W+PNIN
Sbjct: 94 SFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLDFWTPNINP 153
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
RDPRWGR ETPGED + + Y + GLQ NATD R V + CKHYA Y
Sbjct: 154 FRDPRWGRGQETPGEDSYHLSSYVKALIHGLQG-----NATDPYRR---VVATCKHYAGY 205
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D++NW G RY D ++++QD+ E +L PFE CV + + + MCSYN VNG P CADP L
Sbjct: 206 DIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYL 264
Query: 292 LNQTVRGEW----DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
L +R W D H ++ +DCD+IQ + H++ + ++E A A +L AG DLDCG Y
Sbjct: 265 LQTVLREHWGWSSDDH-WVTSDCDAIQNVYLPHQW-SSTREGAAADSLNAGTDLDCGTYL 322
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIEL 404
AV+QG ET +DK+L Y+ L++LG+FD +P+ Y LG + + + L
Sbjct: 323 QTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFD-APENQPYRQLGFDAVATSASQAL 381
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
A +AA EGIVLLKND LP+N K V + G ANAT + GNY G+ SP+
Sbjct: 382 ALKAAEEGIVLLKND-GVLPINLGS-KQVGIYGDWANATSQLQGNYFGVAKFLTSPLMAL 439
Query: 465 SGYA-NVTYK----TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
+V Y G D + +S+ S T+D I + G+D VE+E DR
Sbjct: 440 QNLGVDVKYAGNLPGGQGDPTTGAWSSL---SGVITTSDVHIWVGGIDNGVESEDRDRSW 496
Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
L L G Q +I Q+A+ K PVI+VIM G +D + N I A+LWAGYPG++GG AI
Sbjct: 497 LTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDGGTAI 555
Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
+++ GK P GRLP T Y YV +P+T M +RP D PGRTYK+Y G ++ FGY
Sbjct: 556 VNILTGKAAPAGRLPQTQYPSKYVSEVPMTDMAMRPSDK--NPGRTYKWYTGEPIFEFGY 613
Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP--GVLVNDLRCDDYFEF 697
GL YT F ++ + K + ++ C ++ RCP G+ V+ + +
Sbjct: 614 GLHYTNFSASITNQPKQSYAISDLVKGCN----STGGFLERCPFTGITVS---VQNTGKI 666
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
D+ +G GS Y K K ++ + R+F A + + SL
Sbjct: 667 SSDYVTLGFLTGSFGPKPYPK----------KSLVAYDRLFNIAAGSSSTATLNLTLASL 716
Query: 758 NIVDYAANTLLPAGEHTIFVGN 779
VD + N +L G++ + + N
Sbjct: 717 ARVDESGNKVLYPGDYELQIDN 738
>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
Length = 752
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 295/751 (39%), Positives = 415/751 (55%), Gaps = 53/751 (7%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
++S CD+SL + R + LV T+ E + + A GVPRLGLP YEWW+EALHGV
Sbjct: 30 LASNPVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWNEALHGV-G 88
Query: 109 VGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
+ PG F + P ATSFP I ++F+++L +G +STEARA N GRAGL YW+P
Sbjct: 89 LSPGVVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRAGLDYWTP 148
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
NIN +DPRWGR ETPGEDP +A YVR L VEG + D S LKV++ CKH
Sbjct: 149 NINPFKDPRWGRGAETPGEDPL----HAARYVRSL--VEGLQGGIDPPS--LKVAAACKH 200
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+AAYD++NW GV RY FDA VT QD+ E + PF CV++ A+S MCSYN VNG+P+CA
Sbjct: 201 WAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNGVPACA 260
Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
P LL +R W L ++ +DC ++ + D H + D +A +LKAG DL+CG
Sbjct: 261 SPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTED-LVNASTVSLKAGTDLNCGT 319
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENI 402
YT + A +G + E D+ +L LY L+ LG+FD +P+ Y + D+ + E
Sbjct: 320 NYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQITWADVNTPEAQ 378
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT-VAMIGNYAGIPCRYMSPI 461
LA AA + VLLKND TLPL + + ++A++GP ANA+ + M+GNY GIP ++P+
Sbjct: 379 ALAYTAAIKSFVLLKND-GTLPLTDSTL-SLALIGPMANASALQMLGNYFGIPPFVIAPL 436
Query: 462 AGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
GF +G+ NVTY G +V S AA AA+ AD I + G+D ++E E DR +
Sbjct: 437 QGFLDAGF-NVTYVLGT-NVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEKDRTE 494
Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
+ P Q L++ + V K P+++V M G +D + + + AILWAGYPG+ GG AI
Sbjct: 495 ISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAI 553
Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
AD V GK P GRL YV + +T M LRP ++ G PGRTYK+Y G +YP+GY
Sbjct: 554 ADTVTGKVAPAGRL--------YVDEVAMTDMTLRPDNATGNPGRTYKWYTGTPVYPYGY 605
Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL-NYTSDASKTRCPGVLVNDLRCDDYFEFK 698
GL YT S + + C ++ + T +AS DL D F+
Sbjct: 606 GLHYTNISVAWAS---------DAPEACYSIQDLTGEASG-------FVDLAPLD--TFR 647
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSL 757
V N G V +++ A A IK+++ + R V+ G + ++
Sbjct: 648 VTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARASDVQPGNSTEVELEVTLGALA 707
Query: 758 NIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+ +L P F +G +S L
Sbjct: 708 RTDESGDASLYPGKYELTFDYDGALSLSFEL 738
>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 775
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 266/633 (42%), Positives = 371/633 (58%), Gaps = 38/633 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD+SL + RV LV +TL+EK+ L D + G RLGLP YEWWSEA HGV +
Sbjct: 27 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS 86
Query: 109 VGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PG F ATSFP ILT ASF+++L +KI + + E R N G +G +W
Sbjct: 87 -APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFSGFDFW 145
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN RDPRWGR ETPGEDP V Y N+V GLQ + +V + C
Sbjct: 146 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNK---------QVIATC 196
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++ RY + T+QD+ E FL PF+ CV++ D S+MCSYN V+GIP+
Sbjct: 197 KHYAVYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 252
Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA+ LL++ +R W+ + Y+V+DC ++ + H F D++E A + L AG+DL+
Sbjct: 253 CANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALNAGVDLE 311
Query: 343 CGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
CG Y + A Q VK +D+SL LY+ L +GFFDG +Y L D+ + +
Sbjct: 312 CGSSYLKLNESLAANQTSVKV--MDQSLARLYSALFTVGFFDGG-KYDKLDFSDVSTPDA 368
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
LA EAA EG+ LLKND + LPL+S K K+VAV+GP ANAT M G+Y+G +SP
Sbjct: 369 QALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISP 427
Query: 461 IAGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
+ F V Y G + ++ + A AA +D I L G+D S+E+E+LDR
Sbjct: 428 LEAFGDSRWKVNYALGT-AINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTS 486
Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
L PG Q LI +++++K P+++V G VD + N +I+A++WAGYP + GG A+
Sbjct: 487 LAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTAL 545
Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
DV+ GK +P GRLP+T Y Y + + + LRP D YPGRTYK+Y G + PFGY
Sbjct: 546 LDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDL--YPGRTYKWYTGKPVLPFGY 603
Query: 640 GLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
GL YT+F ++ + KT+ + N+ L CRN
Sbjct: 604 GLHYTKFMFD---WEKTLNREYNIQDLVASCRN 633
>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
Length = 774
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 290/712 (40%), Positives = 401/712 (56%), Gaps = 43/712 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ CD+S R L+ T +E + + GVPRLGLP Y WWSE LHGV+
Sbjct: 35 LSNNTVCDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ 94
Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F ATSFP IL A+F++ L + + VSTE RA N+GRAGL YW+
Sbjct: 95 -SPGVTFAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGLDYWT 153
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
PNIN +DPRWGR ETPGEDPF + Y N + GLQ L+ P KV + C
Sbjct: 154 PNINPFKDPRWGRGQETPGEDPFHLQGYVYNLILGLQG--------GLDPTPYFKVVADC 205
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+AAYD+DNW+G RY F+A VT+QD+ E +L F+ CV++ +SVMCSYN VNGIPS
Sbjct: 206 KHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNGIPS 265
Query: 286 CADPKLLNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CA+ LL +R W D ++ +DCD++Q + H + D+ A A L AG D+DC
Sbjct: 266 CANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNY-TDNPAQAAADALLAGTDIDC 324
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDEN 401
G + + + +A+ QG V TD+ ++ Y L+RLG+FD S Y LG D+ + E
Sbjct: 325 GTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFDPPESQPYRQLGWSDVNTPEA 384
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+LA AA EG+VLLKND TLPL S V+ +A++GP ANAT M GNYAGI +SP+
Sbjct: 385 QQLAHTAAVEGMVLLKND-GTLPL-SKHVRKLALIGPWANATTLMQGNYAGIAPYLISPL 442
Query: 462 AGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAESLDRE 518
G +G+ +V Y G +V ++ S FAA+ AA +I A GLD +VE E +DR
Sbjct: 443 LGAQQAGF-DVEYVFGT-NVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVDRL 500
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
++ PG Q L+ ++A V K P+I+ G +D + ++ ++ AI+W GYPG+ GG A
Sbjct: 501 NVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGGTA 559
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ D++ GK P GRLPIT Y +Y +P+T M LRP S PGRTYK+Y G ++ FG
Sbjct: 560 LFDILTGKAAPAGRLPITQYPAEYANQVPMTDMTLRP--SATNPGRTYKWYTGTPVFEFG 617
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD---ASKTRCPGVLVNDLRCDDYF 695
+GL YT F + S N + +Y+ D AS + L DL D F
Sbjct: 618 FGLHYTTFSFAWAS---------NAHANTPAASYSIDALMASGNKSAAFL--DLAPLDTF 666
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+V N G V ++++ A KQ++ + RV A + I
Sbjct: 667 AVRV--TNTGKMTSDYVALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTI 716
>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
gi|223975771|gb|ACN32073.1| unknown [Zea mays]
Length = 507
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 231/511 (45%), Positives = 325/511 (63%), Gaps = 18/511 (3%)
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYN+VNG P+CAD LL+ +RG+W L+GYI +DCDS+ V+ +N + + EDA A
Sbjct: 1 MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAI 59
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
++KAGLDL+CG + T AVQ GK+ E+D+D+++ LMRLGFFDG P+ + +
Sbjct: 60 SIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 119
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
LG D+C+ N ELA EAAR+GIVLLKN LPL++ +K++AV+GP+ANA+ MIGNY
Sbjct: 120 LGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNY 178
Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLS 509
G PC+Y +P+ G Y+ GC +V C N+ + AA++AA +AD T+++ G D S
Sbjct: 179 EGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQS 238
Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
+E ESLDR L LPG Q QL++ VA + GP ILV+MS G DI+FA+++ I AILW G
Sbjct: 239 IERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVG 298
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPGE GG AIADV+FG NP GRLP+TWY + + +P+T M +RP S GYPGRTY+FY
Sbjct: 299 YPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMRMRPDPSTGYPGRTYRFY 357
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
G T+Y FG GLSYT F ++L+S K + + L + C +CP V
Sbjct: 358 TGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT---------EQCPSVEAEGA 408
Query: 690 RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
C+ F+ + +N G G V ++S PPA + K ++GF++V + G+ +
Sbjct: 409 HCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQAGVVA 467
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F + CK L++VD N + G HT+ VG+
Sbjct: 468 FKVDVCKDLSVVDELGNRKVALGSHTLHVGD 498
>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 714
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 284/749 (37%), Positives = 396/749 (52%), Gaps = 95/749 (12%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D SL + R KDLVSRMTL EK+ Q+ A +PRL +P Y WW+E LHGV+ G
Sbjct: 13 YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
AT FP I A+FN L K+ +A+S EARA ++ GLT+W
Sbjct: 71 --------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYKGLTFW 122
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDP++ R V +V+GLQ + + LKV +
Sbjct: 123 SPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGD---------DPKYLKVVATP 173
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + R+ FDARV+++D+ ET+L FE CVKEG A S+M +YNR NG P
Sbjct: 174 KHYA---VHSGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGEPC 230
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA LL +R EW GY+V+DC +I + +HK + E A A + G +L+CG+
Sbjct: 231 CASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESA-ALAVNNGCELNCGK 289
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDIC-SDENI 402
Y + AV+QG + E ID+++ L+T MRLG FD P+ V D+ S E+
Sbjct: 290 TY-EYLCQAVEQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHIPYDVNDSPEHR 347
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
ELA E AR+ IVLLKND+N LPL S K+KT+AV+GP+A+ ++ NY G P +Y++P+
Sbjct: 348 ELALETARQSIVLLKNDENILPL-SKKLKTIAVIGPNADDLDVLLANYFGTPSKYVTPLE 406
Query: 463 GF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---- 514
G S V Y GC +V S + A A+ AD I+ GL +E E
Sbjct: 407 GIKNKVSPDTKVLYAKGC-EVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEGEEGDVA 465
Query: 515 -----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
DR + LPG Q QL+ + K P++LV+++ + I +A + ++ AI+ A
Sbjct: 466 DSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWA--HEHVPAIIEAW 522
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPGEEGG AIADV+FG +NP GRLPIT+ L +P P GRTY+++
Sbjct: 523 YPGEEGGTAIADVLFGDYNPAGRLPITFVRS-------LDDLP--PFTDYNMKGRTYRYF 573
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LYPFGYGLSYT FKY+ L S R P
Sbjct: 574 EKEPLYPFGYGLSYTSFKYSNLRL-----------------------SAMRLP------- 603
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+ + VD +N G G +VV +Y ++Q+ G Q + + G+ + + F
Sbjct: 604 -AGNNLDINVDVENTGKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQTVSF 662
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ +++ DY +L G+ I VG
Sbjct: 663 TVEP-QHMSLFDYDGKRILEPGQFIIAVG 690
>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 224/403 (55%), Positives = 275/403 (68%), Gaps = 45/403 (11%)
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
DVEG EN TDLNSRPLKVSSCCKHYA YD+D+W V+EQDM+ETF PFE
Sbjct: 4 DVEGTENVTDLNSRPLKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE- 53
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
R EWDLHGYIV+DC ++V+VDN +L
Sbjct: 54 ---------------------------------RDEWDLHGYIVSDCYGLEVIVDNQNYL 80
Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
+SK DAVA+TL+AGLDL+CG YYT+ +V GKV + ++D++LK +Y +LMR+G+FD
Sbjct: 81 NESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFD 140
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
G P Y SLG +DIC+ ++IELA EAAR+GIVLLKND LPL K + +VGPHANAT
Sbjct: 141 GIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPGK--KLVLVGPHANAT 198
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
MIGNYAG+P +Y+SP+ FS NVTY TGC D +C ++ A EAAK A+ TII
Sbjct: 199 EVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIF 258
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G DLS+EAE +DR D LPG QT+LI QVAEV+ GPVILV++S +DI FA+ N I
Sbjct: 259 VGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRIS 318
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
AILW G+PGE+GG AIADVVFGK+NPGGRLP+TWY DYV L
Sbjct: 319 AILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVACL 361
>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
RWD-64-598 SS2]
Length = 770
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 252/603 (41%), Positives = 359/603 (59%), Gaps = 23/603 (3%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN-VGPGT 113
CD+SL + R L+ T+DE + ++A GVPRLGLP YEWWSE LHGV+N G
Sbjct: 37 CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
ATSFP IL +A+F+++L K +G + E RA N G AGL +W+PNIN +
Sbjct: 97 SITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWTPNINPFK 156
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAYD 232
DPRWGR ETPGEDP+ + +Y N ++GLQ L+ P +V + CKH+A YD
Sbjct: 157 DPRWGRGQETPGEDPYHIAQYVYNLIQGLQG--------GLDPEPYFQVVATCKHFAGYD 208
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++W RY ++A ++ QD+ E +L F+ C ++ A + MCSYN +NGIP+CAD LL
Sbjct: 209 LEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCADTYLL 268
Query: 293 NQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
+RG W D ++ DCDS++ + D H + A ++ A A LKAG D+DCG +YT +
Sbjct: 269 QDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQ-AAADALKAGSDIDCGIFYTTW 327
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEA 408
A + + E D+ +L Y L+RLG+FD + + Y ++ + ELA A
Sbjct: 328 LPLAYTESLITEQDLRAALTRQYASLVRLGYFDPASEQPYRQYNWSNVDTSYAQELAYTA 387
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FSG 466
A EGI LLKND TLP +SA +K +A++GP AT M GNY G +SP G +G
Sbjct: 388 AVEGITLLKND-GTLPFSSA-IKNIALIGPWTFATTQMQGNYYGNAPYLISPYQGAQLAG 445
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
Y N++Y +V + + AA AA+ ADA + + G+D +VEAE++DR D+ P +Q
Sbjct: 446 Y-NISYVLET-NVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITWPAFQ 503
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
LI ++ ++ K P+++V G VD N ++ A+LW GYPG+ GG+A+ D++ GK
Sbjct: 504 LWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDIISGK 562
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
P GRL T Y DYV +P+T+M LRP + PGRTYK+Y G +Y FGYGL YT
Sbjct: 563 VAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTGTPVYEFGYGLHYTN 622
Query: 646 FKY 648
F Y
Sbjct: 623 FTY 625
>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 849
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 276/726 (38%), Positives = 389/726 (53%), Gaps = 43/726 (5%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPG 121
R K LV+ TL+EK+ + A GV RLG+P Y+WW+E LHG++ GP T F
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
+TSFP IL A+F+++L ++ +STEARA N+ R GL +W+PNIN RDPRWGR
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ETPGED + + Y + GLQ N TD R V + CKHYA YD++NW G R
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQG-----NETDPYRR---VVATCKHYAGYDIENWNGNLR 283
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW- 300
Y D ++++QD+ E +L PFE CV + + + MCSYN VNG P CADP +L +R W
Sbjct: 284 YQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWG 342
Query: 301 ---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
D H ++ +DCDSIQ + H++ + ++E A A +L AG DLDCG Y + AV+Q
Sbjct: 343 WSSDEH-WVTSDCDSIQNVYLPHQW-SSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQ 400
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
G ET +D +L Y+ L++LG+FD + Y LG + + + LA +AA EGIVL
Sbjct: 401 GLTNETTLDNALIRQYSSLIKLGYFDIPENQPYRQLGFDAVATSASQALALKAAEEGIVL 460
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKT 474
LKND LP+N K V + G ANAT + GNY G+ SP NV Y
Sbjct: 461 LKND-GVLPINFGS-KNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAG 518
Query: 475 GC-DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQV 533
+ S S T+D I + G+D +E+E DR L L G Q +I Q+
Sbjct: 519 NLPGGQGDPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQL 578
Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
A+ K PVI++IM G +D + N I A+LWAGYPG++GG AI +++ GK P GRL
Sbjct: 579 ADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRL 637
Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
P T Y YV +P+T M +RP + PGRTYK+Y G ++ FGYGL YT F ++ +
Sbjct: 638 PQTQYLYKYVSEVPMTDMAMRPSNK--NPGRTYKWYTGKPIFEFGYGLHYTNFSASITNQ 695
Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
K + ++ C ++ RCP +N V QN G T V
Sbjct: 696 PKQSYAISDLVKGCN----STGGFLERCPFTGIN-----------VSVQNTGKTSSDYVT 740
Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
+ + K ++ + R+F A + + SL VD + N +L G++
Sbjct: 741 LGFLTGSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLTLASLARVDESGNKVLYPGDY 800
Query: 774 TIFVGN 779
+ + N
Sbjct: 801 ELQIDN 806
>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 758
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 256/604 (42%), Positives = 355/604 (58%), Gaps = 31/604 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD++ R LVS TL EK+ G+ + GVPR+GLP Y+WW+EALHG++ GTH
Sbjct: 34 CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAY---GTH 90
Query: 115 FDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
F ATSFP IL A+F+++L + +STEARA N R GL +W+PNIN
Sbjct: 91 FAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANRYGLNFWTPNINP 150
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS-SCCKHYAA 230
+DPRWGR ETPGEDPF V Y V GLQ L+ P K + CKHYA
Sbjct: 151 YKDPRWGRGQETPGEDPFHVSSYVNALVTGLQG--------GLDDLPYKKGVATCKHYAG 202
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD++N G+ RY FDA + QD+ + +L F+ C ++ + S+MCSYN VNG+P+CAD
Sbjct: 203 YDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVPTCADDW 262
Query: 291 LLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL +R W + ++ +DCD++Q + D+H + + + E A A L AG DLDCG ++
Sbjct: 263 LLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTS-TPEQAAADALNAGTDLDCGGFW 321
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELA 405
+ G+A Q + +D+SL Y L+RLG+FD + Y LG D+ + +LA
Sbjct: 322 PTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYFDPASIQPYRQLGWSDVSTPSAEQLA 381
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP-IAGF 464
+AA +GIVLLKND LPL S + VA++GP ANAT M GNY G SP IA
Sbjct: 382 LQAAEDGIVLLKND-GILPLPS-NITNVALIGPWANATTQMQGNYYGQAPYLHSPLIAAQ 439
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+ +VTY G D+ + AA AAK AD I + G+D S+EAE+ DR+ + P
Sbjct: 440 NAGFHVTYVQGA-DIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKTIAWPS 498
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGG-VDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
Q L+NQ+A ++ + L+I G +D + TN + I+WAGYPG++GG AI +++
Sbjct: 499 SQISLVNQLANLS---IPLIISQMGTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAIFNIL 555
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P GRLPIT Y DYV + + +M L P PGRTYK++NG +++ FG+GL Y
Sbjct: 556 TGKTAPAGRLPITQYPSDYVNEVSMNNMNLHP--GANNPGRTYKWFNGTSIFDFGFGLHY 613
Query: 644 TQFK 647
T F
Sbjct: 614 TTFN 617
>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 796
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 257/610 (42%), Positives = 362/610 (59%), Gaps = 32/610 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD++ + R +V MTL+EKV +G A G RLGLP Y+W +EALHGV+ G
Sbjct: 75 CDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLGLPAYQWQNEALHGVAG-STGVQ 133
Query: 115 FDDVI----PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F + ATSFP IL +A+F+++L K + A+STEARA N G AGL +W+PNIN
Sbjct: 134 FQSPLGANFSAATSFPMPILLSAAFDDALVKSVATAISTEARAFANYGFAGLDFWTPNIN 193
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
RDPRWGR ETPGED F + Y + V GLQ ++ + S CKH+AA
Sbjct: 194 PFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQG--------GIDPDFYRTLSTCKHFAA 245
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD++N + + + T+QDM + +L FE CV++ +S+MC+YN V+G+P+CAD
Sbjct: 246 YDIENGRTAN----NLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACADSY 301
Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL +R + Y+V+DCD+++ + D H + A+ + A A ++ AG DLDCG Y
Sbjct: 302 LLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQ-AAAMSINAGTDLDCGSSY 360
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAE 407
N +VQ G E +DKSL LY+ L+++G+FD +Y SLG ++ + ++ LA +
Sbjct: 361 -NVLNASVQAGLTTEATLDKSLIRLYSALVKVGYFDQPAEYNSLGWGNVNTTQSQALAHD 419
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SG 466
AA EG+ LLKND TLPL S + VAV+GP AN T M GNYAG ++P++ F
Sbjct: 420 AATEGMTLLKND-GTLPL-SRTLSNVAVIGPWANVTTQMQGNYAGTAPLLVNPLSVFQQK 477
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
+ NV Y G + + + AA AA ++D + L G+D+SVE E DR + PG Q
Sbjct: 478 WRNVKYAQGT-AINSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWPGNQ 536
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
LI+Q+A + K P+++V G +D + +N+ + +ILWAGYPG++GG AI DV+ G
Sbjct: 537 LNLISQLANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVLTGA 595
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
P GRLP+T Y +YV + M LRP S G PGRTY +Y G + PFGYGL YT F
Sbjct: 596 NPPAGRLPVTQYPANYVNNNNIQDMNLRP--SNGIPGRTYAWYTGTPVLPFGYGLHYTNF 653
Query: 647 KYNLLSFTKT 656
LSF T
Sbjct: 654 S---LSFQST 660
>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
Length = 698
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 258/619 (41%), Positives = 360/619 (58%), Gaps = 26/619 (4%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
L +S+ CD+SL + R LV++ T E + + A GVPRLG+PQY+WW+EALHGV
Sbjct: 27 LPLSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGV 86
Query: 107 SNVGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
+ PG +F+ G ATSFP VI A+F+++L++ + ++ E RA N GRAGL
Sbjct: 87 AR-SPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRAGL 145
Query: 163 TYWSP-NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
+SP NIN RDPRWGR ET GEDP + RYAV VRGLQ + A N R L +
Sbjct: 146 NMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQDEA---NPR-LTL 201
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
++ CKHY AYD++ GV+RY FDA V+ QD+ + L F CV++G A+++M SYN VN
Sbjct: 202 AATCKHYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVN 261
Query: 282 GIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
G+P A L R W L H Y+ +DCD++ + D H + A A A +L AG
Sbjct: 262 GVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAA-DYVHAAAASLNAG 320
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
DLDCG Y + A+ Q I +++ +Y L+RLG+FD + LG +D+
Sbjct: 321 TDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAAEAQPLRQLGWKDV 380
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
+ +LA EAA I LLKN Q+TLPL KT+A++GP+ NAT A+ GNYAG
Sbjct: 381 NAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPL 440
Query: 457 YMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
++P FS A++ G + AA AK+AD + G+D +VE
Sbjct: 441 VITPFDAARRTFSD-AHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEG 499
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG-VDIAFAETNTNIKAILWAGYP 571
ESLDR D+ P Q +LI ++A + K V++V+ GG VD A + + + A++WAGYP
Sbjct: 500 ESLDRRDIAWPANQLRLIQELAALGK--VLVVVQFGGGQVDGALLKGDDGVGALVWAGYP 557
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ G A+ D++ GK P GRLPIT Y +Y L T+M LRP + YPGRTYK+Y G
Sbjct: 558 GQSGALALMDILAGKRAPAGRLPITQYPANYTHALRETTMALRPTAT--YPGRTYKWYTG 615
Query: 632 PTLYPFGYGLSYTQFKYNL 650
+PFG+GL YT F+ ++
Sbjct: 616 TPTFPFGFGLHYTTFRASI 634
>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
Length = 799
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 250/604 (41%), Positives = 357/604 (59%), Gaps = 28/604 (4%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ CD+S Y R + L++ TL+E + + GVPRLGLP YE WSE LHG+
Sbjct: 62 IVCDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGLDRA--- 118
Query: 113 THF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
HF D ATSFP IL+ A+ N +L +I ++T+ARA N+GR GL ++PNI
Sbjct: 119 -HFVKSGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNI 177
Query: 170 NVARDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
N R P WGR ETPGED F+ YA Y+ GLQ +N LK+++ KH+
Sbjct: 178 NGFRSPLWGRGQETPGEDANFLTSSYAYEYITGLQGGIDPDN--------LKIAATAKHF 229
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
A YD++NW G R FDAR+T+QD+ E + F + A S MCSYN VN IPSC+
Sbjct: 230 AGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAIPSCSS 289
Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
LL +R +WD +GY+ +DCD++ + + H + A ++ A A++L+AG D+DCGQ
Sbjct: 290 SFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGY-ASNQSSAAAESLRAGTDIDCGQT 348
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGKQDICSDENIELA 405
Y+ + +G V +I++S+ LY+ L++LG+FDG +Y LG D+ + + ++
Sbjct: 349 YSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGDKNEYRQLGWNDVVTTDAWNIS 408
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
EAA EGIVLLKND LPL S VK+VA+VGP ANAT + GNY G ++P+ G S
Sbjct: 409 YEAAVEGIVLLKND-GVLPL-SKNVKSVALVGPWANATKQLQGNYFGTAPYLITPLQGAS 466
Query: 466 --GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
GY V Y G +++ + + A AAK +D + L G+D ++EAE DR ++ P
Sbjct: 467 DAGY-KVNYALGT-NISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWP 524
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
Q LI Q+++ K P++++ M G VD + ++N+ + A++W GYPG+ GG+AI D++
Sbjct: 525 RNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAIFDIL 583
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P GRL T Y +Y P T M LRP D PG+TY +Y G +Y FGYGL Y
Sbjct: 584 KGKRAPAGRLVSTQYPAEYATQFPATDMSLRP-DGKSNPGQTYMWYIGKPVYEFGYGLFY 642
Query: 644 TQFK 647
T FK
Sbjct: 643 TTFK 646
>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 781
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 261/602 (43%), Positives = 361/602 (59%), Gaps = 27/602 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD + R L+S T +E + + GVPRLGLP Y WWSE LHGV+ PG
Sbjct: 41 CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99
Query: 115 F--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F ATSFP IL A+F++ L + I VSTE RA N GRAGL YW+PNIN
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
+DPRWGR ETPGEDPF + +Y N + GLQ L+ +P KV + CKH+AAY
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADCKHFAAY 211
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D+DNW+GV RY F+A V++QD+ E +L PF+ CV++ +SVMCSYN VNGIPSCA+ L
Sbjct: 212 DMDNWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFL 271
Query: 292 LNQTVRGEWDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R W ++ +DCD++Q + H + D + A A L AG D+DCG + +
Sbjct: 272 LQDVLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQ-AAADALLAGTDIDCGTFSST 330
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAE 407
+ A+Q+G V TD+ ++ Y L+RLG+FD + Y LG D+ + + +LA
Sbjct: 331 YLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTLQAQQLAHT 390
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF--S 465
AA EG+VLLKND LPL S +V+ +A++GP ANAT + GNY GI +SP+ G +
Sbjct: 391 AAVEGMVLLKND-GLLPL-SKRVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQA 448
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAESLDREDLWLPG 524
G+ V Y G +V +++ S FAA+ AA ++ A GLD +VE E +DR ++ PG
Sbjct: 449 GF-EVEYVFGT-NVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPG 506
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q L+ ++ V K P+I+ G +D + + + AI+W GYPG+ GG A+ D++
Sbjct: 507 NQLDLVAELERVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILT 565
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK P GRLPIT Y Y + +P+T M LRP S PGRTYK+Y+G ++ FG+GL YT
Sbjct: 566 GKAAPAGRLPITQYPAAYAEQVPMTDMTLRP--SATNPGRTYKWYSGTPVFEFGFGLHYT 623
Query: 645 QF 646
F
Sbjct: 624 TF 625
>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
Length = 783
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 284/750 (37%), Positives = 404/750 (53%), Gaps = 45/750 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ C+++ + R K LV+ +T +EK G+ + GVPRLGL Y+WW EALHGV++
Sbjct: 29 LSNNTVCNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVAS 88
Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG +F ATSFP IL +A+F+++L + VSTEARA N+ R+GL +W+
Sbjct: 89 -SPGVNFSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRSGLDFWT 147
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PNIN +DPRWGR ETPGED F + Y + GLQ LN KV + CK
Sbjct: 148 PNINPYKDPRWGRGQETPGEDTFHLKSYVAALIDGLQG--------GLNPPIKKVIATCK 199
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H+ AYD+++W DRY+FDA V+ QD+ E +++PF+ C ++ S+MCSYN +NG+P+C
Sbjct: 200 HFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPTC 259
Query: 287 ADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
ADP +L +R W D Y+ +DCD+IQ + H + ++E AVA L AG DL+C
Sbjct: 260 ADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPH-YYEPTREQAVADALTAGTDLNC 318
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G YY A +G +T ID+++ LY+ L++LG+FD + Y SL D+ +
Sbjct: 319 GTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFDPPSATPYRSLNWSDVSTPAA 378
Query: 402 IELAAEAAREGIVLLKNDQNTLPLN--SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
LA +AA EGIVLLKND LPL+ + K TVA++G ANAT M GNY GI S
Sbjct: 379 EALALKAAEEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLHS 437
Query: 460 PIAGFSGYANVT--YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
P+ N+ Y G + + AA AD II GL S E+ES DR
Sbjct: 438 PLYALQQLPNINAVYGGGFGVPTTDGWDELLG---AAGEADLIIIADGLTTSDESESNDR 494
Query: 518 EDL-WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ W P +INQ++ + K P + + M +D N NI A++W GYPG GG
Sbjct: 495 YTIGWQPA-AIDIINQLSGMGK-PTVFLQM-GDQLDNTPLLNNPNISALIWGGYPGMAGG 551
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
A+ +++ GK P GRLP+T Y DYV + +T M LRP + G PGRTYK+YN L P
Sbjct: 552 DALINILTGKAAPAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRTYKWYNNAVL-P 610
Query: 637 FGYGLSYTQFKY--NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
FGYGL YT F + +T + + +Y + + C L +
Sbjct: 611 FGYGLHYTNFSVAASAQGQAQTQSGPSSNSSQGQGTSYNISSLVSSCDRSQYAYLDLCPF 670
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY------IKQVIGFQRVF-VRAGRNKRI 747
F V+ N GS SD V + I+ +Y IKQ++ +QR+F + AG +
Sbjct: 671 ESFNVNVTNTGSKLASDFVAL-----GFISGSYGPQPYPIKQLVAYQRLFNISAGASATA 725
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
SL D N +L G++ + +
Sbjct: 726 TLNL-TLGSLARHDENGNAVLYPGDYGLLI 754
>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 923
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 265/611 (43%), Positives = 353/611 (57%), Gaps = 38/611 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
C++SLP + RV+ LV ++TL EK+ L D A G R+GLP YEWWSEALHGV+ PG
Sbjct: 165 CNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVA-ASPGVT 223
Query: 115 F----DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F ATSFP I +A+F++ L +I V E RA N G +G +W+PNIN
Sbjct: 224 FAGPNGTAFSYATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSGFDFWTPNIN 283
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL--KVSSCCKHY 228
RDPRWGR ETPGED F + +Y + + GLQ S PL ++ + CKHY
Sbjct: 284 PFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQ-----------GSDPLDKQIIATCKHY 332
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
A YDV+ RY +D D+ E +L PF+ CV++ SVMCSYN V+GIP+CA
Sbjct: 333 AVYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACAS 388
Query: 289 PKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
LL +R W + Y+V+DCD+++ + H F DS A A L AG DL+CG
Sbjct: 389 EYLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNF-TDSPAAAAAVALNAGTDLECGS 447
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y N ++ E +D++L LYT L +GFFDGS +Y LG + + + LA
Sbjct: 448 TYLNLN-QSLASNMTTEAALDRALTRLYTALHTIGFFDGSARYGGLGWDAVGTGDAQVLA 506
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
+AA +G VLLKN+++ LPL+S +++ +AV+GP ANAT M GNY G +SP+A F
Sbjct: 507 YQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAAFQ 566
Query: 466 ---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
G NV + G +A S AA AAK ADA + L G+D SVE+ESLDR +
Sbjct: 567 SAWGADNVLFANGT-GIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISW 625
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PG Q LI Q+A V K P+++V G +D + N + A+LWAGYPG+ GG AIAD+
Sbjct: 626 PGNQLDLIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADL 684
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG------YPGRTYKFYNGPTLYP 636
+ GK P GRLP+T Y Y + L LRP S G +PGRTYK+Y G + P
Sbjct: 685 LTGKQAPAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVLP 744
Query: 637 FGYGLSYTQFK 647
FGYGL YT F+
Sbjct: 745 FGYGLHYTTFR 755
>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
Length = 860
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 256/610 (41%), Positives = 362/610 (59%), Gaps = 30/610 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD++ + R +V MTL+EKV +G A G RLGLP Y+W +EALHGV+ G
Sbjct: 139 CDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLGLPAYQWQNEALHGVAG-STGVQ 197
Query: 115 FDDVI----PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F + ATSFP IL +A+F+++L + + A+STEARA N G AGL +W+PNIN
Sbjct: 198 FQSPLGANFSAATSFPMPILLSAAFDDALVQSVATAISTEARAFANYGFAGLDFWTPNIN 257
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
RDPRWGR ETPGED F + Y ++ + GLQ ++ + S CKH+AA
Sbjct: 258 PFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQG--------GIDPDFFRTISTCKHFAA 309
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD++N + + + T+QDM + +L FE CV++ S+MC+YN VNG+P+CAD
Sbjct: 310 YDIENGRTAN----NLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACADSY 365
Query: 291 LLNQTVR---GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL +R G + Y+V+DCD+++ + D H + A+ + A A +L AG DLDCG Y
Sbjct: 366 LLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQ-AAAMSLNAGTDLDCGSSY 424
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAE 407
N +VQ G E +DKSL LY+ L+++G+FD +Y SLG ++ + + LA +
Sbjct: 425 -NVLNASVQAGMTTEATLDKSLIRLYSALIKVGWFDQPAKYSSLGWGNVNTTQTRALAHD 483
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SG 466
AA G+ LLKND TLPL S ++ VAV+GP NAT + GNYAG ++P+ F
Sbjct: 484 AATGGMTLLKND-GTLPL-SPTLQNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVFQQK 541
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
+ NV Y G + + + AA AA ++D + L G+D+SVE E DR + PG Q
Sbjct: 542 WRNVKYAQGT-AINSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWPGNQ 600
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
LI+Q+A + K P+++V G +D + +N+ + +ILWAGYPG+EGG A+ DV+ G
Sbjct: 601 LSLISQLANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVLTGA 659
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
P GRLPIT Y +YV + M LRP S+ PGRTY +Y G + PFGYGL YT F
Sbjct: 660 NPPAGRLPITQYPANYVNNNNIQDMNLRPSGSI--PGRTYAWYTGTPVLPFGYGLHYTNF 717
Query: 647 KYNLLSFTKT 656
+ S TKT
Sbjct: 718 SVSFQS-TKT 726
>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 761
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 276/740 (37%), Positives = 392/740 (52%), Gaps = 48/740 (6%)
Query: 57 SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD 116
+S P R + LV+ TL+EK+ A GVPRLG+P Y+WWSE LHG++ GP T+F
Sbjct: 3 TSRPPLARAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFS 60
Query: 117 DV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARD 174
D +TSFP IL A+F++ L + + +STEARA N R GL +W+PNIN RD
Sbjct: 61 DSGEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRD 120
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ETPGED + + Y + GLQ +TD R V + CKH+A YDV+
Sbjct: 121 PRWGRGQETPGEDAYHLSSYVQALIHGLQG-----ESTDPYKR---VVATCKHFAGYDVE 172
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+W G RY D ++T+Q++ E +L PF+ CV + + + MCSYN VNG P CADP LL
Sbjct: 173 DWNGNLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQT 231
Query: 295 TVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
+R W + ++ DCD++Q + H++ + ++ A A +L AG D+ CG Y
Sbjct: 232 ILREHWGWTNEEQWVTGDCDAVQNVYLPHQW-SPTRAGAAADSLVAGTDVTCGTYMQEHL 290
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAA 409
A QQ + E+ +D++L Y+ L+RLG+FD S Y LG + ++ + LA AA
Sbjct: 291 PAAFQQKLLNESSLDQALIRQYSSLVRLGYFDASENQPYRQLGFDAVATNASQALARRAA 350
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---- 465
EGIVLLKND TLPL+ TV + G ANAT ++GNYAG+ SP+
Sbjct: 351 AEGIVLLKND-GTLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGV 409
Query: 466 --GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
YA D + +N A S T+D I + G+D SVE E DR L
Sbjct: 410 KINYAGGNPGGQGDPTTNRWSNLYGAYS----TSDVLIYVGGIDNSVEEEGRDRGYLTWT 465
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q +I Q+A+ K PVI+V+ G +D + N NI AI+WAGYPG++GG AI D++
Sbjct: 466 GAQLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDII 524
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P GRLP T Y +Y + + +M LRP ++ PGRTYK+YNG + FGYG+ Y
Sbjct: 525 GGKTAPAGRLPQTQYPANYTAAVSMMNMNLRPGEN--SPGRTYKWYNGSATFEFGYGMHY 582
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F + T +Q + N T + RCP VN V N
Sbjct: 583 TNFSAEI---TTQMQQSYAISSLASGCNSTGGFLE-RCPFASVN-----------VQVHN 627
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G+ + + Y A K ++ ++R+ AG + SL VD
Sbjct: 628 TGNVTSDYITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLTLASLARVDEH 687
Query: 764 ANTLLPAGEHTIFVGNGGVS 783
N +L G++++ + N ++
Sbjct: 688 GNKVLYPGDYSLQIDNNALA 707
>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
Length = 797
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 274/733 (37%), Positives = 405/733 (55%), Gaps = 44/733 (6%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG-- 110
L CDSS Y R + L+S TL+E + + GVPRLGLP Y+ W+EALHG+
Sbjct: 61 LVCDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 120
Query: 111 -PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
G F+ ATSFP ILTTA+ N +L +I +ST+ARA N GR GL ++PN+
Sbjct: 121 TKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNV 176
Query: 170 NVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
N R P WGR ETPGED F + Y Y+ G+Q ++ LKV++ KH+
Sbjct: 177 NGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--------GVDPEHLKVAATVKHF 228
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
A YD++NW R FDA +T+QD+ E + F + + S+MC+YN VNG+PSCA+
Sbjct: 229 AGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCAN 288
Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
L +R W GY+ +DCD++ + + H + ++ A A +L+AG D+DCGQ
Sbjct: 289 SFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSS-AAASSLRAGTDIDCGQT 347
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAA 406
Y + G+V +I++S+ LY L+RLG+FD QY SLG +D+ + ++
Sbjct: 348 YPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNISY 407
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGF 464
EAA EGIVLLKND TLPL S KV+++A++GP ANAT M GNY G +SP+ A
Sbjct: 408 EAAVEGIVLLKND-GTLPL-SKKVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKK 465
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+GY +V ++ G ++A S A AAK +DA I L G+D ++E E DR D+ PG
Sbjct: 466 AGY-HVNFELGT-EIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPG 523
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI Q++EV K P++++ M G VD + ++N + +++W GYPG+ GG A+ D++
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK P GRL T Y +YV P M LRP D PG+TY +Y G +Y FG GL YT
Sbjct: 583 GKRAPAGRLVTTQYPAEYVHQFPQNDMNLRP-DGKSNPGQTYIWYTGKPVYEFGSGLFYT 641
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
FK L S K+++ N + + + YT + P F F+ + +N
Sbjct: 642 TFKETLASHPKSLKFNTSSILSAPHPGYT---YSEQIP-----------VFTFEANIKNS 687
Query: 705 GSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
G T+ +++ + A Y K ++GF R+ ++ G + ++ +L VD
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746
Query: 763 AANTLLPAGEHTI 775
N ++ G++ +
Sbjct: 747 HGNRIVYPGKYEL 759
>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
Length = 759
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 252/621 (40%), Positives = 360/621 (57%), Gaps = 36/621 (5%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G +S+ CD+SL Y R LV+ T E + + A GVPRLG+P Y+WW+EALHG
Sbjct: 27 GTPLSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHG 86
Query: 106 VSNVGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG 161
V+ PG +F D + AT+FP +I A+F+++L++++ ++ E RA N G+AG
Sbjct: 87 VAG-SPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAGKAG 145
Query: 162 LTYWSP-NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
L +SP NIN RDPRWGR ET GEDP + RYAV V+GLQ N L+
Sbjct: 146 LNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGP---------NQDELR 196
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+++ CKHY AYD++ W GV+RY FDA+V+ Q++ E +L F CV++G A ++M SYN V
Sbjct: 197 LAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYNAV 256
Query: 281 NGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N +P A L R EW L H Y+ +DCD++ + D H + ADS A A ++ A
Sbjct: 257 NNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHY-ADSYVQAAADSINA 315
Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQ 394
G DL+CG Y++ G A++Q I ++ +Y +RLG FD G P LG +
Sbjct: 316 GTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPKQGQP-LRELGWE 374
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ + +LA +A + LLKN+ TLP++ A VAV+GP++NAT A+ GNYAG P
Sbjct: 375 HVNTKAAQDLAYSSAAASVTLLKNN-GTLPVDGA--TKVAVIGPYSNATFALRGNYAG-P 430
Query: 455 CRY---MSPIAG--FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
+ M+ A FS A ++ G ++ AA + AK AD I G+D +
Sbjct: 431 GPFAITMTEAAQRVFS-QATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPT 489
Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
+E+E LDR + P Q QLI+ + +AK + +V G +D A + + NI A+LWAG
Sbjct: 490 IESEELDRATIAWPPNQLQLIHALGGMAK-KMAVVQFGGGQIDGASIKADGNIGALLWAG 548
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPG+ G A+ DV+ G P GRLPIT Y +Y+ L T+M LRP + YPGRTYK+Y
Sbjct: 549 YPGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLAETTMALRP--NATYPGRTYKWY 606
Query: 630 NGPTLYPFGYGLSYTQFKYNL 650
+G YP+ +GL YT+FK L
Sbjct: 607 SGTPTYPYAHGLHYTEFKAEL 627
>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 792
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 278/806 (34%), Positives = 429/806 (53%), Gaps = 60/806 (7%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFV---------CDPGRFSKLGLQMSS 51
+A V++++L L+ A ++ +AN +P V CD G SK
Sbjct: 7 IATVLAAILPSVLAQANTSYADYNTEANPDLTPQSVATIDLSFPDCDNGPLSKT------ 60
Query: 52 FLFCDS-SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
+ CD+ + PY R L+S TL+E V G+ + GVPRLGLP Y+ W+EALHG+
Sbjct: 61 -IVCDTLTSPYD-RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRA- 117
Query: 111 PGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
+F D +TSFP ILT ++ N +L ++ +ST+ RA N GR GL +SPN
Sbjct: 118 ---YFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPN 174
Query: 169 INVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
IN R P WGR ETPGED + + YA Y+ G+Q ++ + LK+ + KH
Sbjct: 175 INSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--------GVDPKSLKLVATAKH 226
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
YA YD++NW G R D +T+QD+ E + F + ++ SVMCSYN VNG+PSCA
Sbjct: 227 YAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCA 286
Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+ L +R + GYI +DCDS + + H++ A+ A A +++AG D+DCG
Sbjct: 287 NSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRAGTDIDCGT 345
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
Y + AV Q + DI++ + LY+ LMRLG+FDG S Y +L D+ + + +
Sbjct: 346 TYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSAYRNLTWNDVVTTNSWNI 405
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
+ E EG VLLKND TLPL S ++++A+VGP N + + GNY G +SP+ F
Sbjct: 406 SYEV--EGTVLLKND-GTLPL-SESIRSIALVGPWMNVSTQLQGNYFGPAPYLISPLDAF 461
Query: 465 -SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
+ +V Y G +++ S + A AAK +DA I G+D S+EAE+LDR ++ P
Sbjct: 462 RDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRMNITWP 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q +LI+Q++++ K P+I++ M G VD + ++N N+ +++W GYPG+ GG+A+ D++
Sbjct: 521 GKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDII 579
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P GRL +T Y +Y P T M LRP + PG+TY +Y G +Y FG+GL Y
Sbjct: 580 TGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFY 637
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F+ +S + + K++ N+ D PG + + + F VD N
Sbjct: 638 TTFR---VSHARAV-----KIKPTYNIQ---DLLAQPHPGYI--HVEQMPFLNFTVDITN 684
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G ++++ A A K ++GF R+ ++ + S+ D
Sbjct: 685 TGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMARTDEL 744
Query: 764 ANTLLPAGEHTIFVGNG-GVSFPIHL 788
N +L G++ + + N V P+ L
Sbjct: 745 GNRVLYPGKYELALNNERSVVLPLSL 770
>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
Length = 797
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 248/601 (41%), Positives = 350/601 (58%), Gaps = 22/601 (3%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ CD+S Y R + L++ TL+E + + A GVPRLGLP Y+ WSEALHG+
Sbjct: 62 IVCDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRANFA 121
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T D+ ATSFP IL+ A+ N +L +I + T+ARA N GR GL ++PNIN
Sbjct: 122 TSGDEWT-WATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNINGF 180
Query: 173 RDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
R P WGR ETPGED F+ YA Y+ GLQ ++ LKV + KH+A Y
Sbjct: 181 RSPLWGRGQETPGEDANFLSSSYAYEYITGLQG--------GVDPDHLKVVATAKHFAGY 232
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D++NW G R FDA +T+QD+ E + F + A S MCSYN VNG+PSC+ L
Sbjct: 233 DLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFL 292
Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R WD +GY+ +DCD++ + + H + A ++ A A +L+AG D+DCGQ Y
Sbjct: 293 LQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGY-ASNQSAAAADSLRAGTDIDCGQTYPW 351
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEA 408
+ +G V +I++S+ LY+ L++LG+FDG +Y LG D+ + + ++ EA
Sbjct: 352 NLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKSEYRQLGWNDVVTTDAWNISYEA 411
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS--G 466
A EGIVLLKND LPL S VK++A++GP ANAT + GNY G ++P+ G S G
Sbjct: 412 AVEGIVLLKND-GILPL-SKHVKSIALIGPWANATEQLQGNYYGTAPYLITPLQGASDAG 469
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
Y V Y G ++ + A AAK +D + L G+D ++EAE DR ++ PG Q
Sbjct: 470 Y-KVNYALGT-NILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQ 527
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
LI Q+++ K P++++ M G VD + + N+ + A++W GYPG+ GG AI D++ GK
Sbjct: 528 LDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGK 586
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
P GRL T Y +Y P T M LRP D PG+TY +Y G +Y FGYGL YT F
Sbjct: 587 RVPAGRLVTTQYPAEYATQFPATDMNLRP-DGASNPGQTYMWYTGTPVYDFGYGLFYTTF 645
Query: 647 K 647
K
Sbjct: 646 K 646
>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
SS1]
Length = 783
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 257/611 (42%), Positives = 363/611 (59%), Gaps = 27/611 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD + R L+ T +E + + GVPRLGLP Y WWSE LHGV+
Sbjct: 35 LKSNAVCDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQ 94
Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F ATSFP IL A+F+++L + I VSTE RA N GRAGL YW+
Sbjct: 95 -SPGVTFAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWT 153
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
PNIN +DPRWGR ETPGEDPF + +Y N + GLQ L+ +P KV + C
Sbjct: 154 PNINPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADC 205
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+AAYD++NW+G+ R FDA V++QD+ E +L PF+ CV++ +SVMCSYN VNGIPS
Sbjct: 206 KHFAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPS 265
Query: 286 CADPKLLNQTVRGEWDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CA+ LL +R W ++ +DCD+++ ++ HK+ D + A A L AG D+DC
Sbjct: 266 CANSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQ-AAADALLAGTDIDC 324
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G + + + A+Q+G V TD+ ++ Y L+RLG+FD + Y LG D+ + +
Sbjct: 325 GTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTPQA 384
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+LA AA EGIVLLKND LP S V+ +A++GP ANAT + G+Y G+ +SP+
Sbjct: 385 QQLAHTAAVEGIVLLKND-GVLPF-SKHVRKLALIGPWANATSLLQGSYIGVAPYLVSPL 442
Query: 462 AGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAESLDRE 518
G +G+ V Y G +V +++ S FAA+ AA ++ A GLD +VE E DR
Sbjct: 443 QGAQEAGF-EVEYVLGT-NVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRL 500
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
++ PG Q L+ ++ V K P+I+ G +D + + + AI+W GYPG+ GG A
Sbjct: 501 NVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTA 559
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ D++ GK P GRLPIT Y Y + +P+T M LRP S PGRTYK+Y+G ++ FG
Sbjct: 560 LFDILTGKAAPAGRLPITQYPAAYTKQVPMTDMSLRP--SATNPGRTYKWYSGTPVFEFG 617
Query: 639 YGLSYTQFKYN 649
+GL YT F ++
Sbjct: 618 FGLHYTTFVFS 628
>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 865
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 252/613 (41%), Positives = 357/613 (58%), Gaps = 29/613 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD++L + R +V MTLDEKV +G A G RLGLP Y+W +EALHGV+
Sbjct: 138 LCSNAICDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSARLGLPAYQWQNEALHGVAG 197
Query: 109 VGPGTHFDDVI----PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
G F + ATSFP IL +A+F+++L + + A+STEARA N G AGL +
Sbjct: 198 -STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVQNVATAISTEARAFANYGFAGLDF 256
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+PNIN RDPRWGR ETPGED F + Y + + GLQ +N ++ +
Sbjct: 257 WTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQG--------GINPDFFRIIAT 308
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH+AAYD++N R + T+QDM + +L FE CV++ SVMC+YN V+GIP
Sbjct: 309 CKHFAAYDIEN----GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIP 364
Query: 285 SCADPKLLNQTVR---GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
+CA LL +R G + Y+V+DCD++ + D H + ++ E A A +L AG DL
Sbjct: 365 ACASEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTE-AAALSLNAGTDL 423
Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
DCG Y N +V+ E +++SL LY+ L+++G+FD +Y SL ++ + +N
Sbjct: 424 DCGSSY-NVLNASVEAALTSEAALNQSLVRLYSALIKVGYFDQPSEYKSLSWANVNTTQN 482
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
LA +AA G+ LLKND TLPL S + VA++GP NAT M GNYAG ++P+
Sbjct: 483 QALAHDAATGGMTLLKND-GTLPL-SRTLSNVAIIGPWVNATTQMQGNYAGTAPFLVNPL 540
Query: 462 AGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
F + NV Y G + + + AA AA ++D + L G+D++VE E DR +
Sbjct: 541 DVFQQKWGNVKYAQGT-AINSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSI 599
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
PG Q LI+Q+A + K P+++V G +D + +N N+++ILWAGYPG++GG A+
Sbjct: 600 VWPGNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVF 658
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
DV+ G P GRLPIT Y Y+ + M LRP S G PGRTY +Y G + PFGYG
Sbjct: 659 DVLTGANPPAGRLPITQYPASYINNNNIQDMNLRP--SNGIPGRTYAWYTGTPVLPFGYG 716
Query: 641 LSYTQFKYNLLSF 653
L YT F + S
Sbjct: 717 LHYTNFSVSFQSI 729
>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
Length = 772
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 259/603 (42%), Positives = 359/603 (59%), Gaps = 29/603 (4%)
Query: 55 CD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
CD SS PY+ R L+S TL EKV G+ + GVPR+GLP YEWW+EALHG++ PGT
Sbjct: 34 CDTSSDPYT-RAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGT 91
Query: 114 HFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F +TSFP IL A+F++ L K+ VSTEARA N+ R GL +W+PNIN
Sbjct: 92 TFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNIN 151
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS-SCCKHYA 229
+DPRWGR ETPGEDPF Y + GLQ L+ P K + CKH+A
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--------GLDDLPYKKGVATCKHFA 203
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD+++ G RY FDA + QD+ + +L PF+ C ++ + SVMCSYN +NG+P+CAD
Sbjct: 204 GYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADD 263
Query: 290 KLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
LL +R W + ++ +DCD+++ + D H + + E + A L AG DLDCG +
Sbjct: 264 WLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTL-TPEQSAADALNAGTDLDCGTF 322
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIEL 404
+ + G+A QG + +D+SL Y L+RLG+FD Y L ++ + +L
Sbjct: 323 WPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNWDNVSTPAAQQL 382
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP-IAG 463
A +AA +GIVLLKND LPL S+ + VA++GP ANAT M GNY G SP IA
Sbjct: 383 ALQAAEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAA 440
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
+ VTY G D+ ++ AA AA++AD I + G+D S+EAE +DR + P
Sbjct: 441 QNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISWP 499
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
Q LINQ+A ++ P+I+ M +D + +NT + A+LWAGYPG++GG AI +++
Sbjct: 500 SSQLSLINQLANLST-PLIISQMGC-MIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNIL 557
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P GRLPIT Y +YV + +T M L+P S PGRTYK+YNG ++ +GYGL Y
Sbjct: 558 TGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYKWYNGEPVFEYGYGLQY 615
Query: 644 TQF 646
T F
Sbjct: 616 TTF 618
>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
Length = 774
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 267/741 (36%), Positives = 401/741 (54%), Gaps = 45/741 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
++S CD++L R LV+ MT +EK+Q L + G PR+GLP Y WWSEALHGV+
Sbjct: 36 LASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVA- 94
Query: 109 VGPGTHF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PGT F D +TSFP +L A+F++ L +K+G+ + TE RA N G +G YW
Sbjct: 95 YAPGTQFRSGDGPFNSSTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGFSGFDYW 154
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N +DPRWGR +ETPGED + RYA + +RGLQ L R +V + C
Sbjct: 155 TPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQG--------PLPER--RVVATC 204
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYAA D ++W G R+ FDA+VT QD+ E +L PF+ C ++ S+MCSYN VNG+P+
Sbjct: 205 KHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPA 264
Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA+ L+ +R W+ YI +DC+++ + NH + A + + A +AG D
Sbjct: 265 CANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHY-AKTNAEGTALAFEAGTDSS 323
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDEN 401
C ++ A QG ++++ +D++L LY L+R+G+FDG+ +Y SLG +D+ S ++
Sbjct: 324 CEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNHSEYASLGWKDVNSPKS 383
Query: 402 IELAAEAAREGIVLLKNDQNTLP--LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+A + A EGIVLLKNDQ TLP L + +A++G AN + G Y+G P S
Sbjct: 384 QEVALQTAVEGIVLLKNDQ-TLPLGLKTDPKSKLAMIGFWANDPKTLSGGYSGKPAFEHS 442
Query: 460 PIAGFSGYA-NVTYKTGCDDVACKSNNS-IFAASEAAKTADATIILAGLDLSVEAESLDR 517
P+ NVT G SN++ AA EAA+ A+ + GLD S E+ DR
Sbjct: 443 PVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYILYFGGLDTSAAGETKDR 502
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+ P Q QLI + ++ K P+++V M + T T + +ILWA +PG++GG
Sbjct: 503 TTINWPEAQLQLIKTLTKLGK-PLVVVQMGDQLDNTPLLATKT-VNSILWANWPGQDGGT 560
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYP 636
A+ ++ G +P GRLP+T Y +Y +P+T M LRP D L PGRTY++Y PT + P
Sbjct: 561 AVMQILTGLKSPAGRLPVTQYPANYTAAVPMTDMNLRPSDRL--PGRTYRWY--PTAVQP 616
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FG+GL YT F+ + + + + + L C N + P
Sbjct: 617 FGFGLHYTTFQAKIAAPLPRLAIQ-DLLSRCGGDNANAYPDTCALP-------------P 662
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
KV+ N G+ VV+ + A IK ++ + R+ + +K +
Sbjct: 663 LKVEVTNSGNRSSDYVVLAFLAGDAGPRPYPIKTLVSYTRLRDVSPGHKTTAHLEWTLGD 722
Query: 757 LNIVDYAANTLLPAGEHTIFV 777
+ D NT+L G +T+ V
Sbjct: 723 IARYDEQGNTVLYPGTYTVTV 743
>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
Length = 798
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 273/749 (36%), Positives = 393/749 (52%), Gaps = 49/749 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ + CD S R K LV+ TL+EK+ A GVPRLG+P Y+WW+E LHG++
Sbjct: 31 LKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLHGIA- 89
Query: 109 VGPGTHFDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
GP T+F +TSFP IL A+F++ L ++ + +STEARA N R GL +W
Sbjct: 90 -GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLDFW 148
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN RDPRWGR ETPGED + + Y + GLQ ATD R V + C
Sbjct: 149 TPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQG-----EATDPYKR---VVATC 200
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A YDV++W G RY D ++T+QD+ E +L PF+ CV + + + MCSYN VNG P
Sbjct: 201 KHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNGAPP 259
Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CADP LL +R W + ++ DCD++Q + H++ + ++ A A +L AG D+
Sbjct: 260 CADPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQW-SSTRAGAAADSLVAGTDIT 318
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSD 399
CG Y A +Q + E+ +D +L Y+ L+RLG+FD +P+ Y LG + ++
Sbjct: 319 CGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFD-APENQPYRQLGFDAVATN 377
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
+ LA AA EGIVLLKND TLPL+ TV + G ANAT ++GNYAG+ S
Sbjct: 378 ASQALARRAAAEGIVLLKND-GTLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHS 436
Query: 460 PIAGFSGYA-NVTYK----TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
P+ + Y G D ++++ A T+D I + G+D VE E
Sbjct: 437 PLYALKQTGVKINYAGGKPGGQGDPTTNRWSNLYGAY---STSDVLIYVGGIDNGVEEEG 493
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR L G Q +I Q+AE K PVI+V+ G +D + N NI AI+WAGYPG++
Sbjct: 494 HDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQD 552
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG AI D++ GK P GRLP T Y Y + + +M LRP ++ PGRTYK+YNG +
Sbjct: 553 GGSAIIDIISGKTAPAGRLPQTQYPASYAAAVSMMNMNLRPGEN--NPGRTYKWYNGSAV 610
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
+ FGYG+ YT F + Q ++ +S AS G + RC +
Sbjct: 611 FEFGYGMHYTNFSAAI------------STQMQQSYAISSLASGCNSTGGFLE--RC-PF 655
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
V N G V + Y A K ++ ++R+ AG +
Sbjct: 656 ASVDVQVHNTGKVTSDYVTLGYMAGTFGPAPHPRKTLVSYKRLHNIAGGATSTAKLNLTL 715
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
S+ VD N +L G +++ + N ++
Sbjct: 716 ASVARVDEYGNKVLYPGHYSLQIDNNALA 744
>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
Length = 908
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 256/610 (41%), Positives = 353/610 (57%), Gaps = 31/610 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD+SL + RV LV +TL+EK+ L D A G RLGLP YEWW+EA HGV +
Sbjct: 157 LCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGVGS 216
Query: 109 VGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PG F ATSFP IL ASF+ +L +KI + + E RA N G +G +W
Sbjct: 217 A-PGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFW 275
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN RDPRWGR ETPGED FV Y N++ GLQ + + +V + C
Sbjct: 276 APNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDD---------PKNKQVIATC 326
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++ RY + T+QD+ + FL PF+ CV++ D S+MCSYN V+GIP+
Sbjct: 327 KHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 382
Query: 286 CADPKLLNQTVRGEW----DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
CA+ LL++ +R W D H Y+V+DC+++ + H F D++E A A L AG+DL
Sbjct: 383 CANEYLLDEVLRKHWGFNADYH-YVVSDCNAVTDIWQYHNF-TDTEEAAAAVALNAGVDL 440
Query: 342 DCGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
+CG Y + A Q VK +D+SL LY+ L +GFFDG +Y L D+
Sbjct: 441 ECGSSYLKLNESLAANQTSVKA--MDQSLARLYSALFTIGFFDGG-KYDHLDFSDVSIPA 497
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
LA EAA EG+ LLKND LPL+S K K+VAV+GP ANAT M G Y+G +S
Sbjct: 498 AQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLIS 556
Query: 460 PIAGFSGYANVTYKTGCDDVACKSNNSIFAAS-EAAKTADATIILAGLDLSVEAESLDRE 518
P+ F N + F AS AAK +D + L G+D S+E+E++DR
Sbjct: 557 PLVAFESDHRWKVNYAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDRT 616
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L PG Q LI ++ ++K P+++V G VD + N +I+A++WAGYP + GG A
Sbjct: 617 SLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGTA 675
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ D++ GK +P GRLP+T Y Y + + + LRP +PGRTYK+Y G + PFG
Sbjct: 676 LLDILVGKRSPAGRLPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGKPVIPFG 735
Query: 639 YGLSYTQFKY 648
+GL YT+FK+
Sbjct: 736 HGLHYTKFKF 745
>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
Length = 797
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 274/733 (37%), Positives = 404/733 (55%), Gaps = 44/733 (6%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG-- 110
L CDSS Y R + L+S TL+E + + GVPRLGLP Y+ W+EALHG+
Sbjct: 61 LVCDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 120
Query: 111 -PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
G F+ ATSFP ILTTA+ N +L +I +ST+ARA N GR GL ++PN+
Sbjct: 121 TKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNV 176
Query: 170 NVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
N R P WGR ETPGED F + Y Y+ G+Q ++ LKV++ KH+
Sbjct: 177 NGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--------GVDPEQLKVAATVKHF 228
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
A YD++NW R FDA +T+QD+ E + F + + S+MCSYN VNG+PSCA+
Sbjct: 229 AGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCAN 288
Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
L +R W GY+ +DCD++ + + H + ++ A A +L+AG D+DCGQ
Sbjct: 289 SFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSS-AAASSLRAGTDIDCGQT 347
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAA 406
Y + G+V +I++S+ LY L+RLG+FD QY SLG +D+ + ++
Sbjct: 348 YPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNISY 407
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGF 464
EAA EGIVLLKND TLPL S KV+++A++GP ANAT M GNY G +SP+ A
Sbjct: 408 EAAVEGIVLLKND-GTLPL-SKKVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKK 465
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+GY +V ++ G ++A S A AAK +DA + L G+D ++E E DR D+ PG
Sbjct: 466 AGY-HVNFELGT-EIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPG 523
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI Q++EV K P++++ M G VD + ++N + +++W GYPG+ GG A+ D++
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK P GRL T Y +YV P M LRP D PG+TY +Y G +Y FG GL YT
Sbjct: 583 GKRAPAGRLITTQYPAEYVHQFPQNDMNLRP-DGKSNPGQTYIWYTGKPVYEFGSGLFYT 641
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
FK L S K ++ N + + + YT + P F F+ + +N
Sbjct: 642 TFKETLASHPKCLKFNTSSILSAPHPGYTYSE---QIP-----------VFTFEANIKNS 687
Query: 705 GSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
G T+ +++ + A Y K ++GF R+ ++ G + ++ +L VD
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746
Query: 763 AANTLLPAGEHTI 775
N ++ G++ +
Sbjct: 747 YGNRIVYPGKYEL 759
>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
2508]
gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 770
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 260/741 (35%), Positives = 399/741 (53%), Gaps = 45/741 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
++S CD +L R LV+ MT +EK+Q L + G PR+GLP Y WWSEALHGV+
Sbjct: 36 LASLKVCDVTLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVA- 94
Query: 109 VGPGTHF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PGT F D +TSFP +L A+F++ L +K+G+ + TE RA N G +G YW
Sbjct: 95 YAPGTQFWSGDGPFNASTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGFSGFDYW 154
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N +DPRWGR +ETPGED + RYA + +RGLQ +R +V + C
Sbjct: 155 TPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQG----------PARERRVVATC 204
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYAA D ++W G R+ F+A+VT QD+ E +L PF+ C ++ S+MCSYN VNG+P+
Sbjct: 205 KHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPA 264
Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA+ L+ +R W+ YI +DC+++ + NH + A++ + A +AG+D
Sbjct: 265 CANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHY-AETNAEGTALAFEAGIDSS 323
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDEN 401
C ++ A QG ++++ +D++LK +Y L+R+G+FDG+ +Y SLG +D+ S ++
Sbjct: 324 CEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNHSEYASLGWKDVNSPKS 383
Query: 402 IELAAEAAREGIVLLKNDQNTLPLN--SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+A +AA EGIVLLKND+ TLPL+ + +A++G AN + G Y+G P S
Sbjct: 384 QEVALQAAVEGIVLLKNDK-TLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGKPAFEHS 442
Query: 460 PIAGFSGYANVTYKTGCDDVACKSNNSIF--AASEAAKTADATIILAGLDLSVEAESLDR 517
P+ G + ++N + AA EAAK A+ + G D S E+ DR
Sbjct: 443 PVYAAQAMGFSVTTAGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAAGETKDR 502
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+ P Q QLI ++++ K P+++V M +D + AILWA + G++GG
Sbjct: 503 TTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLLAAKAVNAILWANWLGQDGGT 560
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYP 636
A+ ++ G NP GRLP+T Y +Y +P+T M LRP D L PGRTY++Y PT + P
Sbjct: 561 AVMQILTGLKNPAGRLPVTQYPANYTAAVPMTDMNLRPSDKL--PGRTYRWY--PTAVQP 616
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FG+GL YT F+ + + + + L C N + P
Sbjct: 617 FGFGLHYTTFQTKIAVPLPRLAIQ-DLLSRCGGDNANAYPDTCALP-------------P 662
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
KV+ N G+ VV+ + IK ++ + R+ + +K +
Sbjct: 663 LKVEVTNSGNRSSDYVVLAFLAGDVGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKWTLGD 722
Query: 757 LNIVDYAANTLLPAGEHTIFV 777
+ D NT+L G +T+ V
Sbjct: 723 IARYDEQGNTVLYPGTYTVTV 743
>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
Length = 796
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 274/740 (37%), Positives = 401/740 (54%), Gaps = 43/740 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ L C++S R + LVS TL+E + + A GVPRLGLPQY+ W+EALHG+
Sbjct: 58 LSTNLVCNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDR 117
Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
+F D ATSFP IL+ ASFN +L +I ++T+ARA N GR GL ++
Sbjct: 118 A----NFSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYA 173
Query: 167 PNINVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
PNIN R P WGR ETPGED F + YA Y+ GLQ ++ +K+ +
Sbjct: 174 PNINGFRSPLWGRGQETPGEDAFFLSSAYAYEYITGLQG--------GVDPEHVKIVATA 225
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A YD++NW V R +A +T+QD+ E + F + S+MCSYN VNG+PS
Sbjct: 226 KHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPS 285
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C++ L +R ++ GY+ +DCD++ + + H + A ++ A A +L AG D+DC
Sbjct: 286 CSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGY-ALNQSGAAADSLLAGTDIDC 344
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENI 402
GQ + + V DI+KSL LY L+RLG+FDG+ Y +L D+ + +
Sbjct: 345 GQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSVYRNLNWNDVVTTDAW 404
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
++ EAA EGI LLKND TLPL S KV+++A++GP ANATV M GNY G P +SP+
Sbjct: 405 NISYEAAVEGITLLKND-GTLPL-SKKVRSIALIGPWANATVQMQGNYYGTPPYLISPLE 462
Query: 462 -AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
A SG+ V Y G +++ S A AAK +D I G+D ++EAE DR DL
Sbjct: 463 AAKASGFT-VNYAFGT-NISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDL 520
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
PG Q LI Q+++V K P++++ M G VD + + N N+ A++W GYPG+ GG A+
Sbjct: 521 KWPGNQLDLIEQLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALF 579
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
D++ GK P GRL T Y +Y P M LRP S PG+TY +Y G +Y FG+G
Sbjct: 580 DILTGKRAPAGRLVSTQYPAEYATQFPANDMNLRPNGS--NPGQTYIWYTGTPVYEFGHG 637
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
L YT+F+ ++ NK L D T PG +L + VD
Sbjct: 638 LFYTEFQ-------ESAAAGTNKTSTLDIL----DLVPTPHPGYEYIELV--PFLNVTVD 684
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLNI 759
+NVG T ++++ A K ++GF R+ + + ++ F ++
Sbjct: 685 VKNVGHTPSPYTGLLFANTTAGPKPYPNKWLVGFDRLATIHPAKTAQVTFPV-PLGAIAR 743
Query: 760 VDYAANTLLPAGEHTIFVGN 779
D N ++ GE+ + + N
Sbjct: 744 ADENGNKVIFPGEYELALNN 763
>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
Length = 792
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 275/794 (34%), Positives = 423/794 (53%), Gaps = 49/794 (6%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSF 52
+AK ++++L L AL +T+ VD N ++P P + + L +S
Sbjct: 3 VAKSIAAVLVALLPGALAQANTSYVDYNVEANPDLT--PQSVATIDLSFPDCENGPLSKT 60
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CD+S R LVS T +E V G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 61 LVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRA--- 117
Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
+F D ATSFP ILT ++ N +L +I ++T+ RA N+GR GL ++PNIN
Sbjct: 118 -NFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNIN 176
Query: 171 VARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
R WGR ETPGED + + YA Y+ G+Q ++ LK+ + KHYA
Sbjct: 177 AFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--------GVDPEHLKLVATAKHYA 228
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD++NW G R D +T+Q++ E + F + ++ SVMCSYN VNG+PSCA+
Sbjct: 229 GYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANS 288
Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
L +R + GY+ +DCDS + + H+F A+ A A +++AG D+DCG Y
Sbjct: 289 FFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGTDIDCGTTY 347
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
+ G A + +V +I++ + LY+ L+RLG+FDG+ Y L D+ + + ++
Sbjct: 348 QYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVTTDAWNISY 407
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
EAA EGIVLLKND TLPL + V++VA++GP N T + GNY G +SP+ F
Sbjct: 408 EAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQN 465
Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
+V Y G + ++ S + A AAK +D I G+D ++EAE++DR ++ PG
Sbjct: 466 SDFDVNYAFGTN-ISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGN 524
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q QLI+Q++++ K P+I++ M G VD + ++N N+ +++W GYPG+ GG+A+ D++ G
Sbjct: 525 QLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITG 583
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRL +T Y +Y P T M LRP + PG+TY +Y G +Y FG+GL YT
Sbjct: 584 KRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYTT 641
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F +L K + + N +Q + A+ + P L+N F V N G
Sbjct: 642 FHASLPGTGKD-KTSFN-IQDLLTQPHPGFANVEQMP--LLN---------FTVTITNTG 688
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
++++ A A K ++GF R+ + + S+ D A N
Sbjct: 689 KVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVARTDEAGN 748
Query: 766 TLLPAGEHTIFVGN 779
+L G++ + + N
Sbjct: 749 RVLYPGKYELALNN 762
>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
Length = 792
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 275/794 (34%), Positives = 423/794 (53%), Gaps = 49/794 (6%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSF 52
+AK ++++L L AL +T+ VD N ++P P + + L +S
Sbjct: 3 VAKSIAAVLVALLPGALAQANTSYVDYNVEANPNLT--PQSVATIDLSFPDCENGPLSKT 60
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CD+S R LVS T +E V G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 61 LVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRA--- 117
Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
+F D ATSFP ILT ++ N +L +I ++T+ RA N+GR GL ++PNIN
Sbjct: 118 -NFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNIN 176
Query: 171 VARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
R WGR ETPGED + + YA Y+ G+Q ++ LK+ + KHYA
Sbjct: 177 AFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--------GVDPEHLKLVATAKHYA 228
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD++NW G R D +T+Q++ E + F + ++ SVMCSYN VNG+PSCA+
Sbjct: 229 GYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANS 288
Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
L +R + GY+ +DCDS + + H+F A+ A A +++AG D+DCG Y
Sbjct: 289 FFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGTDIDCGTTY 347
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
+ G A + +V +I++ + LY+ L+RLG+FDG+ Y L D+ + + ++
Sbjct: 348 QYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVTTDAWNISY 407
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
EAA EGIVLLKND TLPL + V++VA++GP N T + GNY G +SP+ F
Sbjct: 408 EAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQN 465
Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
+V Y G + ++ S + A AAK +D I G+D ++EAE++DR ++ PG
Sbjct: 466 SDFDVNYAFGTN-ISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGN 524
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q QLI+Q++++ K P+I++ M G VD + ++N N+ +++W GYPG+ GG+A+ D++ G
Sbjct: 525 QLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITG 583
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRL +T Y +Y P T M LRP + PG+TY +Y G +Y FG+GL YT
Sbjct: 584 KRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYTT 641
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F +L K + + N +Q + A+ + P L+N F V N G
Sbjct: 642 FHASLPGTGKD-KTSFN-IQDLLTQPHPGFANVEQMP--LLN---------FTVTITNTG 688
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
++++ A A K ++GF R+ + + S+ D A N
Sbjct: 689 KVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVARTDEAGN 748
Query: 766 TLLPAGEHTIFVGN 779
+L G++ + + N
Sbjct: 749 RVLYPGKYELALNN 762
>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
Length = 792
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 275/794 (34%), Positives = 423/794 (53%), Gaps = 49/794 (6%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSF 52
+AK ++++L L AL +T+ VD N ++P P + + L +S
Sbjct: 3 VAKSIAAVLVALLPGALAQANTSYVDYNVEANPDLT--PQSVATIDLSFPDCENGPLSKT 60
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CD+S R LVS T +E V G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 61 LVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRA--- 117
Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
+F D ATSFP ILT ++ N +L +I ++T+ RA N+GR GL ++PNIN
Sbjct: 118 -NFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNIN 176
Query: 171 VARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
R WGR ETPGED + + YA Y+ G+Q ++ LK+ + KHYA
Sbjct: 177 AFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--------GVDPEHLKLVATAKHYA 228
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD++NW G R D +T+Q++ E + F + ++ SVMCSYN VNG+PSCA+
Sbjct: 229 GYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANS 288
Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
L +R + GY+ +DCDS + + H+F A+ A A +++AG D+DCG Y
Sbjct: 289 FFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGTDIDCGTTY 347
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
+ G A + +V +I++ + LY+ L+RLG+FDG+ Y L D+ + + ++
Sbjct: 348 QYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVTTDAWNISY 407
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
EAA EGIVLLKND TLPL + V++VA++GP N T + GNY G +SP+ F
Sbjct: 408 EAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQN 465
Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
+V Y G + ++ S + A AAK +D I G+D ++EAE++DR ++ PG
Sbjct: 466 SDFDVNYAFGTN-ISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGN 524
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q QLI+Q++++ K P+I++ M G VD + ++N N+ +++W GYPG+ GG+A+ D++ G
Sbjct: 525 QLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITG 583
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
K P GRL +T Y +Y P T M LRP + PG+TY +Y G +Y FG+GL YT
Sbjct: 584 KRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYTT 641
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F +L K + + N +Q + A+ + P L+N F V N G
Sbjct: 642 FHASLPGTGKD-KTSFN-IQDLLTQPHPGFANVEQMP--LLN---------FTVTITNTG 688
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
++++ A A K ++GF R+ + + S+ D A N
Sbjct: 689 KVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVARTDEAGN 748
Query: 766 TLLPAGEHTIFVGN 779
+L G++ + + N
Sbjct: 749 RVLYPGKYELALNN 762
>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
JBW45]
Length = 750
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 262/760 (34%), Positives = 394/760 (51%), Gaps = 97/760 (12%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+M F + D +L + R KDLVSRMTL+EKV Q+ + +PRLG+P Y WWSEALHGV+
Sbjct: 26 RMEIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVA 85
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-------- 159
G AT FP I A+F+E L + + +S E RA ++ +
Sbjct: 86 RAGV----------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIY 135
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+WSPN+N+ RDPRWGR ET GEDP++ GR V++++GLQ + + L
Sbjct: 136 KGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKKYL 186
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ ++C KH+A V + +R+ FDA V+ +D+ ET+L F+ CVKE + +VM +YNR
Sbjct: 187 RAAACAKHFA---VHSGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNR 243
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
VNG P C LL +T+R EW G++V+DC +I+ +NH+ + S ++VA L G
Sbjct: 244 VNGEPCCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHR-VTSSAPESVALALNNGC 302
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDIC 397
DL+CG Y N A Q+G V E I+ ++ L M+LG FD + Y ++G
Sbjct: 303 DLNCGNMYLNLL-IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTNIGFHQND 361
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
E+ E A E +++ +VLLKN+ N LPL+ + ++AV+GP+AN+ A+ GNY G Y
Sbjct: 362 CQEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNY 421
Query: 458 MSPIAGFSGYAN----VTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLD 507
++ + G V+Y GC K+ N A A+ AD ++ GLD
Sbjct: 422 ITVLEGIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLD 481
Query: 508 LSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
S+E E S D+ L LPG Q +L+ + + K P+ILV+++ + + +A
Sbjct: 482 ASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE 540
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
+ AI+ A YPG EGG+A+A +FG+++P G+LPIT+Y T+ L
Sbjct: 541 K--VPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYR---------TTEELPEFTD 589
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
RTY++ LYPFGYGL YT F Y Q+ LN+ Q S
Sbjct: 590 YSMKNRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ-------ISAGEN 634
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
+C VLV +N G+ + V +Y K I ++ G Q+V
Sbjct: 635 VQC-SVLV---------------KNTGNFASDETVQLYIKDVKASVEVPILELQGIQKVH 678
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ G + + F + L +++ N +L G I+VG
Sbjct: 679 LLPGTEQEVFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717
>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6578]
gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
thermophila DSM 6578]
Length = 693
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 270/736 (36%), Positives = 395/736 (53%), Gaps = 92/736 (12%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R+ L+S+M+++EK + A G+PRLG+P Y WW+EALHGV+N G AT
Sbjct: 6 RMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------AT 55
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGRA-------GLTYWSPNINVARDP 175
FP I A+F+ L +++ +A+STEARA +N +G+ GLT+WSPNIN+ RDP
Sbjct: 56 VFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRDP 115
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDPF+ + V++V+GLQ + ++V++C KHYA +
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVSFVKGLQGDHPYY---------MRVAACAKHYAVH--SG 164
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+G+ R+ FDARV+E+D+ ET+L FE VK G +VM +YNRVNG P+C +LL++
Sbjct: 165 PEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLDEI 222
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R W G++V+DC +I +HK D E ++A L+AG DL+CG Y + +AV
Sbjct: 223 LRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYEHLL-DAV 280
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
+ G V E +D+S+ L + L RLG F Y L DI + + LA EAA + +VL
Sbjct: 281 KAGVVSEELVDRSVARLLSTLDRLGLFTDDHPYARLSLSDIDWEAHRALAREAAEKSVVL 340
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
LKN+ LP + K++ + V GP+A VA++GNYAG+ R ++ + G +GYA VT
Sbjct: 341 LKNN-GILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVT 399
Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR---------EDLWL 522
YK GC + N I AS A+ AD T+ + G D +VE E D DL L
Sbjct: 400 YKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDL 458
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
P Q + + ++ E+ K P+++V++S G + E AI++A YPGEEGG AIA V
Sbjct: 459 PREQIEYLRRIKEIGK-PLVVVLLS--GAPVCSPELEELADAIVYAWYPGEEGGNAIARV 515
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG+ +P GRLPIT+ G V LP P GRTY++ LYPFG+GLS
Sbjct: 516 LFGEISPSGRLPITFPRG--VDQLP-------PFTDYSMEGRTYRYMREEPLYPFGFGLS 566
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
Y F Y R L ++ R + E + +
Sbjct: 567 YATFSY-------------------RGLQSSASRWDKR------------ETLELVCEVE 595
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
N S +VV +Y + + + GF RV + AG K+++FV + + L+ +D
Sbjct: 596 NTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGAGERKQVRFVLSP-EELSFIDE 654
Query: 763 AANTLLPAGEHTIFVG 778
+LP G VG
Sbjct: 655 EGRKVLPEGRLHFHVG 670
>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
Length = 732
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 268/751 (35%), Positives = 394/751 (52%), Gaps = 93/751 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ + L + R + LV+ MT+DEK+ QL +PRL +PQY WW+EALHG++ G
Sbjct: 29 IWFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK- 87
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTY 164
AT FP I A+F+ L +++ A+S EARA Y + + AGLT+
Sbjct: 88 ---------ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQYAGLTF 138
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+PN+N+ RDPRWGR ET GEDP + + +V+GLQ + + LK +
Sbjct: 139 WTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDD---------PKYLKSAGV 189
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + R+ FD +++D+ ET+L FE V + + VMC+YN V G P
Sbjct: 190 AKHFA---VHSGPESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQP 246
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
SCA LL + ++ +W +GY+V+DC ++ HK + E A A L+AG+DL+CG
Sbjct: 247 SCASEFLLGEMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESA-ALALRAGVDLNCG 305
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENI 402
Y A ++G + ++ ID+ LK L + RLG FD S + ++G++ I S E+I
Sbjct: 306 FTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRLGLFDPSELNPHNAIGQEVIHSLEHI 365
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
ELA + A + IVLLKN++ LPL S +K V GP A ++ ++GNY GI ++ +
Sbjct: 366 ELARKVAAKSIVLLKNEKQVLPL-SKDIKVPYVTGPFAASSDMLMGNYYGISDSLVTVLE 424
Query: 463 GFSGY----ANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGLDLSVEAESLD- 516
G +G +++ Y+ G + SN N + A E AKTADA I + G+ +E E +D
Sbjct: 425 GIAGKVSLGSSLNYRAGA--LPFHSNINPLNWAPEVAKTADAVIAVVGISADMEGEEVDA 482
Query: 517 --------REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
R + LP Q + Q+AE KGP+ILV+ + VDI+ E + AILW
Sbjct: 483 IASADRGDRVAITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDIS--ELDPLADAILWI 540
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPGE+GG A+ADV+FG NP G LP+T+ T L P D GRTYKF
Sbjct: 541 WYPGEQGGNAVADVIFGDTNPSGHLPLTFVK---------TIDDLPPFDDYTMTGRTYKF 591
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
LYPFG+GLSYTQFK+ LS +K Q N+N +
Sbjct: 592 LKKLPLYPFGFGLSYTQFKFGKLSLSKRAP------QEGENINIS--------------- 630
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
V+ +N + DG VV VY P + I + F+RV + A + I+
Sbjct: 631 ----------VEVENSTALDGETVVQVYLSPQVPLKNEAITNLKAFKRVHIGAYEKRLIE 680
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F K+L V+ A + P+G +T+ VG+
Sbjct: 681 FTIEG-KNLYRVNDAGENVWPSGAYTLAVGD 710
>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 211/449 (46%), Positives = 295/449 (65%), Gaps = 13/449 (2%)
Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
+A LDLDCG + T +AV++G + E +I+ +L TV MRLG FDG P Y +LG
Sbjct: 5 QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
D+C+ + ELA EAAR+GIVLLKN LPL++ ++VA++GP++N TV MIGNYAG
Sbjct: 65 PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124
Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+ C Y +P+ G YA Y+ GC DVAC S+ AA +AA+ ADAT+++ GLD S+EA
Sbjct: 125 VACGYTTPLQGIGRYAKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIEA 184
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
ES DR +L LPG Q +LI++VA +KGP ILV+MS G +D++FAE + I I+WAGYPG
Sbjct: 185 ESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYPG 244
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
+ GG AI+DV+FG NPGG+LP+TWY DYV LP+T+M +RP S GYPGRTY+FY G
Sbjct: 245 QAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKGK 304
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLN-KLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+YPFG+G+SYT F + + S + V L+ Q RN + A + V RC
Sbjct: 305 VVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNATISGKA-------IRVTHARC 357
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ F +VD +N GS DG+ ++VYSKPPA A +KQ++ F++V V AG +R+
Sbjct: 358 NRLSFGVQVDVKNTGSMDGTHTLLVYSKPPAGHWAP-LKQLVAFEKVHVAAGTQQRVGIN 416
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD + +P G H++ +G+
Sbjct: 417 VHVCKFLSVVDRSGIRRIPMGAHSLHIGD 445
>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
8126]
Length = 758
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 283/748 (37%), Positives = 395/748 (52%), Gaps = 68/748 (9%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDF------AHGVPRLGLPQYEWWSEALHGVSN 108
CD++ R LV M + EK+ L ++ + G PRLGLP YEWWSEALHGV+
Sbjct: 11 CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69
Query: 109 VGPGTHFD---DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PG F+ ATSF I +A+F++ L +K+ +STEARA N G AGL +W
Sbjct: 70 ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSAGLDFW 129
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN RDPRWGR +ETPGEDP + Y + +RGL EG E+ KV + C
Sbjct: 130 TPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGL---EGEESIK-------KVIATC 179
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYAAYD++ W + RY FDA V+ QD+ E +L PF+ C ++ S+MCSYN +NG P+
Sbjct: 180 KHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTPA 239
Query: 286 CADPKLLNQTVRGEW---DLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGL-- 339
CA+ L++ +R W + + YI +DC++I+ + D H F + E A A
Sbjct: 240 CANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDTV 299
Query: 340 -DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQD 395
++ YT+ G A Q + E ID++L+ LY L+R G+FD SP Y +G D
Sbjct: 300 CEVAGSPPYTDVVG-AYDQKLLSEEVIDRALRRLYEGLVRAGYFDPASASP-YRDIGWSD 357
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ + E LA ++A +G+VLLKND TLP+ + KTVA++G A+ T +M+G Y+GIP
Sbjct: 358 VNTAEAQALALQSASDGLVLLKND-GTLPIK-LEGKTVALIGHWASGTRSMLGGYSGIPP 415
Query: 456 RYMSPIAGFSGYANVTYKTGCDDVACKS---NNSIFAASEAAKTADATIILAGLDLSVEA 512
Y SP+ +G N+TYK VA S + A AA +D + GLD SV +
Sbjct: 416 YYHSPVYA-AGQLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVAS 474
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E DR+ + P Q LI +A + K ++VI VD TN N+ AILWAGYPG
Sbjct: 475 EDKDRDSIAWPPAQLTLIQTLAGLGK--PLVVIQLGDQVDDTPLLTNPNVSAILWAGYPG 532
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY-NG 631
+ GG A+ + + G P GRLP+T Y Y LPLT M LRP + G PGRTY++
Sbjct: 533 QSGGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTYRWLPRN 592
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
T+ PFGYGL YT F N Q N+T S P L + C
Sbjct: 593 ATVLPFGYGLHYTNFT-----------ARPNPAQ-----NFTLTPSALLAPCKLAHRDLC 636
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSK-----PPAEIAATYIKQVIGFQRVF-VRAGRNK 745
+ V+ N G+ V +V++ PP +K ++ + R+ + GR
Sbjct: 637 PLPYPVTVEVTNTGARTSDYVGLVFATTRDAGPPPHP----LKTLVAYARLRGIAPGRTA 692
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEH 773
R + V A L VD A N +L G +
Sbjct: 693 RAQ-VQVALGDLARVDAAGNRVLYPGRY 719
>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 918
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/733 (36%), Positives = 396/733 (54%), Gaps = 34/733 (4%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD SL R LV+ +T+ EK+ L + A G+PRL +P YEWWSE LHGV+ PGT
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVAR-SPGTK 228
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
F ATSFP IL ++F++ L + +G+ VS EARA N GR+GL +SPNIN
Sbjct: 229 FTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRSGLDLYSPNINAF 288
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
+DPRWGR ETPGED F + +Y + GL+ + + K+ + CKHYAA D
Sbjct: 289 KDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDK----------KLIATCKHYAAND 338
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+N+KGVDR F+A ++ QD+ E +L PF+ C E + S MCSYN +NG P CA+ L+
Sbjct: 339 FENYKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLI 398
Query: 293 NQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY-T 348
+R W +G Y+ DCD + +MV H + D A A +++AG DL+C + +
Sbjct: 399 EDILRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGH-AAAWSMQAGTDLECNAFPGS 457
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAA 406
+A Q + E D+DK+L +YT L+ +G FD + SLG ++ + E +LA
Sbjct: 458 EALQSAWNQSLISEKDVDKALTRMYTSLVSVGLFDLDRKDPLRSLGWDEVNTKEAQDLAY 517
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
AA EG VL+KND LPL+ K A++GP +AT M GNY G +SP
Sbjct: 518 RAAVEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISPRKAAKD 576
Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
+ TY G KS++S A +AA+ AD I + G+D ++E E+LDR L P
Sbjct: 577 LGLDFTYFLGSR--TNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPEP 634
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q QL+ ++EV K P++++ G VD N ++ AILW GYPG+ GG+AI D+VFG
Sbjct: 635 QLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVFG 693
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+ P GRL +T Y Y +P T M LRP GRTY++Y G T P+G+GL YT+
Sbjct: 694 RAAPAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYTK 753
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F ++ + +++ ++ N D + + P R V +N G
Sbjct: 754 FSVDMKPASNVHNIDIAQMAAEAN-----DDAASEIPSWQRGLER--RMVTVTVSAKNEG 806
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAA 764
+ V +V+ + A K ++G+ R+ ++ G ++ + + + L VD
Sbjct: 807 NVISDYVALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKM-EQLVRVDEVG 865
Query: 765 NTLLPAGEHTIFV 777
N +L G +++F+
Sbjct: 866 NRVLYEGLYSLFL 878
>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
XB6B4]
Length = 710
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/747 (35%), Positives = 387/747 (51%), Gaps = 99/747 (13%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
Y R +LV +MTL+EKV Q A V RL + Y WW+EALHGV+ G
Sbjct: 13 YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG--------LTYWSPNINVA 172
AT FP I A+F+E L +++G AVSTEARA +N+ + G LT+W+PN+N+
Sbjct: 64 -ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIF 122
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP++ R V Y+ GLQ GH + LK ++C KH+A
Sbjct: 123 RDPRWGRGHETFGEDPYLTSRLGVRYIEGLQ---GH------DENYLKAAACAKHFA--- 170
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
V + R+ FDA VTEQD+ ET+L FE CVKEG +VM +YNR NG+P C + +LL
Sbjct: 171 VHSGPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLL 230
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+R EW G++ +DC +I+ + H + E +VA + G DL+CG + F
Sbjct: 231 IDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIE-SVAMAMNNGCDLNCGTLF-GFLV 288
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
AV+QG VKE +D+++ L+ M+LG FD + Y + S E +L AR
Sbjct: 289 QAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMKKLNEAVAR 348
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
+VLLKN ++ LPL+ K+KTV V+GP+A++ A++GNY G RY++ + G Y
Sbjct: 349 RTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGD 408
Query: 468 -ANVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
V Y GC D + + N+ + K +D + + GLD +E E
Sbjct: 409 DVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNE 468
Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
S D+ DL LPG Q +++ K PVILV++S + + +A + ++ AI+ YP
Sbjct: 469 YGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYP 525
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G GG AIAD++FG+ NP G+LP+T+Y T+ L + GRTY++
Sbjct: 526 GARGGAAIADILFGEANPEGKLPVTFYR---------TTEELPDFEDYSMQGRTYRYMEQ 576
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
LYPFGYGLSYT++ Y Q+ R L S+ G+ V
Sbjct: 577 EALYPFGYGLSYTEYAY----------------QNVRFLEQEPVVSEGVTIGLSV----- 615
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
+N G DG++ V VY K AE + Q+ ++ + AG K I
Sbjct: 616 ----------KNTGKMDGTETVQVYVK--AEHSKMPHGQLKKIVKLPLCAGEEKEINIRL 663
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ ++ + D +LP+G IFVG
Sbjct: 664 ES-EAFMLYDENGEKILPSGHFEIFVG 689
>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
cellulolyticum H10]
Length = 712
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 267/760 (35%), Positives = 394/760 (51%), Gaps = 102/760 (13%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
M+ + D SL + R DLVSRMTL+EK QL A V RLG+P+Y WW+EALHGV+
Sbjct: 1 MNKPKYLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVAR 60
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
G AT FP I A F++ +KI ++TE RA YN
Sbjct: 61 AGV----------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYK 110
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
G+T+WSPN+N+ RDPRWGR ET GEDP++ R V +V+GLQ + + LK
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLK 160
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
++C KH+A V + DR+HF+A +++DM ET+L FE VKE SVM +YNR
Sbjct: 161 SAACAKHFA---VHSGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRT 217
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
NG P LL +R +W G++V+DC +I+ + H + + ++VA LK G D
Sbjct: 218 NGEPCNGSKTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKNGCD 276
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
L+CG Y A+++GK+ E DID++ L T M+LG FD ++ + + S E
Sbjct: 277 LNCGNMYL-LILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKIPYEVNDSIE 335
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ +L+ EAAR+ +VLLKN+ LPL+S K+K +AV+GP+A++++A+ NY+G P ++
Sbjct: 336 HNKLSLEAARKSMVLLKNN-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITI 394
Query: 461 IAGF----SGYANVTYKTGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
+ G S V Y G +D+A + ++ + A A+ +D ++ GLD S
Sbjct: 395 LDGVRSRVSEDTRVWYSLGSHLFMNREEDLA-QPDDRLKEAVSMAERSDVVVLCLGLDAS 453
Query: 510 VEAESLD-----------REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
VE E D + DL LP Q L+N V K P I+ ++S + I A
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
AI+ YPG +GG A A+++FG ++P GRLP+T+Y ++ L P +
Sbjct: 513 KA--AAIVQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYK---------STEELPPFED 561
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
RTYKF G LYPFG+GLSYT F+Y S
Sbjct: 562 YSMENRTYKFMKGEALYPFGFGLSYTNFEY----------------------------SN 593
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
CP + N + VD QN GS D +VV VY K + GF+R+F
Sbjct: 594 IVCPQAVNNG----ESLSVSVDVQNAGSVDSDEVVQVYIKDMEASVRVPNHSLCGFKRIF 649
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++G K + F ++ +++ IVD + G+ T++VG
Sbjct: 650 LKSGEKKTVTFEIDS-RAMTIVDEEGKRYIENGDFTLYVG 688
>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
Length = 712
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 269/760 (35%), Positives = 392/760 (51%), Gaps = 102/760 (13%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
M + D SL + R DLVSRMTL+EK QL A V RLG+P+Y WW+EALHGV+
Sbjct: 1 MEKPKYLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVAR 60
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
G AT FP I A F++ +KI ++TE RA YN
Sbjct: 61 AGV----------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYK 110
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
G+T+WSPN+N+ RDPRWGR ET GEDP++ R V +V+GLQ + + LK
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLK 160
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
++C KH+A V + DR+HFDA V+++D+ ET+L FE VKE SVM +YNR
Sbjct: 161 TAACAKHFA---VHSGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRT 217
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
NG P LL +R W G++V+DC +I+ + H + + ++VA LK+G D
Sbjct: 218 NGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSGCD 276
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
L+CG Y A+++G++ E DID++ L T MRLG FD ++ + + S E
Sbjct: 277 LNCGNMYL-LILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPYELNDSVE 335
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ +L+ EAA++ +VLLKND LPL+S K+K +AV+GP+A++++A+ NY+G P + ++
Sbjct: 336 HNKLSLEAAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITI 394
Query: 461 IAGF----SGYANVTYKTGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
+ G S V Y G +D+A + ++ + A A+ +D ++ GLD S
Sbjct: 395 LDGIRKRVSEDTRVWYSVGSHLFMNREEDLA-QPDDRLKEAVSVAERSDVVVLCLGLDAS 453
Query: 510 VEAESLD-----------REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
VE E D + DL LP Q L+N V K P I+ ++S + I A
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
AI+ YPG GG A A+++FG ++P GRLP+T+Y ++ L P
Sbjct: 513 KA--AAIVQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYK---------STEELPPFAD 561
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
RTYKF G LYPFG+GLSYT F+Y S
Sbjct: 562 YSMENRTYKFMKGEALYPFGFGLSYTNFEY----------------------------SN 593
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
CP + N + VD QN GS D +VV VY K + GF+R+
Sbjct: 594 IVCPQNVNNG----ENLSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIH 649
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++G K + F ++ ++ IVD A + GE T++VG
Sbjct: 650 LKSGEKKTVTFEIDS-NAMTIVDEAGKRYIENGEFTLYVG 688
>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
Length = 797
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 247/601 (41%), Positives = 354/601 (58%), Gaps = 22/601 (3%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ C++S+ Y R + L+S TL+E + + A GVPRLGLP Y+ WSE LHG+
Sbjct: 62 IVCNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWA 121
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
++ ATSFP IL+ A+ N +L +I ++T+ARA N+GR GL ++PNIN
Sbjct: 122 KSGEE-WKWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGF 180
Query: 173 RDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
R P WGR ETPGED F+ YA Y+ GLQ ++ LK+ + KH+A Y
Sbjct: 181 RSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG--------GVDPEHLKIVATAKHFAGY 232
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D++NW R FDA +T+QD+ E + F + A S MCSYN VNG+PSC+ L
Sbjct: 233 DLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFL 292
Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R WD +GY+ +DCD+ + + H + A + A A +L+AG D+DCGQ Y
Sbjct: 293 LQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGY-AINISAAAADSLRAGTDIDCGQTYPW 351
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEA 408
+ + +G V +I++SL LY+ L++LG+FDG+ +Y LG D+ + + ++ EA
Sbjct: 352 YLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGNQSEYRQLGWNDVVATDAWNISYEA 411
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGFSG 466
A EGIVLLKND LPL S K+K+VAV+GP ANAT + GNY G ++P+ A +G
Sbjct: 412 AVEGIVLLKND-GVLPL-SEKLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAG 469
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
Y V Y G ++ + + AA AAK +D I L G+D ++EAE DR ++ PG Q
Sbjct: 470 Y-KVNYAFGT-NILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQ 527
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
LI Q+++ K P++++ M G VD + ++N N+ A++W GYPG+ GG+AI D++ GK
Sbjct: 528 LDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGK 586
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
P GRL T Y +Y P T M LRP D PG+TY +Y G +Y FGY L YT F
Sbjct: 587 RAPAGRLVTTQYPAEYATQFPATDMNLRP-DGKSNPGQTYIWYTGKPVYEFGYALFYTTF 645
Query: 647 K 647
K
Sbjct: 646 K 646
>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
M50/1]
Length = 710
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 266/747 (35%), Positives = 387/747 (51%), Gaps = 99/747 (13%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
Y R +LV +MTL+EKV Q A V RL + Y WW+EALHGV+ G
Sbjct: 13 YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG--------LTYWSPNINVA 172
AT FP I A+F+E L +++G AVSTEARA +N+ + G LT+W+PN+N+
Sbjct: 64 -ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIF 122
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP++ R V Y+ GLQ GH + LK ++C KH+A
Sbjct: 123 RDPRWGRGHETFGEDPYLTSRLGVRYIEGLQ---GH------DENYLKAAACAKHFA--- 170
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
V + R+ FDA VTEQD+ ET+L FE CVKEG +VM +YNR NG+P C + +LL
Sbjct: 171 VHSGPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLL 230
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+R EW G++ +DC +I+ + H + E +VA + G DL+CG + F
Sbjct: 231 IDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIE-SVAMAMNNGCDLNCGTLF-GFLV 288
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
AV+QG VKE +D+++ L+ M+LG FD + Y + S E +L AR
Sbjct: 289 QAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMKKLNEAVAR 348
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
+VLLKN ++ LPL+ K+KT+ V+GP+A++ A++GNY G RY++ + G Y
Sbjct: 349 RTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGD 408
Query: 468 -ANVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
V Y GC D + + N+ + K +D + + GLD +E E
Sbjct: 409 DVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNE 468
Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
S D+ DL LPG Q +++ K PVILV++S + + +A + ++ AI+ YP
Sbjct: 469 YGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYP 525
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G GG AIAD++FG+ NP G+LP+T+Y T+ L + GRTY++
Sbjct: 526 GARGGAAIADILFGEANPEGKLPVTFYR---------TTEELPDFEDYSMQGRTYRYMEQ 576
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
LYPFGYGLSYT++ Y Q+ R L S+ G+ V
Sbjct: 577 EALYPFGYGLSYTEYAY----------------QNVRFLEQEPVVSEGVTIGLSV----- 615
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
+N G DG++ V VY K AE + Q+ ++ + AG K I
Sbjct: 616 ----------KNTGKMDGTETVQVYVK--AEHSKMPHGQLKKIVKLPLCAGEEKEINIRL 663
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ ++ + D +LP+G IFVG
Sbjct: 664 ES-EAFMLYDENGEKILPSGHFEIFVG 689
>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A11]
gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
B3]
gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
A12]
Length = 724
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 259/759 (34%), Positives = 389/759 (51%), Gaps = 97/759 (12%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
M F + D +L + R KDLVSRMTL+EKV Q+ + +PRLG+P Y WWSEALHGV+
Sbjct: 1 MEIFAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVAR 60
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------A 160
G AT FP I A+F+E L + + +S E RA ++ +
Sbjct: 61 AGV----------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYK 110
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GLT+WSPN+N+ RDPRWGR ET GEDP++ GR V++++GLQ + + L+
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKKYLR 161
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
++C KH+A V + +R+ FDA V+ +D+ ET+L F+ CVKE + +VM +YNRV
Sbjct: 162 AAACAKHFA---VHSGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRV 218
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
NG P C LL +T+R EW G++V+DC +I+ +NH+ + S ++VA L G D
Sbjct: 219 NGEPCCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHR-VTSSAPESVAMALNNGCD 277
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
L+CG Y N A Q+G V E I+ ++ L M+LG FD + Y +G
Sbjct: 278 LNCGNMYLNLL-IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTKIGFHQNDC 336
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
E+ E A E +++ +VLLKN+ N LPL+ + ++AV+GP+AN+ A+ GNY G Y+
Sbjct: 337 QEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYI 396
Query: 459 SPIAGFSGYAN----VTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDL 508
+ + G V+Y GC K+ N A A+ AD ++ GLD
Sbjct: 397 TVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDA 456
Query: 509 SVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
S+E E S D+ L LPG Q +L+ + + K P+ILV+++ + + +A
Sbjct: 457 SIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAA-- 513
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
I AI+ A YPG EGG+A+A +FG+++P G+LPIT+Y T+ L
Sbjct: 514 EKIPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYR---------TTEELPEFTDY 564
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
RTY++ LYPFGYGL YT F Y Q+ LN+ Q
Sbjct: 565 SMKNRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ-------------- 602
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+ + + V +N G+ + V +Y K I + G Q+V +
Sbjct: 603 ---------ISVGENVQGSVLVKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKVHL 653
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
G + + F + L +++ N +L G I+VG
Sbjct: 654 LPGTEQEVFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691
>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
Length = 793
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 272/738 (36%), Positives = 395/738 (53%), Gaps = 39/738 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S L CD S R LVS TL+E V G+ GVPRLGLP+Y+ WSE+LHGV
Sbjct: 57 LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR 116
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
+ D ATSFP ILT A+ N +L +IG +ST+ARA N+GR GL ++PN
Sbjct: 117 ANWASEGD--YSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPN 174
Query: 169 INVARDPRWGRITETPGEDPF-VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
IN R P WGR ETPGED + + YA Y+ G+Q ++ LK+ + KH
Sbjct: 175 INSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--------GVDPETLKLVATAKH 226
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
YA YD++NW G R D ++T+QD+ E + F + ++ SVMCSYN VNG+PSC+
Sbjct: 227 YAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCS 286
Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+ L +R + GY+ DC ++ + H++ A+ + A A +++AG D+DCG
Sbjct: 287 NSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGTDIDCGT 345
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
Y NA +G++ DI++ + LYT L+RLG+FDG S QY L D+ + + +
Sbjct: 346 SYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQTTDAWNI 405
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPIAG 463
+ EAA EG VLLKND TLPL + +++VA++GP ANAT M GNY G P Y+ SP+A
Sbjct: 406 SHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYG-PAPYLTSPLAA 462
Query: 464 FSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
+V Y G +++ + A AA+ ADA I G+D ++E E+LDR ++
Sbjct: 463 LEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITW 521
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PG Q LINQ++ + K P++++ M G VD + + NTN+ A+LW GYPG+ GG A+ D+
Sbjct: 522 PGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDI 580
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+ G P GRL T Y Y P M LRP + PG+TY +Y G +Y FG+GL
Sbjct: 581 IRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGT--NPGQTYMWYTGTPVYEFGHGLF 638
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
YT F+ S T T + N D PG LR + F
Sbjct: 639 YTTFEAKRAS-TATNHSSFN----------IEDLLTAPHPGYAYPQLR--PFLNFTAHIT 685
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVD 761
N G T ++++ A A K ++GF R+ + G ++ + F ++ D
Sbjct: 686 NTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGASQTMTFPI-TIDNVARTD 744
Query: 762 YAANTLLPAGEHTIFVGN 779
N +L G + + + N
Sbjct: 745 ELGNRVLYPGRYELALNN 762
>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
Length = 804
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 266/748 (35%), Positives = 387/748 (51%), Gaps = 45/748 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S L CDS+ R LVS TL+E + G+ + GVPRLGLP Y+ WSEALHG++
Sbjct: 54 LSKNLVCDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLAR 113
Query: 109 VGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
+F D ATSFP+ IL+ A+FN +L +I +ST+ RA N GR GL +S
Sbjct: 114 A----NFTDNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYS 169
Query: 167 PNINVARDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
PNIN R P WGR ETPGED + + YA Y+ G+Q +N LK+++
Sbjct: 170 PNINTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG--------GVNPEHLKLAATA 221
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A YD++NW R D +T+QD+ E + F + ++ S MCSYN VNG+PS
Sbjct: 222 KHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPS 281
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C++ L +R + HGY+ DC ++ + + H + A+ + A A + AG D+DC
Sbjct: 282 CSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDIDC 340
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG----SPQYVSLGKQDICSD 399
G Y ++ G V DI++ LY L+ LG+FDG S Y SLG D+
Sbjct: 341 GTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKT 400
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNS---AKVKTVAVVGPHANATVAMIGNYAGIPCR 456
+ ++ EAA EGIVLLKND TLPL S K K++A++GP ANAT + GNY G
Sbjct: 401 DAWNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPY 459
Query: 457 YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLD 516
+SP+ F+ + +++ S + AA AA+ AD + L G+D ++EAE+ D
Sbjct: 460 LISPVDAFTAAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQD 519
Query: 517 REDLWLPGYQTQLINQVA--EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R + PG Q +LI+Q+A + P+++ M G VD + ++N + A+LW GYPG+
Sbjct: 520 RSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQS 579
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG A+ D++ G P GRL T Y Y + M LRP ++ PG+TY +Y G +
Sbjct: 580 GGLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPV 639
Query: 635 YPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
Y FG+GL YT F + KT N+ L + + T+ +T
Sbjct: 640 YAFGHGLFYTTFNASSAQAAKTKYTFNITDLTSAAHPDTTTVGQRT-------------- 685
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVFVRAGRNKRIKF-VF 751
F F N G D +VY+ + Y K ++GF R+ A + V
Sbjct: 686 LFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVP 745
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
A L VD A NT+L G + + + N
Sbjct: 746 VAVDRLARVDEAGNTVLFPGRYEVALNN 773
>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
[Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
nidulans FGSC A4]
Length = 803
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 267/731 (36%), Positives = 389/731 (53%), Gaps = 38/731 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
CD SL R LVS T DE V G+ GV RLGLP Y+ W EALHGV +N
Sbjct: 61 CDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRANFVES 120
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
+F ATSFP I A+ N++L +IG VST+ RA N G G+ +SPNIN
Sbjct: 121 GNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 176
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
R P WGR ETPGED F+ Y Y+ LQ ++ LK+ + KHYA YD
Sbjct: 177 RHPVWGRGQETPGEDAFLTSVYGYEYITALQG--------GVDPETLKIIATAKHYAGYD 228
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++W R D ++T+Q++ E + PF + ++ SVMCSYN VNG+PSCA+ L
Sbjct: 229 IESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFL 288
Query: 293 NQTVRG--EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
+R E+ GY+ DC ++ + + H + A ++ A A ++ AG D+DCG Y
Sbjct: 289 QTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGY-ASNEAAASADSILAGTDIDCGTSYQWH 347
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAA 409
+ +A + V +DI++ + LY+ L++ G+FDG Y + D+ S + +A EAA
Sbjct: 348 SEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLSTDAWNIAYEAA 407
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA- 468
EGIVLLKND+ TLPL S +K+VAV+GP AN T + GNY G +SP+ GF
Sbjct: 408 VEGIVLLKNDE-TLPL-SKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGL 465
Query: 469 NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
+V Y G + + S + A AAK ADA I G+D ++EAE++DRE++ PG Q
Sbjct: 466 DVHYALGTN-LTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 524
Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
LI++++E+ K P++++ M G VD + + N N+ A++W GYPG+ GG A+AD++ GK
Sbjct: 525 LISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 583
Query: 589 PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
P GRL T Y +Y ++ P M LRP ++ G PG+TY +Y G +Y FG+GL YT F+
Sbjct: 584 PAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFE- 642
Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
T N+ + + Y KT L+N F +N G +
Sbjct: 643 ESTETTDAGSFNIQTVLTTPHSGYEHAQQKT-----LLN---------FTATVKNTGERE 688
Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
+VY A A K V+GF R+ + + V +S+ D N +L
Sbjct: 689 SDYTALVYVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTVESVARTDEQGNRVL 748
Query: 769 PAGEHTIFVGN 779
G + + + N
Sbjct: 749 YPGSYELALNN 759
>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
Length = 776
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 265/707 (37%), Positives = 381/707 (53%), Gaps = 36/707 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S L CD S R LVS TL+E V G+ GVPRLGLP+Y+ WSE+LHGV
Sbjct: 75 LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR 134
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
+ D ATSFP ILT A+ N +L +IG +ST+ARA N+GR GL ++PN
Sbjct: 135 ANWASEGD--YSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPN 192
Query: 169 INVARDPRWGRITETPGEDPF-VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
IN R P WGR ETPGED + + YA Y+ G+Q ++ LK+ + KH
Sbjct: 193 INSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--------GVDPETLKLVATAKH 244
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
YA YD++NW G R D ++T+QD+ E + F + ++ SVMCSYN VNG+PSC+
Sbjct: 245 YAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCS 304
Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+ L +R + GY+ DC ++ + H++ A+ + A A +++AG D+DCG
Sbjct: 305 NSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGTDIDCGT 363
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
Y NA +G++ DI++ + LYT L+RLG+FDG S QY L D+ + + +
Sbjct: 364 SYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQTTDAWNI 423
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
+ EAA EG VLLKND TLPL + +++VA++GP ANAT M GNY G SP+A
Sbjct: 424 SHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYGPAPYLTSPLAAL 481
Query: 465 SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
+V Y G +++ + A AA+ ADA I G+D ++E E+LDR ++ P
Sbjct: 482 EASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITWP 540
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q LINQ++ + K P++++ M G VD + + NTN+ A+LW GYPG+ GG A+ D++
Sbjct: 541 GNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDII 599
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
G P GRL T Y Y P M LRP + PG+TY +Y G +Y FG+GL Y
Sbjct: 600 RGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGT--NPGQTYMWYTGTPVYEFGHGLFY 657
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F+ S T T + N D PG LR + F N
Sbjct: 658 TTFEAKRAS-TATNHSSFN----------IEDLLTAPHPGYAYPQLR--PFLNFTAHITN 704
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKF 749
G T ++++ A A K ++GF R+ + G ++ + F
Sbjct: 705 TGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGASQTMTF 751
>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 728
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 234/585 (40%), Positives = 346/585 (59%), Gaps = 30/585 (5%)
Query: 72 MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF-DDVIPG--ATSFPTV 128
M+++EKV+ L D + GV LGLP + WW+E LHGV PG F D P ATSFP
Sbjct: 1 MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVG-FSPGVLFAQDSEPFGYATSFPLP 59
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
ILT ASF++ L+ IGQ + E RA N G AG +W+PN+N RDPRWGR ETPGED
Sbjct: 60 ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119
Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
VV Y +YV GLQ + + + + CKH+AAYD++ + + Y+
Sbjct: 120 LVVSNYVQSYVTGLQGSDPTDKV---------IIAACKHFAAYDIETARRANNYN----P 166
Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL---HGY 305
T+QD+++ +L F CV++ +VMCSYN V+GIP+C+ LL + +R W + +
Sbjct: 167 TQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQF 226
Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
+V+DC ++ + H F ++++DA + ++ AG DL+CG Y + G+ + +V + +
Sbjct: 227 VVSDCGAVTDVWLLHNF-TNTEQDAASVSMAAGTDLECGSSYLHLNGSLADK-QVTQERV 284
Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
D++L LY L +G+FDGS + SLG D+ + + ++A EAAR G+ LLKND LPL
Sbjct: 285 DEALTRLYKALFTVGYFDGS-SHSSLGWSDVSTIDAQQIACEAARAGMTLLKND-GVLPL 342
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN--VTYKTGCDDVACKS 483
K K+VA++GP ANAT M GNY G SP+ F+ ++ V Y G D+ S
Sbjct: 343 ADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGT-DINSTS 401
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
++ A AAK +D I G+D ++EAE+LDR + PG Q LI+Q++ + K P+++
Sbjct: 402 DSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLGK-PLVV 460
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
G VD N N+ A+ WAG PG+ GG A+ D+V GK + GRLP T Y Y
Sbjct: 461 AQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQYPASYA 520
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
++ + ++ LRP + +PGRTYK+Y G ++PFG+GL YT+F +
Sbjct: 521 DLVSIFNINLRPNGT--FPGRTYKWYIGEPVFPFGFGLHYTKFNF 563
>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
Flags: Precursor
gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 763
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 281/743 (37%), Positives = 397/743 (53%), Gaps = 57/743 (7%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S CD+SL R K LVS +TL+EK+ G A G RLGLP Y WW+EALHGV+
Sbjct: 33 LSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNEALHGVAE 92
Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
G F++ ATSFP I+ A+FN++L +++ + +STEARA N AG+ YW+
Sbjct: 93 KH-GVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHAGIDYWT 151
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PN+N +DPRWGR ETPGEDP RY +V GLQ D +P KV + CK
Sbjct: 152 PNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQG--------DDPEKP-KVVATCK 202
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H AAYD++ W GV R+ FDA+V+ D+ E +L PF+ C + + MCSYN +NG+P+C
Sbjct: 203 HLAAYDLEEWGGVSRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNALNGVPAC 262
Query: 287 ADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
AD LL +R W G ++ DC +++ + H ++ +S +A A L AG+DLDC
Sbjct: 263 ADRYLLQTVLREHWGWEGPGHWVTGDCGAVERIQTYHHYV-ESGPEAAAAALNAGVDLDC 321
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDE 400
G + ++ G A +QG + +D +L LYT L++LG+FD G P SLG D+ + E
Sbjct: 322 GTWLPSYLGEAERQGLISNETLDAALTRLYTSLVQLGYFDPAEGQP-LRSLGWDDVATSE 380
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--- 457
ELA A +G VLLKN TLPL + T+A++GP N T + NYAG P ++
Sbjct: 381 AEELAKTVAIQGTVLLKNIDWTLPLKAN--GTLALIGPFINFTTELQSNYAG-PAKHIPT 437
Query: 458 MSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
M A GY NV G +V S + A A ADA I G+D +VE ESLDR
Sbjct: 438 MIEAAERLGY-NVLTAPGT-EVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEESLDR 495
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+ PG Q +LI ++AE+ + P+ +V G VD + + + AI+WAGYP + GG
Sbjct: 496 TRIDWPGNQEELILELAELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQAGGA 554
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
+ DV+ GK P GRLPIT Y YV +P+T M L+P PGRTY++Y L PF
Sbjct: 555 GVFDVLTGKAAPAGRLPITQYPKSYVDEVPMTDMNLQP--GTDNPGRTYRWYEDAVL-PF 611
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
G+GL YT F +S+ K + R N +S+ T F
Sbjct: 612 GFGLHYTTFN---VSWAKKAFGPYDAATLARGKNPSSNIVDT-----------------F 651
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAA--TYIKQVIGFQRV-FVRAGRNKRIKFVFNAC 754
+ N G V +V++ P E+ A IK ++G+ R ++ G +++
Sbjct: 652 SLAVTNTGDVASDYVALVFASAP-ELGAQPAPIKTLVGYSRASLIKPGETRKVDVEVTVA 710
Query: 755 KSLNIVDYAANTLLPAGEHTIFV 777
+ L P GE+T+ V
Sbjct: 711 PLTRATEDGRVVLYP-GEYTLLV 732
>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
Length = 526
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/487 (44%), Positives = 314/487 (64%), Gaps = 12/487 (2%)
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YIV+DCDS+ ++ + + + E+A A+++ AGLDL+CG + N T NAV++G + E
Sbjct: 45 YIVSDCDSLGILYGSQHY-TKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAA 103
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
I+K++ + LMRLGFFDG+P+ Y LG +D+C+ EN ELA E AR+GIVLLKN
Sbjct: 104 INKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAG 163
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVA 480
+LPL+ + +KT+AV+GP+AN T MIGNY G+ C+Y +P+ G T Y GC +V
Sbjct: 164 SLPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVT 223
Query: 481 CKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGP 540
C + + +A A +ADAT+++ G D ++E E+LDR DL LPG Q +L+ QVA+ A+GP
Sbjct: 224 C-TEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGP 282
Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
V+LVIMS GG DI FA+ + I +I+W GYPGE GG AIADV+FG+ NP G+LP+TWY
Sbjct: 283 VVLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQ 342
Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
YV+ +P+T+M +RP S GY GRTY+FY G T+Y FG GLSYT F + L+ K + +N
Sbjct: 343 SYVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLN 402
Query: 661 LNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
L++ Q CR+ S DA C + R D FE ++ +NVG +G++ V +++ P
Sbjct: 403 LDESQSCRSPECQSLDAIGPHCEKAVGE--RSD--FEVQLKVRNVGDREGTETVFLFTTP 458
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
P E+ + KQ++GF+++ + ++F + CK L +VD L G H + VG+
Sbjct: 459 P-EVHGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGS 517
Query: 780 GGVSFPI 786
SF I
Sbjct: 518 LKHSFNI 524
>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
Length = 804
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 269/719 (37%), Positives = 389/719 (54%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD S+ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121
Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D+ ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGPDPDSN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI+K + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY NV + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-NVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A P+I++ M G VD + + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYP 586
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T ++ LN +Q + + AS T+ P
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTREIKLN-IQDILSQTHEDLASITQLP--------- 693
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
F + +N G + +V++ A Y +K ++G+ R+ V+ G + ++
Sbjct: 694 --VLNFTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 750
>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
Length = 802
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 267/731 (36%), Positives = 388/731 (53%), Gaps = 38/731 (5%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
CD SL R LVS T DE V G+ GV RLGLP Y+ W EALHGV +N
Sbjct: 60 CDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRANFVES 119
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
+F ATSFP I A+ N++L +IG VST+ RA N G G+ +SPNIN
Sbjct: 120 GNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 175
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
R P WGR ETPGED F+ Y Y+ LQ E + K+ + KHYA YD
Sbjct: 176 RHPVWGRGQETPGEDAFLTSVYGYEYITALQGAVDPETS--------KIIATAKHYAGYD 227
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+++W R D ++T+Q++ E + PF + ++ SVMCSYN VNG+PSCA+ L
Sbjct: 228 IESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFL 287
Query: 293 NQTVRG--EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
+R E+ GY+ DC ++ + + H + A ++ A A ++ AG D+DCG Y
Sbjct: 288 QTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGY-ASNEAAASADSILAGTDIDCGTSYQWH 346
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAA 409
+ +A + V +DI++ + LY+ L++ G+FDG Y + D+ S + +A EAA
Sbjct: 347 SEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLSTDAWNIAYEAA 406
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA- 468
EGIVLLKND+ TLPL S +K+VAV+GP AN T + GNY G +SP+ GF
Sbjct: 407 VEGIVLLKNDE-TLPL-SKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGL 464
Query: 469 NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
+V Y G + + S + A AAK ADA I G+D ++EAE++DRE++ PG Q
Sbjct: 465 DVHYALGTN-LTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 523
Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
LI++++E+ K P++++ M G VD + + N N+ A++W GYPG+ GG A+AD++ GK
Sbjct: 524 LISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 582
Query: 589 PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
P GRL T Y +Y ++ P M LRP ++ G PG+TY +Y G +Y FG+GL YT F+
Sbjct: 583 PAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFE- 641
Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
T N+ + + Y KT L+N F +N G +
Sbjct: 642 ESTETTDAGSFNIQTVLTTPHSGYEHAQQKT-----LLN---------FTATVKNTGERE 687
Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
+VY A A K V+GF R+ + + V +S+ D N +L
Sbjct: 688 SDYTALVYVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTVESVARTDEQGNRVL 747
Query: 769 PAGEHTIFVGN 779
G + + + N
Sbjct: 748 YPGSYDVALNN 758
>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
DSM 8532]
Length = 715
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 265/755 (35%), Positives = 388/755 (51%), Gaps = 98/755 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D S + R KDLVSRMT++EKV Q+ + + RLG+P Y WW+EALHGV+ G
Sbjct: 6 VYLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT- 64
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
AT FP I A+F+E L K+ +STE RA Y+ GLT+
Sbjct: 65 ---------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTF 115
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP++ R V +V+GLQ + + LK ++C
Sbjct: 116 WSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGN---------HPKYLKAAAC 166
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + R+ F+A V+++D+ ET+L F+ V+E SVM +YNR NG P
Sbjct: 167 AKHFA---VHSGPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEP 223
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
C LL+ +RGEW G++V+DC +I+ +H A + E A A ++ G DL+CG
Sbjct: 224 CCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESA-ALAVRNGCDLNCG 282
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENI 402
+ N A+++G + E +ID+++ L M+LG FD Q Y S+ + E+
Sbjct: 283 NMFGNLL-IALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISYDFVDCKEHR 341
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
ELA + A++ IVLLKND LPL+ K++++AV+GP+A++ A+IGNY G Y++ +
Sbjct: 342 ELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLD 400
Query: 463 GFSGYA----NVTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEA 512
G A + Y GC + N I A A+ AD I+ GLD ++E
Sbjct: 401 GIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEG 460
Query: 513 ESL---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
E + D+ DL LPG Q +L+ V K P++LV+++ + + +A + +I
Sbjct: 461 EEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIP 517
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AIL A YPG GGRAIA V+FG+ NP G+LP+T+Y T+ L
Sbjct: 518 AILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYR---------TTEELPDFTDYSMEN 568
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY+F LYPFG+GLSYT F Y+ L +K
Sbjct: 569 RTYRFMKNEALYPFGFGLSYTTFDYSDLKLSK---------------------------- 600
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+ +R + F V N G G +VV VY K Q+ G +RV + +G
Sbjct: 601 ---DTIRAGEGFNVSVKVTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGE 657
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
I F + L +V +++ GE I+VG
Sbjct: 658 TAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 712
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 266/754 (35%), Positives = 388/754 (51%), Gaps = 100/754 (13%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D SL + R DLVS+MTL+EK QL A V RLG+P+Y WW+EALHGV+ G
Sbjct: 6 YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A F++ +KI ++TE RA YN G+T+W
Sbjct: 64 --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPN+N+ RDPRWGR ET GEDP++ R V +V+GLQ + + LK ++C
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLKTAACA 165
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + DR+ FDA V+++D+ ET+L FE VKE S+M +YNR NG P
Sbjct: 166 KHYA---VHSGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPC 222
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
LL +R W G++V+DC +I+ + H + + ++VA LK+G DL+CG
Sbjct: 223 NGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSGCDLNCGN 281
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y A+++G + E DID++ L T M+LG FD ++ ++ + S E+ +++
Sbjct: 282 MYL-LILLALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNIPYELNDSAEHNKIS 340
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
EAA++ +VLLKND LPL+S K+K VAV+GP+A++++A+ NY+G P + ++ I G
Sbjct: 341 LEAAKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIR 399
Query: 465 ---SGYANVTYKTGC------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
S V Y G D+ + ++ + A AA+ +D ++ GLD SVE E
Sbjct: 400 KRVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDASVEGEQN 459
Query: 516 -----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
D+ DL LP Q L+N V K P I+ ++S + I A A
Sbjct: 460 DQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAADKA--AA 516
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
I+ YPG GG A A+++FG ++P GRLP+T+Y ++ L P R
Sbjct: 517 IVQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYK---------STEELPPFADYSMENR 567
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TYKF G LYPFG+GLSYT F+Y S CP
Sbjct: 568 TYKFMKGDALYPFGFGLSYTSFEY----------------------------SNMVCPQT 599
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ N + VD QN GS D +VV VY K + GF+R+ +++G
Sbjct: 600 VNN----GENLSVSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEK 655
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
K + F A +++IVD A + GE T++ G
Sbjct: 656 KTVTFEV-ASNAMSIVDEAGKRHIENGEFTLYAG 688
>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
Length = 782
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 232/615 (37%), Positives = 350/615 (56%), Gaps = 26/615 (4%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
CD ++P+ R LV++ T +E + ++A GVPRLG+P Y+WW+EALHGV+ PG
Sbjct: 36 ICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVAK-SPGV 94
Query: 114 HFDDVIP-----GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP- 167
+FD P AT FP I A+F++ L+++I +++E RA N G+AGL +SP
Sbjct: 95 NFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPL 154
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
NIN RDPRWGR ET GEDP + R+AV+ V GLQ G + L V++ CKH
Sbjct: 155 NINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQ---GPHAQNEAEGNKLTVAATCKH 211
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+ AYD++ + +RY FDA V++QD+ + L F CV++G A+++M SYN VN +P A
Sbjct: 212 FLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMTSYNAVNNVPPSA 271
Query: 288 DPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
L R W L H Y+ +DCD++ + D H++ A + +A A+++ AG DLDCG
Sbjct: 272 SKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRY-AQNYVEAAAKSINAGTDLDCG 330
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
Y+ G A++Q I +++ +Y L+RLG+FD S L +D+ S +
Sbjct: 331 ATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFDDPASQPLRQLTWKDVNSPSSQ 390
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA +A I LLKN +TLP+ K +A++GP+ N + + GNYAG M+ +
Sbjct: 391 RLAYTSALSSITLLKNLDSTLPIKQKPTK-IAIIGPYTNVSTSFSGNYAGPAAFNMTMVH 449
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
S A + + G D + A + AD+ + G+D S+E ES DR+D
Sbjct: 450 AASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVVFAGGIDASIERESHDRKD 509
Query: 520 LWLPGYQTQLINQVAEV----AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
+ P Q +LI+++++ K +++V G +D A +++ + A++WAGYPG+
Sbjct: 510 IAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLKSDDAVGALVWAGYPGQSA 569
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
A+ D++ GK P GRLP+T Y Y+ LP ++M LRP GYPGRTYK+Y G Y
Sbjct: 570 SLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRP--KAGYPGRTYKWYKGVPTY 627
Query: 636 PFGYGLSYTQFKYNL 650
PFG+GL YT F +L
Sbjct: 628 PFGHGLHYTTFSASL 642
>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
DSM 17108]
Length = 724
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 259/759 (34%), Positives = 394/759 (51%), Gaps = 97/759 (12%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
M F + D +L + R KDLVSRMT++EKV Q+ + + RLG+P Y WWSEALHGV+
Sbjct: 1 MEIFDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVAR 60
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------A 160
G AT FP I A+F+E L + + +S EARA ++ +
Sbjct: 61 AGV----------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYK 110
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GLT+WSPN+N+ RDPRWGR ET GEDP++ GR V++++GLQ + + L+
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQ---------DKKYLR 161
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
++C KH+A V + +R+ FDA V+ +D+ ET+L F+ CVKE + +VM +YNRV
Sbjct: 162 AAACAKHFA---VHSGPESERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRV 218
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
NG P C LL +T+R EW G++V+DC +I+ +NH+ + S ++VA L G D
Sbjct: 219 NGEPCCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHR-VTSSAPESVALALNNGCD 277
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
L+CG Y N A Q+G V E I+ ++ L M+LG FD + Y ++G
Sbjct: 278 LNCGNMYLNLL-IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENVPYTNIGFHQNDC 336
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
E+ E A E +++ +VLLKN+ + LPL+ + ++AV+GP+AN+ A+ GNY G Y+
Sbjct: 337 QEHREFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYI 396
Query: 459 SPIAGFSGYAN----VTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDL 508
+ + G V+Y GC K+ N A A+ AD ++ GLD
Sbjct: 397 TVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDA 456
Query: 509 SVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
S+E E S D+ L LPG Q +L+ + + K P+ILV+++ + + +A
Sbjct: 457 SIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTWAA-- 513
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
+ AI+ A YPG EGG+A+A +FG+++P G+LPIT+Y T+ L
Sbjct: 514 EKVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYR---------TTEELPEFTDY 564
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
RTY++ LYPFGYGL YT F Y Q+ LN+ + C N
Sbjct: 565 SMKNRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTKICAGEN-------V 609
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+C +LV +N G+ + V +Y K I + G Q++ +
Sbjct: 610 QC-SILV---------------KNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKIHL 653
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
G + I F + + L +++ N +L G I+VG
Sbjct: 654 LPGAEQEISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691
>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
1015]
Length = 804
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 268/719 (37%), Positives = 388/719 (53%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD ++ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDETATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121
Query: 108 NVGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI++ + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY V + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A K P+I++ M G VD + + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYP 586
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T +V LN +Q + + AS T+ P
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEDLASITQLP--------- 693
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQRV-FVRAGRNKRIK 748
F + +N G + +V++ A Y K+ ++G+ R+ V+ G + ++
Sbjct: 694 --VLNFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELR 750
>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
Length = 810
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 268/719 (37%), Positives = 388/719 (53%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD ++ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDETATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121
Query: 108 NVGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI++ + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY V + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A K P+I++ M G VD + + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYP 586
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T +V LN +Q + + AS T+ P
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEDLASITQLP--------- 693
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQRV-FVRAGRNKRIK 748
F + +N G + +V++ A Y K+ ++G+ R+ V+ G + ++
Sbjct: 694 --VLNFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELR 750
>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 711
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 266/758 (35%), Positives = 380/758 (50%), Gaps = 90/758 (11%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
F + Q + +F + P RV DL+ ++TL EK+ LG + V RLG+P Y WW+E
Sbjct: 5 FIVINTQAQTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNE 64
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
ALHGV+ G AT FP I A+FN+ L K+ +STEARA YNL A
Sbjct: 65 ALHGVARAGV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQ 114
Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GLT+WSPNIN+ RDPRWGR ET GEDPF+ +V+GLQ +
Sbjct: 115 GRHLQYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGND------- 167
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
R LK S+C KH+A V + R+ F+A V E+D+ ET+L F V G SV
Sbjct: 168 --PRYLKASACAKHFA---VHSGPENGRHTFNAIVDEKDLRETYLYAFHALVDAG-VESV 221
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MC+YNRVN P C+ LLN +R EW G++V DC ++ + HK + E A A
Sbjct: 222 MCAYNRVNDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAA- 280
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVS 390
+KAG++LDC AV+Q + E DID SL +L ++LGF+D +P Y
Sbjct: 281 AIKAGVNLDCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYDDPTANPFY-K 339
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
G + + + LA A++ +VLLKN LPL+ K + VVG ++ + A++GNY
Sbjct: 340 YGADSVANTAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNY 399
Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL---- 506
G+ R +S + G + + + D + ++ + F AA AD T+ + GL
Sbjct: 400 HGVSNRAVSFVEGITNAVDAGTRVEYDQGSDYNDTTHFGGIWAAGNADITVAVIGLTPVY 459
Query: 507 -----DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
D + A+ D+ D+ LP + + + K P+I VI + VDI+ E +
Sbjct: 460 EGEEGDAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAIEPYAD 519
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
AIL A YPGE+GG A+AD++FGK +P GRLP+T+Y +P D+
Sbjct: 520 --AILLAWYPGEQGGNALADILFGKVSPAGRLPVTFYQS-------FADVP--AYDNYAM 568
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
GRTY+++NG YPFGYGLSYT F Y I+
Sbjct: 569 KGRTYRYFNGKVQYPFGYGLSYTSFAYEWQQMPANIRT---------------------- 606
Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
D F + +N GS DG +VV VY + PA + +K++ F+RV V+A
Sbjct: 607 ---------AKDSVSFSIKVKNTGSMDGDEVVQVYVEYPA-VERMPLKELKAFKRVHVKA 656
Query: 742 GRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
G + ++ A L D A ++ L G + IF G
Sbjct: 657 GGEETVQLTIPAS-DLQKWDLATSSWKLYPGSYNIFAG 693
>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
Length = 805
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 265/749 (35%), Positives = 385/749 (51%), Gaps = 46/749 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S L CDS+ R LVS TL+E + G+ + GVPRLGLP Y+ WSEALHG+
Sbjct: 54 LSKNLVCDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGR 113
Query: 109 VGPGTHFDD---VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D + G SFP+ IL+ A+FN +L +I +ST+ RA N GR GL +
Sbjct: 114 A----NFTDNGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVY 169
Query: 166 SPNINVARDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
SPNIN R P WGR ETPGED + + YA Y+ G+Q +N LK+++
Sbjct: 170 SPNINTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG--------GVNPEHLKLAAT 221
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A YD++NW R D +T+QD+ E + F + ++ S MCSYN VNG+P
Sbjct: 222 AKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVP 281
Query: 285 SCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
SC++ L +R + HGY+ DC ++ + + H + A+ + A A + AG D+D
Sbjct: 282 SCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDID 340
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG----SPQYVSLGKQDICS 398
CG Y ++ G V DI++ LY L+ LG+FDG S Y SLG D+
Sbjct: 341 CGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQK 400
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNS---AKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ ++ EAA EGIVLLKND TLPL S K K++A++GP ANAT + GNY G
Sbjct: 401 TDAWNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAP 459
Query: 456 RYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
+SP+ F+ + +++ S + AA AA+ AD + L G+D ++EAE+
Sbjct: 460 YLISPVDAFTAAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQ 519
Query: 516 DREDLWLPGYQTQLINQVA--EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR + PG Q +LI+Q+A + P+++ M G VD + + N + A+LW GYPG+
Sbjct: 520 DRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQ 579
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG A+ D++ G P GRL T Y Y + M LRP ++ PG+TY +Y G
Sbjct: 580 SGGLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEP 639
Query: 634 LYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
+Y FG+GL YT F + KT N+ L + + T+ +T
Sbjct: 640 VYAFGHGLFYTTFNASSAQAAKTKYTFNITDLTSAAHPDTTTVGQRT------------- 686
Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVFVRAGRNKRIKF-V 750
F F N G D +VY+ + Y K ++GF R+ A + V
Sbjct: 687 -LFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNV 745
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A L VD A NT+L G + + + N
Sbjct: 746 PVAVDRLARVDEAGNTVLFPGRYEVALNN 774
>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 767
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 271/735 (36%), Positives = 396/735 (53%), Gaps = 49/735 (6%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD +L R LV +T++EK+Q L A G PR+GLP Y WWSEALHGV+ PGT+
Sbjct: 43 CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVA-YAPGTY 101
Query: 115 F---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
F D +TS+P +L A+F++ L ++IG A+ EARA N G AGL YW+PN+N
Sbjct: 102 FPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYWTPNVNP 161
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAA 230
+DPRWGR +ETPGED V RYA RGL V G + +V S CKHYA
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQR---------RVISTCKHYAG 212
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
D ++W G R+ FDA++T QD+ E +L PF+ C ++ S+MC+YN VNG+PSCA+
Sbjct: 213 NDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCANEY 272
Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL +R W+ + Y+ +DC+++ + NHK+ A + A +AG+D C
Sbjct: 273 LLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKY-APTNAAGTAICFEAGMDTSCEYTG 331
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
++ A QG +KE +D++L LY L+R G+FDG Y LG +D+ S E LA
Sbjct: 332 SSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAIYAKLGWKDVNSAEAQSLAL 391
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFS 465
+AA EGIVLLKN+ TLPL+ VA++G A+A + G Y+G +P A
Sbjct: 392 QAAVEGIVLLKNN-GTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAYAARQ 450
Query: 466 GYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
++T +G S+N AA EAA+ AD + GLD S E+LDR DL P
Sbjct: 451 LGLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTDLEWPE 510
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI +++ + K P+++ ++ D + + + +ILWA +PG++GG AI ++
Sbjct: 511 AQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGVAIMKLIT 568
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G+ +P GRLP+T Y +Y ++P+TSM LRP YPGRTY++Y+ P + FG+GL YT
Sbjct: 569 GEKSPAGRLPVTQYPSNYTDLIPMTSMDLRPTSQ--YPGRTYRWYDKP-IKRFGFGLHYT 625
Query: 645 QFKYNL-LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
FK + +F KT+++ L C N + + CP V N
Sbjct: 626 TFKAEVGGAFPKTLRI--ADLVGCGNEHPDT------CPAP-----------PLPVSITN 666
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
G+ V + Y IK + ++R+ V G + + + D
Sbjct: 667 TGNRTSDYVALAYLSGEYGPRPYPIKTLSAYKRLRDVAPGETATVDLAWT-LGDIARHDE 725
Query: 763 AANTLLPAGEHTIFV 777
NT+L GE+TI +
Sbjct: 726 QGNTVLYPGEYTITI 740
>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
Length = 795
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 270/730 (36%), Positives = 397/730 (54%), Gaps = 38/730 (5%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CDSS Y+ R + L+S TL+E + + GVPRLGLP Y+ W+EALHG+
Sbjct: 61 LVCDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 120
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T ATSFP IL+ A+ N +L +I +ST+ARA N GR GL ++PNIN
Sbjct: 121 TK-GGQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNINGF 179
Query: 173 RDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
R P WGR ETPGED V+ Y Y+ G+Q EN LK+++ KH+A Y
Sbjct: 180 RSPLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPEN--------LKIAATAKHFAGY 231
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D++NW R FDA +T+QD+ E + F + + S MC+YN VNG+PSCA+
Sbjct: 232 DLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFF 291
Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R W GY+ +DCD++ + + H + ++ A A +L+AG D+DCGQ Y
Sbjct: 292 LQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSS-AAASSLRAGTDIDCGQTYPW 350
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAA 409
+ G+V +I++S+ LY L+RLG+FD +Y SLG +D+ + ++ EAA
Sbjct: 351 HLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNISYEAA 410
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGFSGY 467
EGIVLLKND TLPL S KV+++A++GP ANAT M GNY G +SP+ A +GY
Sbjct: 411 VEGIVLLKND-GTLPL-SKKVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAGY 468
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
V ++ G + A S A AAK +DA I G+D +VE E DR D+ PG Q
Sbjct: 469 -QVNFELGT-ETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQL 526
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
LI Q++E+ K P++++ M G VD + ++N + +++W GYPG+ GG A+ D++ GK
Sbjct: 527 DLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKR 585
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRL T Y DYV P M LRP D PG+TY +Y G +Y FG G+ YT FK
Sbjct: 586 APAGRLVSTQYPADYVHQFPQNDMNLRP-DGKSNPGQTYIWYTGKPVYQFGDGIFYTTFK 644
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
L +K ++ N++ + + YT + P F + +N G T
Sbjct: 645 ETLSGSSKGLKFNVSSVLAAPHPGYT---YSEQTP-----------VLTFTANIENSGKT 690
Query: 708 DGSDVVIVYSKPPAEIAATYI-KQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVDYAAN 765
D +++ + A Y K ++GF R+ ++ G + ++ +L VD N
Sbjct: 691 DSPYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSSKLSIPI-PVSALARVDSLGN 749
Query: 766 TLLPAGEHTI 775
++ G++ +
Sbjct: 750 RIVYPGKYEL 759
>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 269/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD S+ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121
Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI+K + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY V + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEA 526
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A P+I++ M G VD + + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYP 586
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T +V LN +Q + + AS T+ P
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEELASITQLP--------- 693
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
F + +N G + +V++ A Y +K ++G+ R+ V+ G + ++
Sbjct: 694 --VLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 750
>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
22836]
Length = 737
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/772 (35%), Positives = 400/772 (51%), Gaps = 99/772 (12%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
FS LG ++ F +++L RV DLVS++TL+EKV Q+ + + RL +P Y WW+E
Sbjct: 18 FSLLG---QNYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNE 74
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
LHG+ T + T FP I A++N+ L K++ A+S E RA+YN +
Sbjct: 75 CLHGIGR----TDYK-----VTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSK 125
Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GLTYW+PNIN+ RDPRWGR ET GEDPF+ G ++V GLQ +
Sbjct: 126 GNREIYYGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDD------- 178
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
++ LK ++C KHYA V + R+ F+ VT+ D+ +T+L F V E + V
Sbjct: 179 --TKYLKAAACAKHYA---VHSGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGV 233
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MC+YN NG P C + L+ + +R +W+ GY+ +DC +I +HK D+K A A
Sbjct: 234 MCAYNAYNGEPCCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHHKTHPDAKY-AAAD 292
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSL 391
+ G D+DCG +AV+ G + E ID SLK L+T+ RLG FD + +Y +
Sbjct: 293 AVYNGTDIDCGNEAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVKYSQI 352
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S ++ +LA + RE IVLLKN+ NTLPL S K+K VAVVGP+AN V+++GNY
Sbjct: 353 STSVLESQKHKDLALKITRESIVLLKNENNTLPL-SKKLKKVAVVGPNANNEVSVLGNYN 411
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNN--SIFAASEAAKTADATIILAGL 506
G P ++P A V Y+ G D V +N+ + A + K D I + G+
Sbjct: 412 GFPTEIVTPYEAVKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGI 471
Query: 507 DLSVEAESL----------DREDLWLPGYQTQLINQ-VAEVAKGPVILVIMSAGGVDIAF 555
+E E + DR + LP QT + VAE K P + V+M+ G IA
Sbjct: 472 SPELEGEEMPVKIEGFTGGDRTSIKLPKIQTDFMKALVAE--KIPTVFVMMT--GSAIAT 527
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
+ NI AI+ A Y G++ G AIADV+FG +NP G+LP+T+Y D + +P
Sbjct: 528 EWESQNIPAIVNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYAKD-------SDLP--A 578
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+S RTY+++NG LYPFGYGLSYT+F+Y+ + TI N
Sbjct: 579 FNSYEMKNRTYRYFNGEVLYPFGYGLSYTKFEYSPIQVPSTIDTGNNA------------ 626
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
+ V +N G +G +VV +Y P + + GF
Sbjct: 627 --------------------KVSVSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFN 666
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
RV ++AG +K ++F + + L +VD A + AG+ IF+G G P H
Sbjct: 667 RVSLKAGESKTVEFNLSP-RELGLVDDAGILKVSAGKRKIFIG-GSSPTPTH 716
>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
Length = 804
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 267/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD ++ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDETATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121
Query: 108 NVGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI++ + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY V + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A K P+I++ M G VD + + NT + A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYP 586
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T +V LN +Q + + AS T+ P
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSRTHEELASITQLP--------- 693
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQRV-FVRAGRNKRIK 748
F + +N G + +V++ A Y K+ ++G+ R+ V+ G + ++
Sbjct: 694 --VLNFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELR 750
>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
206040]
Length = 794
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 267/732 (36%), Positives = 396/732 (54%), Gaps = 39/732 (5%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CDS+ Y R + L+S TL+E + + GVPRLGLP Y+ W+EALHG+
Sbjct: 62 LVCDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 121
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T + G TSFP IL+ A+ N +L +I +ST+ARA N GR GL ++PNIN
Sbjct: 122 TKGGEFEWG-TSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNINGF 180
Query: 173 RDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
R P WGR ETPGED V+ Y Y+ G+Q EN LK+++ KH+A Y
Sbjct: 181 RSPLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPEN--------LKIAATAKHFAGY 232
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
D++N+ R FDA +T+QD+ E + F + + S MC+YN VNG+PSC++
Sbjct: 233 DLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFF 292
Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L +R W +GY+ +DCD+I + + H + A+S+ A A +LKAG D+DCGQ Y
Sbjct: 293 LQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNY-ANSQSSAAADSLKAGTDIDCGQTYPW 351
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAA 409
+ G V +I++S+ LY L+RLG+FD +Y SLG +D+ + ++ EAA
Sbjct: 352 HLNESFVAGTVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNISYEAA 411
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGFSGY 467
EGIVLLKND TLPL S KV+++A++GP NAT + GNY G +SP+ A +GY
Sbjct: 412 VEGIVLLKND-GTLPL-SKKVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAGY 469
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
V Y+ G + ++ A AAK +DA I + G+D ++E E DR D+ PG Q
Sbjct: 470 -EVNYELGT-GINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQL 527
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
LI Q++EV K P++++ M G VD + ++N + +++W GYPG+ GG A+ D++ GK
Sbjct: 528 DLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKR 586
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
P GRL T Y +YV M LRP D PG+TY +Y G +Y FG GL YT FK
Sbjct: 587 APAGRLVSTQYPAEYVHQFAQNDMNLRP-DGKKNPGQTYIWYTGKPVYQFGDGLFYTTFK 645
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
L T++ N +++ + YT + P F F + QN G T
Sbjct: 646 -ETLGKQSTLKFNASQILGAGHPGYTYSE---QTP-----------VFTFTANIQNSGKT 690
Query: 708 DGSDVVIVYSKPPAEIAATYI-KQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVDYAAN 765
+ + + Y K ++GF R+ ++ G + + +L+ VD N
Sbjct: 691 ASPYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSSTLSIPI-PLNALSRVDSNGN 749
Query: 766 TLLPAGEHTIFV 777
++ G++ + +
Sbjct: 750 KIVYPGKYELVL 761
>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
Length = 778
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 268/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD S+ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 37 LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 95
Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 96 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 151
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 152 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 203
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN V+G+P+
Sbjct: 204 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPA 263
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 264 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 322
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI+K + LYT L++ G+FD + Y L D+
Sbjct: 323 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 382
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 383 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 442
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY V + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 443 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEA 500
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A P+I++ M G VD + + NTN+ A+LW GYP
Sbjct: 501 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYP 560
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 561 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 618
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T +V LN +Q + + AS T+ P
Sbjct: 619 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEELASITQLP--------- 667
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
F + +N G + +V++ A Y +K ++G+ R+ V+ G + ++
Sbjct: 668 --VLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 724
>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
Length = 804
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 268/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD S+ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121
Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P WGR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN V+G+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + S+ A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI+K + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN N LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
+SP A F +GY V + G ++ S + AA AA++AD I G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEA 526
Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+LDRE + PG Q LI ++A A P+I++ M G VD + + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYP 586
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
+Y FG+GL YT F + S T T +V LN +Q + + AS T+ P
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEELASITQLP--------- 693
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
F + +N G + +V++ A Y +K ++G+ R+ V+ G + ++
Sbjct: 694 --VLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 750
>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
6192]
gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
Length = 693
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 273/736 (37%), Positives = 392/736 (53%), Gaps = 92/736 (12%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R+ L+SRM+++EK + A GVPRLG+P Y WW+EALHGV+N G AT
Sbjct: 6 RMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------AT 55
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGRA-------GLTYWSPNINVARDP 175
FP I A+F+ L +++ A+S EARA +N +G+ GLT+WSPNIN+ RDP
Sbjct: 56 VFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRDP 115
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDPF+ + V +V+GLQ + L+V++C KHYA +
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVAFVKGLQGDHPYY---------LRVAACAKHYAVH--SG 164
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+G+ R+ FDARV+E+D+ ET+L FE VK G +VM +YNRVNG P+C +LL +
Sbjct: 165 PEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLEEI 222
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R +W G++V+DC +I +HK D E ++A L+AG DL+CG Y + +AV
Sbjct: 223 LRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYEHLL-DAV 280
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
+ G V E +D+S+ L + L RLG F YV L DI + + LA EAA + +VL
Sbjct: 281 KAGAVSEELVDRSVARLLSTLDRLGLFTDDHPYVRLSLADIDWEAHRALAREAAEKSVVL 340
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
LKN+ LPL+ K++ + V GP+A VA++GNYAG+ R ++ + G +GYA VT
Sbjct: 341 LKNN-GILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVT 399
Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR---------EDLWL 522
YK GC + N I AS A+ AD T+ + G D +VE E D DL L
Sbjct: 400 YKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNL 458
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
Q + ++ E+ K P+++V++S G + E AI++A YPGEEGG AIA V
Sbjct: 459 SREQIDYLRRIKEIGK-PLVVVLLS--GAPVCSPELEELADAIVYAWYPGEEGGNAIARV 515
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG+ +P GRLPIT+ G V LP P GRTY++ LYPFG+GLS
Sbjct: 516 LFGEVSPSGRLPITFPKG--VDQLP-------PFTDYSMEGRTYRYMKEEPLYPFGFGLS 566
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
Y F Y + S AS+ D R + E + +
Sbjct: 567 YATFSYR---------------------DPKSSASRW--------DKR--ETLEVVCEVE 595
Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
N S +VV +Y + + + GF RV + G +++FV + + L+ +D
Sbjct: 596 NTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGTGERIQVRFVLSP-EDLSFIDE 654
Query: 763 AANTLLPAGEHTIFVG 778
+LP G VG
Sbjct: 655 KGRKVLPEGRLRFHVG 670
>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Glomerella graminicola M1.001]
Length = 824
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 270/755 (35%), Positives = 388/755 (51%), Gaps = 56/755 (7%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD +L R LV+ +T+ EK+ L + A GVPRL +P YEWWSE LHGV++ PGT
Sbjct: 65 CDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVAS-SPGTK 123
Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW------- 165
F ATSFP I+ ++F++ L K IG+ VS EARA N GR+GL +
Sbjct: 124 FAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGLDLYVSSISRH 183
Query: 166 --------------SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
SPNIN +DPRWGR ETPGEDPF + Y + GL EG + +
Sbjct: 184 IEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGL---EGGDPS 240
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
K+ + CKHYAA D +N+KGVDR FDA +T QD+ E +L PF+ C +
Sbjct: 241 K-------KLIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVG 293
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKE 328
S MCSYN +NG P CA+P LL +R W +G Y+ DCD + +MV +H + D
Sbjct: 294 SFMCSYNAINGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGH 353
Query: 329 DAVAQTLKAGLDLDCGQYY-TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---G 384
A A +KAG DL+C + + A Q + E ++DKSL +YT L+ +G FD G
Sbjct: 354 -AAAWAMKAGTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDSARG 412
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANAT 443
P SL D+ + E +LA +A EG VLLKND LPL++A + K A++GP NAT
Sbjct: 413 QP-LRSLSWDDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWINAT 470
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
M GNY G P Y+ + + + + +++S A ++A A +
Sbjct: 471 TQMQGNYFG-PAPYLISLYQAAKEFGLDFTYSLGSRINSTDDSFKQALDSAHAAALIVFA 529
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G+D ++EAE+ DR+ L P Q L+ V+ + K PVI++ G VD N +I
Sbjct: 530 GGVDNTLEAETRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLANHSIN 588
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
A+LW GYPG+ GG+A+ D++FG+ P GRL +T Y Y + +P T M LRP G
Sbjct: 589 ALLWGGYPGQSGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLG 648
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY +YNG + P+G+GL YT F L + + + ++ + +Y S G
Sbjct: 649 RTYMWYNGDAVVPYGFGLHYTTFDAKLKARQASALIKTEEVSSLLSNDYVS--------G 700
Query: 684 VLV-NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
LV + + N G+ V +++ + A K + G+ R
Sbjct: 701 TLVWQQILTKPVVSVLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQP 760
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++ + V + L VD N +L G + +FV
Sbjct: 761 GDRSEREVSITIERLVRVDELGNRVLHPGSYELFV 795
>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 717
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 262/761 (34%), Positives = 399/761 (52%), Gaps = 106/761 (13%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
M+ + D + + R + LV MTL+EKV Q A + RLG+P Y +W+EALHGV+
Sbjct: 1 MTDKAWLDETKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVAR 60
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
G AT FP I ASF+E L ++ +STEARA +N+ +
Sbjct: 61 AGV----------ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYK 110
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GLT+WSPN+N+ RDPRWGR ET GEDPF+ GR V+++RG+Q + R +K
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGD---------DERYMK 161
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
V++C KH+A V + R+ F+A V+EQD+ ET+L F CV E +VM +YNR
Sbjct: 162 VAACAKHFA---VHSGPEDQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRT 218
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF--LADSKEDAVAQTLKAG 338
NG C KLL +RGEW G++ +DC +++ D H+F + ++E+ VA + +G
Sbjct: 219 NGEACCGSKKLLVDILRGEWGFRGHVTSDCWALK---DFHEFHMVTKNQEETVALAMNSG 275
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
DL+CG Y + AV+ G V+E+ ID+++ L+T M+LG FD S + Y +G +
Sbjct: 276 CDLNCGNLYVHLL-QAVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEVPYNGIGYDRV 334
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
++ N +L EA+R + LLKN LPL+ +K++T+ VVGP+A+ A++GNY G
Sbjct: 335 DTEANRKLNREASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASE 394
Query: 457 YMSPIAGFSGYA----NVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGL 506
Y++ + G A V Y GC D V + N+ I A A+ +D I + GL
Sbjct: 395 YVTVLDGIRELAGDDVRVVYSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVIAVMGL 454
Query: 507 DLSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D +E E S D+ +L LPG Q +++ + E K PV+LV++ + I +AE
Sbjct: 455 DPGLEGEEGDQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVLVLLGGSALAIPWAE 513
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
++ AIL A YPG +GGRA+ADV+FG+ P G+LP+T+Y TS L
Sbjct: 514 --EHVPAILDAWYPGAQGGRAVADVLFGRACPEGKLPVTFYR---------TSEELPAFT 562
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
RTY++ P LYPFGYGLSYT ++ N T++ S
Sbjct: 563 DYSMKNRTYRYMKQPALYPFGYGLSYTSWELT---------------------NTTAEGS 601
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
DD + +N G+ G+ V VY K P +A Q+ G +++
Sbjct: 602 -------------VDDGVVCRAVLRNTGAMAGAQTVQVYVKAP--LATGPNAQLKGLRKI 646
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ G + + + ++ + + +L GE+ I++G
Sbjct: 647 RLQPGESAEVAISLDK-EAFGVYNEKGLRVLLPGEYKIYIG 686
>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
Length = 739
Score = 414 bits (1064), Expect = e-112, Method: Compositional matrix adjust.
Identities = 266/757 (35%), Positives = 389/757 (51%), Gaps = 93/757 (12%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
L F F D LP RV+DLVSR+TL+EKV+Q+ + V RLG+P Y WW+E LHG+
Sbjct: 20 LAQEKFPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGI 79
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------ 160
T + T FP I A++N++L K++ +++ E RA+YN +
Sbjct: 80 GR----TKYH-----VTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQ 130
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
LTYW+PNIN+ RDPRWGR ET GEDP++ R +V+GLQ N R
Sbjct: 131 YHALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGD---------NPRY 181
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
LK S+C KHYA V + +R+ F++ V+ D+ +T+L F V + S VMC+YN
Sbjct: 182 LKASACAKHYA---VHSGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAYN 238
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
G P C + L+ +R +W+ GY+ +DC +I + ++HK D+ A G
Sbjct: 239 AFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFH-G 297
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDI 396
DLDCG AV+ G + E +D S+K L+T+ RLG FD Y + +
Sbjct: 298 TDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVDYARIPISIL 357
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
++ +LA + ARE +VLLKNDQ LPL K+K V V+GP+A++ +++GNY G P R
Sbjct: 358 ECRKHQDLAKQLARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPSR 416
Query: 457 YMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
++P+ G+ V Y G D V S + + AK ADA I + G+ +E
Sbjct: 417 MLTPLQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRLEG 476
Query: 513 ESL----------DREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTN 561
E + DR + LP QTQ++ A VA+ P + V+M+ + I + N
Sbjct: 477 EEMPVSKDGFDGGDRTTIALPAVQTQMMK--AWVAEHIPTVFVMMTGSALAIPWEAQN-- 532
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
+ AIL A Y G+ GG AIADV+FG +NP G+LP+T+Y D + +P +S
Sbjct: 533 VPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD-------SDLP--DFESYDM 583
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
GRTY+++NG LYPFGYGLSYT F Y+ L K CR
Sbjct: 584 QGRTYRYFNGKALYPFGYGLSYTSFAYSSLKLPKV----------CRT------------ 621
Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
D E V +N G T+G +VV +Y P + + + GF+R+ ++A
Sbjct: 622 ---------TDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGFKRIQLKA 672
Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
G +R+ F ++ + L+ VD + AG I VG
Sbjct: 673 GEAQRVTFSLSS-EDLSCVDENGIRKVWAGTVKIQVG 708
>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
Length = 884
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 271/730 (37%), Positives = 386/730 (52%), Gaps = 55/730 (7%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
SPV + DP +K CD+SL R+ L+S+MT++EK L D A G+PR+
Sbjct: 132 SPVCLTDPFCANKA---------CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRI 182
Query: 92 GLPQYEWWSEALHGVSNVGPGTHFDDV----IPGATSFPTVILTTASFNESLWKKIGQAV 147
GLP YEWW+EALHGV+ G FD ATSFP IL A+F++ L + +
Sbjct: 183 GLPPYEWWNEALHGVAG-SRGVSFDSPNGSDFSYATSFPLPILMGAAFDDPLIYDVASII 241
Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
EARA N +G +W+PN+N DPRWGR E P ED F RY + V GLQ G
Sbjct: 242 GKEARAFANYAHSGYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQ---G 298
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
+ TD ++ + CKH+A YDV+ +R+ + T QD+ E +L F+ CV++
Sbjct: 299 GKEKTDHK----QIIATCKHFAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRD 350
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLA 324
+ S+MCSYN V G+P+CA L +R +W+ + Y+ +DC++++ + H F
Sbjct: 351 VNVGSIMCSYNAVYGVPACASEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNF-T 409
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
D++ A A L AG D +CG Y +V E +D SL LY L +G+FDG
Sbjct: 410 DTEPAAAAVALNAGTDTNCGTSYLQLN-TSVANNWTTEAQMDISLTRLYNALFTVGYFDG 468
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
P+Y L D+ + A AA EGI LLKND LPL + +VA++GP ANAT
Sbjct: 469 QPEYDGLSFADVSTPFAQATAYRAASEGITLLKND-GLLPLKKS-YNSVALIGPWANATT 526
Query: 445 AMIGNYAGIPCRYMSPIAGFSG-YANVTYKTGCDDVACKSNNSIFAAS--EAAKTADATI 501
M G Y GI +SP+A + ++++ G A S N+ AS AA+ AD I
Sbjct: 527 QMQGIYQGIAPYLVSPLAAAQAQWGHISFTNG---TAINSTNTTGFASALSAARDADVII 583
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
G+D S+E ES DR + PG Q L+ Q++E+ K P+++V G VD + N N
Sbjct: 584 YAGGIDSSIEKESRDRTSISWPGNQLDLVQQLSELGK-PLVVVQFGGGQVDDSALLRNKN 642
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
+ +++WAGYPG++GG A+ DV+ GK +P GRL IT Y DY+ + L LRP DS
Sbjct: 643 VNSLVWAGYPGQDGGSALIDVLVGKQSPAGRLTITQYPADYINQISLFDPNLRPSDSS-- 700
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
PGRTYK+YN + PFGYGL YT F+++ + K Q + + S AS T
Sbjct: 701 PGRTYKWYNKEPVLPFGYGLHYTTFEFD---WAKAPQASYDIASLVD-----STASYTTS 752
Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVF-V 739
P ND + E + N GS V +V+ + P A Y K + + R+ +
Sbjct: 753 PK--KND--ASPWTELSIKVHNSGSLGSDYVGLVFLRTPNAGPAPYPNKWLASYARLHGL 808
Query: 740 RAGRNKRIKF 749
AG + + F
Sbjct: 809 SAGASAELSF 818
>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
Length = 715
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 265/758 (34%), Positives = 389/758 (51%), Gaps = 104/758 (13%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D S + R KDLVSRMT++EKV Q+ + + RLG+P Y WW+EALHGV+ G
Sbjct: 6 VYLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT- 64
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
AT FP I A+F+E L K+ +STE RA Y+ GLT+
Sbjct: 65 ---------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTF 115
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP++ R V +V+GLQ + + LK
Sbjct: 116 WSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGN---------HPKYLKAGGM 166
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CK+ + V R+ F+A V+++D+ ET+L F+ V+E SVM +YNR NG P
Sbjct: 167 CKNILPFTV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEP 224
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
C LL+ +RGEW G++V+DC +I+ +H A + E A A ++ G DL+CG
Sbjct: 225 CCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESA-ALAVRNGCDLNCG 283
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENI 402
+ N A+++G + E +ID+++ L M+LG FD Q Y S+ C E+
Sbjct: 284 NMFGNLL-IALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISSFVDCK-EHR 341
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
ELA + A++ IVLLKND LPL+ K++++AV+GP+A++ A+IGNY G Y++ +
Sbjct: 342 ELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLD 400
Query: 463 GFSGYA----NVTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEA 512
G A + Y GC + N I A A+ AD I+ GLD ++E
Sbjct: 401 GIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEG 460
Query: 513 ESL---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
E + D+ DL LPG Q +L+ V K P++LV+++ + + +A + +I
Sbjct: 461 EEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIP 517
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AIL A YPG GGRAIA V+FG+ NP G+LP+T+Y T+ L
Sbjct: 518 AILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYR---------TTEELPDFTDYSMEN 568
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY+F LYPFG+GLSYT F Y+ L +K
Sbjct: 569 RTYRFMKNEALYPFGFGLSYTTFDYSDLKLSK---------------------------- 600
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK---QVIGFQRVFVR 740
+ +R + F V N G G +VV VY K ++ A++ Q+ G +RV +
Sbjct: 601 ---DTIRAGEGFNVSVKVTNTGKMAGEEVVQVYIK---DLEASWRVPNWQLSGMKRVRLE 654
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+G I F + L +V +++ GE I+VG
Sbjct: 655 SGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691
>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
Length = 755
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/603 (41%), Positives = 350/603 (58%), Gaps = 46/603 (7%)
Query: 55 CD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
CD SS PY+ R L+S TL EKV G+ + GVPR+GLP YEWW+EALHG++ PGT
Sbjct: 34 CDTSSDPYT-RAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGT 91
Query: 114 HFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F +TSFP IL A+F++ L K+ VSTEARA N+ R GL +W+PNIN
Sbjct: 92 TFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNIN 151
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS-SCCKHYA 229
+DPRWGR ETPGEDPF Y + GLQ L+ P K + CKH+A
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--------GLDDLPYKKGVATCKHFA 203
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD++N G RY FDA + QD+ + +L PF+ C ++ + SVMCSYN +NG+P+CAD
Sbjct: 204 GYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADD 263
Query: 290 KLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
LL +R W + ++ +DCD+++ + D H + + E + A L AG DLDCG +
Sbjct: 264 WLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTL-TPEQSAADALNAGTDLDCGTF 322
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIEL 404
+ + G+A QG + +D+SL Y L+RLG+FD Y L ++ + +L
Sbjct: 323 WPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNWDNVSTPAAQQL 382
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP-IAG 463
A +AA +GIVLLKND LPL S+ + VA++GP ANAT M GNY G SP IA
Sbjct: 383 ALQAAEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAA 440
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
+ VTY G D+ ++ AA AA++AD I + G+D S+EAE +
Sbjct: 441 QNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEI-------- 491
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
+A ++ P+I+ M +D + +NT + A+LWAGYPG++GG AI +++
Sbjct: 492 ---------LANLST-PLIISQMGC-MIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNIL 540
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GK P GRLPIT Y +YV + +T M L+P S PGRTYK+YNG ++ +GYGL Y
Sbjct: 541 TGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYKWYNGEPVFEYGYGLQY 598
Query: 644 TQF 646
T F
Sbjct: 599 TTF 601
>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
16/4]
Length = 713
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 252/746 (33%), Positives = 393/746 (52%), Gaps = 103/746 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R K+LVS+MT++EK Q+ A + RLG+P+Y WW+EALHGV+ G AT
Sbjct: 8 RAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------AT 57
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A+F+E L +K+ STE RA YN GLTYW+PN+N+ RDP
Sbjct: 58 VFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRDP 117
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ G+ + YVRGLQ DL++ K ++C KH+A V +
Sbjct: 118 RWGRGHETYGEDPYLTGQLGMAYVRGLQ-------GDDLDNP--KSAACAKHFA---VHS 165
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+R+HFDA+V +QD+ +T+L F+ VK+ +VM +YNRVNG P+C +LL
Sbjct: 166 GPEAERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDI 225
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RG+W G++V+DC +I+ +NHK E A A + G DL+CG Y A
Sbjct: 226 LRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESA-ALAVNNGCDLNCGCVYEKLL-YAY 283
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFF-DGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+ V E I +S++ L + +RLG + +Y + + + E+ ELA EAA+ +V
Sbjct: 284 KANLVTEETITESVERLIELRLRLGTLPERRSKYDDIPYEVVECKEHKELAIEAAKRSMV 343
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKT 474
LLKND LPL ++KT+ V+GP++N+ +A++GNY GI Y++ + G Y +
Sbjct: 344 LLKND-GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRV 402
Query: 475 GCDD----------VACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SL 515
D V ++ ++ A A+ +D ++ GLD ++E E S
Sbjct: 403 FHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSG 462
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
D++ L LPG Q +L+ ++ + K PV+L++++ +D+++A N N+ AI+ YPG G
Sbjct: 463 DKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWA--NENVNAIMHCWYPGARG 519
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
G+AIA V+FG+ +P G+LP+T+Y D L P + GRTY+++ G LY
Sbjct: 520 GKAIAQVLFGEDSPSGKLPLTFYKSD---------ADLPPFEDYSMEGRTYRYFKGTPLY 570
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
PFGYGLSY+ +Y S+A + G + D F
Sbjct: 571 PFGYGLSYSDIQY-------------------------SNAGIDKTEGAI------GDKF 599
Query: 696 EFKVDFQNVGSTDGSDVVIVYSK---PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
KV +N G + V VY K +A ++++ +V + G +K + +
Sbjct: 600 TVKVTVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKI---AKVELLPGESKEVSLELS 656
Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVG 778
A + I+D + ++ G+ +FVG
Sbjct: 657 A-RDFAIIDEKGHCIVEPGKFKVFVG 681
>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
Length = 731
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 259/751 (34%), Positives = 384/751 (51%), Gaps = 93/751 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D + ++ R LV+ MT+DEK+ QL + RL +PQY WW+EALHG++ G
Sbjct: 28 VWFDPDISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNG-- 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTY 164
AT FP I A+F+ L ++ A+S EARA Y + + AGLT+
Sbjct: 86 --------KATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTF 137
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+PN+N+ RDPRWGR ET GEDPF+ + +V+GLQ + + LK +
Sbjct: 138 WTPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDD---------PKYLKSAGV 188
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + R+HFD +++D+ ET+L FE V + + VMC+YN VNG P
Sbjct: 189 AKHFA---VHSGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEP 245
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+CA +LL+ ++ +W HGYIV+DC ++ HK E A A L++G++L+CG
Sbjct: 246 ACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESA-ALALQSGVNLNCG 304
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
Y +F A++Q V ID+ L L + +LGFFD G Y + I S E+I
Sbjct: 305 STYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFFDPAGLNPYNEVTPDVIHSPEHI 364
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
L+ + AR+ IVLLKND + LPL S +K V GP A ++ +IGNY GI +S +
Sbjct: 365 NLSRDVARKSIVLLKNDNHVLPL-SKDIKVPYVTGPFAASSDMLIGNYYGISDSLVSVLE 423
Query: 463 GFSGY----ANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGLDLSVEAESL-- 515
G +G +++ Y++G + +N N + A + AKTADA I + G+ +E E +
Sbjct: 424 GIAGKVSLGSSLNYRSGS--LPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDA 481
Query: 516 -------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
DR + LP Q + Q+A KGP+ILV+ + VDI+ E + AILW
Sbjct: 482 IASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDISDLEPLAD--AILWI 539
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPGE+GG A+ADV+FG NP G LP+T+ + LP P D GRTYKF
Sbjct: 540 WYPGEQGGNAVADVLFGDTNPSGHLPLTFVKS--IDDLP-------PFDDYAMTGRTYKF 590
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
LYPFG+G SYT+F +N L+ ++ + L
Sbjct: 591 LEKAPLYPFGFGRSYTEFSFNDLTVSQGKAIEGEAL------------------------ 626
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
V+ +N G G VV Y P A + I + F+R+ + + ++
Sbjct: 627 -------TLSVEVENRGDIAGETVVQAYLSPIARMNNEAISSLKSFKRIHLAPKETRWVE 679
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K L V+ A T+ P G +++ VG+
Sbjct: 680 LTIQG-KDLYQVNNAGETVWPQGRYSLAVGD 709
>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 791
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 260/746 (34%), Positives = 393/746 (52%), Gaps = 49/746 (6%)
Query: 7 SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSFLFCDSS 58
+LL F+ AL +T+ D N + P PG +K+ +S + CD++
Sbjct: 8 ALLAFA-PTALSQANTSYADYNTQAQPDLY--PGTTAKVDFSFPDCSNGPLSKTMVCDTT 64
Query: 59 LPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV 118
R L++ T +E V G+ +PRLGLP Y+ W+EALHG+ T F D
Sbjct: 65 AKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLDRANL-TEFGD- 122
Query: 119 IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWG 178
ATSFP+ ILT A+ N +L +IG VST+ RA N GR GL +SPNIN R P WG
Sbjct: 123 YSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGLDVYSPNINSFRHPVWG 182
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R ETPGED + Y + Y+ GLQ L+ + LK+++ KH+A YD++NW
Sbjct: 183 RGQETPGEDIQLCSVYGLEYITGLQG--------GLDPKELKLAATAKHFAGYDIENWGN 234
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R D ++ D + F V++ SVM SYN VNG+P+ A+ LL +R
Sbjct: 235 HSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGVPASANSFLLQTLLRD 294
Query: 299 EWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
W+ GY+ +DCDS+ + + H + + S A A++++AG D+DCG Y + +
Sbjct: 295 TWNFVEDGYVSSDCDSVYNVFNPHGYAS-SASLAAAKSIQAGTDIDCGATYQLYLNQSFT 353
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVL 415
QG++ ++I+++ Y+ L+ LG+FDG + +Y L D+ + + ++ EAA EGIVL
Sbjct: 354 QGEISRSEIERAATRFYSNLVSLGYFDGDNSKYRDLDWSDVVATDAWNISYEAAVEGIVL 413
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKT 474
LKND TLPL S +VA++GP AN T M GNY G P+A +V Y
Sbjct: 414 LKND-GTLPL-SKDTHSVALIGPWANVTTTMQGNYYGAAPYLTGPLAALQASDLDVNYAF 471
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
G +++ ++ + AA AA+ +D I G+D SVEAE +DRE + PG Q QLI Q++
Sbjct: 472 GT-NISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRETITWPGNQLQLIEQLS 530
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
E+ K P++++ M G VD + + N N+ +++W GYPG+ GG AI D++ GK P GRL
Sbjct: 531 ELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAGRLT 589
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS-- 652
+T Y +Y P T M LRP S PG+TY +Y G +Y FG+GL YT F+ +L +
Sbjct: 590 VTQYPAEYALQFPATDMSLRPKGS--NPGQTYMWYTGKPVYEFGHGLFYTTFETSLANSH 647
Query: 653 -FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
++ KL N Y N + + + ++ +N G+
Sbjct: 648 GANNGASFDIVKLLSRSNAGY--------------NVIEQVPFMNYTIEVENTGTVTSDY 693
Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRV 737
+ + A + K ++GF R+
Sbjct: 694 TAMAFVNTKAGPSPHPNKWLVGFDRL 719
>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
Length = 788
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 261/699 (37%), Positives = 377/699 (53%), Gaps = 48/699 (6%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSF 125
L+S TLDE + G+ GV RLGLP Y+ WSEALHG+ +F D ATSF
Sbjct: 66 LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDRA----NFSDSGSYNWATSF 121
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
P ILTTA+ N +L +I +ST+ RA N GR GL ++PNIN R P WGR ETPG
Sbjct: 122 PQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQETPG 181
Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
ED + YA Y+ G+Q + N LK+++ KHYA YD++NW R D
Sbjct: 182 EDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATAKHYAGYDIENWHNHSRLGND 233
Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--H 303
+T+QD+ E + F + ++ SVMC+YN VNG+P+CAD L +R + H
Sbjct: 234 MNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDH 293
Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
GY+ +DCD+ + + H + + S+ A A+ + AG D+DCG Y ++ G +
Sbjct: 294 GYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRD 352
Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
DI+K + LYT L++ G+FD + Y L D+ + ++ +AA +GIVLLKN
Sbjct: 353 DIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKN 412
Query: 419 DQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTY 472
N LPL TVA++GP ANAT ++GNY G +SP A F +GY V +
Sbjct: 413 SNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGY-KVNF 471
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQ 532
G ++ S + AA AA++AD I G+D ++EAE+LDRE + PG Q LI +
Sbjct: 472 AEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQK 530
Query: 533 VAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
+A A P+I++ M G VD + + NTN+ A+LW GYPG+ GG A+ D++ GK NP G
Sbjct: 531 LASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAG 590
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
RL T Y Y + P T M LRP PG+TYK+Y G +Y FG+GL YT F +
Sbjct: 591 RLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFYTTFAES-S 647
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
S T T +V LN +Q + + AS T+ P F + +N G +
Sbjct: 648 SNTTTKEVKLN-IQDILSQTHEELASITQLP-----------VLNFTANIKNTGKLESDY 695
Query: 712 VVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
+V++ A Y +K ++G+ R+ V+ G + ++
Sbjct: 696 TAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 734
>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
Length = 797
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/764 (34%), Positives = 393/764 (51%), Gaps = 47/764 (6%)
Query: 25 VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
++ G+S P C+ G SK L CD+S R LVS +T +E V +
Sbjct: 42 LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
HG PR+GLP Y+ W+EALHGV++ F D +TSFP I T A+ N +L +
Sbjct: 93 GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148
Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
I +ST+ RA N GR GL +SPNIN R P WGR ETPGED + + YA Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+Q +++ PLK+ + KHYA YD++NW R D ++T+QD+ E + F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
+ ++ SVMCSYN VNG+PSC++ L +R +D GY+ DC ++ + +
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
H + A ++ A A +++AG D+DCG Y + +V D+++ + LY L+R
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRA 379
Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL-NSAKVKTVAVVG 437
G+FDG + Y ++ D+ S L+ EAA + IVLLKND LPL S+ KT+A++G
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIG 438
Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
P ANAT M+GNY G +SP+ F S Y +TY G + + S A AK
Sbjct: 439 PWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTAK 497
Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
AD I G+D ++E E+ DR ++ P Q LI ++A++ K P+I++ M G VD +
Sbjct: 498 EADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSA 556
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
+ N N+ A++W GYPG+ GG+A+AD++ GK P RL T Y +Y ++ P M LRP
Sbjct: 557 LKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRP 616
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
S PG+TY +Y G +Y FG+GL YT F + + + T ++ + N
Sbjct: 617 NGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDEV 666
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
+ LV + F VD +N G + + A A K ++GF
Sbjct: 667 LGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFD 723
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ + + + SL D N +L G + + + N
Sbjct: 724 RLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 767
>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
Length = 800
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/744 (35%), Positives = 399/744 (53%), Gaps = 50/744 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ C+++L R LV+ +T +EK+Q + + G PR+GLP Y WWSEALHGV+
Sbjct: 34 LSTNQVCNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVA- 92
Query: 109 VGPGTHF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PGT F D +TSFP +L A+F++ L +KI + + E RA N G +GL YW
Sbjct: 93 YAPGTQFWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFSGLDYW 152
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N +DPRWGR +ETPGED +V RYA ++GL+ + + +V + C
Sbjct: 153 TPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEG--------PVPEKERRVVATC 204
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYAA D ++W G R++F+A+++ QDM E + PF+ CV++ S+MC+YN VNG+PS
Sbjct: 205 KHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGVPS 264
Query: 286 CADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA P LL +R W+ + YI +DC+++ + NHK+ A + E A + +AG+D
Sbjct: 265 CASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAE-GTAISFEAGMDTS 323
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDEN 401
C ++ A QG +KE+ +D++L LY ++R G+FDG Y SLG D+
Sbjct: 324 CEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSLYSSLGWADVNKPSA 383
Query: 402 IELAAEAAREGIVLLKNDQNTLP----LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
+L+ +AA +G VLLKND TLP L+ ++ K VA++G ++A + G Y+G Y
Sbjct: 384 QKLSLQAAVDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGT-AAY 441
Query: 458 MSPIAGFSGYANVTYKTGCDDVA---CKSNNSIF-AASEAAKTADATIILAGLDLSVEAE 513
+ A + + + T + SN S A AAK AD + G+D S E
Sbjct: 442 LHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGGIDTSAAGE 501
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
+ DR DL PG Q LIN + ++K P+I++ M +D +N I AILWA +PG+
Sbjct: 502 TKDRYDLDWPGAQLSLINLLTTLSK-PLIVLQM-GDQLDNTPLLSNPKINAILWANWPGQ 559
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
+GG A+ ++V G +P GRLP+T Y ++ +++P+T M LRP GRTY++Y P
Sbjct: 560 DGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTDMALRPSAGNSQLGRTYRWYKTP- 618
Query: 634 LYPFGYGLSYTQFKYNL-LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
+ FG+GL YT F F I V+ L+ C D CP + DL
Sbjct: 619 VQAFGFGLHYTTFSPKFGKKFPAVIDVD-EVLEGC------DDKYLDTCP---LPDL--- 665
Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVF 751
V +N G+ V + + P + IK + F R+ G KR +
Sbjct: 666 -----PVVVENRGNRTSDYVALAFVSAPGVGPGPWPIKTLGAFTRLRGVKGGEKREGGLK 720
Query: 752 NACKSLNIVDYAANTLLPAGEHTI 775
+L D NT++ G++ +
Sbjct: 721 WNLGNLARHDEEGNTVVYPGKYEV 744
>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39B_FAA]
gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
[Ruminococcus sp. 5_1_39BFAA]
Length = 706
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 273/749 (36%), Positives = 382/749 (51%), Gaps = 102/749 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ + LVS+MTL EK QL A V RLG+P Y +W+EALHGV+ G AT
Sbjct: 14 KAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------AT 63
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A F++ KK+G ++TE RA YN A GLT+WSPN+N+ RDP
Sbjct: 64 MFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRDP 123
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V +V G+Q + +K ++C KHYA V +
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVKFVEGIQG----------DGPVMKAAACAKHYA---VHS 170
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ FDA+ + +DM ET+L FE V E D +VM +YNR NG P CA L+
Sbjct: 171 GPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDV 230
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RG+W G+ +DC +I+ ++H + ++ A A L AG DL+CG Y + G A
Sbjct: 231 LRGKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSA-AMALNAGCDLNCGNTYLHMMG-AY 288
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
Q G V E I +S L T LG FDGS +Y + + E+I+ A + AR+ VL
Sbjct: 289 QDGLVTEEKITESAVRLLTTRYLLGLFDGS-EYDKIPYSVVECKEHIDEALKMARKSCVL 347
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
LKND LP++ KV T+ V+GP+A++ A+IGNY G Y++ + G A +
Sbjct: 348 LKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRIL 406
Query: 472 YKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GCD K N I A A+ +D I+ GL+ ++E E S D
Sbjct: 407 YSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTGNSDASGD 466
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ DL LP Q +LI +V V K P I+V+M+ +D+ +A+ N N IL A YPG GG
Sbjct: 467 KVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDNCN--GILLAWYPGARGG 523
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
RAIAD++FGK +P G+LPIT+Y D M T ++ RTY++ LYP
Sbjct: 524 RAIADLLFGKESPSGKLPITFYK-DLEGMPEFTDYSMK--------NRTYRYMEKEALYP 574
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGL+Y SD T +V ++ +
Sbjct: 575 FGYGLTY------------------------------SDTCVTEAE--VVGEVSAESDIV 602
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
K +N G+ D +VV VY K A + GF+RV ++AG K ++F + K+
Sbjct: 603 LKATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSLCGFKRVSLKAGEEKSVEFTISN-KA 661
Query: 757 LNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
+NIVD N + AG+H F GVS P
Sbjct: 662 MNIVDEDGNRYI-AGKH--FRLFAGVSQP 687
>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Subdoligranulum variabile DSM 15176]
Length = 717
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 263/748 (35%), Positives = 380/748 (50%), Gaps = 102/748 (13%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
Y R + LV++MTL EK+ Q+ +A +PRLG+P Y WW+E +HGV G
Sbjct: 11 YRERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT--------- 61
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
AT FP I ASF+E L ++G+AV EAR YN+ R+ GLT W+PN+N+
Sbjct: 62 -ATVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIF 120
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP++ R V +V G+Q + L+ ++C KH+A
Sbjct: 121 RDPRWGRGHETYGEDPYLTSRLGVRFVEGMQGD---------DPDYLRAAACAKHFA--- 168
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
V + R++FDA+V++QD+ ET+L F VKE +VM +YNR NG P C LL
Sbjct: 169 VHSGPEDQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLL 228
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+RG+W+ G++ +DC +I+ + H + D+VA + G DL+CG Y +
Sbjct: 229 VDILRGKWNFQGHVTSDCWAIKDFHEGH-MVTSGPVDSVALAVNNGCDLNCGDLYA-YLE 286
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
AV +GKVKE ID+SL L+T M+LG FD + Y +G + S E L E A
Sbjct: 287 EAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAEEKVPYNKIGYDAVDSREMQALNLEVAE 346
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
+ +VLLKN+ +TLPL+ +K+ VAVVGP+A+ A++GNY G RY++ + G Y
Sbjct: 347 KILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGE 406
Query: 468 -ANVTYKTGCDDVA------CKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
V Y GC A KSN I D I GLD +E E
Sbjct: 407 DVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQ 466
Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
S D++ L LPG Q ++ E K PV++V++S G +A A+L A YP
Sbjct: 467 FASGDKQSLSLPGNQESVLKACIESGK-PVVVVVLS--GSALALGTAQEGAAAVLQAWYP 523
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRTYKFYN 630
G +GGRA+A +FG+ NP G+LP+T+Y+ D + LP T ++ GRTY++
Sbjct: 524 GAQGGRAVARALFGECNPQGKLPVTFYHSD--EDLPAFTDYAMK--------GRTYRYME 573
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
LYPFGYGLSY+ F + + +DA++ GV
Sbjct: 574 KEPLYPFGYGLSYSHFTFR---------------------DAKADAAQIGPDGV------ 606
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ +V N G G + V VY K AE T Q+ +V + G K +
Sbjct: 607 -----DVRVTVVNDGQYRGRETVEVYVK--AERPGTPNAQLKALAKVDLMPGEEKCVTLH 659
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
C + + +LP GE+T+++G
Sbjct: 660 LPQCAFALCNEEGISEVLP-GEYTVWLG 686
>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
Length = 804
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 264/720 (36%), Positives = 383/720 (53%), Gaps = 52/720 (7%)
Query: 49 MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ S L CD S+ PY R L+S TLDE + G+ GV RLGLP Y+ WSEALHG+
Sbjct: 63 LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPVYQVWSEALHGLD 121
Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
+F D ATSFP ILTTA+ N +L +I +ST+ RA N GR GL +
Sbjct: 122 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN R P GR ETPGED + YA Y+ G+Q + N LK+++
Sbjct: 178 APNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 229
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA YD++NW R D +T+QD+ E + F + ++ SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPA 289
Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
CAD L +R + HGY+ +DCD+ + + H + + A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAA-AAAEAILAGTDIDC 348
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
G Y ++ G + DI+K + LYT L++ G+FD + Y L D+
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
+ ++ +AA +GIVLLKN LPL TVA++GP ANAT ++GNY G
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468
Query: 455 CRYMSPIAGF--SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+SP F +GY N +TG ++ + + AA AA++AD I G+D ++E
Sbjct: 469 PYMISPRVAFEEAGYNVNFAERTG---ISSTNTSGFAAALSAAQSADVIIYAGGIDNTLE 525
Query: 512 AESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
AE+LDRE + PG Q LI ++A A P+I++ M G VD + + NTN+ A+LW GY
Sbjct: 526 AEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGY 585
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PG+ GG A+ D++ GK NP GRL T Y Y + P T M LRP PG+TYK+Y
Sbjct: 586 PGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYT 643
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G +Y FG+GL YT F + S T T ++ LN +Q + + AS T+ P
Sbjct: 644 GEAVYEFGHGLFYTTFAES-SSNTTTREIKLN-IQDILSQTHEDLASITQLP-------- 693
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRV-FVRAGRNKRIK 748
F + +N G + +V++ A Y +K ++G+ R+ V+ G + ++
Sbjct: 694 ---VLNFTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGEVKVGETRELR 750
>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 835
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 250/623 (40%), Positives = 353/623 (56%), Gaps = 33/623 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S CD +LP + R LV+ +T +EK+Q L A G PR+GLP Y WWSEALHGV++
Sbjct: 23 LSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEALHGVAH 82
Query: 109 VGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
PGT F D PG +TSFP +L A+F++ L + +G + TEARA N G +GL Y
Sbjct: 83 A-PGTQFRDG-PGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEARAFGNAGWSGLDY 140
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS--RPLKVS 222
W+PN+N RDPRWGR +ETPGED + RYA + +RGL+ ++ S P +V
Sbjct: 141 WTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSCSFGSGGEPPRVI 200
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S CKHYA D ++W G R+ FDA ++ QD+ E +L PF+ C ++ SVMC+YN VNG
Sbjct: 201 STCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYNAVNG 260
Query: 283 IPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+PSCA+ L+N +RG W+ Y+ +DC+++ + V H AD+ + +AG+
Sbjct: 261 VPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAV-LDVSAHHHYADTNAEGTGLCFEAGM 319
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDIC 397
D C ++ A G + +D++L LY L+R+G+FDG SP + SLG D+
Sbjct: 320 DTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPESP-HASLGWADVN 378
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPL---------NSAKVKTVAVVGPHANATVAMIG 448
E ELA AA EGIVLLKND +TLPL + VA++G A+A + G
Sbjct: 379 RPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPDKLFG 438
Query: 449 NYAGIPCRYMSPIAGFSGYA-NVTYKTGC---DDVACKSNNSIFAASEAAKTADATIILA 504
Y+G P SP + NVT G D + + A EAA AD +
Sbjct: 439 GYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYIVYFG 498
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLD S E+ DR + P Q LI+++A + K PV++V M D E + + A
Sbjct: 499 GLDTSAAGETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDDTPLFELD-GVGA 556
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
+LWA +PG++GG A+ ++ G +P GRLP+T Y +Y +PLT M LRP S PGR
Sbjct: 557 VLWANWPGQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVPLTDMTLRP--SATNPGR 614
Query: 625 TYKFYNGPTLYPFGYGLSYTQFK 647
TY++Y P + PFG+GL YT F+
Sbjct: 615 TYRWYPTP-VRPFGFGLHYTTFR 636
>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
Length = 735
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 260/705 (36%), Positives = 381/705 (54%), Gaps = 55/705 (7%)
Query: 87 GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
GV RLGL Y+WW+EALHGV++ G + AT FP I ++A+F++ L ++IG
Sbjct: 47 GVSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSSAAFDDHLIERIGVI 105
Query: 147 VSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+STEARA N GRA L +W+PN+N RDPRWGR ETPGED F ++A +V+G+Q E
Sbjct: 106 ISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQGMQGTE 165
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
+V + CKHYAAYD++N R++FDA+V+ QD+ E +L PF+ C +
Sbjct: 166 STH----------RVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCAR 215
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVD---NH 320
+ S+MCSYN VNG+P+CA P L++ +R W D + Y+V+DCD++ + + H
Sbjct: 216 DSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLGNANGGH 275
Query: 321 KFLADSKEDAVAQTLKAGLDLDCGQYYTNFT----GNAVQQGKVKETDIDKSLKYLYTVL 376
++ S A+ +L+AG D C + T T +A + + +DK++ L
Sbjct: 276 RY-KSSYAAAIGASLEAGCDNMC--WATGGTTPDPASAFNSRQFTQATLDKAMLRQMQGL 332
Query: 377 MRLGFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT-VA 434
++ G+FDG + Y +L D+ + + A +AA EGIVLLKND N LPL T VA
Sbjct: 333 VKAGYFDGPNSLYRNLTAADVNTQVARDTALKAAEEGIVLLKND-NILPLTLGGSNTQVA 391
Query: 435 VVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAASEA 493
++G ANA M+G Y+G P P+ A S V Y G ++N AA A
Sbjct: 392 MIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGP---LTQTNADTSAAVNA 448
Query: 494 AKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
A+ + I G+D +VE ES DR + P Q +I ++A+ K PVI+V M VD
Sbjct: 449 AQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRMGT-HVDD 506
Query: 554 AFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL 613
+ N+KAILWAGYPG++GG A+ +++ G +P GRLP+T Y Y P T+M L
Sbjct: 507 TPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAPYTNMAL 566
Query: 614 RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
RP S YPGRTY++Y P ++PFG+GL YT F L F T + + L C+ + Y
Sbjct: 567 RPSSS--YPGRTYRWYKDP-VFPFGHGLHYTNFSVAPLDFPATFSI-ADLLASCKGVTYL 622
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
CP + V N GS VV+ + IK +
Sbjct: 623 E-----LCP-----------FPSVSVSVTNTGSRASDYVVLGFLAGDFGPTPRPIKSLAT 666
Query: 734 FQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++RVF V+ G+ + + + +SL VD N +L G +T+ +
Sbjct: 667 YKRVFDVQPGKTQSAELDWK-LESLARVDGKGNRVLYPGTYTLLL 710
>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/382 (52%), Positives = 270/382 (70%), Gaps = 11/382 (2%)
Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
MYN+G AGLT+WSPN+N+ RDPRWGR ETPGEDP + +YA YVRGLQ + D
Sbjct: 1 MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ------QSDD 54
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
+ LK+++CCKHY AYD+DNWKGVDR+HF+A VT+QDM++TF PF+ CV +G+ +SV
Sbjct: 55 GSPDRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASV 114
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYN+VNG P+CADP LL+ VRGEW L+GYIV+DCDS+ V ++ + + E+A A+
Sbjct: 115 MCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAK 173
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
+ AGLDL+CG + T AV+ G V E+ +DK++ + LMRLGFFDG+P Y
Sbjct: 174 AILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGK 233
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
LG +D+C+ E+ ELA EAAR+GI+LLKN + +LPL+ +KT+A++GP+AN T MIGNY
Sbjct: 234 LGPKDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNY 293
Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
G PC+Y +P+ G TY +GC +VAC S I A + A ADAT+++ G+D S+
Sbjct: 294 EGTPCKYTTPLQGLMALVATTYLSGCSNVAC-STAQIDEAKKIAAAADATVLIVGIDQSI 352
Query: 511 EAESLDREDLWLPGYQTQLINQ 532
EAE DR ++ LPG Q LI +
Sbjct: 353 EAEGRDRVNIQLPGQQPLLITE 374
>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 798
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 264/768 (34%), Positives = 396/768 (51%), Gaps = 54/768 (7%)
Query: 25 VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
++ G+S P C+ G SK L CD+S R LVS +T +E V +
Sbjct: 42 LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
HG PR+GLP Y+ W+EALHGV++ F D +TSFP I T A+ N +L +
Sbjct: 93 GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGGFSWSTSFPQPISTMAALNRTLIHQ 148
Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
I +ST+ RA N GR GL +SPNIN R P WGR ETPGED + + YA Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+Q +++ PLK+ + KHYA YD++NW R D ++T+QD+ E + F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
+ ++ SVMCSYN VNG+PSC++ L +R +D GY+ DC ++ + +
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
H + A ++ A A +++AG D+DCG Y + +V D+++ + LY L+R
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRA 379
Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
G+FDG + Y ++ D+ S L+ EAA + IVLLKND LPL S+ KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438
Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
GP ANAT M+GNY G +SP+ F S Y +TY G + + S A A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
K AD I G+D ++E E+ DR ++ P Q LI ++A++ K P+I++ M G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ N N+ A++W GYPG+ GG+A+AD++ GK P RL T Y +Y ++ P M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT---IQVNLNKLQHCRNLN 671
P S PG+TY +Y G +Y FG+GL YT F + + + T N++++ +L
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASASSGTKNRTSFNIDEVLGRPHLG 674
Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
Y LV + F VD +N G + + A A K +
Sbjct: 675 YK-----------LVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWL 720
Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+GF R+ + + + SL D N +L G + + + N
Sbjct: 721 VGFDRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768
>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 786
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 237/604 (39%), Positives = 341/604 (56%), Gaps = 25/604 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ C+ SL R LV TL+E G+ A GVPRLGLP YE W+EALHG+S+
Sbjct: 54 LSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHGISH 113
Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
HF + ATSFP+ IL+ AS N +L +IG +ST+ RA N GR GL ++
Sbjct: 114 ----GHFATNGTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRYGLDSYA 169
Query: 167 PNINVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
PNIN R P WGR ETPGED F + YA Y+ G+Q G A K+ +
Sbjct: 170 PNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQG--GKAPAVP------KLVAVP 221
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A YD++NW R D +T+QD+ + F ++ A +MCSYN VNG+PS
Sbjct: 222 KHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPS 281
Query: 286 CADPKLLNQTVRGEWDL-HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
C++ L R W +G++ +DCD++ + + H + A++ AVA +L+AG D+DCG
Sbjct: 282 CSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTT-GAVADSLRAGTDIDCG 340
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIE 403
Y + A G V DI+ +L Y+ L+ G+FDG S Y +LG D+ + +
Sbjct: 341 TSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSLYRNLGWNDVLTTDAWN 400
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
++ EAA EGI LLKND TLPL S ++VA++GP ANAT+ + GNY +SP+
Sbjct: 401 ISYEAAVEGITLLKND-GTLPL-SKSTRSVALIGPWANATLQLQGNYYAAAPYLISPLQA 458
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
F + +T +N S FA A A+ +D I G+D S+EAE LDR+++
Sbjct: 459 FRA-SGMTVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITW 517
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
PG Q LI Q+++V K P++++ M G VD + + N+ + A++W GYPG+ GG+A+ D+
Sbjct: 518 PGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDI 576
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+ G P GRL T Y Y +M + PV+ G G+TY +Y G +YPFG+GL
Sbjct: 577 IMGNRAPAGRLVTTQYPASYATSFNQLNMNMAPVN--GSLGQTYMWYTGTPVYPFGHGLF 634
Query: 643 YTQF 646
YT F
Sbjct: 635 YTNF 638
>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
ATCC 42464]
Length = 760
Score = 404 bits (1038), Expect = e-109, Method: Compositional matrix adjust.
Identities = 268/744 (36%), Positives = 397/744 (53%), Gaps = 56/744 (7%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD+S R LVS M +EK+ L + + GV RLGL Y+WW+EALHGV++
Sbjct: 33 LKSNTVCDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH 92
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
G + AT FP I T+A+F+++L ++IG +STEARA N GRA L +W+PN
Sbjct: 93 -NRGITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDFWTPN 151
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
+N RDPRWGR ETPGED F ++A +V+G+Q +V + CKHY
Sbjct: 152 VNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGPGPTH----------RVIATCKHY 201
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AAYD++N R++FDA+V+ QD+ E +L PF+ C ++ S+MCSYN VN IP+CA+
Sbjct: 202 AAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACAN 261
Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVD---NHKFLADSKEDAVAQTLKAGLDLD 342
P L++ +R W D H YIV+DCD++ + + H++ S A+ +L+AG D
Sbjct: 262 PYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRY-KPSYAAAIGASLEAGCDNM 320
Query: 343 CGQYYTNFT----GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDIC 397
C + T T +A G+ +T +D ++ L+ G+FDG Y +L D+
Sbjct: 321 C--WATGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDGPGGMYRNLSVADVN 378
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ + A +AA GIVLLKND LPL N + + VA++G ANA M+G Y+G P
Sbjct: 379 TQTAQDTALKAAEGGIVLLKND-GILPLSVNGSNFQ-VAMIGFWANAADKMLGGYSGSPP 436
Query: 456 RYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
P+ A S V Y G + N AA AA+ ++A + G+D +VE ES
Sbjct: 437 FNHDPVTAARSMGITVNYVNG---PLTQPNGDTSAALNAAQKSNAVVFFGGIDNTVEKES 493
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR + P Q LI ++AE K PVI+V + VD + N++AILWAGYPG++
Sbjct: 494 QDRTSIEWPSGQLALIRRLAETGK-PVIVVRLGT-HVDDTPLLSIPNVRAILWAGYPGQD 551
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG A+ ++ G +P GRLP T Y Y P T+M LRP S YPGRTY++Y+ +
Sbjct: 552 GGTAVVKIITGLASPAGRLPATVYPSSYTSQAPFTNMALRPSSS--YPGRTYRWYSN-AV 608
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
+PFG+GL YT F ++ F + + + L C + S A CP +
Sbjct: 609 FPFGHGLHYTNFSVSVRDFPASFAI-ADLLASCGD----SVAYLDLCP-----------F 652
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNA 753
++ N G+ V + + + IK + ++RVF + G + + +
Sbjct: 653 PSVSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIKTLATYKRVFNIEPGETQVAELDWK- 711
Query: 754 CKSLNIVDYAANTLLPAGEHTIFV 777
+SL VD N +L G +T+ V
Sbjct: 712 LESLVRVDEKGNRVLYPGTYTLLV 735
>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 730
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 245/651 (37%), Positives = 361/651 (55%), Gaps = 72/651 (11%)
Query: 44 KLGLQMSSFLFCDSSLPYSIRVKD--LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
K G QM +++L R K LV +MTL+EKV Q + A + RLG+ Y WW+E
Sbjct: 2 KRGFQMKETSEKETALDRQRREKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNE 61
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
LHGV+ G AT FP I A+F+E L + +G+AVSTEARA Y++ +
Sbjct: 62 GLHGVARAGV----------ATIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRY 111
Query: 160 ------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GLT W+PNIN+ RDPRWGR ET GEDP++ R + Y+RGLQ HE
Sbjct: 112 GDTDIYKGLTLWAPNINIFRDPRWGRGHETYGEDPWLTSRLGIRYIRGLQG--SHE---- 165
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
+ LK ++C KH+A V + R+ FDA V+E+D+ ET+L FE CVK+GD +V
Sbjct: 166 ---KYLKTAACVKHFA---VHSGPEELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAV 219
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
M +YNRVNG+P C + LL +R EW HG++V+DC +I+ + H + DS ++V+
Sbjct: 220 MGAYNRVNGVPCCGNEYLLETILRKEWGFHGHVVSDCWAIKDFHEGHG-VTDSPVESVSM 278
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
+ G DL+CG +T AV++GKVKE +D+++ L+T ++LG + Y
Sbjct: 279 AMNHGCDLNCGNLFTYLI-QAVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAG 337
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+ ++ S +L AA + +VLLKN + LP+++ + KT+ V+GP+A++ A++GNY
Sbjct: 338 ISYLEVDSPAMKKLNRSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNY 397
Query: 451 AGIPCRYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFAA-----SEA---AKTAD 498
G Y++ + G A V Y GC KSN S A SE + +D
Sbjct: 398 EGTASEYVTVLEGIREAAEPEARVLYSEGCH--LYKSNVSGLGARNDRLSEVKGICRESD 455
Query: 499 ATIILAGLDLSVEAES---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
I GLD ++E E D+ DL LPG Q +++ + K PV+LV+++
Sbjct: 456 IVIACMGLDSTLEGEQGDTGNIYAGGDKPDLMLPGLQQKILETAYDSGK-PVVLVLLAGS 514
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+ + +A + ++ AIL A YPG EGGR +ADV+FG NP GRLP+T+Y T
Sbjct: 515 AMAVTWA--DEHLPAILTAWYPGAEGGRGVADVLFGTVNPEGRLPVTFYR---------T 563
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
+ L + GRTY+F LYPFG+GLSYT+F + L ++ V+
Sbjct: 564 TEELPDFTNYSMEGRTYRFMKQKALYPFGFGLSYTEFSCSGLEVSERDSVD 614
>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
Length = 1241
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/325 (60%), Positives = 241/325 (74%), Gaps = 17/325 (5%)
Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
QAVSTEARAMYN+G+ GLTYWSPNINV RDPRWGR ETPGEDP+VVGRYAVN+VRG+QD
Sbjct: 916 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975
Query: 205 VEGHENAT---DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+ GHE D N+RPLK S+CCKHYAAYD+D+W R+ FDARV E+DM ETF RPF
Sbjct: 976 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
EMCV++GD SSVMCSYNRVNGIP+CAD +LL+QT+R +W LHGYIV+DCD+++VM DN
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095
Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTG-------------NAVQQGKVKETDIDKS 368
+L + +A A LKAGLDLDCG+ + N T AV +GK++E+DID +
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155
Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
L Y LMRLG+FD QY SLG+QDIC+D++ LA + AR+GIVLLKND LPL++
Sbjct: 1156 LTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 1215
Query: 429 KVKTVAVVGPHANA-TVAMIGNYAG 452
KV V V GPH A M G+Y G
Sbjct: 1216 KVGFVNVRGPHVQAPEKIMDGDYTG 1240
>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 717
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 254/742 (34%), Positives = 390/742 (52%), Gaps = 89/742 (11%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ ++LV +MTL EK QL A +PRL +P Y WW+E+LHGV+ G AT
Sbjct: 13 QAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------AT 62
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYWSPNINVARDP 175
FP I ASF+ + ++IG+A++ E RA YN GLT+W+PN+N+ RDP
Sbjct: 63 VFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRDP 122
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V+Y+RGLQ + +K ++C KH+A V +
Sbjct: 123 RWGRGHETYGEDPYLSSRLGVSYIRGLQG----------DGETMKAAACAKHFA---VHS 169
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ FDA V+E+D+ ET+L F+ CV+EG +VM +YN VNG P C LL +
Sbjct: 170 GPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKI 229
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R EW G++V+DC +I+ +NH + + + A ++AG DL+CG Y + +A
Sbjct: 230 LREEWGFDGHVVSDCWAIKDFHENH-LVTGTPVQSAALAMEAGCDLNCGVTYLHLV-HAC 287
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
Q+G V E I ++ L+T LG FDGS +Y S+ + E+ +L+ AARE IVL
Sbjct: 288 QEGLVTEAQITEAAIRLFTTRFLLGMFDGS-EYDSVPYTVVECKEHRDLSERAARESIVL 346
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
LKN+ LPL+ K+KT+ ++GP+A++ A+IGNY G Y++ + G +
Sbjct: 347 LKNN-GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRIL 405
Query: 472 YKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GC K+ N + A A+ +D I+ GLD ++E E S D
Sbjct: 406 YSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDETLEGEEGDTGNSYASGD 465
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ DL LP Q L+ VA + K P +L +M+ +D++FAE + + LW YPG GG
Sbjct: 466 KVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHFDAIVDLW--YPGAYGG 522
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
A AD++FGK +P G+LPIT+Y +++LP + GRTY++ YP
Sbjct: 523 AAAADILFGKCSPSGKLPITFYES--LEVLP-------SFEDYSMRGRTYRYLEQKAQYP 573
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGL+YT+ K + + + + ++ ++ N ++A+ C V
Sbjct: 574 FGYGLTYTKMKIRNV-WLENAEKDMKEVTDGEN----AEAAVIVCAEV------------ 616
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G D +V+ +Y + T + GF+R+FV G K +K N +
Sbjct: 617 -----ENCGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVEKGVKKLVKIPVNR-SA 670
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
+VD + +G++ IF G
Sbjct: 671 FTVVDESGRRFTDSGKYEIFAG 692
>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
Length = 516
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 217/520 (41%), Positives = 313/520 (60%), Gaps = 29/520 (5%)
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYNRVNG+P+CAD LL+ T R +W +GYI +DCD++ ++ D + A + EDAVA
Sbjct: 1 MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGY-AKTAEDAVAD 59
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
LKAG+D++CG Y + +A+QQGK+ E DI+++L L+ V MRLG F+G P+ Y
Sbjct: 60 VLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGD 119
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+G +C+ E+ +LA EAA++GIVLLKND LPL+ V ++AV+G +AN + + G
Sbjct: 120 IGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRG 179
Query: 449 NYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
NY G PC ++P+ GY + ++ GC+ AC +I A +AA +AD+ ++ GLD
Sbjct: 180 NYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVLFMGLD 238
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
E E +DR DL LPG Q LI VA AK PVILV++ G VD++FA+TN I AILW
Sbjct: 239 QDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILW 298
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
AGYPGE GG AIA V+FG+ NPGGRLP+TWY D+ + +P+T M +R + GYPGRTY+
Sbjct: 299 AGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTR-VPMTDMRMRADPATGYPGRTYR 357
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
FY GPT++ FGYGLSY+++ + + N+ L+ A + G+
Sbjct: 358 FYRGPTVFNFGYGLSYSKYSHRFATKPPPTS-NVAGLK----------AVEATAGGMASY 406
Query: 688 DLR------CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVF 738
D+ CD F V QN G DG V+V+ + P + + Q+IGFQ +
Sbjct: 407 DVEAIGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLH 466
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+RA + ++F + CK + ++ G H + VG
Sbjct: 467 LRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVG 506
>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
Length = 792
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 232/609 (38%), Positives = 346/609 (56%), Gaps = 21/609 (3%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S + CD++ R L S TL+E V G+ VPRLGLP Y+ WSEALHG+
Sbjct: 56 LSKTIVCDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGLDR 115
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
D ATSFP+ IL A+ N +L +IG+ +ST+ RA N GR GL ++PN
Sbjct: 116 ANLTESGD--YSWATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGLDVYAPN 173
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN R P WGR ETPGED + Y V Y+ G+Q LN R LK+++ KH+
Sbjct: 174 INSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQG--------GLNPRDLKLAATAKHF 225
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
A YD++NW R + ++ D+ + F V++ SVM SYN VNG+PS A+
Sbjct: 226 AGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNGVPSSAN 285
Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
LL +R W+ GY+ +DCD++ + + H + + S A A++++AG D+DCG
Sbjct: 286 SFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYAS-SASLAAAKSIQAGTDIDCGAT 344
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELA 405
Y + ++ ++ ++I++++ Y+ L+ LG+FDG + +Y L D+ + + ++
Sbjct: 345 YQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGDNSKYRHLHWPDVVATDAWNIS 404
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
EAA EGIVLLKND TLPL S ++VA++GP AN T + GNY G P+A
Sbjct: 405 YEAAVEGIVLLKND-GTLPL-SNNTRSVALIGPWANVTTTLQGNYYGAAPYLTGPLAALQ 462
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+ +V Y G +++ S + AA AA ++ I G+D +VEAE +DRE + PG
Sbjct: 463 ASNLDVNYAFGT-NISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRESITWPG 521
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q QLI Q++++ K P++++ M G VD + + N N+ +++W GYPG+ GG AI D++
Sbjct: 522 NQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILT 580
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK P GRL +T Y +Y P T M LRP + PG+TY +Y G +Y FG+GL YT
Sbjct: 581 GKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGN--NPGQTYMWYTGKPVYEFGHGLFYT 638
Query: 645 QFKYNLLSF 653
FK +L F
Sbjct: 639 TFKVSLAHF 647
>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
Length = 798
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 262/765 (34%), Positives = 393/765 (51%), Gaps = 48/765 (6%)
Query: 25 VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
++ G+S P C+ G SK L CD+S R LVS +T +E V +
Sbjct: 42 LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
HG PR+GLP Y+ W+EALHGV++ F D +TSFP I T A+ N +L +
Sbjct: 93 GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148
Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
I +ST+ RA N GR GL +SPNIN R P WGR ETPGED + + YA Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+Q +++ PLK+ + KHYA YD++NW R D ++T+QD+ E + F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
+ ++ SVMCSYN VNG+PSC++ L +R +D GY+ DC ++ + +
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
H + A ++ A A +++AG D+DCG Y + +V D+++ + LY L+R
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRA 379
Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
G+FDG + Y ++ D+ S L+ EAA + IVLLKND LPL S+ KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438
Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
GP ANAT M+GNY G +SP+ F S Y +TY G + + S A A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
K AD I G+D ++E E+ DR ++ P Q LI ++A++ K P+I++ M G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ N N+ A++W GYPG+ GG+A+AD++ GK P RL T Y +Y ++ P M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S PG+TY +Y G +Y FG+GL YT F + + + T ++ + N
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDE 666
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+ LV + F VD +N G + + A A K ++GF
Sbjct: 667 VLGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ + + + SL D N +L G + + + N
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768
>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
Length = 709
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 258/745 (34%), Positives = 386/745 (51%), Gaps = 104/745 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R K+LV++MT++EK QL A + RLG+P Y WW+EALHGV+ G AT
Sbjct: 9 RAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------AT 58
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A+F+E L ++G+ ++ EARA YN GLT+W+PN+N+ RDP
Sbjct: 59 MFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRDP 118
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDPF+ R AV +V+ +Q + +K ++C KH+A V +
Sbjct: 119 RWGRGHETYGEDPFLTSRLAVPFVKAMQG----------DGEYMKAAACAKHFA---VHS 165
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+R+ FDA+ +++D+EET+L FE VKE + +VM +YNR NG P CA+ L+ T
Sbjct: 166 GPEGERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDT 225
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RG+W G+ V+DC +I+ +NHK + S E++ L+ G DL+CG Y + N V
Sbjct: 226 LRGKWGFQGHFVSDCWAIKDFHENHK-VTSSPEESAKLALEMGCDLNCGCTYQSIM-NGV 283
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
+ G + E I +S + L+T LG FD + ++ + + + E++ +A AARE +VL
Sbjct: 284 RAGLIDEKLITESCERLFTTRFLLGMFDKT-EFDEIPYEKVECKEHLAVAKRAARESVVL 342
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKND LPLN +KT+ VVGP+AN+ +++IGNY G RY++ + G V
Sbjct: 343 LKND-GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVL 401
Query: 472 YKTGCDDVACKSNN--------SIFAASEAAKTADATIILAGLDLSVEAE---------S 514
Y GCD +N + A A +D +++ GLD ++E E S
Sbjct: 402 YSEGCDIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFAS 461
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
D+ +L LP Q QL+N V + K P I++ M+ +D++ A+ N A+L A YPG
Sbjct: 462 GDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQDEAN--AVLQAFYPGAR 518
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG +AD++FG +P G+LP+T+Y ++ L RTYK++ G L
Sbjct: 519 GGADVADILFGDVSPSGKLPVTFYK---------SADDLPDFKDYSMKNRTYKYFTGTPL 569
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFGYGL+Y K + N+ Y +DA K
Sbjct: 570 YPFGYGLTYGDCYV--------------KPDYDFNVKY-ADADKVSGA------------ 602
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
E V N G D +VV +Y K AT ++GF+RV V AG R+
Sbjct: 603 -EITVTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGFKRVHVPAGGETRV------- 654
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN 779
+L + + A ++ GE +F N
Sbjct: 655 -TLTVSEKAFTSVNEEGERAVFGKN 678
>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
Length = 798
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 262/765 (34%), Positives = 393/765 (51%), Gaps = 48/765 (6%)
Query: 25 VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
++ G+S P C+ G SK L CD+S R LVS +T +E V +
Sbjct: 42 LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
HG PR+GLP Y+ W+EALHGV++ F D +TSFP I T A+ N +L +
Sbjct: 93 GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148
Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
I +ST+ RA N GR GL +SPNIN R P WGR ETPGED + + YA Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+Q +++ PLK+ + KHYA YD++NW R D ++T+QD+ E + F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
+ ++ SVMCSYN VNG+PSC++ L +R +D GY+ DC ++ + +
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
H + A ++ A A +++AG D+DCG Y + +V D+++ + LY L+R
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRA 379
Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
G+FDG + Y ++ D+ S L+ EAA + IVLLKND LPL S+ KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438
Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
GP ANAT M+GNY G +SP+ F S Y +TY G + + S A A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
K AD I G+D ++E E+ DR ++ P Q LI ++A++ K P+I++ M G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ N N+ A++W GYPG+ GG+A+AD++ GK P RL T Y +Y ++ P M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S PG+TY +Y G +Y FG+GL YT F + + + T ++ + N
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDE 666
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+ LV + F VD +N G + + A A K ++GF
Sbjct: 667 VLGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ + + + SL D N +L G + + + N
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768
>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 705
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 250/711 (35%), Positives = 360/711 (50%), Gaps = 96/711 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ +LVS+MTL+EK QL A +PRLG+P Y WW+EALHGV+ G AT
Sbjct: 10 KAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------AT 59
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR-------AGLTYWSPNINVARDP 175
SFP I A+F++ L K +G AV+ E RA YN R GLT+WSPN+N+ RDP
Sbjct: 60 SFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRDP 119
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V YV GLQ + + +K ++C KH+A V +
Sbjct: 120 RWGRGHETYGEDPYLTSRLGVAYVEGLQGSQ--------DDDFMKTAACAKHFA---VHS 168
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ FDA+ +++DM ET+L FE CVKE +VM +YNR NG P C P L+
Sbjct: 169 GPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQNI 228
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R EWD G+ V+DC +I H + + E++ A LK+G D++CG Y + A
Sbjct: 229 LREEWDFQGHYVSDCWAI-ADFHMHHMVTKTPEESAALALKSGCDVNCGVTYLHLL-KAY 286
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
QQG V E +I ++ + L+T LG FD + +Y + + + E++ELA + A+E +VL
Sbjct: 287 QQGLVTEEEITQAAERLFTTRFLLGCFDKN-EYDDIPYEVVECKEHLELAQKMAKESMVL 345
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKND LPLN +KT+ V+GP+A++ ++GNY G RY++ + G + V
Sbjct: 346 LKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGEDVRVY 404
Query: 472 YKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GC + + I A A+ +D ++ GLD ++E E S D
Sbjct: 405 YSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSYASGD 464
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
++DL LP Q +L+ VA K PV+L +MS +D+ FA + N AIL YPG GG
Sbjct: 465 KKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAEHVN--AILQVWYPGARGG 521
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+A A+++FG +P G+LP+T+Y L P + GRTY++ LYP
Sbjct: 522 KAAAEILFGACSPSGKLPVTFYK-------DLEGFP--AFEDYSMKGRTYRYLEKEPLYP 572
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGL+Y Q T ++
Sbjct: 573 FGYGLTYGQVCVKAAELTGAVEEGKE--------------------------------LT 600
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
K +N G D DV+ VY K A + F+RV ++ G I
Sbjct: 601 IKAMVENSGKYDTDDVIQVYIKDLDSKNAVPNHSLCAFKRVSLKKGEKAEI 651
>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
Length = 714
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 234/619 (37%), Positives = 348/619 (56%), Gaps = 66/619 (10%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
K LVS+MT+DEK+ Q+ + + RLG+P+Y WW+EALHGV+ G AT F
Sbjct: 10 KKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVF 59
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
P I A+F+ L +KIG VSTE R +N GLT+W+PN+N+ RDPRW
Sbjct: 60 PQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRW 119
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
GR ET GEDP++ G+ Y+RGLQ D H LK ++C KH+A V +
Sbjct: 120 GRGHETYGEDPYLTGKLGCAYIRGLQGDDPDH----------LKSAACAKHFA---VHSG 166
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
R+ FDA+ ++ DM +T+L F+ CVK+ +VM +YNRVNG P+C LL +
Sbjct: 167 PEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDIL 226
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
R E+ G++V+DC +I + H + D+ E++ A + G DL+CG + + +A
Sbjct: 227 RDEFGFEGHVVSDCWAI-LDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFLHLK-DAYD 284
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVL 415
+G V + I +++ L V +RLG P Y + + + E++EL+ EAAR +VL
Sbjct: 285 KGMVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVEAARRSLVL 344
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKN N LPL+ VKT+AV+GP+AN+ A+IGNY G RY++P+ G Y V
Sbjct: 345 LKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVL 404
Query: 472 YKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GC D V + + A A+ +D ++ GLD ++E E S D
Sbjct: 405 YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGD 464
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ L LPG Q +L+ VA V K PVILV+ + +D+++AE ++ AI+ + YPG GG
Sbjct: 465 KLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE--EHVDAIIDSWYPGARGG 521
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+A+A+ +FG+++P G+LP+T+Y G ++P S+ + RTY++ N LYP
Sbjct: 522 KAVAEAIFGEYSPNGKLPVTFYQG-------TENLPEFTDYSMAH--RTYRYTNENVLYP 572
Query: 637 FGYGLSYTQFKYNLLSFTK 655
FGYGL Y + Y+ LS K
Sbjct: 573 FGYGLHYGETNYDGLSVDK 591
>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
Length = 716
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 228/620 (36%), Positives = 346/620 (55%), Gaps = 62/620 (10%)
Query: 58 SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDD 117
SL K LV +MTL+EK+ Q+ + + RL +P Y WW+EALHGV+ G
Sbjct: 2 SLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV------ 55
Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--GRA------GLTYWSPNI 169
AT FP I A+F+E L +KIG VSTE RA + GR GLT+W+PNI
Sbjct: 56 ----ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNI 111
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
N+ RDPRWGR ET GEDP + + Y+RG+Q + LK ++C KH+A
Sbjct: 112 NIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGK---------DPDHLKAAACAKHFA 162
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
V + R+ FDA+V+ D+ +T+L F+ CVK+ +VM +YNRVNG P+C
Sbjct: 163 ---VHSGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSK 219
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
LL +R ++ G++V+DC +I + H + + E++ A + G DL+CG+ +
Sbjct: 220 TLLQDILREQFGFEGHVVSDCWAI-LDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAFL- 277
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEA 408
+ A +QG V+E I ++++ L V +RLG + P Y ++ + E+I L+ EA
Sbjct: 278 YLSRACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPSPYANIPYDVVECPEHIALSLEA 337
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY- 467
++ +VLLKND + LPL +V T+AV+GP+AN+ A++GNY G RY++P+ G Y
Sbjct: 338 SKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYT 397
Query: 468 ---ANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE----- 513
V Y GC + + + A AA+ AD ++ GLD +E E
Sbjct: 398 GEKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAG 457
Query: 514 ----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
S D+ L LPG Q +L+ VA V K P++L +++ +D+++A+ + I+AIL
Sbjct: 458 NEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQIRAILDCW 516
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPG GG+AIA+ +FG+F+P G+LP+T+Y G + LP GRTY++
Sbjct: 517 YPGARGGKAIAEALFGEFSPCGKLPVTFYEG--TEFLP-------DFTDYSMAGRTYRYT 567
Query: 630 NGPTLYPFGYGLSYTQFKYN 649
+ LYPFGYGL+Y+Q +Y+
Sbjct: 568 DRHVLYPFGYGLTYSQIRYS 587
>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
Length = 798
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 262/765 (34%), Positives = 393/765 (51%), Gaps = 48/765 (6%)
Query: 25 VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
++ G+S P C+ G SK L CD+S R LVS +T +E V +
Sbjct: 42 LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
HG PR+GLP Y+ W+EALHGV++ F D +TSFP I T A+ N +L +
Sbjct: 93 GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148
Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
I +ST+ RA N GR GL +SPNIN R P WGR ETPGED + + YA Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
+Q +++ PLK+ + KHYA YD++NW R D ++T+QD+ E + F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
+ ++ SVMCSYN VNG+PSC++ L +R +D GY+ DC ++ + +
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
H + A ++ A A +++AG D+DCG Y + +V D+++ + LY L+R
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRA 379
Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
G+FDG + Y ++ D+ S L+ EAA + IVLLKND LPL S+ KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438
Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
GP ANAT M+GNY G +SP+ F S Y +TY G + + S A A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
K AD I G+D ++E E+ DR ++ P Q LI ++A++ K P+I++ M G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ N N+ A++W GYPG+ GG+A+AD++ GK P RL T Y +Y ++ P M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S PG+TY +Y G +Y FG+GL YT F + + + T ++ + N
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDE 666
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+ LV + F VD +N G + + A A K ++GF
Sbjct: 667 VLGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
R+ + + + SL D N +L G + + + N
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768
>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
Length = 709
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 248/723 (34%), Positives = 376/723 (52%), Gaps = 97/723 (13%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
+ +VSRMTLDEK+ Q+ A +PRL +P+Y WW+EALHGV+ G AT F
Sbjct: 15 RRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATVF 64
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYWSPNINVARDPRW 177
P I A F+ + ++IG +STE RA YN GLT+WSPN+N+ RDPRW
Sbjct: 65 PQAIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPRW 124
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ET GEDP++ R AV ++RG+Q + + LK ++C KH+A V +
Sbjct: 125 GRGQETYGEDPYLTARLAVAFIRGIQG----------DGKYLKAAACAKHFA---VHSGP 171
Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
R+ FDARV+++D+ ET+L F+ VKE VM +YNRVNG+P+CA +LL+ +R
Sbjct: 172 EALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILR 231
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
EW G++V+D ++++ + +H ++AD +A LKAG +L G+ + ++V +
Sbjct: 232 SEWGFEGHVVSDYEALEDIFKHHHYVADEAH-TMAVALKAGCNLCAGKIARHLR-SSVDE 289
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
G + E +I ++++ L+T + +G Y S+G ++ + E+ +LA EAA VLLK
Sbjct: 290 GLISEDEITEAVERLFTTRIMMGMMADDCPYDSIGYEENDTPEHHQLAVEAASRSFVLLK 349
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYK 473
ND LPL K+ ++AV+GP+AN+ + GNY G RY++ + G V Y
Sbjct: 350 ND-GLLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVWYS 408
Query: 474 TGC------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDRE 518
GC N+ + A AA+ AD ++ GLD ++E E S D+
Sbjct: 409 EGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGDKP 468
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+L LPG Q L++ + V K PVIL++ S + + E + N+KAIL YPG GG+A
Sbjct: 469 NLSLPGRQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIWYPGAMGGKA 527
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ADV+FG+ P G+LP+T+Y ++ L + GRTY++ G LYPFG
Sbjct: 528 VADVLFGRRAPAGKLPVTFYA---------SADELPAFEDYSMAGRTYRYMKGNALYPFG 578
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGL+Y+ C ++ + KT GV E
Sbjct: 579 YGLTYSP---------------------C-SIVSAGISGKTADGGV-----------EIT 605
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
VD +N G +VV VY K A + GF+R+ + G V ++
Sbjct: 606 VDIRNDGGRTTEEVVQVYVKDMDSPLAVINHALAGFRRITLAPGEKTSRTIVIEP-EAFT 664
Query: 759 IVD 761
+VD
Sbjct: 665 VVD 667
>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1425
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 260/757 (34%), Positives = 388/757 (51%), Gaps = 95/757 (12%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F + L RV DLVSR+TL+EKV+Q+ + A + RLG+P Y WW+E LHGV
Sbjct: 712 YPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR--- 768
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLT 163
T + T FP I AS+N+ L K++ +++ E RA+YN + LT
Sbjct: 769 -TKYH-----VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHALT 822
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
YW+PNIN+ RDPRWGR ET GEDP++ + +V GLQ + R LK S+
Sbjct: 823 YWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGDD---------PRYLKASA 873
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KHYA V + +R+ F++ V+ D+ +T+L F V + + S VMC+YN G
Sbjct: 874 CAKHYA---VHSGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAYNAFKGQ 930
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
P C + L+ +R +W+ GY+ +DC +I + ++HK D+ A G DLDC
Sbjct: 931 PCCGNDLLMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFH-GTDLDC 989
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDEN 401
GQ AV+ G + E +D S+K L+T+ RLG FD + Q Y + + ++
Sbjct: 990 GQSAYLALVKAVKNGIITEKQLDVSVKRLFTIRFRLGLFDPAEQVDYAHIPISVLECKKH 1049
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+LA + ARE +VLLKND+ LPL K+K V V+GP+A+ A++GNY G P R ++P+
Sbjct: 1050 QDLAKQLARESMVLLKNDR-LLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPSRMLTPL 1108
Query: 462 AG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-- 515
G A V Y +G D + S + + AK ADA I + G+ +E E +
Sbjct: 1109 QAIRERLKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLEGEEMSV 1168
Query: 516 --------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
DR + LP QTQL+ + + P + V+M+ + I + + + AIL
Sbjct: 1169 NKDGFDGGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTGSALAIPWEAKH--VPAILN 1225
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
A Y G+ GG AIADV+FG +NP G+LP+T+Y D + +P +S GRTY+
Sbjct: 1226 AWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD-------SDLP--DFESYDMQGRTYR 1276
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
++ G LYPFGYGLSYT F+Y+ L T C
Sbjct: 1277 YFKGKALYPFGYGLSYTDFRYSSLKMP------------------------TACN----- 1307
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
D V +N G DG +VV +Y P + + + GF+R++++AG K+I
Sbjct: 1308 --TTDKEIPVTVTVKNTGKMDGEEVVQLYVSHPDKKILVPVTALKGFKRIYLKAGEAKQI 1365
Query: 748 KFVFNACKSLNIVDY-AANTLLPAGEHTIFVGNGGVS 783
F ++ + L+ VD +LP T+ + GG S
Sbjct: 1366 TFSLSS-EDLSCVDENGIRKVLPG---TVKIQVGGCS 1398
>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
Length = 714
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 234/619 (37%), Positives = 348/619 (56%), Gaps = 66/619 (10%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
K LVS+MT+DEK+ Q+ + + RLG+P+Y WW+EALHGV+ G AT F
Sbjct: 10 KKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVF 59
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
P I A+F+ L +KIG VSTE R +N GLT+W+PN+N+ RDPRW
Sbjct: 60 PQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRW 119
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
GR ET GEDP++ G+ Y+RGLQ D H LK ++C KH+A V +
Sbjct: 120 GRGHETYGEDPYLTGKLGCAYIRGLQGDDPDH----------LKSAACAKHFA---VHSG 166
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
R+ FDA+ ++ DM +T+L F+ CVK+ +VM +YNRVNG P+C LL +
Sbjct: 167 PEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDIL 226
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
R E+ G++V+DC +I + H + D+ E++ A + G DL+CG + + +A
Sbjct: 227 RDEFGFEGHVVSDCWAI-LDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFLHLK-DAYD 284
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVL 415
+G V + I +++ L V +RLG P Y + + + E++EL+ EAAR +VL
Sbjct: 285 KGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVEAARRSLVL 344
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKN N LPL+ VKT+AV+GP+AN+ A+IGNY G RY++P+ G Y V
Sbjct: 345 LKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRVL 404
Query: 472 YKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GC D V + + A A+ +D ++ GLD ++E E S D
Sbjct: 405 YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGD 464
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ L LPG Q +L+ VA V K PVILV+ + +D+++AE ++ AI+ + YPG GG
Sbjct: 465 KLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE--EHVDAIIDSWYPGARGG 521
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+A+A+ +FG+++P G+LP+T+Y G ++P S+ + RTY++ N LYP
Sbjct: 522 KAVAEAIFGEYSPSGKLPVTFYQG-------TENLPEFTDYSMAH--RTYRYTNENVLYP 572
Query: 637 FGYGLSYTQFKYNLLSFTK 655
FGYGL Y + Y+ LS K
Sbjct: 573 FGYGLHYGETNYDGLSVDK 591
>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
Gv29-8]
Length = 722
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 268/724 (37%), Positives = 385/724 (53%), Gaps = 58/724 (8%)
Query: 72 MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG-------THFDDVIPGATS 124
+TLDEK L + A GV RLGLP YEW +EALHG++ V PG T + +T
Sbjct: 12 LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP+ I+ A+F++ L I AVSTEARA N +AGL YW+PNIN RDPRWGR ETP
Sbjct: 72 FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP+ V +YA NYV GL+ G + KV S CKH+A YD+++ GV R +
Sbjct: 132 GEDPYHVAQYAYNYVVGLKGGVGPAKS--------KVVSTCKHFAGYDIEDSDGVVRGSY 183
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
+A ++ QD+ E +L F C ++ +VMCSYN VNG PSCA+ +L+ +R W
Sbjct: 184 NAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGS 243
Query: 305 ---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
++ DC ++ + + H + S VA + G DLDCG Y + +AVQ
Sbjct: 244 SAHWVTGDCGAVDGVFNQHH-VGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTT 302
Query: 362 ETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
E +D++L LY+ L+ LG+FD +Y +LG D+ + +LA A EGI
Sbjct: 303 EAQLDQALSRLYSSLIVLGYFDPPEGQEYRTLGVSDVNTPSTQKLAYTALVEGI------ 356
Query: 420 QNTLPLNSAKVKTVAVVGPHA-NATVAMIGNYAGI-PCRYMS-PIAGFSGYA-NVTYKTG 475
N LP+ +TV VGP A NA+V+M GNY G+ P + + P A S Y NVTY G
Sbjct: 357 -NILPIRPMG-QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQG 414
Query: 476 CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAE 535
V + AA AA+ AD + + G+D VEAE+ DR + PG Q LI Q+A
Sbjct: 415 LQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQLAA 474
Query: 536 VAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPI 595
V PV++V + G VD + N N+K +LW GYPG+E G + D++ G P GRLP+
Sbjct: 475 VK--PVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLPV 532
Query: 596 TWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK 655
T Y +Y+ +P+T LRP S PGRTY++YNG ++ PFG G+ YT+F
Sbjct: 533 TQYPANYITQVPMTDQSLRPSSS--NPGRTYRWYNG-SVIPFGTGIHYTKFN-------- 581
Query: 656 TIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV 715
++ R T+D P L ++ F+++ +NVGST V ++
Sbjct: 582 ---ISWKTGGSGRGTYDTADFINAEDPKDLA------EFDVFQINVENVGSTTSDYVALL 632
Query: 716 YSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
+ K Y +K ++ + R + G +I N + + D + N +L G +
Sbjct: 633 FVKSSDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVNVGQ-IARNDSSGNLVLYPGAY 691
Query: 774 TIFV 777
T+ +
Sbjct: 692 TLEI 695
>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
Length = 723
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 257/743 (34%), Positives = 389/743 (52%), Gaps = 103/743 (13%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
K+LV++MTL EK +QL + V RL +P+Y WW+E LHGV+ G AT F
Sbjct: 30 KELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATVF 79
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
P I A F+E K+ ++TE RA YN GLTYWSPN+N+ RDPRW
Sbjct: 80 PQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPRW 139
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ET GEDP++ R V +V+GLQ + + LK+S+C KH+A V +
Sbjct: 140 GRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLKLSACAKHFA---VHSGP 186
Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
R+ F+A V+++D+ ET+L FE CVKE + SVM +YNR NG P C LL +R
Sbjct: 187 ESLRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILR 246
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
G+W G++V+DC ++ +HK + + E +VA ++ G DL+CG Y N A ++
Sbjct: 247 GKWGFKGHVVSDCWALADFHMHHKVTSTATE-SVALAIENGCDLNCGNMYLNLL-LAYKE 304
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
G V E I + + L T +LG FD +Y + + E+ +++ EA+R+ +VLLK
Sbjct: 305 GLVTEEQITTAAERLMTTRFKLGMFDEDCEYNQIPYEVNDCKEHNQVSLEASRKSMVLLK 364
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN----VTYK 473
N+ LPL+ +K+K VAV+GP+AN+ + + GNY+G +Y + + G + V Y
Sbjct: 365 NN-GILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYS 423
Query: 474 TGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDR 517
GC +D+A + ++ + A A+ AD I+ GLD ++E E + D+
Sbjct: 424 EGCHLYKEKVEDLA-RRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDK 482
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
DL LPG Q +L+ +V E K PV++V+ + G+ + AE AIL A YPG GG
Sbjct: 483 LDLNLPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGAEERC--AAILNAWYPGSHGGT 539
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYP 636
A AD++FGK +P G+LP+T+Y D ++ T ++ GRTY++ + LYP
Sbjct: 540 AAADILFGKCSPSGKLPVTFYK-DTDKLPEFTDYAMK--------GRTYRYMDESNCLYP 590
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-DYF 695
FGYGL+Y+ T + S + P V R + D
Sbjct: 591 FGYGLTYS----------------------------TVELSNLQVPAV-----RGEFDGI 617
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
+ V+ +N GS D +VV Y K A + GF+RV ++ G +K + N +
Sbjct: 618 DISVEIENTGSYDIEEVVQCYIKDLESKYAVLNHSLAGFKRVSLKKGESKTVTMKLNR-R 676
Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
+ VD A +L + + +FVG
Sbjct: 677 AFEAVDDAGERILDSKKFKLFVG 699
>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
17629]
Length = 714
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 233/619 (37%), Positives = 348/619 (56%), Gaps = 66/619 (10%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
K LVS+MT+DEK+ Q+ + + RLG+P+Y WW+EALHGV+ G AT F
Sbjct: 10 KKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVF 59
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
P I A+F+ L +KIG VSTE R +N GLT+W+PN+N+ RDPRW
Sbjct: 60 PQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRW 119
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
GR ET GEDP++ G+ Y+RGLQ D H LK ++C KH+A V +
Sbjct: 120 GRGHETYGEDPYLTGKLGCAYIRGLQGDDPDH----------LKSAACAKHFA---VHSG 166
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
R+ FDA+ ++ DM +T+L F+ CVK+ +VM +YNRVNG P+C LL +
Sbjct: 167 PEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDIL 226
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
R E+ G++V+DC +I + H + D+ E++ A + G DL+CG + + +A
Sbjct: 227 RDEFGFEGHVVSDCWAI-LDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFLHLK-DAYD 284
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVL 415
+G V + I +++ L V +RLG P Y + + + E++EL+ EAAR +VL
Sbjct: 285 KGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVEAARRSLVL 344
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKN N LPL+ VKT+AV+GP+AN+ A+IGNY G RY++P+ G Y V
Sbjct: 345 LKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVL 404
Query: 472 YKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GC D V + + A A+ +D ++ GLD ++E E S D
Sbjct: 405 YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGD 464
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ L LPG Q +L+ VA V K PVILV+ + +D+++AE ++ AI+ + YPG GG
Sbjct: 465 KLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE--EHVDAIIDSWYPGARGG 521
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+A+A+ +FG+++P G+LP+T+Y G ++P S+ + RTY++ N LYP
Sbjct: 522 KAVAEAIFGEYSPSGKLPVTFYQG-------TENLPEFTDYSMAH--RTYRYTNENVLYP 572
Query: 637 FGYGLSYTQFKYNLLSFTK 655
FGYGL Y + Y+ +S K
Sbjct: 573 FGYGLHYGETNYDGMSVDK 591
>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 718
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 268/767 (34%), Positives = 395/767 (51%), Gaps = 99/767 (12%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
FS + SF D SL R + +V ++T++EK+ QL + A V RL +P+Y+WW+E
Sbjct: 7 FSLKAQEDCSFRNPDISL--DERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNE 64
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---- 157
LHGV+ G AT FP I A+++ +L ++G A+STEARA YN+
Sbjct: 65 CLHGVARAGR----------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKH 114
Query: 158 ----GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GLT+W+PN+N+ RDPRWGR ET GEDPF+ R V++V+GLQ G+
Sbjct: 115 GYRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQ---GN----- 166
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
+ + LKV++ KHYA V N R+ FDA+V+ +D+ ET+L FE VKE V
Sbjct: 167 -HPKYLKVAALAKHYA---VHNGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGV 222
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
M +YNR NG P CA P L+ + +R +W GY V+DC +I HK + D+ E+A A
Sbjct: 223 MGAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHK-IVDTPEEAAAM 281
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSL 391
L AG +L+CG Y + ++++G E +ID+S+K L+ +RLG F +G+ Y ++
Sbjct: 282 ALNAGCNLNCGDTYASLL-KSLEKGLTTEEEIDRSVKQLFKTRLRLGLFAPEGAVPYDTI 340
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
I S E+ +LA EAAR+ +VLLKN+ NTLP+ + VK V V GP A A++ NY
Sbjct: 341 STDVIRSKEHQKLALEAARKSVVLLKNEANTLPV-ARDVKKVYVTGPTATHVQALLANYY 399
Query: 452 GIPCRYMSPIAGFSG----YANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGL 506
G+ + + G G +V Y+ G + ++N N++ S AA +AD T+ G+
Sbjct: 400 GVSEDMTTILEGIVGKVSPQTSVQYRQGA--LLYEANRNTMDWFSGAAASADVTVACLGI 457
Query: 507 DLSVEAES---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
+E E DRE LP Q + ++ AK LV++ G I+ E
Sbjct: 458 SQLIEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAKK---LVVVITSGSAISLPE 514
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
A+L+ YPGE+GG+A+ADV+FG P GRLP+T V LP P +
Sbjct: 515 IYDMADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVTVVKS--VDDLP-------PYE 565
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
+ GRTY++ +PFG+GLSYT F Y+ L+
Sbjct: 566 NYDMKGRTYRYMEVSPQFPFGFGLSYTDFTYSNLTLES---------------------- 603
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQR 736
N ++ + D N G D +VV Y E + KQ +IGF+R
Sbjct: 604 ---------NKVKSGESVRLSFDLTNEGEYDADEVVQFYIT-DVEASVNVPKQSLIGFKR 653
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
V + AG + +I+F + IVD +L +GE I++G S
Sbjct: 654 VGLAAGESTKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIGGSSYS 699
>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
Length = 729
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 260/758 (34%), Positives = 387/758 (51%), Gaps = 95/758 (12%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q + + D+SL + R+ LV MTL EK+ QL + V RL +P+Y WW+EALHGV+
Sbjct: 20 QKKAQKWLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVA 79
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-------- 159
G +T FP I A+F+ L K++ A+S EARA +N+ +
Sbjct: 80 RNGK----------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQY 129
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
AGLT+W+PN+N+ RDPRWGR ET GEDP++ + V +V+GLQ G+ + + L
Sbjct: 130 AGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQ---GN------HPKYL 180
Query: 220 KVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
K ++C KH+A + G + R+HF+A +++D+ ET+L FE VK+ + VM +Y
Sbjct: 181 KSAACAKHFAVHS-----GPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAY 235
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N V G+P+ + LL +T+R W GYIV+DC ++ + HK + E A A LKA
Sbjct: 236 NAVYGVPAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPE-AAAVALKA 294
Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQD 395
G++L+CG Y AVQQG V E ID LK L +LGFFD Y ++
Sbjct: 295 GVNLNCGYVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEANPYNAIPTSV 354
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
I SD++I LA + A++ IVLLKN +TLPL+ +K V GP A+++ ++ NY G+
Sbjct: 355 IHSDDHIALARKTAQKSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTT 413
Query: 456 RYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+S + G + ++ Y+ G K+ N A AKTADA I + GL E
Sbjct: 414 NLVSVLEGIADKVSLGTSLNYRMGALPF-NKNLNPKNWAPNVAKTADAVIAVVGLSADFE 472
Query: 512 AESL---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
E + D++DL LP Q + ++A KGP+ILV+ A G +A E
Sbjct: 473 GEEVDAIASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVV--ASGSAVALGELYDLA 530
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AI+ YPGE+GG A+ADV+FG +P G LP+T+ P + L P +
Sbjct: 531 DAIVLMWYPGEQGGNAVADVLFGDVSPSGHLPVTF---------PKSVAQLPPFEDYSMQ 581
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
GRTYK+ L+PFG+GLSYT FK++ +Q++ K
Sbjct: 582 GRTYKYMEEEPLFPFGFGLSYTDFKFS------NVQISEEK------------------- 616
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
++ D F N G DG +VV +Y P Q++ F+R+ ++
Sbjct: 617 ------IKKKDSFTVSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKN 670
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+K + F A K L V+ G++ + V N
Sbjct: 671 TSKTVSFNLEA-KDLFQVNKEGKKTWIKGKYKLVVANA 707
>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
Length = 705
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 250/773 (32%), Positives = 386/773 (49%), Gaps = 116/773 (15%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
Y + ++LV++MTL+EK QL + + RLG+P Y WW+EALHGV+ G
Sbjct: 7 YKKKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT--------- 57
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
AT FP I A F++ KI A++ EARA YN GLT WSPNIN+
Sbjct: 58 -ATVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIF 116
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDPF+ G+ V +++GLQ + + ++C KH+AAY
Sbjct: 117 RDPRWGRGHETYGEDPFLSGKLGVAFIKGLQG----------DKDVMMTAACVKHFAAYS 166
Query: 233 VDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
G + R+ F+A VT++D+ ET+L FE CVK+ +VM YNR NG P C
Sbjct: 167 -----GPEDLRHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYT 221
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL +R +W G++V+DC +I+ +H + + E++VA + AG DL+CG Y
Sbjct: 222 LLRDILREKWGFEGHVVSDCWAIKDFHTDH-MVTKTPEESVALAIDAGCDLNCGNMYLML 280
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAR 410
A+Q+G + E I ++ ++T +LG F+GS ++ ++ + + E+ E+A EAAR
Sbjct: 281 L-IALQEGLITEEHITRAAVRIFTTRFKLGLFEGS-EFDNIPYEVVECSEHKEMAIEAAR 338
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----G 466
+ VLLKND LP+N +KT+ V+GP+AN+ +A+ GNY G RY++ + G
Sbjct: 339 KSAVLLKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGD 397
Query: 467 YANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
V Y GC+ +V +N+ + A A+ +D ++ GLD ++E E
Sbjct: 398 EVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNN 457
Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
S D++DL LP Q L+ ++ K P +L +M+ +++++A + N IL YP
Sbjct: 458 GGSGDKKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAINLSYAHEHCN--GILLTWYP 514
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G GG+A+AD++FG +P G+LP+T+Y L ++P P+ RTY++
Sbjct: 515 GARGGKAVADILFGNASPSGKLPVTFYRS-------LDNLP--PITDYSMKNRTYRYIEE 565
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
LYPFGYGL+Y + + T+++ +
Sbjct: 566 APLYPFGYGLTYGDVELKHVEIKGTVEIEKD----------------------------- 596
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
V QN GS +VV Y K + A + F RV + A K++
Sbjct: 597 ---IYITVTLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCAFMRVGLGANEEKQVSMRI 653
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGG-------------VSFPIHLNFN 791
SL +V+ +L + + T+F G G +S I L FN
Sbjct: 654 -PFDSLKVVNLDGEKVLDSKKFTLFAGLCGPDKRSVELTGKEPISILIELEFN 705
>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
Length = 785
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 258/734 (35%), Positives = 380/734 (51%), Gaps = 40/734 (5%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L CD + R L S MTL+E + G+ +PRLGLP Y+ W+EALHG+
Sbjct: 60 LVCDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGLYLA--- 116
Query: 113 THFDDVIP--GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
+F + P +TSFP+ ILT A+ N +L +I Q ++T+ RA N GR GL +SPNIN
Sbjct: 117 -NFTESGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPNIN 175
Query: 171 VARDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
R P WGR ETPGED + YA Y+ GLQ NAT+ K+ + KHYA
Sbjct: 176 AFRHPVWGRGQETPGEDANCLCSAYAYEYITGLQG-----NATNP-----KIIATAKHYA 225
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
YD++NW+ R+ D +T+QD+ E F F + V++ SVM SYN VNG+PS A+
Sbjct: 226 GYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSANT 285
Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LL VR W GY+ +DCD++ + + H + A+ A A +L+AG D+DCG Y
Sbjct: 286 FLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAAN-LSSASAMSLRAGTDIDCGISY 344
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAA 406
++ QG++ ++I++++ Y+ L+ G+FDG Y L D+ +A
Sbjct: 345 LTTLNESLTQGQISRSEIERAVTRFYSNLVSAGYFDGPDAPYRDLSWSDVVRTNRWNVAY 404
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
EAA G+VLLKND LPL S V+ VA++GP ANAT M GNY G+ SP+A
Sbjct: 405 EAAVAGVVLLKND-GVLPL-SKSVQRVALIGPWANATEQMQGNYHGVAPYLTSPLAAVQA 462
Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
V Y G ++ N AA AA+ +D I G+D ++EAE LDR ++ PG
Sbjct: 463 SGLEVNYAFGT-NITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWPGN 521
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +LI+++ E+ K P++++ M G VD + + + + A+LW GYPG+ GG+A+ D++ G
Sbjct: 522 QLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDILTG 580
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+ P GRL T Y +Y P T M LRP PG+TY +Y G +Y FG+GL YT
Sbjct: 581 QRAPAGRLTTTQYPAEYALQFPATDMSLRPRGD--NPGQTYMWYTGEPVYAFGHGLFYTT 638
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F L + + R+ + + ++ LV L + F V N G
Sbjct: 639 FATALAGPGQEPE---------RSFDIGALLARPHAGYNLVEQL---PFLNFTVKVTNTG 686
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
+ ++ A K ++GF R+ R V + SL D N
Sbjct: 687 EVISDYTAMAFANTTAGPRPHPNKWLVGFDRIGPLDPRVSARMSVPVSLDSLARTDAQGN 746
Query: 766 TLLPAGEHTIFVGN 779
++ G + + + N
Sbjct: 747 RVIYPGPYELALNN 760
>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
Length = 905
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 269/764 (35%), Positives = 385/764 (50%), Gaps = 114/764 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ DSS P +R DL+ RM+L EKV QL + A G+PRLGLP Y++W+EA HG++N G
Sbjct: 204 IWRDSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI- 262
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR--------AGL 162
AT FP I A++N +L + G + E RA +N R GL
Sbjct: 263 ---------ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGL 313
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
TYW+PNIN+ RDPRWGR ET GEDPF+ + +V+G+Q + R +
Sbjct: 314 TYWAPNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDD---------PRYMLAM 364
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KHYA V + R+ F+A + E+D+ +T+L FE V+EG + VM +YN VNG
Sbjct: 365 ACAKHYA---VHSGPERTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNG 421
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDL 341
+P+ A+ LL + +R W GY+ +DCD+I+ + + + E+A A +KAG +L
Sbjct: 422 VPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNL 481
Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY----VSLGKQDIC 397
CG Y N AVQQG V E D+D +L + RLG FD + Q +L D+
Sbjct: 482 CCGGDY-NALVRAVQQGLVTEKDLDGALYHTLWTRFRLGLFDPAEQVPFSGYTLKDNDLP 540
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
+ + L E AR+ IVLLKND TLPL+ K+K +AV+GP+A + + GNY G R
Sbjct: 541 AHSQVAL--ELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRS 597
Query: 458 MSPIAGFSGYAN------------VTYKTGCDDVACKSNNS-------IFAASEAAKTAD 498
+S + VT K G + + N + A + A AD
Sbjct: 598 ISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEAD 657
Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
A I + G+ + E ES DRE + LP Q LI + K PV++V S G +A
Sbjct: 658 AIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-PVVMVNCS--GSAMALTWQ 714
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
+ N+ AI+ A YPG+EGGRA+A+V+FG+ NP G LPIT+Y ++ L
Sbjct: 715 DENLPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYR---------STADLPDFSD 765
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
RTY+++ G LY FG+GLSY+ F+Y NL A+
Sbjct: 766 YSMKNRTYRYFTGRPLYAFGHGLSYSTFEYA-------------------NLRVAPAAN- 805
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
G L L D N G DG DVV +Y+ PPA ++ + GF+R
Sbjct: 806 ----GALTVTL----------DLTNSGKRDGDDVVQLYATPPASSQPQELRALCGFRRTH 851
Query: 739 VRAGRNKRIKFVFNAC--KSLNIV--DYAANTLLPAGEHTIFVG 778
V+AG + + A + +I DYA +P+G+ TI G
Sbjct: 852 VKAGETRTVTVTVPAVALRRWDIAKKDYA----IPSGDWTIAAG 891
>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 792
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 237/628 (37%), Positives = 349/628 (55%), Gaps = 45/628 (7%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ + CD + + R LV M LDEK++ L + + G PR+GLP YEWWSEALHGV+
Sbjct: 35 LSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK 94
Query: 109 VGPGTHFDD----VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
PG F+ ATSF I+ +A+F++ L + + +STEARA N G AGL +
Sbjct: 95 -SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDW 153
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+PNIN +DPRWGR ETPGED + +Y +RGL+ +D +R K+ +
Sbjct: 154 WTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE-------GSDPTTR--KMVAN 204
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV---- 280
CKHYAA D++ W GV RY+FDA VT QD+ E +L F+ C ++ + S MC+YN +
Sbjct: 205 CKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKG 264
Query: 281 -----NGIPSCADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
NG P CA L+N +R W + + +I +DC+++ M + H + +D++E+A
Sbjct: 265 KDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAG 323
Query: 333 QTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYV 389
AG D C Y A +G + E +D++LK LY L+R G+FDG Y
Sbjct: 324 SAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYR 383
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN----SAKVKTVAVVGPHANATVA 445
++ D+ + E +LA +A EG+VL KN+ LP+ K KTVA++G +
Sbjct: 384 NITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKTVALIGNWVDNGEQ 442
Query: 446 MIGNYAGIPCRYMSPIAGFSG--YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
M+G Y+GI +P+A VT + ++ A AA AD +
Sbjct: 443 MLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYF 502
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G+DLSVEAE DR L P Q +L++ ++ + K P ++V + D A + N NI
Sbjct: 503 GGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLD-NKNIS 560
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV-DSLG-- 620
AI+WAGYPG++GG A D++ GK P GRLP+T Y Y +P+T M +RP D+ G
Sbjct: 561 AIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGA 620
Query: 621 --YPGRTYKFYNGPTLYPFGYGLSYTQF 646
PGRTY++Y+ ++PFG+GL +T F
Sbjct: 621 ASNPGRTYRWYD-EAVHPFGFGLHFTNF 647
>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
Length = 847
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 237/628 (37%), Positives = 349/628 (55%), Gaps = 45/628 (7%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ + CD + + R LV M LDEK++ L + + G PR+GLP YEWWSEALHGV+
Sbjct: 90 LSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK 149
Query: 109 VGPGTHFDD----VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
PG F+ ATSF I+ +A+F++ L + + +STEARA N G AGL +
Sbjct: 150 -SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDW 208
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+PNIN +DPRWGR ETPGED + +Y +RGL+ +D +R K+ +
Sbjct: 209 WTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE-------GSDPTTR--KMVAN 259
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV---- 280
CKHYAA D++ W GV RY+FDA VT QD+ E +L F+ C ++ + S MC+YN +
Sbjct: 260 CKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKG 319
Query: 281 -----NGIPSCADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
NG P CA L+N +R W + + +I +DC+++ M + H + +D++E+A
Sbjct: 320 KDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAG 378
Query: 333 QTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYV 389
AG D C Y A +G + E +D++LK LY L+R G+FDG Y
Sbjct: 379 SAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYR 438
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN----SAKVKTVAVVGPHANATVA 445
++ D+ + E +LA +A EG+VL KN+ LP+ K KTVA++G +
Sbjct: 439 NITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKTVALIGNWVDNGEQ 497
Query: 446 MIGNYAGIPCRYMSPIAGFSG--YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
M+G Y+GI +P+A VT + ++ A AA AD +
Sbjct: 498 MLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYF 557
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G+DLSVEAE DR L P Q +L++ ++ + K P ++V + D A + N NI
Sbjct: 558 GGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLD-NKNIS 615
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV-DSLG-- 620
AI+WAGYPG++GG A D++ GK P GRLP+T Y Y +P+T M +RP D+ G
Sbjct: 616 AIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGA 675
Query: 621 --YPGRTYKFYNGPTLYPFGYGLSYTQF 646
PGRTY++Y+ ++PFG+GL +T F
Sbjct: 676 ASNPGRTYRWYD-EAVHPFGFGLHFTNF 702
>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 256/744 (34%), Positives = 379/744 (50%), Gaps = 102/744 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ K+LV +MTL+EK +QL + V RL +P+Y WW+E LHGV+ G AT
Sbjct: 15 KAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------AT 64
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A F++ L I + +STE RA YN G+T+WSPN+N+ RDP
Sbjct: 65 VFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRDP 124
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V +V+GLQ EG + LK ++C KH+A +
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQG-EG---------KYLKAAACAKHFAVHS--G 172
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+G+ R+ FDA V+++D+ ET+L FE CVKEGD +VM +YNR NG P C LL
Sbjct: 173 PEGL-RHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDI 231
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RG+W+ G++V+DC +I +H+ + + E A A +K G DL+CG Y A
Sbjct: 232 LRGKWNFKGHVVSDCWAIADFHLHHRVTSTATESA-ALAMKNGCDLNCGNVYLQLL-LAY 289
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ-DICSDENIELAAEAAREGIV 414
++G V E DI + + L +RLG FD +Y + + + C + N EL+ +AAR +V
Sbjct: 290 KEGLVTEEDITTAAERLMATRIRLGMFDEECEYNKIPYELNDCKEHN-ELSLKAARNSMV 348
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANV 470
LLKN+ LPLN +K++AV+GP+A++ + + GNY+G RY++ + G V
Sbjct: 349 LLKNN-GILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRV 407
Query: 471 TYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SL 515
Y GC + + N+ + A A+ +D I+ GLD ++E E +
Sbjct: 408 YYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAG 467
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
D+ L LPG Q +L+ ++ E PVILVI G + F AIL A YPG G
Sbjct: 468 DKASLNLPGRQQELLEKIIETGT-PVILVI--GAGSALTFNNAEDKCSAILDAWYPGSRG 524
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
GRA+AD++FGK +P G+LPIT+Y + L RTY++ + +LY
Sbjct: 525 GRAVADLIFGKCSPSGKLPITFYR---------NTKDLPEFIDYSMKDRTYRYMSCESLY 575
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-DY 694
PFGYGL+Y+ K + L V D++ D +
Sbjct: 576 PFGYGLTYSTVKLSELH---------------------------------VPDVKSDFED 602
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
E V N G+ D +V+ Y K A + GF+RV ++ G +K K
Sbjct: 603 VEVSVKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGESKIAKMKIKK- 661
Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
S +V+ +L + +FVG
Sbjct: 662 SSFEVVNDDGERILDSKRFKLFVG 685
>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
Length = 461
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/451 (44%), Positives = 286/451 (63%), Gaps = 9/451 (1%)
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QY 388
A +KAGLDLDCG + T AV++G V + +I+ +L TV MRLG FDG P QY
Sbjct: 1 ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+LG +D+C+ + +LA EAAR+GIVLL+N +LPL++ + +TVAV+GP+++ TV MIG
Sbjct: 61 GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120
Query: 449 NYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
NYAG+ C Y +P+ G Y ++ GC DV C N AA AA+ ADAT+++ GLD
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180
Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
S+EAE +DR L LPG+Q +L+++VA ++GP ILV+MS G +D+ FA+ + I AI+W
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
GYPG+ GG AIA+V+FG NPGG+LP+TWY +YV LP+T M +R + GYPGRTY+F
Sbjct: 241 GYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRF 300
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
Y GP ++PFG GLSYT F +NL + V L L+ N S + P D
Sbjct: 301 YIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHP-----D 355
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
+ VD +N GS DG+ ++V++ PP A+ KQ++GF ++ + G KR++
Sbjct: 356 CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASS-KQLMGFHKIHIATGSEKRVR 414
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD +P GEH + +G+
Sbjct: 415 IAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 445
>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 762
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 263/790 (33%), Positives = 400/790 (50%), Gaps = 96/790 (12%)
Query: 11 FSLSI-ALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
FSLS A LV+ A+D + P V P ++S CD++L R LV
Sbjct: 16 FSLSCSAALVY---AIDLPFQTYPDCVNGP---------LASLKVCDATLSPPQRAAALV 63
Query: 70 SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF---DDVIPGATSFP 126
+ MT +EK+Q L + G PR+GLP Y WWSEALHGV+ PGT F + +TSFP
Sbjct: 64 AAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGNGTFNSSTSFP 122
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
+L A+F++ L +++G+ + E RA N G +G YW+PN+N +DPRWGR +ETPGE
Sbjct: 123 MPLLMAATFDDELIERVGEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGE 182
Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
D + RYA + +RGL+ + R ++ + CKHYAA D ++W G R+ F+A
Sbjct: 183 DILRIKRYAASMIRGLEG--------PVRERERRIVATCKHYAANDFEDWNGSTRHDFNA 234
Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG-- 304
+VT QD+ E +L PF+ C ++ S+MCSYN VNG+P+CA+ L+ +R W+
Sbjct: 235 KVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPG 294
Query: 305 -YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
YI +DC+++ + NH + A + + A +AG+D C ++ A QG +K++
Sbjct: 295 NYITSDCEAVLDISANHHY-AKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQS 353
Query: 364 DIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
+D++L+ LY L+++G+FDG+ +Y SLG + ++ E+A +AA EGIVLLKND+ T
Sbjct: 354 TVDRALRRLYEGLVQVGYFDGNRSEYASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-T 412
Query: 423 LPL----NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
LPL N K+K +A++G AN + G Y+G P SP+ G
Sbjct: 413 LPLGVKKNGPKLK-LAMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPV 471
Query: 479 VACKSNNSIFAASEAAKTADATIIL--AGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
+ ++ + + A DA IL G D S E+ DR + P Q QLI ++++
Sbjct: 472 LQNSTSKDTWTQAALAAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKL 531
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
K P+++V M +D + I +ILWA +P P GRLP+T
Sbjct: 532 GK-PLVVVQM-GDQLDNTPLLASKAINSILWANWP----------------VPAGRLPVT 573
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
Y+ +Y +P+T M LRP D L PGRTY++Y P + PFG+GL YT FK ++
Sbjct: 574 QYHANYTAAVPMTDMTLRPSDKL--PGRTYRWYPTP-VQPFGFGLHYTTFKTKIV----- 625
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL--RCDDYF-------EFKVDFQNVGST 707
R P + DL RC + + KV+ N G
Sbjct: 626 -----------------------RLPRFAIKDLLSRCGNAYPDTCGLPPLKVEVTNTGKR 662
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
VV+ + K IK ++ + R+ + K + + D NT+
Sbjct: 663 SSDYVVLAFLKGDVGPKPYPIKTLVSYTRLRDLSPGRKTTAHLDWTLGDIARYDEQGNTV 722
Query: 768 LPAGEHTIFV 777
L G +T+ V
Sbjct: 723 LYPGTYTVIV 732
>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
8052]
gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
beijerinckii NCIMB 8052]
Length = 709
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 252/741 (34%), Positives = 387/741 (52%), Gaps = 100/741 (13%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
K+LVS+MTL EK +QL + + L +P+Y WW+E LHGV+ G AT F
Sbjct: 17 KELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVF 66
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
P I A F++ K+ ++TE RA YN GLTYWSPNIN+ RDPRW
Sbjct: 67 PQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPRW 126
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ET GEDP++ R V +++GLQ EG + LK+++C KH+A + +
Sbjct: 127 GRGHETYGEDPYLTSRLGVAFIKGLQG-EG---------KYLKLAACAKHFAVHS--GPE 174
Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
G+ R+ F+A V ++D+ ET+L FE CVKE + SVM +YNR NG P C LL +R
Sbjct: 175 GL-RHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILR 233
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
G+W G++V+DC ++ H + + ++VA ++ G DL+CG Y N A ++
Sbjct: 234 GKWGFKGHVVSDCWAL-ADFHLHHMVTSTATESVALAIENGCDLNCGNMYLNLL-LAYKE 291
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
G V E I + + L T +LG FD +Y + + S E+ E+A A+R+ +VLLK
Sbjct: 292 GLVTEEQITTAAERLMTTRFKLGMFDEECEYNKIPYEVNDSREHNEVALIASRKSMVLLK 351
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYK 473
N+ TLPL+ + +K++AV+GP+AN+ + + GNY+G +Y + + G V Y
Sbjct: 352 NN-GTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYS 410
Query: 474 TGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDR 517
GC +D+A + ++ + A A+ +D ++ GLD ++E E + D+
Sbjct: 411 EGCHLFKDKVEDLA-RPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDK 469
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
E+L LPG Q L+ +V EV K PVI+V+ + + + AE AIL A YPG GG
Sbjct: 470 ENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGAEEKC--AAILNAWYPGSHGGT 526
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
A+AD++FGK +P G+LP+T+Y D ++ T ++ GRTY++ +LYPF
Sbjct: 527 AVADILFGKCSPSGKLPVTFYK-DTAKLPDFTDYSMK--------GRTYRYLGHESLYPF 577
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGL+Y+ T + S + P V + F+
Sbjct: 578 GYGLTYS----------------------------TVELSNLQVPSV----KQGFGSFDI 605
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
++ +N G D +VV Y K A + GF+RV ++ G +K + N KS
Sbjct: 606 SIEIKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLNK-KSF 664
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+V+ LL + + +FVG
Sbjct: 665 EVVNDDGERLLDSKKFKLFVG 685
>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
10762]
Length = 849
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 243/622 (39%), Positives = 346/622 (55%), Gaps = 44/622 (7%)
Query: 49 MSSFLFCDS-SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
++S L CD+ + PY R +++ M + EK+ L D ++G RLGLP YEWWSEALHGV+
Sbjct: 37 LTSNLVCDTNATPYQ-RASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVA 95
Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PG +F ATSFP I +++F++ + I +STEARA N R GL Y+
Sbjct: 96 G-SPGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARGGLDYF 154
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE-GHENATDLNSRPLKVSSC 224
+PNIN +DPRWGR +ETPGEDP + Y N + GL+ + G+ N + + K+ +
Sbjct: 155 TPNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYK--KMIAT 212
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
CKH+A YD+++W G RY +DA +T QD+ E +L PF+ C ++ + +S+MCSYN VN +P
Sbjct: 213 CKHFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVP 272
Query: 285 SCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
+CA+ L +R W + YI +DC++I + NH + ++ A +L G+D
Sbjct: 273 ACANSYLQETILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAA-AAGLSLSNGMDT 331
Query: 342 DC------------GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQ 387
C G YY G V E I +L Y L+ G+FD S
Sbjct: 332 ACIVANTGVMTDVNGSYY---------GGYVTEATITTALIRQYEALVIAGYFDPASSNP 382
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y S+G + + LA +AA EG LLKN LP VA++G AN T M
Sbjct: 383 YRSIGWSSVNTPAAQTLARQAATEGTTLLKN-TGLLPYKFTSQTKVAMIGMWANGTSQMQ 441
Query: 448 GNYAGIPCRYM-SPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
G Y+G P Y+ SP+ S + Y G + ++N A+ AA+ AD + G
Sbjct: 442 GGYSG-PAPYLHSPLYAASQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGG 500
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
+D SVEAE++DR + PG Q LI Q+A + K P+I++ M + +D +N NI A+
Sbjct: 501 IDWSVEAEAMDRYQIAWPGAQQALIAQLAALGK-PMIVLQMGS-MLDATPILSNNNISAL 558
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
+W GYPG++GG A D++ G P GRLP+T Y DYV +P+T+M LRP G PGRT
Sbjct: 559 VWVGYPGQDGGVAAFDILTGAVAPAGRLPVTMYPADYVNQVPMTNMSLRP--GPGNPGRT 616
Query: 626 YKFYNGPTLYPFGYGLSYTQFK 647
YK+YN L PF YGL YT FK
Sbjct: 617 YKWYNNAVL-PFAYGLHYTTFK 637
>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 696
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 254/724 (35%), Positives = 376/724 (51%), Gaps = 108/724 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ K LV+ MTL+E+ QL + + RLG+P Y WW+EALHGV+ G AT
Sbjct: 9 KAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------AT 58
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
SFP I A+F++ L K++ + ++ E RA YN GLT+WSPN+N+ RDP
Sbjct: 59 SFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V +V+GLQ EG LK ++C KH+A V +
Sbjct: 119 RWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG-----------LKTAACAKHFA---VHS 164
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
DR+HFDARV+++D+ ET+L FE VKE + SVM +YNR NG P C P L+
Sbjct: 165 GPEADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDI 224
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R +W G+ V+DC +I+ ++H + ++E A A LK+G DL+CG Y + A
Sbjct: 225 LREKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESA-ALALKSGCDLNCGNTYLHIL-MAY 282
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
Q G V E +I + + L+T LG FDGS Y ++ + + S ++ +A EA + IVL
Sbjct: 283 QNGLVTEEEITTAAERLFTTRYLLGLFDGS-TYDAIPYEVVESKPHLSVADEATAKSIVL 341
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG----YANVT 471
LKN+ LPLN +KT+ V+GP+AN+ A+IGNY G +Y++ + G +
Sbjct: 342 LKNN-GLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRIL 400
Query: 472 YKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y G A + + + A AK +D I+ GLD ++E E S D
Sbjct: 401 YSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVGLDETLEGEEGDTGNAYASGD 460
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ DL LP Q +L+ +A++ K PVIL + + +D+ +A+ + + A+L A YPG GG
Sbjct: 461 KRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYADAHYD--AVLQAWYPGARGG 517
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+ IA + G+ P G+LP+T+Y L+ +P + GRTY++ LYP
Sbjct: 518 QVIAKALLGEIVPSGKLPVTFYR-------DLSGLP--AFEDYSMQGRTYRYMQEEALYP 568
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGL+Y + CR + D R VLV++
Sbjct: 569 FGYGLTYGK---------------------CRIEEASYDQGSLR---VLVHN-------- 596
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF--NAC 754
+VDF+ +VV +Y K A + GF+RV + AG K I+ NA
Sbjct: 597 -EVDFKL------EEVVQLYIKNLDSEFAVPNHSLCGFKRVSLEAGETKEIQINVSPNAF 649
Query: 755 KSLN 758
K +N
Sbjct: 650 KVVN 653
>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 721
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 245/725 (33%), Positives = 366/725 (50%), Gaps = 91/725 (12%)
Query: 42 FSKLGLQMSSF-----LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
+ LGL ++F ++ + S RV+DL+SR+TL EKV LG + VPRL +P Y
Sbjct: 13 LTSLGLIKTAFCQQIPIYRNPDKKLSTRVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAY 72
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
WW+E LHGV+ G AT FP I A+F+++L K++ VSTEARA YN
Sbjct: 73 NWWNEGLHGVARAGE----------ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYN 122
Query: 157 LGRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
L A GLT+WSPNIN+ RDPRWGR ET GEDPF+ + YV GLQ +
Sbjct: 123 LSTAMGRHLQYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPL 182
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
LK S+ KH+ A+ +R +FDA V E+D+ +T+L F+ V +G
Sbjct: 183 H---------LKTSATAKHFVAHSGPEG---ERDYFDALVDEKDLRDTYLYAFKSLV-DG 229
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
S+M +YNRVNG+P+ + L+N V EW G++V DC ++ + HK L + E
Sbjct: 230 GVESIMTAYNRVNGVPNSINKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRME 289
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SP 386
A A +KAG+DLDC + NA+ + E +D +L + + +LGFFD S
Sbjct: 290 VAAA-AIKAGVDLDCSSIFQTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDAPSSS 348
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ S G I +D ++ LA + A++ +VLLKND+ LPL ++ VVGP+A + A+
Sbjct: 349 PFYSFGADSIHNDSHVMLARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDAL 408
Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
+ +Y G+ + ++ + G + + + D A + + F A AD T+ + GL
Sbjct: 409 VASYHGVSSKAVNFVEGITAAVDKGTRVEYDLGADYRDTTHFGGIWGAGNADVTVAVIGL 468
Query: 507 DLSVEAES---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
+E E+ D++DL LP + + + K P+I V+ S VDIA
Sbjct: 469 TPVLEGEAGDAFLSQTGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIA 528
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
+ A++ A YPGE+GG A+AD++FGK +P G LP+T+YN V LP +
Sbjct: 529 PYAD--AVILAWYPGEQGGNALADILFGKISPSGHLPLTFYNS--VNDLP-------AYN 577
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
+ GRTY+++ G YPFG+GLSYT F Y KT
Sbjct: 578 NYSMKGRTYRYFAGAVQYPFGFGLSYTTFNYQWQQQPKT--------------------- 616
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
D + V +N G+ +VV Y P + +K++ GF+R+
Sbjct: 617 ----------SYSAKDTIQLSVVVKNTGNISADEVVQAYIGYPT-LNRMPLKELKGFKRI 665
Query: 738 FVRAG 742
+ G
Sbjct: 666 TLNKG 670
>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 793
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 234/626 (37%), Positives = 350/626 (55%), Gaps = 38/626 (6%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
+G ++S CD SL S R LV+ + + EK+ L A+G R+GLP+Y WWSEALH
Sbjct: 34 VGGLLASNKVCDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALH 93
Query: 105 GVSNVGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
GV+ PGT F PG +TSFP +L ASF++SL +KIG + TE+RA N +
Sbjct: 94 GVA-YAPGTQFRRG-PGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWS 151
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GL YW+PN+N +DPRWGR +ETPGED + RYA + ++GL+ + +
Sbjct: 152 GLDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPEKER--------R 203
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
V S CKHYAA D ++W G R+ FDAR++ QD+ E +L PF+ C ++ S+MC+YN V
Sbjct: 204 VVSTCKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAV 263
Query: 281 NGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
NG+PSCA+ LL+ +R W G Y+ +DC+++ + HK+ A + + A +A
Sbjct: 264 NGVPSCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKY-ARTNAEGTAMCFEA 322
Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDI 396
G D C ++ A QG ++E +D++L LY L+R+G+FDG S + + D+
Sbjct: 323 GTDTSCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSAFSDISWADV 382
Query: 397 CSDENIELAAEAAREGIVLLKNDQN-TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ +L+ ++A EGIV+LKND LPL + +AMIG +A P
Sbjct: 383 NAPAAQDLSLQSAVEGIVMLKNDGTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPE 442
Query: 456 RYMSPIAGFSGY----ANVTYKTGCDDVAC-----------KSNNSIFAASEAAKTADAT 500
+ +G + Y A + G D V ++N A AA+ AD
Sbjct: 443 KLRGGYSGTAAYLRTPAYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYI 502
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
+ GLD + E+ DR D+ PG Q L+ ++A + K P+++V M +D N
Sbjct: 503 VYFGGLDETAAGENKDRWDVEWPGAQLALVKRLAALGK-PLVVVQM-GDQLDGTPLLANA 560
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
+ A+LWA +PG++GG A+ ++ G +P GRLP+T Y +Y +++P+T M LRP S
Sbjct: 561 GVGAVLWASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGS 620
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQF 646
PGRTY++Y+ P L PFG+GL YT F
Sbjct: 621 RPGRTYRWYSTPVL-PFGFGLHYTNF 645
>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
Length = 756
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 256/736 (34%), Positives = 387/736 (52%), Gaps = 44/736 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD + + R LV+ M EK++ L + GV RLGLP Y WW EALHGV+
Sbjct: 29 LKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWWGEALHGVAG 88
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG +F ATSFP +L +A+F++ L +I + EARA N G A + +W+P+
Sbjct: 89 A-PGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPD 147
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN RDPRWGR +ETPGED + Y + + GL+ + K+ + CKHY
Sbjct: 148 INPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR----------KIIATCKHY 197
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
YDV+NW G DR+HFDA++T QD+ E F+ PF+ C ++ S MCSYN VNG+P+CAD
Sbjct: 198 VGYDVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCAD 257
Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+L +R W D + YI +DC++++ + HK++A +E A A G+DL C
Sbjct: 258 TYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQE-ATAIAFNNGMDLSCEY 316
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
T+ A QG + + ID++L Y L+ G+FDG + Y LG QDI + E +L
Sbjct: 317 SGTSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAATYAHLGVQDINTPEAQKL 376
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPI-A 462
+ A EG+ LLKND +TLPL+ VA+VG AN T + G Y+G P Y+ +P+ A
Sbjct: 377 VLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANTTSKLSGIYSG-PAPYLHTPVYA 434
Query: 463 GFSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
G ++ TG + ++N A AAK +D + GLD S AE DR D+
Sbjct: 435 GNKLGLDMAVATGPILQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDRTDIS 494
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
P Q LI ++A A G ++VI VD + +++WA +PG++GG A+
Sbjct: 495 WPSAQIDLITKLA--ALGKPLVVIALGDMVDHTPILKMKGVNSLIWANWPGQDGGTAVMQ 552
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
V+ G+ GRLPIT Y +Y Q L + M +RP + PGRTY++YN ++ PFG+GL
Sbjct: 553 VITGEHAIAGRLPITQYPAEYTQ-LSMLDMNMRPGGN--NPGRTYRWYN-ESVQPFGFGL 608
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT+F S + + VN+ + ++ DL C D +V
Sbjct: 609 HYTKFAAKFGS-SSGLTVNIQDIMKSCTKDHP--------------DL-C-DVPPIEVAV 651
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G+ + + + K +K ++ + R+ +G ++ + +L+ VD
Sbjct: 652 TNEGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRVD 711
Query: 762 YAANTLLPAGEHTIFV 777
+ N + GE+T+ +
Sbjct: 712 QSGNLVAYPGEYTLLL 727
>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
Length = 699
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 263/738 (35%), Positives = 382/738 (51%), Gaps = 97/738 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+++ L+S M+L+EK+ + A G+PRLG+P Y WW+EALHGV+N G AT
Sbjct: 11 QIETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------AT 60
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGRA-------GLTYWSPNINVARDP 175
FP I A+F+E L ++ +A+S EARA +N +G+ GLT+W+PNIN+ RDP
Sbjct: 61 VFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDP 120
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
RWGR ET GEDP + R YVRGLQ S P L+ ++C KH+A +
Sbjct: 121 RWGRGQETYGEDPVLTSRLGTAYVRGLQ-----------GSDPYYLRAAACAKHFAVH-- 167
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+G+ R+ F+A V+++D+EET+L F+ VK G SVM +YNRVNG P+C LL
Sbjct: 168 SGPEGL-RHTFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACGSTYLLK 225
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
Q +R EW G++V+DC +I NHK D E ++A L++G DL+CG Y N+
Sbjct: 226 QKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILE-SIALALRSGCDLNCGDAY-NYLAE 283
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
AV +G V E DI++++ L L +LG Y + I ++ LA EAA + I
Sbjct: 284 AVLKGYVTEDDINRAVVRLLITLDKLGLIHDDGPYQGITIHQIDWKKHDSLALEAAEKSI 343
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----N 469
VLLKN+ LPL K+ + V GP+A + A++GNYAG+ R ++ + A
Sbjct: 344 VLLKNN-GVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEIT 402
Query: 470 VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR---------EDL 520
VTYK GC +A + N AS K AD TI + G D SVE E D EDL
Sbjct: 403 VTYKKGC-PLAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDL 461
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L Q ++++ E K P+I+V+M GG I E + AIL A YPG+ GG A++
Sbjct: 462 NLNDEQLSYLHKLKESGK-PLIVVLM--GGAPICSPELHEIADAILVAWYPGQAGGTAVS 518
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
++VFGK NP G+LP+T P + L ++ GRTY++ LYPFG+G
Sbjct: 519 NIVFGKTNPSGKLPVT---------FPKSVRQLPEFENYSMQGRTYRYMTEEPLYPFGFG 569
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
LSYT+ ++ ++ + + P D +
Sbjct: 570 LSYTKMEFKHVT------------------------GRWKSPE--------KDELIVSTE 597
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
N G+ DG +VV +Y A +I F+RV V AG + +F + L +
Sbjct: 598 LYNQGTIDGEEVVQLYYHWKDAPFAVPNWSLIDFKRVLVAAGASCICEFKI-PLEKLQCI 656
Query: 761 DYAANTLLPAGEHTIFVG 778
D + ++P G +VG
Sbjct: 657 DPSGKGVIPTGTLQFYVG 674
>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
Length = 743
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 258/805 (32%), Positives = 398/805 (49%), Gaps = 107/805 (13%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFV---------CDPGRFSKLGLQMSS 51
+A V++++L L+ A ++ +AN +P V CD G SK
Sbjct: 7 IATVLAAILPSVLAQANTSYADYNTEANPDLTPQSVATIDLSFPDCDNGPLSKT------ 60
Query: 52 FLFCDS-SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
+ CD+ + PY R L+S TL+E V G+ + GVPRLGLP Y+ W+EALHG+
Sbjct: 61 -IVCDTLTSPYD-RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRA- 117
Query: 111 PGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
+F D +TSFP ILT ++ N +L ++ +ST+ RA N GR GL +SPN
Sbjct: 118 ---YFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPN 174
Query: 169 INVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
IN R P WGR ETPGED + + YA Y+ G+Q ++ + LK+ + KH
Sbjct: 175 INSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--------GVDPKSLKLVATAKH 226
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
YA YD++NW G R D +T+QD+ E + F + ++ SVMCSYN VNG+PSCA
Sbjct: 227 YAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCA 286
Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+ L +R + GYI +DCDS + + H++ A+ A A +++AG D+DCG
Sbjct: 287 NSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRAGTDIDCGT 345
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + AV Q + DI++ + LY+ LMRLG+FD
Sbjct: 346 TYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFD---------------------- 383
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
VGP N + + GNY G +SP+ F
Sbjct: 384 ------------------------------VGPWMNVSTQLQGNYFGPAPYLISPLDAFR 413
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+ +V Y G + ++ S + A AAK +DA I G+D S+EAE+LDR ++ PG
Sbjct: 414 DSHLDVNYAFGTN-ISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRMNITWPG 472
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +LI+Q++++ K P+I++ M G VD + ++N N+ +++W GYPG+ GG+A+ D++
Sbjct: 473 KQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDIIT 531
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
GK P GRL +T Y +Y P T M LRP + PG+TY +Y G +Y FG+GL YT
Sbjct: 532 GKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYT 589
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+ +S + + K++ N+ D PG + + + F VD N
Sbjct: 590 TFR---VSHARAV-----KIKPTYNIQ---DLLAQPHPGYI--HVEQMPFLNFTVDITNT 636
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
G ++++ A A K ++GF R+ ++ + S+ D
Sbjct: 637 GKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMARTDELG 696
Query: 765 NTLLPAGEHTIFVGNG-GVSFPIHL 788
N +L G++ + + N V P+ L
Sbjct: 697 NRVLYPGKYELALNNERSVVLPLSL 721
>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
Maddingley MBC34-26]
Length = 709
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 253/742 (34%), Positives = 383/742 (51%), Gaps = 102/742 (13%)
Query: 66 KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
K+LVS+MTL E+ +QL + + L +P+Y WW+E LHGV+ G AT F
Sbjct: 17 KELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVF 66
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
P I A F+E +I +STE RA YN GLTYWSPN+N+ RDPRW
Sbjct: 67 PQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPRW 126
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ET GEDP++ R V +++GLQ EG + LK+++C KH+A + +
Sbjct: 127 GRGHETYGEDPYLTSRLGVAFIKGLQG-EG---------KYLKLAACAKHFAVHS--GPE 174
Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
G+ R+ F+A V ++D+ ET+L FE CVKE + SVM +YNR NG P C LL +R
Sbjct: 175 GL-RHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILR 233
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
G+W G++V+DC ++ H + + ++VA ++ G DL+CG Y N A ++
Sbjct: 234 GKWGFKGHVVSDCWAL-ADFHLHHMITSTATESVALAIENGCDLNCGNMYLNLL-LAYKE 291
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
G V E I + + L T +LG FD +Y + + E+ E+A A+R+ +VLLK
Sbjct: 292 GLVTEEQITTAAERLMTTRFKLGMFDEDCEYNRIPYEVNDCKEHNEIALIASRKSMVLLK 351
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVTYK 473
ND TLPL+ + +K++AV+GP+AN+ + + GNY+G +Y + + G V Y
Sbjct: 352 ND-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYS 410
Query: 474 TGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDR 517
GC +D+A ++ + A A+ +D I+ GLD ++E E + D+
Sbjct: 411 EGCHLFKDKVEDLA-GPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDK 469
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
E L LPG Q L+ +V EV K PVI+V+ G + F AIL A YPG GG
Sbjct: 470 ESLNLPGRQQNLLEKVLEVGK-PVIVVL--GAGSALTFNGAEEKCAAILNAWYPGSHGGT 526
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
A+AD++FGK +P G+LP+T+Y D + T ++ GRTY++ +LYPF
Sbjct: 527 AVADILFGKCSPSGKLPVTFYK-DTANLPEFTDYSMK--------GRTYRYLEHESLYPF 577
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-DYFE 696
GYGL+Y+ +V L+ LQ V ++ D + F+
Sbjct: 578 GYGLTYS-------------KVELSNLQ--------------------VPFVKADFESFD 604
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+D +N G+ +VV Y K A + GF+RV ++ G +K + + +S
Sbjct: 605 ISIDIRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELSK-RS 663
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
V+ LL + +FVG
Sbjct: 664 FEAVNNDGERLLDSKSFKLFVG 685
>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
Length = 736
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 247/766 (32%), Positives = 386/766 (50%), Gaps = 104/766 (13%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q+ + F ++ LP +RVKDLV+R+TL+EKV + + +PRLG+P Y+WW+EALHGV+
Sbjct: 20 QVENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVA 79
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA--- 160
+ T FP I A+F+ +K+G STE RA++N G+
Sbjct: 80 R---------TLEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTR 130
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GLTYW+PNIN+ RDPRWGR ET GEDP++ + VRGL+ + H
Sbjct: 131 YRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHY--------- 181
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
LK +C KHYA + + +R+ FDAR + D+ +T++ F V + VMC+YN
Sbjct: 182 LKSVACAKHYAVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYN 238
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
R+NG P C + LL +R +W GY+ +DC +++ + HK + A++ L AG
Sbjct: 239 RLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHT-IAMSDALLAG 297
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
DL+CG Y + V++G E DI+ SL L+T+L ++G FD + + Y S+G++ +
Sbjct: 298 TDLECGNLY-HLLAEGVKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPYSSIGREVL 356
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
+ + + A A+E IVLL+N + LPL+++K+K++A++GP+A+ + NY G P
Sbjct: 357 ECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSE 416
Query: 457 ----YMSPIAGFSGYANVTYKTGCDDV-ACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
YMS + Y G V K S + A +D + ++G+ E
Sbjct: 417 IVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYE 476
Query: 512 -------------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
S DR + LP Q +L+ ++ + + P+I+V MS G ++F
Sbjct: 477 GEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMS--GSVMSFEWE 533
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
+ N A+L A Y G+ G AI DV+FG NP GR+P+T Y D +P P ++
Sbjct: 534 SQNADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD-------NDLP--PFEN 584
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
GRTY+++ G YPFGYGLSYT F Y+ + C + +T D ++
Sbjct: 585 YSMLGRTYRYFKGEPRYPFGYGLSYTTFAYSDV--------------QCVDETHTGDTAR 630
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP----AEIAATYIKQVIGF 734
V N G DG +VV +Y P +I +K GF
Sbjct: 631 V------------------TVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALK---GF 669
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+R+ ++ G + + F + L + + N + G+ T+FVG G
Sbjct: 670 KRIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVGGG 714
>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 734
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 256/774 (33%), Positives = 381/774 (49%), Gaps = 102/774 (13%)
Query: 40 GRFSKLGLQM----SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
G F L +Q S F + L + RV DLVSR+TL+EKV+Q+ + A +PRLG+P
Sbjct: 10 GLFFSLAVQAQADKSQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPA 69
Query: 96 YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
Y+WWSE LHGV+ T T +P I A+++ + + E RA++
Sbjct: 70 YDWWSEVLHGVARTPYHT---------TVYPQAIAMAATWDTVALYTMADQSAREGRAIH 120
Query: 156 NLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
N GLTYW+PNIN+ RDPRWGR ET GEDPF+ +VRGLQ +
Sbjct: 121 NKATEEGKNGDRYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGED 180
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
+ LK ++C KHYA + + R+ FD V++ D+ T+L F+ V
Sbjct: 181 ---------PKYLKAAACAKHYA---IHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVT 228
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
+ VMC+YN P C L+ +R +W GY+ +DC +I + HK ++
Sbjct: 229 HAKVAGVMCAYNAFRKKPCCGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNA 288
Query: 327 KEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD-- 383
E A + G D++CG + Y T +AV+ G++ E +ID+S+K L+ + MRLG FD
Sbjct: 289 -EAAAIDAVTNGTDVECGNRAYLTLT-DAVKTGRIAEKEIDRSVKRLFMIRMRLGMFDPV 346
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
Y + S + A + A+E IVLLKN+ + LPL S +K +AVVGP+A+ +
Sbjct: 347 SMVSYAQTSPAVLESAPHKAQALKMAQESIVLLKNENHLLPL-SKSIKKIAVVGPNADNS 405
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD--DVACKSNNSIFAA-SEAAKT 496
+A++GNY G P + ++ + G +V Y+ + + + FAA + K
Sbjct: 406 IAVLGNYNGTPSKIVTALDGIKAKLGTNGSVVYEKAVNFTNAMLPEGKTDFAALTSRVKD 465
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
ADA I + G+ +E E + DR + LP QT+ + + K PV+ V+M
Sbjct: 466 ADAIIFVGGISPQLEGEEMKVNEPGFNSGDRTTILLPTVQTEAMKALKATGK-PVVFVMM 524
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ + I + + NI AI+ A Y G+ G AIADV+FG +NP GRLP+T+Y D
Sbjct: 525 TGSALAIPWEQ--ENIPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSD----- 577
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
L D RTY++++G LYPFGYGLSYT F+Y L T++
Sbjct: 578 ----ADLPAFDDYRMENRTYRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVK-------- 625
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
+K R P + N G+ G +VV +Y +
Sbjct: 626 ----------NKVRIP--------------VSIQLTNTGAKGGEEVVQLYISYQGQPIKK 661
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+K + GFQRV++ G+ K IKF+ +L I L P G+ I VG G
Sbjct: 662 PLKALKGFQRVWLNRGQTKTIKFLLTP-DALAIAGENGKLLNPKGKLRISVGGG 714
>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
Length = 697
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 245/743 (32%), Positives = 383/743 (51%), Gaps = 105/743 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ + LV+RMTL+EK QL A + RLG+P Y WW+E LHGV+ G AT
Sbjct: 9 KAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------AT 58
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A+F+ ++ V+TE RA YN GLT+WSPN+N+ RDP
Sbjct: 59 VFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ V++V+ LQ N +K ++C KH+A V +
Sbjct: 119 RWGRGHETYGEDPYLTKELGVSFVKALQG----------NGDTMKAAACAKHFA---VHS 165
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ FDA + +DMEET+L FE VKE +VM +YNR NG P C P L +
Sbjct: 166 GPEALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSP-TLQKK 224
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RGEW G+ V+DC +I+ ++H + D+ ++ A + G DL+CG Y + A
Sbjct: 225 LRGEWKFQGHFVSDCWAIRDFHEHH-MVTDTAVESAALAINNGCDLNCGNTYLHIM-KAY 282
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
++G V E I ++ L+T LG FDGS +Y +L ++ S +++ A +AA + VL
Sbjct: 283 EKGLVTEETITRAAVRLFTTRYLLGLFDGS-EYDNLSYMEVESPRHLDAAEKAAEKSFVL 341
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKN+ LPL+ K+KT+ ++GP+A++ A+IGNY G RY++ G Y +
Sbjct: 342 LKNN-GILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRIL 400
Query: 472 YKTGCDDVACKSNNSIFA------ASEAAKTADATIILAGLDLSVEAE---------SLD 516
GCD ++ + F A A+ +D I+ GLD ++E E S D
Sbjct: 401 TSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGDTGNSYVSGD 460
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ED+ LPG Q +L+ +A+ K PV+ +++ +D+ +A + +LW YPG +GG
Sbjct: 461 KEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQGG 517
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
+A A V+FG+ +P G+LP+T+Y + ++ LP T ++ GRTY++ +
Sbjct: 518 KAAAKVLFGEISPSGKLPVTFY--ESLEELPDFTDYSMK--------GRTYRYMERKAQF 567
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
PFGYGL+Y++ + V+ +++ C G +N
Sbjct: 568 PFGYGLTYSK-----------VAVDKAEVKTC---------------GQKIN-------- 593
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
+V+ QN G+ D DVV +Y K A + GFQR+F++AG ++I+ K
Sbjct: 594 -VEVEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKIEIPIWE-K 651
Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
+ +VD + + I+ G
Sbjct: 652 AFTVVDETGKRMEEGKKFEIYAG 674
>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
phytofermentans ISDg]
Length = 717
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 252/752 (33%), Positives = 378/752 (50%), Gaps = 107/752 (14%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
+ R +LV +MTL+EKV Q A +PRL + Y +W+EALHGV+ G
Sbjct: 10 FQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV--------- 60
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
AT FP I A+F+E L ++I +STE R +N + GLT+WSPN+N+
Sbjct: 61 -ATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPNVNIF 119
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY- 231
RDPRWGR ET GEDPF+ G +V G+Q GH+ LK ++C KH+A +
Sbjct: 120 RDPRWGRGHETFGEDPFLSGTLGGRFVDGIQ---GHDETY------LKAAACAKHFAVHS 170
Query: 232 ---DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
D+ R+ F+A V+EQD+ ET+L F+ VKE +VM +YNR NG P C
Sbjct: 171 GPEDI-------RHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGS 223
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
LL +RGEW+ G++ +DC +I+ ++H +++ E +VA + G DL+CG Y
Sbjct: 224 KTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVE-SVALAMNRGCDLNCGNLYV 282
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAA 406
N AV+ G V+E ID +L L+T M+LG FD S + ++ + + + EL
Sbjct: 283 NLL-QAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESIPFNTITYDQVDTKSSKELNI 341
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
+A+++ +VLLKN+ N LPLN K+ +V V+GP+AN A++GNY G Y++ + G
Sbjct: 342 KASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQ 401
Query: 467 Y----ANVTYKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSVEAE--- 513
V + GC K N+ I + +D I GLD +E E
Sbjct: 402 VVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEEGD 461
Query: 514 ------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S D++ L LPG Q ++ + E K PVIL+++S + + +A + +I AIL
Sbjct: 462 QGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAILQ 518
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
YPG +GGRAIA+++FG NP G+LP+T+Y T+ L RTY+
Sbjct: 519 GWYPGAQGGRAIAELIFGDGNPEGKLPVTFYR---------TTEELPEFTDYAMKNRTYR 569
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+ LYPFGYGLSYT F++ LL VN + L N+
Sbjct: 570 YMKNEALYPFGYGLSYTTFEHTLL------YVNTDTLGKGSNV----------------- 606
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
E V +N G +GS Y K E A Q+ G ++V + G K I
Sbjct: 607 --------ECMVRVKNTGDYEGSVTTQAYVKYVGEDAPNC--QLKGLKKVSLLPGEEKDI 656
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ ++ + + +L GE+ +++ +
Sbjct: 657 MIELDD-RAFGLYNEEGEFILNQGEYELYLSD 687
>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
BAA-286]
Length = 738
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 249/754 (33%), Positives = 382/754 (50%), Gaps = 95/754 (12%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F D+ L RV DLVSR+TL+EKV Q+ + + RL +P Y WW+E LHG+
Sbjct: 24 YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR--- 80
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLT 163
T + T FP I A+++ L K + A+S E RA+YN A GLT
Sbjct: 81 -TEYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLT 134
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
YW+PN+N+ RDPRWGR ET GEDP++ G ++V GLQ + S+ LK ++
Sbjct: 135 YWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDD---------SQYLKAAA 185
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KHYA V + R+ F+ VT D+ +T+L F V + + VMC+YN +G
Sbjct: 186 CAKHYA---VHSGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGE 242
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
P C + L+ + +R +W GY+ +DC +I +HK D+K A A + +G D+DC
Sbjct: 243 PCCGNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKY-AAADAVYSGTDIDC 301
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G +AV+ G + E ID SLK L+ + RLG FD + ++ + + S +
Sbjct: 302 GNEAYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVKFSKIPLSVLESQPH 361
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+LA + RE IVLLKN+ N LPL S K+K VAV+GP+A+ V+++GNY G P + ++P
Sbjct: 362 KDLALKITRESIVLLKNENNFLPL-SKKLKKVAVIGPNADNEVSVLGNYNGFPTQIITPY 420
Query: 462 AGFSGY---ANVTYKTGCDDVACKSNN--SIFAASEAAKTADATIILAGLDLSVEAESL- 515
V Y+ G D V N+ I A ++ K D I G+ +E E +
Sbjct: 421 KAIKNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMP 480
Query: 516 ---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
DR + LP QT+L+ Q + + P + V+M+ G IA + N+ AIL
Sbjct: 481 VKIEGFTGGDRTSIKLPKIQTELM-QALKAERIPTVFVMMT--GSAIAAEWESQNVPAIL 537
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
A Y G++ G AIADV+FG +NP G+LP+T+Y D + +P +S RTY
Sbjct: 538 NAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYTKD-------SDLP--AFNSYEMKNRTY 588
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
++++G LYPFGYGLSYT+F+Y+ + +I+ N
Sbjct: 589 RYFDGQVLYPFGYGLSYTKFEYSPIQMPASIKAGEN------------------------ 624
Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK--QVIGFQRVFVRAGRN 744
E + +N G TDG +VV +Y + + F+R+ ++AG +
Sbjct: 625 --------MEVSITVKNTGKTDGEEVVQLYISHDNNGTNRQLPLYALKSFERISLKAGES 676
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
K + F + + + + D + G+ +++G
Sbjct: 677 KSVTFKLSP-REMALADEDGVLKMTKGKSKLYIG 709
>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
6_1_58FAA_CT1]
Length = 735
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 254/758 (33%), Positives = 392/758 (51%), Gaps = 97/758 (12%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G ++F F + L + RV DLVSR+TL+EK+ Q+ + A + RLG+P Y+WW+E LHG
Sbjct: 21 GKAQNTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNECLHG 80
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA----- 160
V T + T FP I A+++++L++++ +++ E RA+Y+ +
Sbjct: 81 VGR----TPYK-----VTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISKGVHE 131
Query: 161 ---GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
GLTYW+PNIN+ RDPRWGR ET GEDP++ G +V GLQ + +
Sbjct: 132 IYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGD---------DPK 182
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
LK S+C KHYA V + + R+ F+ V+ D+ +T+L F V + SSVMC+Y
Sbjct: 183 YLKASACAKHYA---VHSGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAY 239
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N + G P C + L+ +R +W GY+ +DC +I + HK AD+ + L
Sbjct: 240 NALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFL-KHKTHADAAHASADAVLH- 297
Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQD 395
G DL+CGQ +AV+QG + E ID+S+K L+ RLG FD + +Y
Sbjct: 298 GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDPADRVKYADTPLSV 357
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ DE+ LA + +RE +VLLKND N LPL +K +AV+GP+A+ + ++GNY G P
Sbjct: 358 LECDEHKALALKMSRESVVLLKND-NVLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPS 415
Query: 456 RYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+ ++P+ V Y D V ++ A E K D I + G+ +E
Sbjct: 416 KVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRLE 475
Query: 512 AESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
E L DR + LP QT+L+ ++ E A PVI V+M+ + I + + N
Sbjct: 476 GEELPISVDGFRGGDRTTIALPEVQTELMKKMKE-AGLPVIFVMMTGSALGIEW--ESQN 532
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
I AIL A Y G+ G+AIADV+FG +NP G+LP+T+Y D + +P P +
Sbjct: 533 IPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSD-------SDLP--PFGAFSM 583
Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
RTY+++ G LYPFG+GLSYT F Y++
Sbjct: 584 ANRTYRYFKGEALYPFGFGLSYTMFDYSV------------------------------- 612
Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGFQRVFVR 740
P V V+ + + + V +N+G +G +VV +Y S E A I + GF+RV+++
Sbjct: 613 PQV-VSGGKVGEPIKVSVKVKNIGKKNGDEVVQLYLSHEGVEKAP--ITALKGFKRVYLK 669
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
AG K + F + + +++ D + G+ TI+ G
Sbjct: 670 AGEEKTLSFEISP-RDMSLPDDNGIITVFPGKKTIYAG 706
>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 742
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 262/749 (34%), Positives = 388/749 (51%), Gaps = 92/749 (12%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F D+S RVKDLVSR+TLDEK Q+ A + RLG+ Y WW+EALHGV+ G
Sbjct: 38 YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTG- 96
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLT 163
AT FP + A+F+E L +IGQA+S EA A YN+ + +G+T
Sbjct: 97 ---------RATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYGQYSGIT 147
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
+++PN+N+ RDPRWGR ET GEDPF+ R V YV+G+Q + + LK ++
Sbjct: 148 FYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGND---------PKYLKTAA 198
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KHY V + R+ +DA +D ET++ FE VKEG SVMC+YNR G
Sbjct: 199 CAKHYV---VHSGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCAYNRTFGK 255
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
P C LL+ +R +W GY+ DC +IQ +H DS E A A +K+G++L+C
Sbjct: 256 PCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLE-ACALAIKSGVNLNC 314
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDE 400
G + N+ AV++G V E ++D++L L RLG FD SP Y + ++ I S +
Sbjct: 315 GNEF-NYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD-SPNENPYAKIKEEVIGSQQ 372
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--- 457
NI+LA EAA + +VLL+N NTLPL +K++ VVGP+A ++GNY G+ R
Sbjct: 373 NIDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNGVNSRLTTI 431
Query: 458 MSPIAG-FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII--LAGLDLSVEAES 514
M I G S +V Y+ G + A N+ ++ EAA + ++G+ E ES
Sbjct: 432 MQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFEGEEGES 491
Query: 515 L------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
DR DL LP Q + ++ + K P+ILV+ GG I E + AIL+
Sbjct: 492 TASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVL--TGGSPICTPELADMVDAILFV 549
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPG+EGG A+ADV+FG NP GRL IT+ P + L + GRTY++
Sbjct: 550 WYPGQEGGHAVADVIFGDVNPSGRLCITF---------PKSVSQLPAFEDYSMKGRTYRY 600
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
LYPFG+GLSYT + Y+ I+ + +K++ ++++ T+ S
Sbjct: 601 MTEEPLYPFGFGLSYTNYSYS------NIKTDKDKIKKGQSVHVTATVS----------- 643
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
N G T G +V +Y A T + + G +RV + AG +K +
Sbjct: 644 --------------NTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAGESKEVS 689
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFV 777
F + + +V ++ G+ +++
Sbjct: 690 FEVTP-QMMELVTVTGEKVIEPGDFKVYI 717
>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
Length = 658
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 258/700 (36%), Positives = 364/700 (52%), Gaps = 81/700 (11%)
Query: 96 YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
Y WWSEAL+ S ATSFP I A+F++ L I +STEARA
Sbjct: 1 YNWWSEALNFSS--------------ATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46
Query: 156 NLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
N+ R GL +++PNIN +DPRWGR ETPGEDPF + +Y V GLQ G N
Sbjct: 47 NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVGPTN----- 101
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
LK+++ CKH+AAYD++N GV R+ FDA+VT QD+ E + F+ C+++ +S+MC
Sbjct: 102 ---LKIAADCKHWAAYDLEN-LGVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMC 157
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
SYN VNGIPSCA+ LL R W L +I DC ++ + H + D + A
Sbjct: 158 SYNAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHY-TDDPANGTAV 216
Query: 334 TLKAGLDLDC---GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
L AG D+DC Y+ G A+ + V E + ++ Y L+RL +
Sbjct: 217 ALNAGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW--------- 267
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
D+ ++ +LA +AA EGIVLLKND LPL S+ VK VAVVGP ANAT M NY
Sbjct: 268 ---DDVNTEPAQQLAYQAAVEGIVLLKND-GILPLASS-VKKVAVVGPMANATTQMQSNY 322
Query: 451 AGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII-LAGLD 507
GI +SP F +G+ NVT+ G S+ S F+A+ AA + + G+D
Sbjct: 323 NGIAPFLVSPQQAFRNAGF-NVTFANGTG--LNSSDTSGFSAAIAAADDADVVFYVGGID 379
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
++E E DR ++ G Q L+ Q+A + K P+I++ M G VD + NT++ A++W
Sbjct: 380 TTIEREDRDRPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVNALIW 438
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
GYPG+ GG A+ D++ GK P GRLPIT Y YV P+T M LRP S PGRTYK
Sbjct: 439 GGYPGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRPSSS--NPGRTYK 496
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+Y G ++ FG+GL YT F S + V ++L S + GV
Sbjct: 497 WYTGAPIFEFGFGLHYTTFDAEWASGGDSFSV--------QDL-----VSSAKNSGVAHV 543
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
DL D F V N G+ V +++S+ A + K+++ + RV K I
Sbjct: 544 DLGVLD--TFNVTVTNSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRV-------KGI 594
Query: 748 KFVFNACKSLNI-------VDYAANTLLPAGEHTIFVGNG 780
+ ++ SL + D N +L GE+ + + G
Sbjct: 595 EPGASSAASLKVTLGAVARTDEQGNRVLYPGEYVLLLDTG 634
>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
CL02T12C30]
Length = 738
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 250/766 (32%), Positives = 380/766 (49%), Gaps = 98/766 (12%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
L Q + F + LP +RV+D++SR+TL+EKVQ + A VPRLG+P Y WW+EALH
Sbjct: 19 LTAQTYDYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALH 78
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA 160
GV+ T FP I A+F+ +K+G S+E RA++N G+
Sbjct: 79 GVARTK---------EKVTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKT 129
Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
GLTYW+PNIN+ RDPRWGR ET GEDP++ + V GL EG+ N
Sbjct: 130 GEIYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGL---EGN------N 180
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
LK +C KHYA V + +R+ +DARV+ D+ +T+L F V + VMC
Sbjct: 181 PEYLKSVACAKHYA---VHSGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMC 237
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
+YNR G P C +LL +R +W GY+ +DC ++ HK ++ E AVA +
Sbjct: 238 AYNRFEGTPCCGHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDTE-AVADAV 296
Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGK 393
G DL+CG Y V++G + E DI+ SL L+ + +LG +D + + Y S+G+
Sbjct: 297 LNGTDLECGNLYQKLQ-QGVEKGLISEKDINVSLARLFEIQFKLGMYDPADRVPYASIGR 355
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+ I D + + A E A++ +VLLKN++N LPLN++K+K +A++GP+ + ++ NY G
Sbjct: 356 EVIECDAHKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGT 415
Query: 454 PCRYMSPIAG----FSGYANVTYKTGCDDV-ACKSNNSIFAASEAAKTADATIILAGLDL 508
P ++P F + TG V + S + AK AD I + G+
Sbjct: 416 PSEIITPYKSLQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISA 475
Query: 509 SVE-------------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
E S DR + LP QT+L+ ++ + + P+ILV MS G ++F
Sbjct: 476 DYEGEAGDAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMS--GSVMSF 532
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
+ N AIL A Y G+ G AI DV+FG +NP GR+P+T Y D + LP
Sbjct: 533 DWESRNADAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTYMND--EDLP-------D 583
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+ RTY+++ G YPFGYGLSYT F Y L T+
Sbjct: 584 FEDYSMANRTYRYFKGDVRYPFGYGLSYTTFGYAPLQNASTV------------------ 625
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGF 734
+ + + N G G +VV +Y S P ++ + GF
Sbjct: 626 --------------KTGESIQVTTTVTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKGF 671
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+R+ + G ++++ F + + L++VD N + G +++G G
Sbjct: 672 KRIHLDTGESRQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIGGG 716
>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
Length = 744
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 249/748 (33%), Positives = 363/748 (48%), Gaps = 101/748 (13%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
L + FCD++L +R D VSRMT+ EK+ L + LGLP Y WWSEA
Sbjct: 30 LNATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASS 89
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
GV P T F ++P + T SFN +LW+ G A+ EARA+ N G A TY
Sbjct: 90 GVMGSRPTTKF--------AYP--VTTAMSFNRTLWRATGAAIGREARALMNAGAAYSTY 139
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+P +N+AR+PRWGR E PGEDP++ G YA +V G Q A + L+ S+C
Sbjct: 140 WAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQ-------AAPEDPYHLQASAC 192
Query: 225 CKHYAAYDVDNWKGVD-----RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
CKHY A +++N + D R H D+ VT++D+ ++++ PF+ CV++G SS+MCSYN
Sbjct: 193 CKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNA 252
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
VNG+PSCA+ LL R W GYI +DCD+ + D H + A + E+AVA LKAG
Sbjct: 253 VNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAA-TPEEAVADVLKAGT 311
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS----LGKQD 395
D+DC + +A+ +G + E D+D L L+ V +RLG FD S L + D
Sbjct: 312 DVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEID 371
Query: 396 ----ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+CSD +++ + E + LLKND LPL + T AVVGP+A + A G Y
Sbjct: 372 ADAVVCSDAHLDASMEGLAQSATLLKND-GALPLKPSG--TAAVVGPNALLSKADAGYYG 428
Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
ADA ++ G DL+
Sbjct: 429 -----------------------------------------PTDAADAVVLAVGTDLTWA 447
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA--FAETNTNIKAILWAG 569
AE D + Q +LI+ VA + PV++V+ SA +D+ A ++ + A++ G
Sbjct: 448 AEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVG 507
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL---------- 619
P + + D+++G+ + GR T Y Y + + +RP S
Sbjct: 508 QPSVTV-KGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFARPDCATNE 566
Query: 620 ------GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
PGRTY+FY + PFG+GLSYT F Y + S T V+L L+
Sbjct: 567 SACPRGTNPGRTYRFYVDEPVVPFGFGLSYTTFAYAVRSAPTT--VDLAPLRAAYAGVAA 624
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVI 732
+ L +D Y VD N G D DVV+ + PP A + +K++
Sbjct: 625 ARGDGGPAFLSLHDDAAAATY---AVDVTNTGDIDADDVVLGFVTPPGAGVDGVPLKELF 681
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIV 760
GF+RV V+AG K + +++ A V
Sbjct: 682 GFERVHVKAGETKTV-YLYPALSKFKTV 708
>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 839
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 257/664 (38%), Positives = 348/664 (52%), Gaps = 90/664 (13%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD + R LV ++T+DEK+ L D A G R+GLP+Y WWSE LHGV+ PG
Sbjct: 37 CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAG-SPGVT 95
Query: 115 FDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
F+ ATSF I ASF++ L ++G A+STEARA N G GL YW+PN+N
Sbjct: 96 FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNP 155
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
+DPRWGR ETPGEDP + Y + GL EG+E KV + CKHYAAY
Sbjct: 156 YKDPRWGRGAETPGEDPLHIKGYVKAMLAGL---EGNETVR-------KVIATCKHYAAY 205
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV----------- 280
D++ W G+ RY F+A VT QD+ E +L PF+ C ++ S+MCSYN +
Sbjct: 206 DLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGSKP 265
Query: 281 -------NGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSI-QVMVDNHKFLADSKED 329
P+CA+ L+ +R W+ + YI +DC++I + DNH F + + +
Sbjct: 266 DEIINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITSDCNAILDFLPDNHNF-SQTPAE 323
Query: 330 AVAQTLKAGLDLDC---GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--- 383
A A KAG D C G T+ G A Q + E ID +L+ LY L+R G+ D
Sbjct: 324 AAAAAYKAGTDTVCEVSGSPLTDVVG-AYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGR 382
Query: 384 -----------GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
SP Y +L D+ + ELA +A EGIVLLKN + LPL+ + K
Sbjct: 383 SAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSG-KK 441
Query: 433 VAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
VA++G ANAT M G Y+GIP Y +P+ A +++Y G A + A
Sbjct: 442 VALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPAL 501
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
AA+ AD + G D +V +E LDRE + P Q +L++++A + K PV+ VI V
Sbjct: 502 AAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAGLGK-PVV-VIQLGDQV 559
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
D + N N+ +ILW GYPG+ GG A+ DV+ GK P GRLP+T Y YV +PLT M
Sbjct: 560 DDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEM 619
Query: 612 PLRPVD-----------------------------SLGYPGRTYKFYNGPTLYPFGYGLS 642
LRP + +L PGRTYK+Y+ P L PFGYGL
Sbjct: 620 ALRPFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYSTPVL-PFGYGLH 678
Query: 643 YTQF 646
YT F
Sbjct: 679 YTTF 682
>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 475
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 199/448 (44%), Positives = 279/448 (62%), Gaps = 17/448 (3%)
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGK 393
AGLDL+CG + T AVQ GK+ E+D+D+++ LMRLGFFDG P+ + +LG
Sbjct: 31 AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 90
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
D+C+ N ELA EAAR+GIVLLKN LPL++ +K++AV+GP+ANA+ MIGNY G
Sbjct: 91 SDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNYEGT 149
Query: 454 PCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLSVEA 512
PC+Y +P+ G Y+ GC +V C N+ + AA++AA +AD T+++ G D S+E
Sbjct: 150 PCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIER 209
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
ESLDR L LPG Q QL++ VA + GP ILV+MS G DI+FA+++ I AILW GYPG
Sbjct: 210 ESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPG 269
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
E GG AIADV+FG NP GRLP+TWY + + +P+T M +RP S GYPGRTY+FY G
Sbjct: 270 EAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMRMRPDPSTGYPGRTYRFYTGD 328
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
T+Y FG GLSYT F ++L+S K + + L + C +CP V C+
Sbjct: 329 TVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT---------EQCPSVEAEGAHCE 379
Query: 693 DY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
F+ + +N G G V ++S PPA + K ++GF++V + G+ + F
Sbjct: 380 GLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQAGVVAFKV 438
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L++VD N + G HT+ VG+
Sbjct: 439 DVCKDLSVVDELGNRKVALGSHTLHVGD 466
>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 756
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 252/736 (34%), Positives = 385/736 (52%), Gaps = 44/736 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD + + R LV+ M EK+ L + GV RLGLP Y WW EALHGV+
Sbjct: 29 LKSNAICDVTASPAKRAAALVAAMQTQEKLDNLVSKSKGVARLGLPAYNWWGEALHGVAG 88
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG +F ATSFP +L +A+F++ L +I + EARA N G A + +W+P+
Sbjct: 89 A-PGINFTGPYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPD 147
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN RDPRWGR +ETPGED + Y + + GL+ + K+ + CKHY
Sbjct: 148 INPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR----------KIIATCKHY 197
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
YD+++W G DR+ FDA++T QD+ E F+ PF+ C ++ S MCSYN VNG+P+CAD
Sbjct: 198 VGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCAD 257
Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+L +R W D + YI +DC++++ + HK++A +E A A G+DL C
Sbjct: 258 TYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQE-ATAIAFNNGMDLSCEY 316
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
++ A QG + + ID++L Y L+ G+FDG + Y +LG QDI + E +L
Sbjct: 317 SGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAATYANLGVQDINTPEAQKL 376
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPI-A 462
+ A EG+ LLKND +TLPL+ VA+VG AN + + G Y+G P Y+ +P+ A
Sbjct: 377 VLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANDSSKLSGIYSG-PAPYLHNPVYA 434
Query: 463 GFSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
G ++ TG + ++N A +AAK +D + GLD S AE DR D+
Sbjct: 435 GNKLGLDMAVATGPILQKSGAADNWTTKALDAAKKSDTILYFGGLDPSAAAEGSDRTDIS 494
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
P Q LI ++A A G ++VI VD + +++WA +PG++GG A+
Sbjct: 495 WPSAQIDLITKLA--ALGKPLVVIALGDMVDHMPILNMKGVNSLIWANWPGQDGGTAVMQ 552
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
V+ G+ GRLPIT Y Y Q L + M LRP + PGRTY++YN ++ PFG+GL
Sbjct: 553 VITGEHAIAGRLPITQYPAKYTQ-LSMLDMNLRPGGN--NPGRTYRWYN-ESVQPFGFGL 608
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT+F S ++ VN+ + ++ DL C D +V
Sbjct: 609 HYTKFAAKFGS-NSSLTVNIQDIMKSCTKDHP--------------DL-C-DVPPIEVAV 651
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G+ + + + K +K ++ + R+ +G + + +L+ VD
Sbjct: 652 TNKGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKTASLALTLGTLSRVD 711
Query: 762 YAANTLLPAGEHTIFV 777
+ N + GE+T+ +
Sbjct: 712 QSGNLVAYPGEYTLLL 727
>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 738
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 257/756 (33%), Positives = 376/756 (49%), Gaps = 99/756 (13%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F + +L RV DLV RMTL+EKV Q+ + A + RLG+P Y WW+E LHGV+
Sbjct: 31 YPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVAR--- 87
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLT 163
T F T +P I A+++++ +G + E RA+YN GLT
Sbjct: 88 -TPFK-----VTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHDIYLGLT 141
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
YW+PNIN+ RDPRWGR ET GEDPF+ G +V+GLQ + R LK +
Sbjct: 142 YWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGDD---------PRYLKAAG 192
Query: 224 CCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
C KHYA + G + R+ F+ +++ D+ +T+L F V + + VMC+YN
Sbjct: 193 CAKHYAVHS-----GPEDLRHKFNTDISDYDLWDTYLPAFRKLVVDAKVTGVMCAYNAFK 247
Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLADSKEDAVAQTLKAGL 339
G P C L+N + +W GY+ +DC I + H+ D+ E A A + G
Sbjct: 248 GQPCCGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDA-ESAAADAVLHGT 306
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDIC 397
D++CG AV+ GK+ E ID+SLK L++V +LG FD + +Y +GK +
Sbjct: 307 DVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVKYNQIGKDALE 366
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
+ + A + A + IVLLKN+ N LPL S +K +AV+GP+A+ V+++GNY G P R
Sbjct: 367 APAHGAQALKMAHQSIVLLKNEGNLLPL-SKNLKKIAVLGPNADNAVSVLGNYNGTPSRI 425
Query: 458 MSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEA-AKTADATIILAGLDLSVEA 512
++ + G V Y D VA + +AA A K ADA I + G+ +E
Sbjct: 426 VTALQGIKNKLPAGTEVIYDKAVDYVADSAARYNYAAMAAKVKDADAIIYIGGISPELEG 485
Query: 513 ESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
E + DR + LPG QT+L+ + K PV+ V+M+ G IA N+
Sbjct: 486 EEMPVSKPGFHGGDRSTILLPGVQTELLKALKATGK-PVVFVMMT--GSAIATPWEAENL 542
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AI+ A Y G+ G AIADV+FG +NP GRLP+T+Y D L S +D+
Sbjct: 543 PAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFYGSDK----DLPSFTDYSMDN---- 594
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
RTY+++ G LY FGYGLSY++F+Y L DA T
Sbjct: 595 -RTYRYFKGKPLYAFGYGLSYSKFEYAPL-----------------------DAPLT--- 627
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
L+ + V N DG +V +Y T I+ + GF+R ++AG
Sbjct: 628 ------LKAGEALTVHVKVTNKSKMDGEEVTELYLSHIGIKQKTAIRALKGFERTLIKAG 681
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
K I F ++ L+I D N + +G+ I VG
Sbjct: 682 ETKDITFKLSSA-DLSITDLNGNLVKASGKIAISVG 716
>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
Length = 702
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 244/751 (32%), Positives = 376/751 (50%), Gaps = 104/751 (13%)
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
+RVKDLV+R+TL+EKV + + +PRLG+P Y+WW+EALHGV+ +
Sbjct: 1 MRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVART---------LEKV 51
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRAG-----LTYWSPNINVAR 173
T FP I A+F+ +K+G STE RA++N G+ G LTYW+PNIN+ R
Sbjct: 52 TVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFR 111
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR ET GEDP++ + VRGL+ + H LK +C KHYA +
Sbjct: 112 DPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHY---------LKSVACAKHYAVHSG 162
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+ +R+ FDAR + D+ +T++ F V + VMC+YNR+NG P C + LL
Sbjct: 163 PEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLV 219
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
+R +W GY+ +DC +++ + HK + A++ L AG DL+CG Y +
Sbjct: 220 DILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHT-IAMSDALLAGTDLECGNLY-HLLAE 277
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAARE 411
V++G E DI+ SL L+T+L ++G FD + + Y S+G++ + + + + A A+E
Sbjct: 278 GVKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPYSSIGREVLECEAHKQHAERMAKE 337
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR----YMSPIAGFSGY 467
IVLL+N + LPL+++K+K++A++GP+A+ + NY G P YMS
Sbjct: 338 SIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDK 397
Query: 468 ANVTYKTGCDDV-ACKSNNSIFAASEAAKTADATIILAGLDLSVE-------------AE 513
+ Y G V K S + A +D + ++G+ E
Sbjct: 398 IKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFA 457
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
S DR + LP Q +L+ ++ + + P+I+V MS G ++F + N A+L A Y G+
Sbjct: 458 SGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMS--GSVMSFEWESQNADALLQAWYGGQ 514
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
G AI DV+FG NP GR+P+T Y D +P P ++ GRTY+++ G
Sbjct: 515 AAGDAIVDVLFGHCNPAGRMPLTTYKSD-------NDLP--PFENYSMLGRTYRYFKGEP 565
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
YPFGYGLSYT F Y S +C V++ D
Sbjct: 566 RYPFGYGLSYTTFAY----------------------------SDVQC----VDETHTGD 593
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPP----AEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
V N G DG +VV +Y P +I +K GF+R+ ++ G + + F
Sbjct: 594 TARVTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALK---GFKRIHLKRGESTSVSF 650
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+ L + + N + G+ T+FVG G
Sbjct: 651 TLTP-EELALTETDGNLVEKNGQVTLFVGGG 680
>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
CL03T12C07]
gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
CL03T00C08]
Length = 722
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 255/737 (34%), Positives = 389/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ + T+Q SDA L+C V+ N
Sbjct: 603 SFEFDNIQGNDTLQ---------------SDAI-----------LQCS------VELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
Length = 711
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 248/742 (33%), Positives = 378/742 (50%), Gaps = 87/742 (11%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
K+LV +M L EK QL A + RLG+P Y WW+EALHGV+ G AT
Sbjct: 7 EAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV----------AT 56
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
FP I A F+E +I ++ E RA YN G+T+W+PNIN+ RDP
Sbjct: 57 VFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIFRDP 116
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V +++GLQ D N LK ++C KH+A V +
Sbjct: 117 RWGRGHETYGEDPYLTARLGVAFIKGLQG--------DENEDYLKAAACAKHFA---VHS 165
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
DR+HFDA V+++D+ ET+L FE VKE + VM +YNRVNG P+C LL
Sbjct: 166 GPEEDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDI 225
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
++ +W GYIV+DC +I+ H + E A A + G +L+CG Y + A
Sbjct: 226 LKKDWGFDGYIVSDCWAIRDFHTEHMVTHTAAESA-ALAINNGCELNCGNTYLHML-EAH 283
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
Q+G VKE I ++ + L + M+LG FD + +Y + + E+A EA+R +V+
Sbjct: 284 QEGLVKEEIITEAAEKLMRIRMQLGLFDKNCKYNEIPYAVNDCKVHREVALEASRRSMVM 343
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKND LPLN K+K++ ++GP AN + GNY G RY + + G Y V
Sbjct: 344 LKND-GILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVGDDVRVY 402
Query: 472 YKTGCDDVACKSNNSIFA---ASEA---AKTADATIILAGLDLSVEAES---------LD 516
Y GC A +N + +EA A+ +D ++ GLD ++E E D
Sbjct: 403 YSEGCHLFANGMSNLAWENDREAEALIVAEQSDVVVLCLGLDSTIEGEQGDTGNAFAGGD 462
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ L L G Q QL+ +V V K PVILV+ + + I +A+ + N AI YPG +GG
Sbjct: 463 KLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYADEHCN--AIFQTWYPGAQGG 519
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+A+A ++FG+++P G+LP+T+Y T+ L + RTY++ LYP
Sbjct: 520 KALAQLLFGEYSPSGKLPVTFYK---------TTEELPAFEDYSMKDRTYRYMPNEALYP 570
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGLSY K ++++V L+ + N+++ +K ++
Sbjct: 571 FGYGLSYADIK------VQSVKV-LDGAKGEEITNFSAGQTK----------------YK 607
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
KV+ +N + D DVV +Y K A + F+ VF++AG +K + K+
Sbjct: 608 VKVELENKSNVDSYDVVQIYIKDMESQYAVPNFSLCSFKSVFLKAGESKEVTLNVGE-KA 666
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
+++ ++ + + +F+G
Sbjct: 667 FTVINEEGKRIVDSKKFKLFIG 688
>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
616]
Length = 722
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 250/731 (34%), Positives = 384/731 (52%), Gaps = 78/731 (10%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RVK L+ +MTL EK QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP++ R V +V+GLQ G A LK + KH+ A +
Sbjct: 160 RDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---GDHPAY------LKTVATIKHFVANN 210
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+N +R+ +++ + + E + +E CVKE D SVM +YN NG+P LL
Sbjct: 211 EEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLL 266
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 267 GEVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEKLV 325
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAR 410
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EAA
Sbjct: 326 QAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEAAV 385
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANV 470
+ +VLLKN+ N LPL+ K K+VAVVGP A+ +G Y+G P ++ + G
Sbjct: 386 KSVVLLKNE-NLLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSITLLKGVKDLMGK 442
Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
K + S +SI A A K D ++ G D + E+ D ++LP Q +L+
Sbjct: 443 RGKVNYLNGIGASRDSIVA---AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLL 499
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+ +V P I+++ +G + +T+I AI+ A YPG+E GRA+A+++FG NP
Sbjct: 500 KAIYQV--NPRIVLVFHSGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLFGNENPS 556
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT F+++
Sbjct: 557 GKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYSFGHGLSYTSFEFD- 607
Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
IQ N + L+ D + V+ N G G
Sbjct: 608 -----NIQGN--------------------------DTLQPDAILQCSVELSNSGQLAGE 636
Query: 711 DVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+VV VY TY +K+++ F++V + +G K++ F A + L++ + +L
Sbjct: 637 EVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDGKWRML- 694
Query: 770 AGEHTIFVGNG 780
+G++T+F+G+G
Sbjct: 695 SGKYTLFIGSG 705
>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 722
Score = 380 bits (976), Expect = e-102, Method: Compositional matrix adjust.
Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
Length = 722
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEVAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
CL07T00C01]
gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
CL07T12C05]
Length = 722
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
Length = 763
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 247/736 (33%), Positives = 384/736 (52%), Gaps = 44/736 (5%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ CD + P R LV+ M EK+ L + GV RLGLP Y WW EALHGV+
Sbjct: 31 LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG 90
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG F + ATSFP IL +A+F++ L KI + EARA N G A + YW+P+
Sbjct: 91 A-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDYWTPD 149
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN RD RWGR +E+PGED + Y + GL+ + K+ + CKHY
Sbjct: 150 INPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGDQAQR----------KIIATCKHY 199
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
YD++ W G DR++F A++T QD+ E ++ PF+ C ++ S MCSYN VNGIP+CAD
Sbjct: 200 VGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIPTCAD 259
Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+L +R W D + YI +DC+++ + +NHK++ ++ A G+DL C
Sbjct: 260 TYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGMDLSCEY 318
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIEL 404
++ A QG + + IDK+L Y L+ G+FDG+ Y +L +DI + E +L
Sbjct: 319 TGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKATYANLSYKDINTPEARQL 378
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AG 463
+ + EG+V+LKND +TLPL K VA++G AN + + G Y+G P SP+ AG
Sbjct: 379 SLQVTSEGLVMLKND-HTLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHSPVFAG 437
Query: 464 FSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
++ G + +N A +AA+ +D + G D +V E DR +
Sbjct: 438 EQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDRTTISF 497
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGV-DIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
P Q L+ ++A++ K LV+++ G + D + + + +I+WA +PG++GG AI +
Sbjct: 498 PQVQIDLLTKLAKLGKP---LVVITLGDMTDHSPLLSMEGVNSIIWANWPGQDGGPAILN 554
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
VV G P GRLPIT Y DYV+ L + M LRP PGRTY+++N ++ PFG+GL
Sbjct: 555 VVSGAHAPAGRLPITEYPADYVK-LSMLDMNLRPHTE--SPGRTYRWFN-ESVQPFGFGL 610
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT F+ + S + + ++ ++ L+ + K C + +V
Sbjct: 611 HYTTFEASFAS-EEGLTYDIEEI-----LDGCTQQYKDLC-----------EVAPLEVTV 653
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G+ V + + K +K +I + R+ G K+ + L VD
Sbjct: 654 ANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLGELARVD 713
Query: 762 YAANTLLPAGEHTIFV 777
+ NT++ GE+T+ +
Sbjct: 714 QSGNTVIYPGEYTLLL 729
>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
Length = 722
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
615]
Length = 722
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 253/737 (34%), Positives = 386/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I ++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKEIYQV--NPRIALVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
C5]
Length = 763
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 249/739 (33%), Positives = 379/739 (51%), Gaps = 50/739 (6%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ CD + P R LV+ M EK+ L + GV RLGLP Y WW EALHGV+
Sbjct: 31 LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG 90
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
PG F + ATSFP IL +A+F++ L KI + EARA N G A + YW+P+
Sbjct: 91 A-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPMDYWTPD 149
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
IN RD RWGR +E+PGED + Y + GL+ + K+ + CKHY
Sbjct: 150 INPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGDQAQR----------KIIATCKHY 199
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
YD++ W G DR++F A++T QD+ E ++ PF+ C ++ S MCSYN VNG+P+CAD
Sbjct: 200 VGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCAD 259
Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+L +R W D + YI +DC+++ + +NHK++ ++ A G+DL C
Sbjct: 260 TYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGMDLSCEY 318
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIEL 404
++ A QG + + IDK+L Y L+ G+FDG+ Y +L DI + E +L
Sbjct: 319 SGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDGAKATYANLSYNDINTPEARQL 378
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AG 463
+ + EG+V+LKND +TLPL K VA++G AN + + G Y+G P SP+ AG
Sbjct: 379 SLQVTSEGLVMLKND-HTLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHSPVFAG 437
Query: 464 FSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
++ G + +N A +AA+ +D + G D +V E DR +
Sbjct: 438 EQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDRTTISF 497
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGV-DIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
P Q L+ ++A++ K LV+++ G + D + + I +I+WA +PG++GG AI +
Sbjct: 498 PQVQIDLLAKLAKLGKP---LVVITLGDMTDHSPLLSMEGINSIIWANWPGQDGGPAILN 554
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
V+ G P GRLPIT Y DYV+ L + M LRP PGRTY+++N ++ PFG+GL
Sbjct: 555 VISGVHAPAGRLPITEYPADYVK-LSMLDMNLRP--HAESPGRTYRWFN-ESVQPFGFGL 610
Query: 642 SYTQFKYNLLS---FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YT F+ S T IQ L+ + K C + +
Sbjct: 611 HYTTFEAGFASEEGLTYDIQ---------ETLDSCTQQYKDLC-----------EVAPLE 650
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
V N G+ V + + K +K +I + R+ G K+ + L
Sbjct: 651 VTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLGELA 710
Query: 759 IVDYAANTLLPAGEHTIFV 777
VD + NT++ GE+T+ +
Sbjct: 711 RVDQSGNTVIYPGEYTLLL 729
>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 643
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 231/610 (37%), Positives = 336/610 (55%), Gaps = 61/610 (10%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
++ R + LV++MTL+EKV Q+ A + RLG+P Y WW+E LHGV G
Sbjct: 4 FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT--------- 54
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
AT FP I ASF+ESL + + QA+S EARA YN + GLT+WSPNIN+
Sbjct: 55 -ATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLF 113
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP + GR ++RGLQ+ E +S+ K+ + KH+AA+
Sbjct: 114 RDPRWGRGHETYGEDPLLTGRMGTAFIRGLQEGE--------DSQYRKLDATVKHFAAHS 165
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
R+ F+A V+ +DM +++L F C++ ++VM +YNR+NG P+CA L
Sbjct: 166 GPE---AGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYL 222
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+ EW GY+V+DC +IQ + +NH + KE A A + G L+CG+ Y ++
Sbjct: 223 KGVLYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESA-ALAVNNGCQLNCGKAY-HWVK 280
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREG 412
AV+ G + E + +++ L+ RLG FD Y S+ I ++ EL + A+E
Sbjct: 281 AAVEDGLISEDTVTCAVERLFEARFRLGMFDSDCVYDSIPMNVIECRKHRELNRKMAQES 340
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NV 470
IVLLKN+ LPLN KT+AV+GP+A+ ++GNY G P + + + G A V
Sbjct: 341 IVLLKNN-GILPLNPE--KTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEV 397
Query: 471 TYKTGCDDVACK----SNNSIFAASEAAKTADATIILAGLDLSVE---------AESLDR 517
Y G V + + + A AK AD ++ GL +E A+S DR
Sbjct: 398 YYARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDR 457
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+D+ LP Q QL+ + + K PV+LV +S G VD+ + + AIL YPG EGG
Sbjct: 458 KDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDL--RQADERCAAILQCFYPGAEGGN 514
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
A+AD++FG+ +P GRLP+T+Y V+ LP P GRTY+F++G LYPF
Sbjct: 515 ALADILFGRVSPSGRLPVTFYRT--VEDLP-------PFTDYSMKGRTYRFFDGKPLYPF 565
Query: 638 GYGLSYTQFK 647
G+GL+Y K
Sbjct: 566 GHGLTYADIK 575
>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 723
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 256/756 (33%), Positives = 389/756 (51%), Gaps = 101/756 (13%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D P +RV+DL+S++TL+EKV Q+ D + VPRL LP+Y WW+EALHGV+ G
Sbjct: 24 YLDPFNPTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F++ L K+ A+S EARAMYN GLT+W
Sbjct: 82 --------ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSC 224
+PNIN+ RDPRWGR ET GEDPF+ + V +++GLQ D H LKV++C
Sbjct: 134 TPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDDPEH----------LKVAAC 183
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + R+ F+A + +D+ ET+L F+ V +VMC+YNR N
Sbjct: 184 AKHFA---VHSGPERLRHSFNAIASPKDLRETYLPAFKALVN-ARVEAVMCAYNRTNSEV 239
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
C LL+Q +R EW G++V+DC +I HK + E AVA +K G+DL+CG
Sbjct: 240 CCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPE-AVALAVKHGVDLNCG 298
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDEN 401
Y AV++G + E +IDK+L L +LG FD SP Y ++ I S ++
Sbjct: 299 DEYPALI-EAVKRGLITEKEIDKALATLLKTRFKLGLFDPKQNSP-YNNIPVSVINSTDH 356
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
LA E A + IVLLKN++ LPL + + + GP+A + A++GNY G+ + +
Sbjct: 357 RALAKEVALKSIVLLKNEK-CLPLKN-NLSKYYITGPNAASVDALMGNYYGVNPHMSTIL 414
Query: 462 AGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-- 515
G +G + + YK G + +NN I + AK +D T ++ G+ +E E
Sbjct: 415 EGIAGAIQPGSQMQYKPGIL-LDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEA 473
Query: 516 -------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
DR D LP Q + ++ + K V+ +I GG + +E + A+L A
Sbjct: 474 IASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAII--TGGSPMNLSEVHELADAVLLA 531
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPGEEGG A+AD++FGK +P GRLP+T+ P + L P + GRTY++
Sbjct: 532 WYPGEEGGNAVADILFGKVSPSGRLPVTF---------PKSFAQLPPYEDYSMKGRTYRY 582
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
+Y FGYGLSY+ + Y+ L+ L++ Q +N+ ++ T
Sbjct: 583 MTAEPMYTFGYGLSYSTYTYSSLT--------LSEKQIKKNMTIIAETMVT--------- 625
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
N G +G +VV +Y + P E Y + GF+RV ++AG ++++
Sbjct: 626 --------------NTGKMEGEEVVQLYITVPQTEKNPQY--SLKGFKRVNLKAGESRKV 669
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+F + VD + +L +G + + +G S
Sbjct: 670 QFQITP-DLMKSVDANGSEVLLSGSYVVRIGGASPS 704
>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
Length = 722
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 GQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 710
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 245/744 (32%), Positives = 383/744 (51%), Gaps = 101/744 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ K+LVS+MTL E+ +QL A + L + +Y WW+E LHGV+ G AT
Sbjct: 15 KAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------AT 64
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A F++ L +KI ++TE RA YN GLT+WSPN+N+ RDP
Sbjct: 65 VFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRDP 124
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V +V+GLQ E + LK+++C KH+A +
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQGDEKY----------LKIAACAKHFAVHS--G 172
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+G+ R+ F+A V+++D+ ET+L FE CVKE D +VM +YNR N P C LL
Sbjct: 173 PEGL-RHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDI 231
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RG+W G++V+DC +I H + + E A A +K G DL+CG Y A
Sbjct: 232 LRGKWQFKGHVVSDCWAIADFHLYHGVTSTATESA-ALAIKNGCDLNCGNVYLQML-LAY 289
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
++G V E DI ++ + L +RLG FD ++ + E+ E++ A+R+ IV+
Sbjct: 290 KEGLVTEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCKEHHEVSLMASRKSIVM 349
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-----SGYANV 470
L+N+ LPL+ +K+K++ ++GP+A++ + + GNY G +Y++ + G S +
Sbjct: 350 LRNN-GLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIRI 408
Query: 471 TYKTGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------S 514
Y GC D+A + ++ + A A+ +D I+ GLD S+E E +
Sbjct: 409 FYSEGCHLYKDRVQDLA-EPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGA 467
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
D+ +L LPG Q +L+ +V +A G ++V++ AG + N AIL A YPG
Sbjct: 468 GDKLNLNLPGKQQELLEKV--IATGKPVIVVLGAGSA-LTLQGQEENCAAILNAWYPGSF 524
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GGRAIAD++FGK +P G+LP+T+Y T+ L RTY++ +L
Sbjct: 525 GGRAIADLIFGKCSPSGKLPVTFYK---------TTEELPEFTDYSMKNRTYRYMKNESL 575
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GL+Y++ + + LS SD SK GV
Sbjct: 576 YPFGFGLTYSKVQLSDLS--------------------VSDISKD-FEGV---------- 604
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
E + NVG+ D +V+ Y K A + F+RV + G +K +K N
Sbjct: 605 -EVSIKISNVGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKRVALNKGESKVVKMTINK- 662
Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
++ +V+ + +L + + +FVG
Sbjct: 663 RAFEVVNDEGDRILDSKKFKLFVG 686
>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
Length = 755
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 252/737 (34%), Positives = 384/737 (52%), Gaps = 57/737 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
CD + + R LV M +EK+ L GV RLGLP+Y WW EALHGV+ PG
Sbjct: 33 ICDVTAAPAERAAALVEAMQTNEKLDNL---MRGVTRLGLPKYNWWGEALHGVAGA-PGI 88
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
+F ATSFP +L +A+F++ L KI + EARA N G A + +W+P+IN R
Sbjct: 89 NFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDFWTPDINPFR 148
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR +ETPGED + Y + + GL+ + K+ + CKHY YD+
Sbjct: 149 DPRWGRGSETPGEDIVRIKGYTKHLLAGLEGDKPQR----------KIIATCKHYVGYDM 198
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+ W G+DR+ F+A++ QD+ E ++ PF+ C ++ S MCSYN VNG+P+CAD +L
Sbjct: 199 EAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLQ 258
Query: 294 QTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
+R W+ + YI +DC++++ + HK+ A + + AG+D C ++
Sbjct: 259 TILRDHWNWTESNNYITSDCEAVKDISLKHKY-AKTNAEGTGLAFTAGMDNSCEYTGSSD 317
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEAA 409
A Q + ID++LK Y L+R G+FDG + Y +LG +DI + E +L+ + A
Sbjct: 318 IPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAAATYANLGVKDINTPEAQQLSLQVA 377
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPI-AGFSGY 467
EG+VLLKND +TLPL+ VA++G AN T + G Y+G P Y+ SP+ AG
Sbjct: 378 SEGLVLLKND-DTLPLSLTNGSKVAMLGFWANDTSKLSGIYSG-PAPYLRSPVWAGQKLG 435
Query: 468 ANVTYKTGCDDVACKSNNS-----IFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
++ +G + +SN+S A AA+ +D + GLD S AE DR +
Sbjct: 436 LDMAIASG--PILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAAAEGFDRNSIAW 493
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
P Q LI ++A + K V+LV+ +D + + +++WA +PG++GG A+ V
Sbjct: 494 PTAQVDLIKKLAAIGKPLVVLVLGDL--MDNSPLLELDGVNSVIWANWPGQDGGSAVMQV 551
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
V G GRLPIT Y +Y + L + M +RP S PGRTY+++NG + PFG GL
Sbjct: 552 VTGAVAVAGRLPITQYPANYTE-LSMLDMNMRPSSS--SPGRTYRWFNG-AVQPFGTGLH 607
Query: 643 YTQFKYNLLSFTKTIQVNLNKL-QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
YT F + TI+ +++ + + C N Y S P V
Sbjct: 608 YTTFDAKFAA-NSTIEYDISNITKECTN-QYPDTCSVPSIP----------------VAV 649
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIV 760
N G+ + + + K A +K +I + RV V+ G+ K + +L V
Sbjct: 650 TNSGNRTSDFIALAFIKGENGPAPYPLKTLISYTRVRDVKGGQTKSAEMQL-TLGNLARV 708
Query: 761 DYAANTLLPAGEHTIFV 777
D NT+L GE+T+ +
Sbjct: 709 DQMGNTVLYPGEYTVLL 725
>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
Length = 712
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 245/753 (32%), Positives = 377/753 (50%), Gaps = 99/753 (13%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + P RV DL+SRMTL+EK+ Q+ + +PRLG+P Y++WSEALHGV+ G
Sbjct: 7 LYLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK- 65
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-----LGRA----GLT 163
AT FP I A+++ L +++ A+++EARA ++ G+ GLT
Sbjct: 66 ---------ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLT 116
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
WSPNIN+ RDPRWGR ET GEDP++ G +VRGLQ + H LK ++
Sbjct: 117 MWSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQGKDPHY---------LKTAA 167
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KHY V + +R+ F+A VT +++ +T+L F+ V E +VM +YNR G
Sbjct: 168 CAKHYT---VHSGPEKERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGE 224
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
P C P LL + +R +W G++V+DC +I +H+ D E A A +K G D+
Sbjct: 225 PCCGSPYLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESA-ALGIKNGCDMAC 283
Query: 343 -CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDIC 397
C Y N T A+ +G + E DID +L+ +LG FD PQ Y + +
Sbjct: 284 ICTYSYENLT-EALNRGLITEEDIDHALRNTLRTRFKLGLFD--PQEKVPYAHISMSVVG 340
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
+ + +LA E A + VLLKN + LP+ VK++ +VGP+A ++GNY G+
Sbjct: 341 CEAHRKLAYETAVKSAVLLKNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYGLSDSM 399
Query: 458 MSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
+ + G G + + G K + ++ + AA + D I GL +E E
Sbjct: 400 TTFMEGLVGRLPEGVRMEFMPGSLLTDSKKIKNDWSVASAA-SFDLVIAFMGLSPLLEGE 458
Query: 514 --------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
+ DRED+ LP Q + I +A A G I+++++ GG IA ++AI
Sbjct: 459 EGEAILSDNGDREDIALPKAQQEYIRDLA--ATGAKIVLVLT-GGSAIALNGIEDLVEAI 515
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPG+EGGRAIAD++FG +P G+LPIT+ P+++ L P RT
Sbjct: 516 LWVGYPGQEGGRAIADLIFGDHSPSGKLPITF---------PVSTDQLPPFREYSMKERT 566
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y++ L+PFG+GLSYTQF+Y L +
Sbjct: 567 YRYMTSSPLFPFGFGLSYTQFEYKNLQLEHPV---------------------------- 598
Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
L + + NVG +G +VV VY ++++I FQRV ++ G
Sbjct: 599 ---LSAGEALRGTFELANVGEYEGEEVVQVYLSDLEASTIVPLQKLISFQRVRLKPGETV 655
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ F +++ ++D N +L G+ + +G
Sbjct: 656 QLSFAIQP-EAMMMIDDEGNQVLEPGKFKLTIG 687
>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 735
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 251/758 (33%), Positives = 387/758 (51%), Gaps = 99/758 (13%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
+ S F F D+ L R+ DL+SR+TL+EK QQ+ + + + RLG+P Y+WW+EALHG+
Sbjct: 27 IDKSEFDFYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEALHGL 86
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LG 158
G AT FP I A+F++ L K+ A+S EARA +N
Sbjct: 87 GRSGV----------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFNNAVKHGYHRK 136
Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GLT+W+PN+N+ RDPRWGR ET GEDP++ + +V+GLQ N +
Sbjct: 137 YGGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQGD---------NDKY 187
Query: 219 LKVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
LK ++ KHYA + G + R+ F+A V+E+D+ ET+L F+ V + + ++MC+
Sbjct: 188 LKTAAAAKHYAVH-----SGPEKLRHEFNADVSEKDLWETYLPAFKTLV-DANVETIMCA 241
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
YN NG P CA+ +L+N +R +W +G++V+DC ++Q V H + +S E A A ++
Sbjct: 242 YNSTNGEPCCANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHD-IVESPEAAAALAVE 300
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQ 394
G++L+CG Y NF AV+ G V E +DK L L +LG FD S Y +G +
Sbjct: 301 VGIELNCGDTY-NFLAKAVEDGLVSEELVDKRLHKLLETRFKLGLFDPEESNPYNKIGVE 359
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ SDE+ LA E AR+ IVLLKND LPL + K + GP+A ++GNY G+
Sbjct: 360 VMNSDEHRALARETARKSIVLLKND-GVLPLKNNLSKYF-ITGPNATNIEVLLGNYHGVN 417
Query: 455 CRYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATII---LAGLD 507
++ + G + + + Y+ G + + N AS A +DAT + ++GL
Sbjct: 418 PDMVTVLEGIAKAIKPESQLQYRMGT-RLNLPNENPQDWASPNAGNSDATFVVMGISGLL 476
Query: 508 LSVEAESL------DREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNT 560
E ES+ DR D LP Q + +V+E A+ PV+ ++ GG + E +
Sbjct: 477 EGEEGESIASPTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIV--TGGSPMNLTEVHK 534
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
A+L YPGEEGG A+AD++FGK +P GRLPIT+ P+T L +
Sbjct: 535 LADAVLLVWYPGEEGGNAVADIIFGKNSPSGRLPITF---------PMTIEDLPAYEDYT 585
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
GRTYK+ + +YPFGYGLSYT F+Y+ + +K +
Sbjct: 586 MEGRTYKYMDVVPMYPFGYGLSYTDFEYSEIKLSKDKIKKKESV---------------- 629
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
E ++ N G + +VV VY K + +++ F+ + ++
Sbjct: 630 ---------------EARISVTNTGDFEADEVVQVYLKDVKASSRVPNFELVAFKNIHLK 674
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
G +K + F + L+ +D L G I++G
Sbjct: 675 RGESKELTFEITP-EMLSFIDDNGKEKLEKGAFEIYIG 711
>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 722
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 252/737 (34%), Positives = 386/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GEDP + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q + + ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKFLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHILAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
CL05T00C42]
gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
CL05T12C13]
Length = 722
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 252/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RV+ L+ +MTL EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
RDPRWGR ET GE+P + R V +V+GLQ P LK + KH+ A
Sbjct: 160 RDPRWGRNEETYGEEPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
+ +N +R+ +++ + + E + +E CVKE +A SVM +YN NG+P
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
LL+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
A + +VLLKND LPLN K+K+VAVVGP A+ +G Y+G P +S + G
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VTY G S +SI ++ K AD ++ G D + E+ D ++LP
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ +V P I+++ G + +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 GQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T Y + + LP +D + GRTY++ G LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F+++ IQ N + L+ D + V+ N
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630
Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G G +VV VY TY +K+++ F++V + +G K++ F A + L++ +
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689
Query: 764 ANTLLPAGEHTIFVGNG 780
+L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705
>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
S85]
Length = 727
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 260/758 (34%), Positives = 385/758 (50%), Gaps = 102/758 (13%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
SFL D S+ R + LVS+MTL EK+ QL + A + RL +P Y+WW+EALHGV+ G
Sbjct: 20 SFLDTDKSI--EERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNG 77
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGL 162
AT FP I A+F+ L ++ A+STEARA Y + + AGL
Sbjct: 78 K----------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGL 127
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+W+PN+N+ RDPRWGR ET GEDP+++ + V +V+GLQ D N LK +
Sbjct: 128 TFWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQ-------GDDPNY--LKSA 178
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KHYA V + R F+A T+QD+ ET+L FE VK+ + VM ++N V G
Sbjct: 179 ACAKHYA---VHSGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFG 235
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
P A+ LL +R W GY+V DC +I+ + HK++ DS+ A A LKAG +L+
Sbjct: 236 APMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYV-DSEVAAAAVALKAGTNLN 294
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSD 399
CG Y A+ QG V E + + K L+ RLG FD Y +G + I S
Sbjct: 295 CGATYKELK-KAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSK 353
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+IELA EAA++ IV+LKN N LPL + +K V GP AN++ ++G+Y G+ ++
Sbjct: 354 EHIELAREAAQKSIVMLKNKNNLLPLPT-DIKVPYVTGPFANSSDMLMGSYYGVSPGVVT 412
Query: 460 PIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
+AG + ++ Y++G K+ N A A +D TI + GL E E +
Sbjct: 413 ILAGITDAVSLGTSLNYRSGALPFQ-KNINPKNWAPNVAGMSDVTICVVGLTADREGEGV 471
Query: 516 ---------DREDLWLPGYQTQLINQVAEVAK-GPVILVIMSAGGVDIAFAETNTNIKAI 565
DR DL LP Q + Q+A K P++LVI S V + E + + AI
Sbjct: 472 DAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSPVSLEGIEEHCD--AI 529
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
L YPGE+GG A+ADV+FGK +P G LP+T+ P + L GRT
Sbjct: 530 LQIWYPGEQGGNAVADVLFGKVSPTGHLPMTF---------PKSVAQLPDYKDYSMKGRT 580
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
YK+ ++PFG+GL+Y SKT ++
Sbjct: 581 YKYMTEEPMFPFGFGLTY---------------------------------SKTEFKNLV 607
Query: 686 VND--LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI--KQVIGFQRVFVRA 741
V D LR + + V+ NVG D ++V +Y P ++ + + F+RV ++
Sbjct: 608 VEDAKLRKKESLKVSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLKAFKRVALKK 667
Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
G ++++F + +SL +++ + G + + VGN
Sbjct: 668 GETQKVEFTIHP-ESLKVINVKGQKVWRKGAYKVTVGN 704
>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
Length = 805
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 253/743 (34%), Positives = 371/743 (49%), Gaps = 91/743 (12%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH--------------------GV 88
++ L CD++ R LV + + EK+ L ++ G
Sbjct: 30 LAKTLACDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGA 89
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IPGATSFPTVILTTASFNESLWKKIGQ 145
R+GLP Y WW+EALHGV+ PG F+ ATSF I A+F+ L ++
Sbjct: 90 ERIGLPAYAWWNEALHGVA-ASPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVAD 148
Query: 146 AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE------------------TPGED 187
+STEARA N AGL YW+PNIN +DPRWGR E TPGED
Sbjct: 149 TISTEARAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGED 208
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P + Y + GL EG + KV + CKH+AAYD++ W+G RY F+A
Sbjct: 209 PVHIKGYVQALLEGL---EGRDKIR-------KVIATCKHFAAYDLERWQGALRYRFNAV 258
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL---HG 304
VT QD+ E +L+PF+ C ++ S MCSYN +NG P+CA L++ +R W+ +
Sbjct: 259 VTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNN 318
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKV 360
YI +DC++IQ + N + + A A AG D C T+ G A Q +
Sbjct: 319 YITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIG-AYNQSLL 377
Query: 361 KETDIDKSLKYLYTVLMRLGFFD-GSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
E ID++L+ LY L+R G+ D SP Y + + + + LA ++A +GIVLLKN
Sbjct: 378 SEEIIDRALRRLYEGLIRAGYLDSASPHPYTKISWSQVNTPKAQALALQSATDGIVLLKN 437
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYK--TGC 476
+ LPL+ KT+A++G ANAT M+G Y+GIP Y +PI + NVT+ G
Sbjct: 438 N-GLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQL-NVTFHHAPGP 494
Query: 477 DDVACKSNNSIFA--ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
+ + S N + A AA +D + L G DLS+ AE DR+ + P Q L+ +A
Sbjct: 495 VNQSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLA 554
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
++ K P I+ + VD +N NI +ILW GYPG+ GG A+ +++ G +P RLP
Sbjct: 555 QMGK-PTIVARL-GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLP 612
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
+T Y Y ++PLT+M LRP + PGRTY++Y P L PFG+GL YT F F
Sbjct: 613 VTVYPETYTSLIPLTAMSLRPTSA--RPGRTYRWYPSPVL-PFGHGLHYTTFTAKFGVF- 668
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
+++ +N+ +L N Y L + + V N G V +
Sbjct: 669 ESLTINIAELVSNCNERY----------------LDLCRFPQVSVWVSNTGELKSDYVAL 712
Query: 715 VYSKPPAEIAATYIKQVIGFQRV 737
V+ + IK ++G++R+
Sbjct: 713 VFVRGEYGPEPYPIKTLVGYKRI 735
>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
Length = 2278
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 256/769 (33%), Positives = 397/769 (51%), Gaps = 97/769 (12%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA--HG-VPRLGLPQYEWWSEALHGVSN 108
F FC+SSL +RV+DL+ R+ LDEKV+ L A HG +PRLG+P+Y W + +HGV +
Sbjct: 34 FPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGVQS 93
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY------NLGRA-- 160
GTH ATSFP + A F+ + K+ Q + E RA+ N R
Sbjct: 94 TC-GTH------CATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGARENYARGPH 146
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GL WSPNIN+ RDPRWGR ETP EDP+V +Y V Y +GLQ EG +SR L
Sbjct: 147 IGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQ--EGQ------DSRFL 198
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ KHY AY +N+ G DR FDA V+ D +T+ FE V +G A +MCSYN
Sbjct: 199 QAVVTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNS 258
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NGIP+CA+ K LNQ +R + + GYI +D +IQ + D HK+ E A +++G+
Sbjct: 259 LNGIPTCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFDGHKYTKTLCE-ATKIAMESGV 316
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
D+ G Y N + ID++++ + +LG FD G +D+ +
Sbjct: 317 DICSGNAYWNCLKQLANSTNFSAS-IDEAIRRTLKLRFQLGLFDAIGDQPHFGPEDVRTA 375
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR--- 456
++++L+ + AR+ IVLL+N NTLPL +AV+GPH+ ++GNY G C
Sbjct: 376 KSLQLSLDLARKSIVLLQNHGNTLPLRLG--LRIAVIGPHSMTRRGIMGNYYGQLCHGDY 433
Query: 457 -----YMSP---IAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
SP I +G N + GC + S A +A +TAD ++ G+D+
Sbjct: 434 DEVRCIQSPLEAIQSVNGRNNTHHVNGC-GINDTSTAEFDDALQAVRTADVAVLFLGIDI 492
Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG--GVD--IAFAETNTNIKA 564
S+E ES DR+++ +P Q +L+ + VA P ++V+ + G G++ I +A++
Sbjct: 493 SIERESKDRDNIDVPHIQLELLKAI-RVAGKPTVVVLFNGGILGIEKLILYADS------ 545
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
+L A YPG G +AIA+++FG NP G+LP+T Y +++ + + SM + YPGR
Sbjct: 546 VLEAFYPGFFGAQAIAEILFGSINPSGKLPVTMYRSNFINDVDMKSMSM-----TLYPGR 600
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+Y++Y +Y FG+GLSYT F +IQ + D+ TR
Sbjct: 601 SYRYYTEVPVYSFGWGLSYTTF---------SIQ--------------SIDSHDTRA--- 634
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT-----YIKQVIGFQRVFV 739
+N + +++ N G G +V+ + + P +I AT +Q+ + RV +
Sbjct: 635 -MNHVLTAQPKMYRILITNNGKYYGEEVLFAFFR-PLDIHATGPVESLQQQLFNYTRVRL 692
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFP 785
G + + ++L + D N + G + + + NG ++FP
Sbjct: 693 DPGDMREVPLHVKD-ENLALHDRNGNLCVFEGFYELIISNGVEEQLTFP 740
>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
[Marvinbryantia formatexigens DSM 14469]
Length = 700
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 251/749 (33%), Positives = 376/749 (50%), Gaps = 104/749 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R + LV++MT++EK QL A + RLG+P Y WW+EALHGV+ G AT
Sbjct: 9 RAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------AT 58
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A+F+E+L +I ++TE RA YN A GLT+WSPN+N+ RDP
Sbjct: 59 VFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRDP 118
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP + R V +V+GLQ + +K ++C KH+A V +
Sbjct: 119 RWGRGHETYGEDPCLTSRLGVAFVKGLQG----------DGETMKAAACAKHFA---VHS 165
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ F+A + +DMEET+L FE VKE D +VM +YNR NG CA P +L +
Sbjct: 166 GPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCASP-VLQKI 224
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R +W G+ V+DC +I+ ++H A +KE A A + +G DL+CG Y + +A
Sbjct: 225 LREDWGFEGHFVSDCWAIRDFHEHHMLTATAKESA-AMAINSGCDLNCGNTYLHIL-HAY 282
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
+ G V E I ++ L+T LG FDGS +Y + + S E++ LA +AA E VL
Sbjct: 283 RDGLVSEETITEAAVRLFTTRFLLGLFDGS-EYDDIPYTVVESKEHLALAEKAALESAVL 341
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKN+ LPL +++TV V+GP+A++ A+ GNY G RY + G Y V
Sbjct: 342 LKNN-GILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRVL 400
Query: 472 YKTGC---DDVACK---SNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
GC +D K + + + A A+ +D I+ GLD ++E E S D
Sbjct: 401 TSVGCALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGLDETLEGEEGDTGNSYASGD 460
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+E L LP Q L+ VA K PV+L +MS +D+++A + + LW YPG +GG
Sbjct: 461 KETLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAEHFDAILQLW--YPGSQGG 517
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
A A ++FG+ +P G+LP+T+Y + ++ LP + GRTY++ P YP
Sbjct: 518 SAAAKLLFGEVSPSGKLPVTFY--ETLEELP-------AFEDYSMKGRTYRYMGHPAQYP 568
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FG+GL+Y + +DA+ + +
Sbjct: 569 FGFGLTYGDVR-------------------------VTDAN--------IRGASAEGDLT 595
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
V +N G+ +V+ +Y K A + F R+ + AG K I+ A ++
Sbjct: 596 LAVTAENAGNAVTDEVLQIYVKCTDSANAVPNPALAAFGRIHLEAGEKKTIEMTVPA-RA 654
Query: 757 LNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
+VD A + + + FV GVS P
Sbjct: 655 FTVVDEAG---VRSRDGKQFVIYAGVSQP 680
>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
Length = 733
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 244/749 (32%), Positives = 375/749 (50%), Gaps = 86/749 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ LP + R++ L+ MTL EK QL + + RLGLP+Y++W+EALHGV+ G
Sbjct: 26 WFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARNG--- 82
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F++ L + +S EARA +N+ +GLT+W
Sbjct: 83 -------RATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGNRSKYSGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + V GLQ + + LK ++
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA +E+DM ET+ FE V E D +VM +YNRVNG P+
Sbjct: 187 KHFA---VHSGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYNRVNGHPA 243
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
LLN +R +W G+IV+DC + + HK A++ E A A + G DL+CG
Sbjct: 244 GGSDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESA-ALAINTGTDLNCGS 302
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
YT +AV+ G V E ID L + +LGFFD Y S+ + SD + +
Sbjct: 303 VYTALP-DAVEAGLVDEKTIDTRLHKVLATKFKLGFFDPKDDNPYNSISADVVNSDAHAD 361
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
+A E A + IVLL+N+ LPL+ ++ V V GP A+++ ++GNY G+ + + + G
Sbjct: 362 VAYEMAVKSIVLLQNENQVLPLDK-NIRNVYVTGPFASSSEVLLGNYYGLSGKTTNILDG 420
Query: 464 FSGYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES----- 514
+ +V YK G N + EA + D I + GL + E E
Sbjct: 421 ITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIA 480
Query: 515 ----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
DR L LP +Q + + ++ + PVI+V+ + G + E AI++A Y
Sbjct: 481 SPHKGDRLSLDLPEHQIEFLRKLRKDNDKPVIVVLTA--GTPVNVTEIAQLADAIVFAWY 538
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PG+EGG+A+AD++FG+ +P GRLPIT+ P + L P D GRTY++
Sbjct: 539 PGQEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMT 589
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
+YPFG+GLSY K++ ++ L + L+ T T
Sbjct: 590 EEPMYPFGFGLSYATVKFDNIT-----------LGNAEALSSTDGQKGT----------- 627
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ V+ N G+ + +VV +Y K P I+ + GFQR+ + G+ ++ F
Sbjct: 628 ----LDVSVNVTNTGTRELEEVVQLYLKTPNAGIDQPIQSLKGFQRIKLAPGQTGQVSFT 683
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ K L ++ +L G++ + VGN
Sbjct: 684 VSK-KQLYSINAKGKPVLLEGDYHVIVGN 711
>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 720
Score = 371 bits (952), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 245/722 (33%), Positives = 364/722 (50%), Gaps = 94/722 (13%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
LQ S F D +L R K L+S +TL EK+ LG V RL +P Y WW+EALHGV
Sbjct: 22 LQGQSTNFRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGV 81
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------ 160
+ G AT FP I A+F+ +L +I A+STEAR+ YN+ R+
Sbjct: 82 ARAGE----------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQ 131
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
G+T+W+PNIN+ RDPRWGR ET GEDPF+ +V+GLQ E R
Sbjct: 132 YLGITFWTPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSE--------PERR 183
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
LK ++ KH+A V + DR+HF+A V E+D+ ET+L F+ V+ G +++MC+YN
Sbjct: 184 LKTAAGAKHFA---VHSGPEADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYN 239
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
RVNG P C LL +R EW G +V DC ++ + HK + ++ + A +KAG
Sbjct: 240 RVNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIP-TRVEVAAAAVKAG 298
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
++LDC +A+++ + +D +L ++LGF+D Y G +
Sbjct: 299 VNLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYDDPSHSPYRHYGIDSV 358
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
+ +I LA EAA + +VLLKND LPL + ++ VVG +A + A+ GNY G+
Sbjct: 359 NNSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGN 417
Query: 457 YMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
++ + G +V Y GC ++ S F AA D TI + GL +E
Sbjct: 418 MVTFVEGLVKAGGPGMSVQYDYGC----SFADTSHFGGIWAAGFTDVTIAVIGLSPLLEG 473
Query: 513 E---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
E D++DL +P + ++ E PVI V+ +DI+ E +
Sbjct: 474 EHGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAIEPYAD-- 531
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AI++A YPGE+GG A+AD++FG+ +P GRLPIT+Y ++ LP P
Sbjct: 532 AIIYAWYPGEQGGTALADLIFGEVSPSGRLPITFYKD--IKDLP-------PYHDYNMTN 582
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY+++ G LYPFGYGLSYT F Y LS P
Sbjct: 583 RTYRYFQGDVLYPFGYGLSYTSFHYEWLS----------------------------KPS 614
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
V++ DD + N G+ D +V+ VY P +I ++++ GF R+ ++AG+
Sbjct: 615 TKVSE---DDIISVNIAVTNTGTMDADEVIQVYIVYP-DIERMPLRELKGFSRIHIKAGQ 670
Query: 744 NK 745
+
Sbjct: 671 TQ 672
>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
Length = 805
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 257/765 (33%), Positives = 386/765 (50%), Gaps = 92/765 (12%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEALHGVSN 108
FC++SL + RV+DL+SR+ L EK L A PR +GLP+Y W + +HGV +
Sbjct: 37 FCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHGVQS 94
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---------R 159
GT+ TSFP + A F+ + + Q + E RA++ G
Sbjct: 95 TC-GTNC------PTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPH 147
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GL WSPNIN+ RDPRWGR TETP EDP V +Y V Y RGLQ EG + R L
Sbjct: 148 LGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--EGKRQ----DPRFL 201
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ KHYAAY +N+ GV+R FDA V+ D +T+ F V +G+A VMCSYN
Sbjct: 202 QAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNS 261
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
VNGIP CA+ +L+ +RG GY+ +D +++ + D H + ADS+ +A + AG
Sbjct: 262 VNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHY-ADSQCEAARLAILAGT 320
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
D++ G+ Y V +++E +D +L++ + LG FD Y ++ ++
Sbjct: 321 DINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQPYWNVTPSEVN 380
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR- 456
+ L+ A R+ +V+L+N+ + LPL K +AV+GPHA + ++GNY G C
Sbjct: 381 TAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGLLGNYLGQMCHG 438
Query: 457 ----------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
+ I +G +N T+ GC ++ S A AAK ADA ++ G+
Sbjct: 439 DYDEVGCVQTPLDAIRAANGASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFLGI 497
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
D S+E E DR ++ LP Q QL+ +V V + P ++V+++ GGV I E A++
Sbjct: 498 DKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLIN-GGV-IGAEEIIERTDALV 554
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
A YPG G RA+ADV+FG NP G+LP+T Y DYV + + SM D +PGRTY
Sbjct: 555 EAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSM-----DMTAHPGRTY 609
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT----SDASKTRCP 682
+++ G ++PFG+GLSYT F ++ S T N H N ++ SD +
Sbjct: 610 RYFKGEPVFPFGWGLSYTTFSLSVDSGT-------NSSSHSNNAAFSGGEVSDTANVTIS 662
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP-------PAEIAATYIKQVIGFQ 735
V+ ND G G +VV+ + +P PA + +Q+ +Q
Sbjct: 663 VVVKND----------------GEVAGDEVVLAFFRPVNSNVTGPATLLN---EQLFDYQ 703
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
RV + + + F +L + D N G + + V NG
Sbjct: 704 RVSLGPLDSTEVSFTIER-STLALPDEEGNLASFPGSYEVIVSNG 747
>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
610]
Length = 722
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 245/731 (33%), Positives = 378/731 (51%), Gaps = 78/731 (10%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P ++RVK L+ +MTL EK QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 50 IIGDLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
T F I A+++ TV++ K++ A+STEAR Y GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP++ R V +V+GLQ G A LK + KH+ A +
Sbjct: 160 RDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---GDHPAY------LKTVATIKHFVANN 210
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+N +R+ +++ + + E + +E CVKE SVM +YN NG+P LL
Sbjct: 211 EEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLL 266
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+ +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 267 GEVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEKLV 325
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAR 410
AV+QG + E ID++L + T +LG FD Y K+ + + ELA EAA
Sbjct: 326 QAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEAAV 385
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANV 470
+ +VLLKN+ N LPL+ K K+VAVVGP A+ +G Y+G P ++ + G
Sbjct: 386 KSVVLLKNE-NLLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDLMGK 442
Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
K + S +SI A A K D ++ G D + E+ D ++LP Q +L+
Sbjct: 443 RGKVNYLNGIGASRDSIVA---AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLL 499
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+ +V P I+++ +G + + +I AI+ A YPG+E GRA+AD++FG NP
Sbjct: 500 KAIYQV--NPRIVLVFHSGN-PLTSEWADVHIPAIMQAWYPGQEAGRALADLLFGNENPS 556
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
G+LP+T Y + LP +D + GRTY++ LY FG+GLSYT F ++
Sbjct: 557 GKLPMTIYRAE--DQLPDI------LDFDMWKGRTYRYMKEDPLYGFGHGLSYTSFGFDG 608
Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
+ + T++ ++ +C V+ N G G
Sbjct: 609 IQGSDTLK----------------SGARLQC----------------SVELSNTGKWTGE 636
Query: 711 DVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+VV VY TY +K+++ F++V + G KR++F + L++ + N +
Sbjct: 637 EVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWE-NGNWRML 694
Query: 770 AGEHTIFVGNG 780
G++T+F+G+G
Sbjct: 695 TGKYTLFIGSG 705
>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
Length = 786
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 247/635 (38%), Positives = 331/635 (52%), Gaps = 91/635 (14%)
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IPGATSFPTVILTTASFNESLWK 141
A G RLGLP+Y WWSE LHGV+ PG F+ ATSF I ASF++ L
Sbjct: 8 ALGASRLGLPKYAWWSEGLHGVAG-SPGVKFNTTGYPFSYATSFANAINLGASFDDDLVY 66
Query: 142 KIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRG 201
++G A+STEARA N G GL YW+PN+N +DPRWGR ETPGEDP + Y + G
Sbjct: 67 EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAG 126
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
L EG+E KV + CKHYAAYD++ W G+ RY F+A VT QD+ E +L PF
Sbjct: 127 L---EGNETVR-------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPF 176
Query: 262 EMCVKEGDASSVMCSYNRV-----------------NGIPSCADPKLLNQTVRGEWDL-- 302
+ C ++ S+MCSYN + P+CA P L+ +R W+
Sbjct: 177 QQCARDSKVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRDHWNWTE 235
Query: 303 -HGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC---GQYYTNFTGNAVQQ 357
+ YI +DC++I + DNH F + + +A A KAG D C G T+ G A Q
Sbjct: 236 HNNYITSDCNAILDFLPDNHNF-SQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG-AYNQ 293
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFD---------------GSPQYVSLGKQDICSDENI 402
+ E ID +L+ LY L+R G+ D SP Y +L +D+ +
Sbjct: 294 SLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQ 353
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
ELA +A EGIVLLKN + LPL+ + K VA++G ANAT M G Y+GIP Y +P+
Sbjct: 354 ELALRSATEGIVLLKNAGSLLPLDFSG-KKVALIGHWANATGTMRGPYSGIPPFYHNPLY 412
Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
A + +Y G A + A AA+ AD + G D +V +E LDRE +
Sbjct: 413 AAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIA 472
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
P Q QL++++A + K ++VI VD + N N+ +ILW GYPG+ GG A+ D
Sbjct: 473 WPETQMQLLSELAGLGK--PLVVIQLGDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFD 530
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD------------------------ 617
V+ GK P GRLP+T Y YV +PLT M LRP +
Sbjct: 531 VLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLTIQPR 590
Query: 618 ------SLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
+L PGRTYK+Y+ P L PFGYGL YT F
Sbjct: 591 STPGNKTLSSPGRTYKWYSSPVL-PFGYGLHYTTF 624
>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
FGSC 2508]
Length = 788
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 245/639 (38%), Positives = 332/639 (51%), Gaps = 94/639 (14%)
Query: 85 AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IPGATSFPTVILTTASFNESLWK 141
A G R+GLP+Y WWSE LHGV+ PG F+ ATSF I ASF++ L
Sbjct: 8 ALGASRIGLPKYAWWSEGLHGVAG-SPGVTFNTTGYPFSYATSFANAINLGASFDDDLVY 66
Query: 142 KIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRG 201
++G A+STEARA N G GL YW+PN+N +DPRWGR ETPGEDP + Y + G
Sbjct: 67 EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAG 126
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
L EG+E KV + CKHYAAYD++ W G+ RY F+A VT QD+ E +L PF
Sbjct: 127 L---EGNETVR-------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPF 176
Query: 262 EMCVKEGDASSVMCSYNRV-----------------NGIPSCADPKLLNQTVRGEWDL-- 302
+ C ++ S+MCSYN + P+CA+ L+ +R W+
Sbjct: 177 QQCARDSKVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTYLMT-ILRDHWNWTE 235
Query: 303 -HGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC---GQYYTNFTGNAVQQ 357
+ YI +DC++I + DNH F + + +A A KAG D C G T+ G A Q
Sbjct: 236 HNNYITSDCNAILDFLPDNHNF-SQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG-AYNQ 293
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFD---------------GSPQYVSLGKQDICSDENI 402
+ E ID +L+ LY L+R G+ D SP Y +L +D+ +
Sbjct: 294 SLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQ 353
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
ELA +A EGIVLLKN + LPL+ + K VA++G ANAT M G Y+GIP Y +P+
Sbjct: 354 ELALRSATEGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLY 413
Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
A + +Y G A + A AA+ AD + G D +V +E LDRE +
Sbjct: 414 AAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIA 473
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
P Q +L++++A + K ++VI VD +F N N+ +ILW GYPG+ GG A+ D
Sbjct: 474 WPKAQMKLLSELAGLGK--PLVVIQLGDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFD 531
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD------------------------ 617
V+ GK P GRLP+T Y YV +PLT M LRP +
Sbjct: 532 VLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGSGSLT 591
Query: 618 ----------SLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
+L PGRTYK+Y+ P L PFGYGL YT F
Sbjct: 592 IQPRSTPGNKTLSSPGRTYKWYSNPVL-PFGYGLHYTTF 629
>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
Length = 689
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 231/637 (36%), Positives = 350/637 (54%), Gaps = 76/637 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D L R L ++ +E+ QQL A + + GLP Y WW+E LHGV+ G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A+F++ + ++G+ VSTEARAMYN GLT W
Sbjct: 62 --------ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ R VN+V+G+Q E + L+ ++C
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEEKY----------LRAAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDARV+E+D+EET+L F+ VKEG VM +YNRVNG PS
Sbjct: 164 KHFA---VHSGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPS 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA KL+ + EW GY V+DC +I+ NHK + D+ + A LKAG D++CG
Sbjct: 221 CASEKLMGKLR--EWGFDGYFVSDCGAIRDFHTNHK-ITDTAPQSAAMALKAGCDVNCGN 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + A+++G + + DI + + +RLG D + ++ L I D N L+
Sbjct: 278 TYLHILA-ALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACDGNKALS 335
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
EAA + +VLL ND LPL+ +++ ++AV+GP+A++ A++GNY G P R ++ + G
Sbjct: 336 LEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQ 394
Query: 464 --FSGYANVTYKTGCDDVACKSN------NSIFAASEAAKTADATIILAGLDLSVEAE-- 513
F G V Y GC ++ + A A + AD T++ GLD ++E E
Sbjct: 395 DAFDG--RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEG 452
Query: 514 -----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
S D+ DL LP Q L+ ++ + K P+I+V+ + V+ T A++ A
Sbjct: 453 DTENKSGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVN-----TECEGNALINA 506
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRTYK 627
YPG+ GG+A+A+++FG+ +P G+LP+T+Y MLP T ++ RTY+
Sbjct: 507 WYPGQYGGKALAEILFGEVSPSGKLPVTFYKS--ADMLPDFTDYSMK--------NRTYR 556
Query: 628 FYNGPT--LYPFGYGLSYTQFKYNLLSFT-KTIQVNL 661
F + + LYPFGYGL+Y+ F+ +S+ T+ VN+
Sbjct: 557 FCDDESNVLYPFGYGLTYSHFECGDISYKDNTLAVNV 593
>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
str. Grapes]
Length = 701
Score = 368 bits (945), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 251/757 (33%), Positives = 370/757 (48%), Gaps = 116/757 (15%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ K LV+ M+L E QL A +PRLGLP+Y WW+EALHG + G AT
Sbjct: 9 QAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------AT 58
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A F++ K+I +STE RA YN A GLT WSPN+N+ RDP
Sbjct: 59 VFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRDP 118
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ + V++++GLQ + LK ++C KH+A V +
Sbjct: 119 RWGRGQETYGEDPYLASQLGVSFIQGLQG----------DGPYLKTAACVKHFA---VHS 165
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ F+A V+ +D+ ET+L FE CVKEG+ ++VM +Y+ VNG P C P L+
Sbjct: 166 GPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDI 225
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R +W G ++DC +I+ NH + ++ D+VA L AG DL+CG Y + A
Sbjct: 226 LRNDWGFEGMYISDCWAIRDFHLNHA-VTKNQVDSVALALNAGCDLNCGCEYLSLE-KAY 283
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
QQG + I ++ + T LG F Y ++G + ++E+ ++A +A+ +VL
Sbjct: 284 QQGLIDRKTITQACIRVMTTRFALGLFSEDCTYSNIGYEQNDTEEHRKVAFKASCNSLVL 343
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKND LPL+S + +A++GP+A++ A+ GNY G Y + + GF V
Sbjct: 344 LKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVK 402
Query: 472 YKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y G + + N+ I A A +D I+ G D +VE E + D
Sbjct: 403 YSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGD 462
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
++DL LP Q L+ VA K P++LV++S G +D E N+KA+L YPG+EGG
Sbjct: 463 KQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEI-ERFPNVKALLQGWYPGQEGG 520
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTL 634
AIA + G NP G LP+T+Y + V LP D Y GRTY++ L
Sbjct: 521 LAIAHTILGLNNPSGHLPVTFYRSETV--LP---------DFCDYRMEGRTYRYVQEKVL 569
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GLSYT F Y LS K NL
Sbjct: 570 YPFGFGLSYTTFSYGNLSTGKQADGNL--------------------------------- 596
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSK------PPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
E N G+ +G +VV +Y PP + + GF + ++ G +K +
Sbjct: 597 -ELSFIVSNSGNREGREVVQIYCHSDHPFFPPNPV-------LCGFTSLVLQPGEHKTVT 648
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
A ++ + +D + G ++VGN + P
Sbjct: 649 QTILA-EAFSAIDPEGKRIALKGWFDLYVGNHQKALP 684
>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
Length = 411
Score = 368 bits (944), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 188/417 (45%), Positives = 268/417 (64%), Gaps = 11/417 (2%)
Query: 377 MRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
MRLGFFDG+P+ Y LG +D+C+ EN ELA E AR+GIVLLKN +LPL+ + +KT+
Sbjct: 1 MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASE 492
AV+GP+AN T MIGNY G+ C+Y +P+ G T Y GC +V C + + +A
Sbjct: 61 AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC-TEADLDSAKT 119
Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
A +ADAT+++ G D ++E E+LDR DL LPG Q +L+ QVA+ A+GPV+LVIMS GG D
Sbjct: 120 LAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179
Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
I FA+ + I +I+W GYPGE GG AIADV+FG+ NP G+LP+TWY YV+ +P+T+M
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239
Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
+RP S GY GRTY+FY G T+Y FG GLSYT F + L+ K + +NL++ Q CR+
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRSPEC 299
Query: 673 TS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
S DA C + R D FE ++ +NVG +G++ V +++ PP E+ + KQ+
Sbjct: 300 QSLDAIGPHCEKAVGE--RSD--FEVQLKVRNVGDREGTETVFLFTTPP-EVHGSPRKQL 354
Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
+GF+++ + ++F + CK L +VD L G H + VG+ SF I +
Sbjct: 355 LGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNISV 411
>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
Length = 698
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 253/752 (33%), Positives = 372/752 (49%), Gaps = 117/752 (15%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R ++LV RM L + + QL A + LG+P Y WW+E LHG + G AT
Sbjct: 6 RAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------AT 55
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
FP I + F+ + VSTE RA YNL GLT WSPN+N+ RDP
Sbjct: 56 VFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRDP 115
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R AV ++RGLQ EG LK +SC KH+AA+
Sbjct: 116 RWGRGQETFGEDPYLTARLAVAFIRGLQG-EGP---------VLKTASCVKHFAAHS--- 162
Query: 236 WKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
G + R+ F+A V ++D+EET+L F VKE A +VM +Y+ +N P CA L+
Sbjct: 163 --GPEPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLME 220
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
+T+R W G ++DC +I+ NHK + ++E++ A LK G DL CG Y +
Sbjct: 221 ETLRLRWGFEGMYISDCWAIRDFHLNHK-VTKNEEESAALALKRGCDLACGCEYQSLE-K 278
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
A Q+G + I K+ + T +LG FD Y +LG + + SDE+ LA EA+ +
Sbjct: 279 AFQKGLITREQIKKAAIRVMTTRFKLGQFDQGTAYDTLGLESLDSDEHAALAFEASCRSL 338
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----AN 469
VLLKND LPL V +AV+GP+A++ A+ GNY G RY++ + G Y
Sbjct: 339 VLLKNDA-LLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTR 397
Query: 470 VTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------S 514
+ Y G + + K ++ + A AK +D ++ GL+ +VE E +
Sbjct: 398 ILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWA 457
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
D++DL LP Q +L+ VAE K P+I+V++S G +D E N+KA++ A YPG+E
Sbjct: 458 GDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEI-EQYANVKALIQAWYPGQE 515
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP-T 633
GG+AIA +++G P G+LP+T+Y + ++ P T L RTY++ + P
Sbjct: 516 GGKAIAHLLYGALCPSGKLPVTFYKAE-AKLPPFTDYSL--------IRRTYRYCDDPDV 566
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
LYPFG+GLSY F + L S A +T GV L
Sbjct: 567 LYPFGFGLSYASFSFCL-----------------------SAAQETEQNGVAATVL---- 599
Query: 694 YFEFKVDFQNVGSTDGSDVVIVY------SKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+N + D VV +Y PP + + G + V ++AG +I
Sbjct: 600 -------VRNTSALDARTVVQLYLAMEGKDLPPHPV-------LCGMKSVHLKAGEETQI 645
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
F+ K V N G +T++ G+
Sbjct: 646 TFILEE-KQFTAVQEDGNRYAVRGGYTLYAGS 676
>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
Length = 714
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 243/701 (34%), Positives = 356/701 (50%), Gaps = 77/701 (10%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D S RV+DLVS+MTL+EKV QL A V RLG+P Y WW+EALHGV+ G
Sbjct: 4 VYLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG-- 61
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-----NLGRA---GLTY 164
AT FP I A F+E+L +KIG + E RA Y N R G+T+
Sbjct: 62 --------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITF 113
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP + GR Y++G+Q N + LK ++C
Sbjct: 114 WSPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQG----------NGKRLKAAAC 163
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+AA+ KG R+ F++ V+++D+ ET+ FE CVKE VM YNR+NG
Sbjct: 164 VKHFAAHSGPE-KG--RHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEA 220
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+C L+ + +R +W GY V+DC +I+ H L D+ +++ A LK+G DL+CG
Sbjct: 221 ACGSHHLITEILREKWGFDGYYVSDCGAIKDF-HMHHGLTDTPQESAALALKSGCDLNCG 279
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIEL 404
Y + +A QG V DID+++ +L MRLG FD ++ + + E+ L
Sbjct: 280 AVYLHVM-SAYNQGLVSAEDIDRAVTHLMMTRMRLGMFDQHTEFDEIPYEINDCAEHHGL 338
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG- 463
A +AA E +VLLKND LPL+ +KTVAV+GP+ ++ + GNY G + + G
Sbjct: 339 ALKAAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGI 397
Query: 464 ----------FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
F + Y+ +++A ++++ + A A +D + GL+ ++E E
Sbjct: 398 RAVLGKETRIFCSEGSHLYRDNVENLA-EADDRLKEAVSMAVRSDVVFLCLGLNGTLEGE 456
Query: 514 S---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
D+ DL LP Q +L+ V PVIL++ + + I +A + + A
Sbjct: 457 EGDANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLAAGSAMAINYAAEHCS--A 513
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL YPG+ GG A A ++ G+ P GRLP+T+Y T+ L GR
Sbjct: 514 ILHIWYPGQMGGLAAARLLTGEAVPSGRLPVTFYQ---------TTEELPEFTDYSMKGR 564
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY++ LYPFGYGLSY F+Y S K Q R ++ SK C +
Sbjct: 565 TYRYMEREALYPFGYGLSYGDFEY---SNFKAEQTEAGPDGQVRFSVKITNRSKAECDEI 621
Query: 685 LVNDLRCDDYFEFK------VDFQNVGSTDGSDVVIVYSKP 719
+R D E DF+ + G V + ++ P
Sbjct: 622 AEVYVRIADS-ELAAPGGSLADFRRIHMKAGESVTVPFTLP 661
>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 722
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 244/724 (33%), Positives = 374/724 (51%), Gaps = 78/724 (10%)
Query: 60 PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVI 119
P ++RVK L+ +MTL EK QL + +PRL LP Y +W+E LHGV+ G T F I
Sbjct: 57 PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGEVTVFPQAI 116
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
A+++ TV++ K++ A+STEAR Y GLTYWSP IN+ARDPRWGR
Sbjct: 117 NLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGR 166
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
ET GEDP++ R V +V+GLQ G A LK + KH+ A + +N
Sbjct: 167 NEETYGEDPYLTSRLGVAFVKGLQ---GDHPAY------LKTVATIKHFVANNEEN---- 213
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+R+ +++ + + E + +E CVKE SVM +YN NG+P LL + +R E
Sbjct: 214 NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKE 273
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGK 359
W G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y AV+QG
Sbjct: 274 WGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGL 332
Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAREGIVLLK 417
+ E ID++L + T +LG FD Y K+ + + ELA EAA + +VLLK
Sbjct: 333 ISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLK 392
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
N+ N LPL+ K K+VAVVGP A+ +G Y+G P ++ + G K
Sbjct: 393 NE-NLLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYL 449
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
+ S +SI A A K D ++ G D + E+ D ++LP Q +L+ + +V
Sbjct: 450 NGIGASRDSIVA---AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQV- 505
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
P I+++ +G + + +I AI+ A YPG+E GRA+AD++FG NP G+LP+T
Sbjct: 506 -NPRIVLVFHSGN-PLTSEWADVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTI 563
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y + LP +D + GRTY++ LY FG+GLSYT F ++ + + T+
Sbjct: 564 YRAE--DQLPDI------LDFDMWKGRTYRYMKEDPLYGFGHGLSYTSFGFDGIQGSDTL 615
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
+ L+C V+ N G G +VV VY
Sbjct: 616 KSG--------------------------TTLQCS------VELSNTGKWTGEEVVQVYV 643
Query: 718 KPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
TY +K+++ F++V + G KR++F + L++ + N + G++T+F
Sbjct: 644 SRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWE-NGNWRMLTGKYTLF 701
Query: 777 VGNG 780
+G+G
Sbjct: 702 IGSG 705
>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
FD-1]
Length = 690
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 243/724 (33%), Positives = 359/724 (49%), Gaps = 110/724 (15%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D SL R +DL +R+TL+E+ QL A V RL +P Y WWSE LHGV+ G
Sbjct: 4 YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A F+E K+G + EARA YN A GL W
Sbjct: 62 --------ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPN+N+ RDPRWGR ET GEDP++ R V + +GLQ EG LK ++C
Sbjct: 114 SPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQG-EGE---------VLKTAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH A V + R+ FDA + +DMEET+L FE VKE VM +YNRVNG P+
Sbjct: 164 KHLA---VHSGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPA 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA L+ + EW GY V+DC +I+ NH + E A A LK G DL+CG
Sbjct: 221 CASKFLMGKL--DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESA-AMALKLGCDLNCGN 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + +A +G + + DI K+ +L +RLG FD +Y L + ++EN A
Sbjct: 278 TYLHLL-HAYNEGLINDEDIKKACTHLMRTRVRLGMFDDETEYDKLDYSIVANEENKAYA 336
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
+ + +V+LKN+ LPL+ +K+KT+ V+GP+A++ A+ GNY G RY++ + G
Sbjct: 337 RKCSERSMVMLKNN-GILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQ 395
Query: 464 --FSGY-----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE--- 513
F G + YK C +A +++ + A + +D ++ GLD ++E E
Sbjct: 396 DAFGGRVLYSEGSHLYKDRCMGLAV-ADDRLSEAEIVTEHSDVVVLCVGLDATIEGEEGD 454
Query: 514 ------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
S D+ DL LP Q +L+ V K PVI+V + +++ + A++
Sbjct: 455 TGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGSAINV-----EADCDALIH 508
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
A YPG+ GG A+AD++FGK +P G+LP+T+Y D ++ T ++ GRTY+
Sbjct: 509 AWYPGQFGGTALADILFGKISPSGKLPVTFYT-DTTKLPEFTDYSMK--------GRTYR 559
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+ LYPFGYGL+Y++ + + L F + K
Sbjct: 560 YTQDNILYPFGYGLTYSKTEVSDLKF---------------------ENGKASVKVTNTG 598
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
D +D +F + +GSD V YS + GF+RVF++ G + +
Sbjct: 599 DFDTEDVVQFYI------KGEGSDYVPFYS-------------LCGFRRVFLKKGESTVV 639
Query: 748 KFVF 751
+
Sbjct: 640 EVTL 643
>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
Length = 733
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 242/748 (32%), Positives = 372/748 (49%), Gaps = 86/748 (11%)
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF 115
D+ LP R+ L+ MTL EK QL + + RLGLP+Y++W+EALHGV+ G
Sbjct: 28 DTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNG----- 82
Query: 116 DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSP 167
AT FP I A+F++ L K +S EARA +N+ +GLT+W+P
Sbjct: 83 -----RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTFWTP 137
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
NIN+ RDPRWGR ET GEDP++ + V GLQ + + LK ++ KH
Sbjct: 138 NINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAAKH 188
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+A V + R+ FDA + +DM ET+ FE V E + +VM +YNRVNG P+
Sbjct: 189 FA---VHSGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAGG 245
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
LLN +R +W G++V+DC + HK A++ E A A + G DL+CG Y
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-ALAINTGTDLNCGAVY 304
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELA 405
N +AV+ G V E IDK L + +LGFFD Y ++ + S+ + ++A
Sbjct: 305 -NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNISADVVNSEAHAQVA 363
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
E A + IVLL+N N LPL+ ++ + V GP A+++ ++GNY G+ + + + G +
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422
Query: 466 GYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES------- 514
+V YK G N + EA + D I + GL + E E
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482
Query: 515 --LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
DR L LP +Q + ++ + PVI+V+ + G + E AI++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA--GTPVNLTEIAELADAIVFAWYPG 540
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
+EGG+A+AD++FG+ +P GRLPIT+ P + L P D GRTY++
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMTQE 591
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
+YPFG+GLSY Q K++ ++ T Q +K + N+ T
Sbjct: 592 PMYPFGFGLSYAQVKFDNITLGNT-QALASKNELQENMTVT------------------- 631
Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
V+ N G + +VV +Y K P + + + GF R+ + AG+ +++ F
Sbjct: 632 ------VNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFNI- 684
Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVGNG 780
K L ++ +L G++++ VGN
Sbjct: 685 PKKHLYSINEQGKPVLLKGQYSVIVGNA 712
>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
Length = 748
Score = 366 bits (939), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 254/762 (33%), Positives = 386/762 (50%), Gaps = 98/762 (12%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL---GDFAHG----VPRLGLPQYEWWS 100
+ F F D +LP+ R++DL+ R+T E V Q+ G +G +PRLG+ Y W +
Sbjct: 23 EAQPFPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNT 82
Query: 101 EALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--- 156
E L G D PG AT+FP + A+F+ L ++ A +TE RA +N
Sbjct: 83 ECLRG----------DAEAPGWATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFV 132
Query: 157 -LGR----AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
GR GL+ +SP +N+ R P WGR ET GEDP++ A ++V+GLQ
Sbjct: 133 AAGRYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQ------ 186
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
+ R +K S+ CKH++ + V R FDA+V E+D TFL F+ CV+ G +
Sbjct: 187 ---HPRYIKASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAG-SY 242
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
S MCSYNR+NG+P+CA+ KLL +RGEW GY+V+D ++++++ H++ E A+
Sbjct: 243 SFMCSYNRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAI 302
Query: 332 AQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
A ++ AGL+L+ N A+ G + + ++ L+ +RLG FD
Sbjct: 303 A-SVNAGLNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFDPPAM 361
Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y +L + S E+ L+ EAA + VLLKN ++TLPL K +AVVGP A+
Sbjct: 362 NPYNALELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRV 421
Query: 446 MIGNYAGIP-CRYM-SPIAGFSGY-ANVTYKTGCDDVAC--KSNNSIFAASEAAKTADAT 500
+ G+YA +P +Y+ +P G ANV++ GC + C S + + A + AD
Sbjct: 422 LFGDYAPVPEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEV---ENAVRGADVV 478
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETN 559
++ G + VE E+ DR+DL LPG+Q QL+ A G PVIL++ +AG +D+++A+ +
Sbjct: 479 LVCLGTGIDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLH 538
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGK--FNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
+ AIL +P + G AIA V+ GK +P GRLP TW G + +P P++
Sbjct: 539 DGVGAILACFFPAQATGLAIASVLLGKQGASPAGRLPATWPAG--MHQVP-------PME 589
Query: 618 SLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
+ GRTY++Y LYPFGYGLSYT F Y L + + L C NL+ +
Sbjct: 590 NYTMEGRTYRYYGQEAPLYPFGYGLSYTTFHYRDLVLSPPV------LPICANLSVS--- 640
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
V +N G D +VV +Y + Q++ F+R
Sbjct: 641 ----------------------VVLENTGPRDSEEVVQLYLRWEQPSVPVPRWQLVAFRR 678
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
V V AG ++ F A + + L G T+F G
Sbjct: 679 VAVPAGGATKLSFGVTAAQR---AVWMQQWHLEPGAFTLFAG 717
>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
4H-3-7+YE-5]
Length = 733
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 241/750 (32%), Positives = 373/750 (49%), Gaps = 86/750 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ LP R+ L+ MTL EK QL + + RLGLP+Y++W+EALHGV+ G
Sbjct: 26 WFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNG--- 82
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F++ L K +S EARA +N+ +GLT+W
Sbjct: 83 -------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + V GLQ + + LK ++
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA + +DM ET+ FE + E + +VM +YNRVNG P+
Sbjct: 187 KHFA---VHSGPEALRHEFDAIASPKDMYETYFPAFEALITEANVETVMAAYNRVNGHPA 243
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
LLN +R +W G++V+DC + HK A++ E A A + G DL+CG
Sbjct: 244 GGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-ALAINTGTDLNCGA 302
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
Y N +AV+ G V E IDK L + +LGFFD Y ++ + S+ + +
Sbjct: 303 VY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNISADVVNSEAHAQ 361
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
+A E A + IVLL+N N LPL+ ++ + V GP A+++ ++GNY G+ + + + G
Sbjct: 362 VAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYYGLSGKTTNILDG 420
Query: 464 FSGYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES----- 514
+ +V YK G N + EA + D I + GL + E E
Sbjct: 421 ITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIA 480
Query: 515 ----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
DR L LP +Q + ++ + PVI+V+ + G + E AI++A Y
Sbjct: 481 SPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA--GTPVNLTEIAELADAIVFAWY 538
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PG+EGG+A+AD++FG+ +P GRLPIT+ P + L P D GRTY++
Sbjct: 539 PGQEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMT 589
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
+YPFG+GLSY Q K++ ++ T Q +K + N+ T
Sbjct: 590 QEPMYPFGFGLSYAQVKFDNITLGNT-QALASKNEPQENMTVT----------------- 631
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
V+ N G + +VV +Y K P + + + GF R+ + AG+ +++ F
Sbjct: 632 --------VNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFS 683
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
K L ++ +L G++++ VGN
Sbjct: 684 I-PKKHLYSINEQGKPVLLKGQYSVIVGNA 712
>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 723
Score = 365 bits (938), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 254/759 (33%), Positives = 372/759 (49%), Gaps = 115/759 (15%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + SL + R DLVSR+TL+EK+ + + + V RLG+ YEWW+EALHGV+ G
Sbjct: 25 YQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL-- 82
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT +P I ASFN++L ++ ++S EAR Y R GLT+W
Sbjct: 83 --------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTFW 134
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ R ++ V GLQ + N++ K +C
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQ--------NTKYNKTHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W +R+ F+A + +D+ ET+L F+ V +G+ VMC+YNR G P
Sbjct: 187 KHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA-----DSKEDAVAQTLKAGL 339
C +LL +R EW+ G +V+DC +I DN F +K DA A + +G
Sbjct: 244 CCGSDRLLINILRNEWNYKGLVVSDCGAI----DNFYFKGRHETHKNKADASAAAVLSGT 299
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
DL+CG+ YT +AV++G + E+ ID+SL L LG D + + L +
Sbjct: 300 DLECGRSYTGLI-SAVKEGLINESAIDQSLCRLMKARFELGEMDDTTPWDQLPDSLLSCH 358
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
+ +LA + ARE + LL+N +N LPL+ K TVA++GP+AN +V NY G P ++
Sbjct: 359 AHQQLALQMARESMTLLQNHKNILPLD--KEMTVALIGPNANDSVMQWANYNGFPVHTIT 416
Query: 460 PIAGFSGY---ANVTYKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSV 510
+ G + Y + Y + K N I A A AD I G+ S+
Sbjct: 417 LLEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQAVINQAAKADVIIYAGGISASL 476
Query: 511 EAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
E E + DR + LP Q +L+ + K P++ V S G + +
Sbjct: 477 EGEEMDVDAEGFRGGDRTTIELPNVQRKLVKALKATGK-PIVFVNFS--GCAMGLQPESQ 533
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
AIL A YPG+ GG AIA+V+FG +NP GRLPIT+Y D LP +
Sbjct: 534 ICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKD--NQLP-------DFEDYN 584
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
GRTY++ N LYPFG+GLSYT F Y+ T + KL
Sbjct: 585 MQGRTYRYLNYEPLYPFGHGLSYTTFSYS------TPFIENGKL---------------- 622
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
KV N G+ +G +V+ +Y K + +K + GFQR+ +
Sbjct: 623 -----------------KVKVTNSGNYNGDEVIQLYIKRYDDPDGP-LKTLRGFQRIHIP 664
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
AG+ + F + + D +NT+ P G + I VG
Sbjct: 665 AGQTSEVSFPLTS-DTFTWWDKDSNTVHPLQGRYKILVG 702
>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
Length = 734
Score = 365 bits (936), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 258/754 (34%), Positives = 379/754 (50%), Gaps = 105/754 (13%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
+F + D++ + R K LV+ +TL+EK+ + D + + RL +P+Y WW+E LHGV+ G
Sbjct: 38 NFEWFDTNKSFEKRAKALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNG 97
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGR----AGL 162
AT FP I A+F++ L K+ A+STEARA +N +G AGL
Sbjct: 98 R----------ATVFPQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGL 147
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+W+PNIN+ RDPRWGR ET GEDP++ + VN+V+GLQ G+ + + LK +
Sbjct: 148 TFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQ---GN------HPKYLKSA 198
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KHYA V + R+ FDA +++DM ET+L FE VKE VM +YNRVNG
Sbjct: 199 ACAKHYA---VHSGPEELRHEFDAIASKKDMAETYLPAFEALVKEAKVEGVMGAYNRVNG 255
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF--LADSKEDAVAQTLKAGLD 340
+CA P LL + ++ W GYIV+DC + D HKF + + E++ A L GL+
Sbjct: 256 EGACASPYLLEKLLKDTWGFKGYIVSDC---WALSDLHKFHKVTQTAEESAAAALNVGLN 312
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
++CG Y G A++QG E +D L++ +LGFFD S Y + + S
Sbjct: 313 VNCGNVYPALDG-AIKQGLTSEKQLDNVLQHQLLTRFKLGFFDPSNNNPYNKITTDVVDS 371
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
+ + +A EAA++ IVLLKN+ N L +K+V V GP+A ++GNY G+ +
Sbjct: 372 EAHRAIALEAAQKSIVLLKNNNNLL-PLKKDLKSVYVAGPNAAREDVLLGNYYGVTSKTQ 430
Query: 459 SPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
+ + G S ++ YK G N ++ E ++ AD II+ GL + E E
Sbjct: 431 TILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWSTGEISR-ADVGIIVMGLSGNYEGEE 489
Query: 515 ---------LDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNIKA 564
DR D+ LP Q I ++ G P++LV+ GG IA E + A
Sbjct: 490 GEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVL--TGGSPIAMPEVYDLVDA 547
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
I++A YPGEEGG+A+AD++FG P G+LPIT+ P + L P + GR
Sbjct: 548 IVFAWYPGEEGGQAVADILFGDVVPSGKLPITF---------PKSVDDLPPYNDYAMKGR 598
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TYK+ +PFG+GLSYT FKY+ L Y AS
Sbjct: 599 TYKYMTKTPQFPFGFGLSYTSFKYDNLKV------------------YKEKAS------- 633
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
N G+ D +V VY P + ++GF RV ++AG
Sbjct: 634 --------------FSITNNGNVDAEEVAQVYVSSPNAGKGDPLNTLVGFTRVSLKAGAT 679
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
K++ F+ K+ D + G +TI VG
Sbjct: 680 KQVSIPFSK-KAFVQFDSDGKEITRKGTYTIHVG 712
>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
(Beta-D-glucoside glucohydrolase) [Treponema
azotonutricium ZAS-9]
Length = 706
Score = 365 bits (936), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 240/750 (32%), Positives = 386/750 (51%), Gaps = 106/750 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R+K+++S+MTL+EKV QL A V G+P+Y WW+E LHGV+ G AT
Sbjct: 6 RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYWSPNINVARDP 175
FP I A+F+E+ + + A+S E RA YN + R GLT+W+PN+N+ RDP
Sbjct: 56 VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ GR + +++GLQ ++ LKV++C KHYA V +
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQGD---------DTEHLKVAACAKHYA---VHS 163
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R+ FDA V+++D+ ET+L F++ V+ G +VM +YNR G P LL +
Sbjct: 164 GPEKLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGSTYLLKEI 222
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+RG W G++ +DC +I+ +NHK + S E++ A L AG DL+CG Y T +
Sbjct: 223 LRGRWGFKGHVTSDCWAIRDFHENHK-VTKSPEESAAMALNAGCDLNCGCTYPYLTVSH- 280
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGI 413
++G V + ID +L L +LG FD Q Y +LG + +++ LA EAA++ I
Sbjct: 281 KKGLVTDETIDTALTRLLRTRFKLGLFDPPEQDPYRNLGNDIVGCEKHRNLALEAAQKSI 340
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN---- 469
VLLKND N LPL+ + + + ++GP A + ++ NY G+ R ++ + G +
Sbjct: 341 VLLKNDSNILPLDDS-ARKILLMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTA 399
Query: 470 VTYKTGCDDVACKSN---NSIFAASEAAKTA--------DATIILAGLDLSVEAE----- 513
++++ + + N N F ++ A D I + GLD S+E E
Sbjct: 400 ISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSI 459
Query: 514 ----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
+ DR+ + LP +Q + ++ + K V+++ GG IAF E + A+L+A
Sbjct: 460 ASDANGDRDTIELPSWQLNFLRRIRKAGKKVVLIL---TGGSPIAFPEDLAD--AVLFAW 514
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
YPGE+GG A+AD++FG +P G+LPIT+ P ++ L P D GRTY++
Sbjct: 515 YPGEQGGNAVADILFGDVSPSGKLPITF---------PQSTAQLPPYDDYALKGRTYRYM 565
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LYPFG+GLSYT F+++ +++++ +K+ ++
Sbjct: 566 KETPLYPFGFGLSYTSFRFD------SVELSSSKISAGNSV------------------- 600
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+ KV N G D +VV +Y + GF+R+ + AG++ ++
Sbjct: 601 ------KAKVQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSASVEI 654
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A I A+ L+P G +T+ +
Sbjct: 655 ELPASAFETINAEGASVLIP-GSYTVIAAD 683
>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
WAL-18680]
Length = 695
Score = 364 bits (934), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 218/603 (36%), Positives = 326/603 (54%), Gaps = 66/603 (10%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV +MTL+E+ Q+ A VPRLG+P Y WW E LHGV+ G AT FP
Sbjct: 13 LVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------ATMFPQ 62
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYWSPNINVARDPRWGR 179
I A F+ L ++I VSTE RA YN GLT+WSPN+N+ RDPRWGR
Sbjct: 63 AIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDPRWGR 122
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
ET GEDP++ R +VRGLQ H LK+++C KH+A V +
Sbjct: 123 GHETYGEDPYLTSRLGTAFVRGLQGDGEH----------LKIAACAKHFA---VHSGPEA 169
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
R+ F A +++D+ ET+L FE CVKE SVM +YN +G P CA+ L+ + +RG+
Sbjct: 170 LRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILRGQ 229
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGK 359
W G+ V+DC +I+ N+ + D+ ++ A +K G DL+CG Y A ++G
Sbjct: 230 WGFEGHFVSDCWAIRDFHMNY-MVTDTAMESAALAVKKGCDLNCGNTYLQVL-KACEEGL 287
Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
+ + + +++ L+T LG + + +Y + + + E+ ELA EAAR +VLLKND
Sbjct: 288 LDDACVTEAVVRLFTTRYLLGMGEET-EYDDIPYEVVECKEHRELAVEAARRSMVLLKND 346
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTG 475
LPL++ K+ T+AV+GP+A+ A+IGNY G Y + + G V Y G
Sbjct: 347 -GLLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRVLYAEG 405
Query: 476 CD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDREDL 520
C + + + + A AK +D ++ GLD ++E E S D++DL
Sbjct: 406 CHLFKDRVEHLAVAGDRLSEARIVAKHSDVVVLCVGLDETLEGEEGDTGNSHASGDKKDL 465
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
LP Q +L+ ++ + K PV++ MS +D++ A+ +W YPG EGGRA+A
Sbjct: 466 LLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQEKAGAVIQVW--YPGAEGGRALA 522
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
D++FGK +P G+LP+T+Y L ++P P + GRTY++ LYPFG+G
Sbjct: 523 DLLFGKASPSGKLPVTFYKD-------LENLP--PFEDYSMDGRTYRYLTAEPLYPFGFG 573
Query: 641 LSY 643
L+Y
Sbjct: 574 LTY 576
>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
Length = 733
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 241/750 (32%), Positives = 372/750 (49%), Gaps = 86/750 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ LP R+ L+ MTL EK QL + + RLGLP+Y++W+EALHGV+ G
Sbjct: 26 WFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNG--- 82
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F++ L K +S EARA +N+ +GLT+W
Sbjct: 83 -------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + V GLQ + + LK ++
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA + +DM ET+ FE V E + +VM +YNRVNG P+
Sbjct: 187 KHFA---VHSGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPA 243
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
LLN +R +W G++V+DC + HK A++ E A A + G DL+CG
Sbjct: 244 GGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-ALAINTGTDLNCGA 302
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
Y N +AV+ G V E IDK L + +LGFFD Y ++ + S+ + +
Sbjct: 303 VY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNISADVVNSEAHAQ 361
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
+A E A + IVLL+N N LPL+ ++ + V GP A+++ ++GNY G+ + + + G
Sbjct: 362 VAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYYGLSGKTTNILDG 420
Query: 464 FSGYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES----- 514
+ +V YK G N + EA + D I + GL + E E
Sbjct: 421 ITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIA 480
Query: 515 ----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
DR L LP +Q + ++ + PVI+V+ + G + E AI++A Y
Sbjct: 481 SPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA--GTPVNLTEIAELADAIVFAWY 538
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
PG+EGG+A+AD++FG+ +P GRLPIT+ P + L P D RTY++
Sbjct: 539 PGQEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQERTYRYMT 589
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
+YPFG+GLSY Q K++ ++ T Q +K + N+ T
Sbjct: 590 QEPMYPFGFGLSYAQVKFDNITLGNT-QALASKNEPQENMTVT----------------- 631
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
V+ N G + +VV +Y K P + + + GF R+ + AG+ +++ F
Sbjct: 632 --------VNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFN 683
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
K L ++ +L G++++ VGN
Sbjct: 684 I-PKKHLYSINAQGKPVLLKGQYSVIVGNA 712
>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
Length = 695
Score = 363 bits (933), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 243/698 (34%), Positives = 362/698 (51%), Gaps = 104/698 (14%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
+VS+MTL EK+ Q+ A + RL +P Y +W+E LHGV+ G AT FP
Sbjct: 15 IVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------ATVFPQ 64
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRWGR 179
I A+F+ L K I + +S E RA YN GLT+WSPNIN+ RDPRWGR
Sbjct: 65 AIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDPRWGR 124
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
ET GEDPF+ + V +++GLQ EG + L++++C KH+A V +
Sbjct: 125 GQETYGEDPFLTAQIGVAFIKGLQG-EG---------KYLRLAACTKHFA---VHSGPEA 171
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
DR++FDA V +D+ E +L F+ ++E D S M +YN +NG P+C + +L+ +T+ G+
Sbjct: 172 DRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLLGK 231
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGK 359
W G++V+D +++ + +NH + + E +A +K G +L C ++ AV +G
Sbjct: 232 WGFEGHVVSDYAALEDVHENHHYTQTAAE-TMALAMKIGTNL-CAGKISDALFEAVGKGL 289
Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
V ET+I S+ LYT +RLG F Y ++ + S E+ L+ +AA + +VLLKND
Sbjct: 290 VTETEITASVVKLYTTHVRLGMFAEDNDYDTIPYEVNASAEHEMLSLKAAEKSMVLLKND 349
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTG 475
N LPL+ +++K+VAV+GP A A+ GNYAG Y + ++G S A VTY G
Sbjct: 350 -NFLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVTYALG 408
Query: 476 CDDVACKSNNSIFAASE-------AAKTADATIILAGLDLSVEAE---------SLDRED 519
C A + +S+ A+E AA+ AD ++ GLD ++E E S D+
Sbjct: 409 CHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSGDKPS 468
Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
L LPG Q +LI +V E K VILV+ S + + E +T +KAI+ A YPG GG A+
Sbjct: 469 LSLPGQQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHGGTAL 527
Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
A+++ GK +P G+LP+T+ Q LP S RTY+ LYPFGY
Sbjct: 528 ANILLGKVSPSGKLPVTFCKD--TQGLPDFS-------DYSMAERTYQNTQLEVLYPFGY 578
Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
GL+Y + KT+Q+ D V
Sbjct: 579 GLTYGHAE------IKTLQL---------------------------------DDLTLSV 599
Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
+N G D +V+ VY K +E A K +I F+R+
Sbjct: 600 TAENKGDYDIEEVIQVYVKINSEFAPKNHK-LIAFKRI 636
>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
siraeum DSM 15702]
Length = 691
Score = 363 bits (933), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 229/639 (35%), Positives = 349/639 (54%), Gaps = 78/639 (12%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D L R L ++ +E+ QQL A + + GLP Y WW+E LHGV+ G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A+F++ + ++G+ +STEARAMYN GLT W
Sbjct: 62 --------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ R VN+V+G+Q E + L+ ++C
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEEEY----------LRAAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDARV+E+DMEET+L F+ VKEG VM +YNRVNG PS
Sbjct: 164 KHFA---VHSGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPS 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA KL+ + EW GY V+DC +I+ HK + D+ + A LKAG D++CG
Sbjct: 221 CASEKLMGKLR--EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGCDVNCGN 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + A+++G + + +I + + +RLG D + ++ L I D N L+
Sbjct: 278 TYLHILA-ALEEGLITKQNIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACDGNKALS 335
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
EAA + +VLL ND LPL+ +++ ++AV+GP+A++ A++GNY G P R ++ + G
Sbjct: 336 LEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQ 394
Query: 464 --FSGYANVTYKTGCDDVACKSN------NSIFAASEAAKTADATIILAGLDLSVEAE-- 513
F G V Y GC ++ + A A + AD T++ GLD ++E E
Sbjct: 395 DAFDG--RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEG 452
Query: 514 -------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
S D+ DL LP Q L+ ++ + K P+I+V+ + V+ T A++
Sbjct: 453 DTGNEFASGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVN-----TECEGNALI 506
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRT 625
A YPG+ GG+A+A+++FG+ +P G+LP+T+Y MLP T ++ RT
Sbjct: 507 NAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKS--ADMLPDFTDYSMK--------NRT 556
Query: 626 YKFYNGPT--LYPFGYGLSYTQFKYNLLSFT-KTIQVNL 661
Y+F + + LYPFGYGL+Y+ F+ +S+ T+ VN+
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHFECGDISYKDNTLAVNV 595
>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
Length = 691
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 230/639 (35%), Positives = 348/639 (54%), Gaps = 78/639 (12%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D L R L ++ +E+ QQL A + + GLP Y WW+E LHGV+ G
Sbjct: 4 YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A+F++ + ++G+ +STEARAMYN GLT W
Sbjct: 62 --------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ R V++V+G+Q E + L+ ++C
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEEEY----------LRAAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDARV+E+DMEET+L F+ VKEG VM +YNRVNG PS
Sbjct: 164 KHFA---VHSGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPS 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA KL+ + EW GY V+DC +I+ HK + D+ + A LKAG D++CG
Sbjct: 221 CASEKLMGKLR--EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGCDVNCGN 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + A+++G + + DI + + +RLG D + ++ L I D N L+
Sbjct: 278 TYLHILA-ALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACDGNKALS 335
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
EAA + +VLL ND LPL+ +++ ++AV+GP+A++ A++GNY G P R ++ + G
Sbjct: 336 LEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQ 394
Query: 464 --FSGYANVTYKTGCDDVACKSN------NSIFAASEAAKTADATIILAGLDLSVEAE-- 513
F G V Y GC ++ + A A + AD T+I GLD ++E E
Sbjct: 395 DAFDG--RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEG 452
Query: 514 -------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
S D+ DL LP Q L+ + + K P+I+V+ + V+ T A++
Sbjct: 453 DTGNEFASGDKPDLRLPEVQRVLLQNLKDTGK-PLIIVLAAGSSVN-----TECEGNALI 506
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRT 625
A YPG+ GG+A+A+++FG+ +P G+LP+T+Y MLP T ++ RT
Sbjct: 507 NAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKS--ADMLPDFTDYSMK--------NRT 556
Query: 626 YKFYNGPT--LYPFGYGLSYTQFKYNLLSFT-KTIQVNL 661
Y+F + + LYPFGYGL+Y+ F+ +S+ T+ VN+
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHFECGDVSYKDNTLAVNV 595
>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 769
Score = 362 bits (930), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 238/666 (35%), Positives = 352/666 (52%), Gaps = 65/666 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEALHGVSN 108
FC++SL + RV+DL+SR+ L EK L A PR +GLP+Y W + +HGV +
Sbjct: 36 FCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHGVQS 93
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---------R 159
GT+ TSFP + A F+ + + Q + E RA++ G
Sbjct: 94 TC-GTNC------PTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPH 146
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GL WSPNIN+ RDPRWGR TETP EDP V +Y V Y RGLQ EG + R L
Sbjct: 147 LGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--EGKRQ----DPRFL 200
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ KHYAAY +N+ GV+R FDA V+ D +T+ F V +G+A VMCSYN
Sbjct: 201 QAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNS 260
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
VNGIP CA+ +L+ +RG GY+ +D +++ + D H + ADS+ +A + AG
Sbjct: 261 VNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHY-ADSQCEAARLAILAGT 319
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
D++ G+ Y V +++E +D +L++ + LG FD Y ++ ++
Sbjct: 320 DINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQPYWNVTPSEVN 379
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR- 456
+ L+ A R+ +V+L+N+ + LPL K +AV+GPHA + ++GNY G C
Sbjct: 380 TAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGLLGNYLGQMCHG 437
Query: 457 ----------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
+ I +G +N T+ GC ++ S A AAK ADA ++ G+
Sbjct: 438 DYDEVGCVQTPLDAIRAANGASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFLGI 496
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
D S+E E DR ++ LP Q QL+ +V V + P ++V+++ GGV I E A++
Sbjct: 497 DKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLIN-GGV-IGAEEIIERTDALV 553
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
A YPG G RA+ADV+FG NP G+LP+T Y DYV + + SM D +PGRTY
Sbjct: 554 EAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSM-----DMTAHPGRTY 608
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT----SDASKTRCP 682
+++ G ++PFG+GLSYT F ++ S T N H N ++ SD +
Sbjct: 609 RYFKGEPVFPFGWGLSYTTFSLSVDSGT-------NSSSHSNNAAFSGGEVSDTANVTIS 661
Query: 683 GVLVND 688
V+ ND
Sbjct: 662 VVVKND 667
>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
Length = 750
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 246/768 (32%), Positives = 387/768 (50%), Gaps = 96/768 (12%)
Query: 41 RFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL----GDFAHGVPRLGLPQY 96
RF+ + SF F + SLP R+ DL+SR+T+++ + Q G F G+ RLG+
Sbjct: 26 RFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTVARYGKFTPGIERLGIKPI 85
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
E+ +E L GV AT FP + ASF+ L +++ AVS E RA YN
Sbjct: 86 EYITECLRGVRR-----------ENATGFPQALGLAASFSRDLMQRVATAVSVEVRAFYN 134
Query: 157 -------LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
G G+T +SP IN+ R P WGR ET GEDP++ G A YV GLQ +
Sbjct: 135 HDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLSGELASQYVSGLQGDD--- 191
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
R L+VS+ CKH+ A+ + V ++ FDA++ E+D++ TFL F+ C+
Sbjct: 192 ------PRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKKCIA-AK 244
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+VMCS+N +NG+PSCA+ +LL +R +W G++V+D +++ + H + S E
Sbjct: 245 PYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHY-NSSFET 303
Query: 330 AVAQTLKAGLDLD-CGQY---YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
A + +K+G +++ G++ Y T A+ + + + ++ ++++ ++ LG FD
Sbjct: 304 AAVEAIKSGCNMELVGKFDPSYWQLT-KALNEHLITKDELMENVRPVFLTRFLLGEFDPP 362
Query: 386 P--QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
+ + K + S E+ LA EAA + VLLKND+N LPL +KTVAVVGP +N T
Sbjct: 363 ALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVGPMSNYT 422
Query: 444 VAMIGNYA--GIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
+IG+Y+ P ++P+ G A NV + +GC + C + A+ A A
Sbjct: 423 DGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRATDVAA-AVDGAQVV 481
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETN 559
+ G VEAE+ DR D+ LPG Q QL+ A G PV+L++ + G +D+ FA+
Sbjct: 482 FVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPLDVTFAQLT 541
Query: 560 TNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLP-LTSMPLRP 615
+ I +I+ +P G AI ++ G +P GRLP+TW Y+ +P +T ++
Sbjct: 542 SGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW--PAYLNQVPNITDYTMK- 598
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
GRTY++Y LYPFGYGLSYTQFKY+ L T + + K Q R
Sbjct: 599 -------GRTYRYYTEDPLYPFGYGLSYTQFKYSDLKVTP---LEVTKGQEIR------- 641
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV-----YSKPPAEIAATYIKQ 730
KV N+G D +V I+ S P EI Q
Sbjct: 642 ---------------------VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIPVPRW-Q 679
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ F R+ + +G+++ ++ A + + GE T+++G
Sbjct: 680 LVAFDRIHIASGKSETVELTIEASLLEVWQNPETGFDILEGEMTLYIG 727
>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
Length = 743
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 245/768 (31%), Positives = 379/768 (49%), Gaps = 109/768 (14%)
Query: 46 GLQMSSF----LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
G M+S ++ D +L + R +DLVSRMTL+EK+ Q+ A + RLG+P Y WW+E
Sbjct: 18 GSHMASMTQIPVYRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNE 77
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
ALHGV+ G +T FP I A+F+ L +K +STE RA Y+ +
Sbjct: 78 ALHGVARAGV----------STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRK 127
Query: 160 ------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GLT+WSP IN+ RDPRWGR ET GEDP++ R AV+++RG+Q
Sbjct: 128 GDRDIYKGLTFWSPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQG--------- 178
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
R LK ++C KH+A V + +R+ F+A V+++D+ ET+L FE VKE + V
Sbjct: 179 -RGRYLKAAACAKHFA---VHSGPESERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGV 234
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
M +YNRVNG P C LL +RGEW+ GY+ +DC +I+ + + H + + E++ A
Sbjct: 235 MGAYNRVNGEPCCGSGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHG-VTKTIEESSAL 293
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+K+G DL+CG Y + A + G + E +ID ++ L MRLG FD + Y S+
Sbjct: 294 AVKSGCDLNCGCAYASLV-KAYRAGLIGEKEIDTAVHRLMLTRMRLGMFDAPEKVPYSSI 352
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ E+ A E A + +VLL+N LPL+ +++++VAV+GP+A++ VA+ GNY
Sbjct: 353 PYEKNDCAEHRAFALEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYN 412
Query: 452 GIPCRYMSPIAGF----SGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATI 501
G Y++ + G A V Y G + N+ + A+ AA+ AD +
Sbjct: 413 GTASEYVTVLDGIREAVGDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAV 472
Query: 502 ILAGLDLSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
+ GL+ +E E + D+ DL LPG Q +L+ V PV+LV++S +
Sbjct: 473 VCLGLNRDIEGEEGDPSNEYPAGDKRDLRLPGLQEELLETVKATGT-PVVLVLLSGSALA 531
Query: 553 IAFAETNTNIKAILWAGYPGE--EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+ +A+ N + A++ A YPG EG R +FG P G P + TS
Sbjct: 532 VNWADENAD--AVVQAWYPGAQAEGRRG---ALFGIIRPAGGFP------SRSTVRTRTS 580
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
+ P G LYPFGYGLSYT+F+Y L
Sbjct: 581 RIFGTIHENRLP-----LLQGDPLYPFGYGLSYTKFQYGDLKLA---------------- 619
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
+++ + E V +N G D +VV +Y + Q
Sbjct: 620 ---------------ASEIPAGEDAEVSVTVRNAGERDSDEVVQLYLQDLESSVPVPKWQ 664
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ GF+RV ++ G + ++F A + + ++D +L G ++ G
Sbjct: 665 LAGFRRVHLKPGESAGVRFTV-AARQMALIDEDGRCVLEPGGFRVYAG 711
>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 747
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 244/723 (33%), Positives = 366/723 (50%), Gaps = 91/723 (12%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG-------VPRLGLP 94
FS + + F F ++SLP+S RV DLV R+TL+E V Q+ G + RLG+
Sbjct: 15 FSLISTILGDFPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIG 74
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y W +E LHG GP ATSFP A+F+ L ++I A + E RA
Sbjct: 75 PYSWNTECLHGDVAAGP----------ATSFPQAFGLAATFDAVLIEQIANATAYEVRAK 124
Query: 155 YNL--------GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
YN GL+ +SP IN+AR P WGRI ET GEDP++ G A +YV GLQ
Sbjct: 125 YNNYAKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQ--- 181
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
G+ + R + ++ CKH+ AY R FDA+V+++D+ TFL F C++
Sbjct: 182 GN------HPRYVTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPAFHECIQ 235
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G S+MCSYN +NG+P+CA+ KLL +R EW+ GY+++D +++ + D H + D
Sbjct: 236 AG-THSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDM 294
Query: 327 KEDAVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
+ A+A + +GL+L+ + T AV+QG V + + L+ MRLG F
Sbjct: 295 LDTAIA-CVNSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEF 353
Query: 383 DGSPQYVSLGKQD---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
D P+ K D I S E+ EL+ +AA + VLLKN+ LPL K+ +AVVGP
Sbjct: 354 D-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPL 411
Query: 440 ANATVAMIGNYAGIPCRY-MSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTA 497
A+ A+ G+Y+ P Y ++P G + A N +Y +GCD+ C+ +S S A A
Sbjct: 412 ADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSGQVKS-AVSGA 470
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D ++ G +E+E DR +L LPG Q L+ + PVIL++ +AG +D+++A
Sbjct: 471 DMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAGPLDVSWAV 530
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFG---KFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
N ++ I+ +P + G A+ + + NP GRLP+TW Q+ P+T ++
Sbjct: 531 ENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTWPR-SMEQVPPMTDYTMK 589
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
GRTY++ + L+PFG+GLSYT FKY Y +
Sbjct: 590 --------GRTYRYSDADPLFPFGFGLSYTLFKY-----------------------YNT 618
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
AS T ++ D + NVG G +V+ VY Q++GF
Sbjct: 619 SASPTV--------IKSCDTVTIPLTVTNVGDFPGDEVMQVYISWSNASVTVPKLQLVGF 670
Query: 735 QRV 737
+RV
Sbjct: 671 RRV 673
>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
Length = 740
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 243/705 (34%), Positives = 355/705 (50%), Gaps = 72/705 (10%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
F D L RV+DLVSR+TL EKV Q+ A +PRLG+P Y +W+E LHGV+ G
Sbjct: 23 FRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG--- 79
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-----------GL 162
AT FP +I A+++ L ++ A+S EARA ++ A GL
Sbjct: 80 -------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQGL 132
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+W+PNIN+ RDPRWGR ET GEDP + R A +VRGLQ D LK++
Sbjct: 133 TFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQG--------DTPDTHLKLA 184
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KHYA V + +R+ F+ARVT D+ +++L FE V+ SVM +YNR
Sbjct: 185 ACAKHYA---VHSGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLD 241
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
P CA LL +R W G++V+DC +++ + + H+ D E A A L G DL
Sbjct: 242 EPCCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESA-ALALTKGCDLA 300
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-----PQYVSLGKQDIC 397
CG + G AVQ+G + E DID++L +LG FD + P + I
Sbjct: 301 CGTTF-ELLGEAVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAPEAIV 359
Query: 398 S-DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
+ + LA EAA VLL+N + LPL V+++ + GP A A++GNY G+P R
Sbjct: 360 TCAAHTALACEAAVASCVLLQNHNHILPLRP-DVRSIYITGPLAATQDALLGNYYGLPPR 418
Query: 457 YMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
++ + G + Y+ G K N +A + A + D TI GL +E
Sbjct: 419 AITLLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCA-SCDVTIACLGLTALLEG 477
Query: 513 E-------SL--DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
E SL DR+D+ LP Q + + + +G ++VI+ GG ++ ++
Sbjct: 478 EEGEAIASSLHGDRDDISLPPPQRLFLESL--IQRGARVIVILF-GGSALSLGPLADKVE 534
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AILWAGYPG+EGGRA+AD++ G+ +P GRLPIT+Y + + LP P + G
Sbjct: 535 AILWAGYPGQEGGRALADILLGRASPSGRLPITFY--ENINDLP-------PYANYSMRG 585
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCP 682
RT+++++G +PFG+GL+YT+F Y+ L + N + L L T D
Sbjct: 586 RTHRWFDGTPAWPFGFGLTYTRFTYSDLRVSDVYSPGNDSPLCGSVLLTNTGDHEAAEIV 645
Query: 683 GVLVNDLRCDDY----FEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
+ + D E DF V G + +S PP I
Sbjct: 646 QIYLTDFDAPGNGPVPRENLADFHRVTLAPGQSRRVEFSIPPEHI 690
>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
Length = 691
Score = 358 bits (920), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 223/625 (35%), Positives = 337/625 (53%), Gaps = 68/625 (10%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D +L R + L MT +E+ QL A V RLG+P Y WW+E +HG++ G
Sbjct: 4 YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A F++ L KK + S EARA YN GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + + VRGLQ + + +K ++C
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQG----------DGKVIKAAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA+ +DMEET+L FE VKE SVM +YNRVNG P+
Sbjct: 164 KHFA---VHSGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPA 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA L+++ EW+ GY V+DC +I+ +NH A++ E + A LKAG D++CG
Sbjct: 221 CASNFLMDKL--KEWEFDGYFVSDCWAIRDFHENHMVTANAIE-STAMALKAGCDVNCGC 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y N A+++G V + DI + +L +RLG FD +Y + + E+ ++
Sbjct: 278 TYQNLL-VALEKGAVTKEDIRTACVHLMRTRIRLGMFDKKTEYDDIPYDKVACKEHKAIS 336
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
E A + +V+L+N+ LP++++K KT+AV+GP+A++ A+ GNY G+ RY + + G
Sbjct: 337 LECAEKSLVMLENN-GILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395
Query: 466 GY--ANVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---- 513
V + GC D V+ ++ + A AAK AD TI+ GLD ++E E
Sbjct: 396 DRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIEGEEGDT 455
Query: 514 -----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
S D+ L LP Q +L+ ++ V K PV+ V+ + ++ T + A++ A
Sbjct: 456 GNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGSAIN-----TESKPDALIHA 509
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPG EGG+A+A+V+FG +P G+LP+T+Y D ++ T ++ GRTY++
Sbjct: 510 FYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK--------GRTYRY 560
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSF 653
LYPFGYGL+Y K + +
Sbjct: 561 TTENVLYPFGYGLTYGSVKVTKVEY 585
>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
18P13]
Length = 697
Score = 358 bits (919), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 226/625 (36%), Positives = 335/625 (53%), Gaps = 68/625 (10%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + SL R +DL R+T++E+ QL A +PRLG+P Y WW+E LHGV+ G
Sbjct: 9 YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F+ +L +IG+ +TEARA + R GLT W
Sbjct: 67 --------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDPF+ R V +V+G+Q EG + LK ++C
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQG-EG---------KVLKAAACA 168
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA+V+ +D+EE++L F V E VM +YNRVNG PS
Sbjct: 169 KHFA---VHSGPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPS 225
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA P L+++ +W GY V+DC +IQ +H + E A A L+ G DL+CG
Sbjct: 226 CASPMLMDKL--HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESA-ALALRTGCDLNCGN 282
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + A+++G + DI ++ + +RLG FD P + + I S + ++
Sbjct: 283 TYL-YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPHFAACTYDTIASPAHKAVS 341
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
A + +VLLKND LPL+ +K+ +AV+GP+A++ A+ GNY G RY++ + G
Sbjct: 342 LSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQ 400
Query: 466 GY--ANVTYKTGCDDVACKSNNSIFA------ASEAAKTADATIILAGLDLSVEAE---- 513
V Y GC +++N A A AA+ +D I+ GLD ++E E
Sbjct: 401 DAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDATLEGEEGDT 460
Query: 514 -----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
S D+ DL LP Q +L+ ++ V K PVILV+ + ++ E + N A+L A
Sbjct: 461 GNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSALN---PEISCN--AVLQA 514
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPG+ GG+A+A ++FGK +P G+LP+T+Y T+ L RTY++
Sbjct: 515 WYPGQCGGQALAHILFGKVSPSGKLPVTFYE---------TAEQLPDFTDYSMQNRTYRY 565
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSF 653
LYPFGYGL+Y + LS+
Sbjct: 566 ARNNVLYPFGYGLTYGKIVCTELSY 590
>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 805
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 256/779 (32%), Positives = 383/779 (49%), Gaps = 92/779 (11%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEA 102
+ F FCD+SL S RV+DL+ R+ LDEKV L A P+ +GLP+Y W +
Sbjct: 30 EHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLLT--ARASPKGNMSSIGLPEYNWGANC 87
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---- 158
+HGV + GT+ ATSFP + A F+ + Q + E RA++ G
Sbjct: 88 VHGVQSTC-GTNC------ATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGAREN 140
Query: 159 -----RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GL WSPNIN+ RDPRWGR ETP EDP V +Y V Y RGLQ EG +
Sbjct: 141 YAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQ--EGKDK--- 195
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
R L+ KHYAAY +++ G+DR F+A+V+ D +T+L F V EG A V
Sbjct: 196 ---RFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVEGKAKGV 252
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYN VNG+P CA+ +L + +R GYI +D +I+ + + S +A
Sbjct: 253 MCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHY-TKSLCEAGRL 311
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSL 391
+ +G D++ G Y + V G++ E +D +++ + LG FD Y +
Sbjct: 312 AIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 371
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
++ E+ +L+ E R+ IVLL+N N LPL K K +AV+GPHA A A++GNY
Sbjct: 372 APSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLR--KGKKLAVIGPHAKAKRALLGNYL 429
Query: 452 GIPCR-----------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
G C + I +G +N Y G + S AA AA+ ADA
Sbjct: 430 GQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGS-GINDTSTADFDAAEAAARGADAV 488
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
++ G+D S+E E+ DRE++ +P Q QL+ +V K P ++V+ + GGV + E
Sbjct: 489 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFN-GGV-VGAEELIL 545
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
+ + A YPG G +A++D++FG P G+LP+T Y +Y+ + + SM +
Sbjct: 546 HTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYINSVDMKSMSM-----TK 600
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
YPGR+Y++Y ++PFG+GLSYT+F L D
Sbjct: 601 YPGRSYRYYKEVPVFPFGWGLSYTKFTLAL------------------------DGEMPD 636
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP----PAEIAATYIKQVIGFQR 736
P V+ DL V N G G +VV + +P AA +Q+ ++R
Sbjct: 637 DPIVITRDLDQ----TVTVIVSNDGDLVGDEVVFAFFRPLNVNATGDAALLNEQLFDYRR 692
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIHLNFNY 792
V +R + +++ F +L +VD + N G + + + NG V+F IHL Y
Sbjct: 693 VSLRPTQYRKLTFRIQQ-STLAMVDDSGNKASFPGFYEVIITNGVHERVTFAIHLVGKY 750
>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
Length = 713
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 236/743 (31%), Positives = 375/743 (50%), Gaps = 100/743 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ K +V +MT+DEK+ Q+ A + RL +P+Y +W+EALHGV+ G AT
Sbjct: 13 QAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------AT 62
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A+F++ L I + TE RA YN GLT+WSPN+N+ RDP
Sbjct: 63 VFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDP 122
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDPF+ ++ V +++GLQ ++ LK+++ KH+A +
Sbjct: 123 RWGRGHETYGEDPFLTSKFGVAFIKGLQG----------QAKYLKLAATAKHFAVHS--G 170
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+G+ R+ FDA V+++D+ ET+L F+ V+E D S+M +YN V+G+P+ LL
Sbjct: 171 PEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDI 229
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+ +W G++V+D + + + +NHK+ D+ E + +KAGL+L G + A+
Sbjct: 230 LHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAE-TMGLAIKAGLNLVAGHIEQSLH-EAL 287
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
+G V E +I ++ LY +RLG F +Y ++ + + + L+ AA + VL
Sbjct: 288 NRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVL 347
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKND LPL ++ +AVVGP+A++ +A++GNY G P R + + G V
Sbjct: 348 LKND-GVLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVH 406
Query: 472 YKTGC----DDVA---CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SL 515
Y G D A K++ A AA+ +D + + GLD ++E E +
Sbjct: 407 YSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAG 466
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
D+ +L LPG Q QL+ ++ V K PV++++ S + + E + N++AI+ YPG G
Sbjct: 467 DKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARG 525
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
G A+ADV+FG +P G+LP+T+Y + L + GRTY++ LY
Sbjct: 526 GLAVADVLFGTVSPSGKLPVTFYK---------NTDNLPAFEDYNMAGRTYRYMTEEALY 576
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
PFGYGL+Y+ V L+ LQ ++ T+ A+
Sbjct: 577 PFGYGLTYS-------------SVELSDLQ-VKSYEETATAT------------------ 604
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
V QN G+ D +VV VY K A Q+ GF+RVF+ G + I F +
Sbjct: 605 ---VTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITFDLR-PQ 660
Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
+ D + + + I VG
Sbjct: 661 DFEVFDEQGHNFIDSNRFEISVG 683
>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 721
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 250/761 (32%), Positives = 372/761 (48%), Gaps = 111/761 (14%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
++ F D+ L + R DL R+TL+EK + + + VPRLG+ Q++WW EALHG + G
Sbjct: 23 TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GL 162
AT FP I ASF++ L ++ STEARA YN+ +
Sbjct: 83 L----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
+ W+PN+N+ RDPRWGR ET GEDP++ R V GLQ +G + K
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPH-------KYYKAF 185
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
+C KH+A + W +R+ V+ +D ET+L F+ V+ G VMC+YN ++
Sbjct: 186 ACAKHFAVHSGPEW---NRHSISIDDVSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSID 242
Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGL 339
G P C+D +LL Q +R EW G +V+DC +I + H+ D+ A A+ +K G
Sbjct: 243 GEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAH-ASARAVKGGT 301
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDIC 397
D+ CGQ Y + AV+ GKV E IDKSLK L M+LG FD ++ ++ +D+
Sbjct: 302 DMSCGQTYGSLP-EAVRLGKVTEERIDKSLKRLIVGRMQLGEFDPDSITRWNAISMKDVS 360
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
+ + E+A + ARE + LL N + LPL S ++K V V+GP+AN +V M GNY G P
Sbjct: 361 TPASREVALKMARETMTLLHNPMHALPL-SKQLKQVVVMGPNANDSVMMWGNYNGTPHHT 419
Query: 458 MSPIAGFS---GYANVTYKTGCDDVAC--KSNNSIFAASEAAKTAD-ATII--------L 503
++ + G G V + GC V + N ++ + D T+I L
Sbjct: 420 VTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNKTVIFVGGISPQL 479
Query: 504 AGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
G L VEA+ DR + LP Q ++I + K +++++ G I T
Sbjct: 480 EGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGKQ---VIMVNCSGSAIGLVPEVT 536
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
+ AIL A YPGE GG A+ADV+FG +NP G+LP+T+Y D + +P D L
Sbjct: 537 HTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD-------SQLP----DYLD 585
Query: 621 Y--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
Y RTY+++ G L+PFG+GLSYT FK RN T
Sbjct: 586 YNMRNRTYRYFKGKPLFPFGHGLSYTSFKIGKAKM--------------RNGKLT----- 626
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
V +N G DG +VV +Y + IK + GF+R+
Sbjct: 627 --------------------VSVKNTGKRDGEEVVQLYISCLDDPNGP-IKSLRGFKRMA 665
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
++AG + + KS D NT+ + G++ ++ G
Sbjct: 666 LQAGEQRTVTLNLPR-KSFERFDEQTNTIRVVPGKYRVYYG 705
>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
Length = 699
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 240/741 (32%), Positives = 377/741 (50%), Gaps = 101/741 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ + L++ MTLDEK+ Q+ + G+PRLG+ Y+WW+E LHGV G AT
Sbjct: 12 KARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR----------AT 61
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
FP I A+FN +L ++IG A++TE RA YN+ + GLT+WSPNIN+ RDP
Sbjct: 62 VFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIFRDP 121
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
RWGR ET GEDPF+ G + YV+G+Q + P LKV++C KHYA V
Sbjct: 122 RWGRGMETYGEDPFLTGTLGIAYVQGMQ-----------GNDPFYLKVAACGKHYA---V 167
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+ R+ + T++D+ ET+L F+M V++G ++M +YNRV G LL
Sbjct: 168 HSGPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLT 227
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
+R +W G+IV+DCD++ + HK + ++ +A A +KAGL+++CG +
Sbjct: 228 DVLRKQWGFRGHIVSDCDAVADIHAGHK-IVKTEAEACAIAIKAGLNIECGHTFEAMK-Q 285
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGF--FDGSPQYVSLGKQDICSDENIELAAEAARE 411
AV Q + E +ID++L L ++LG +D Y + + +ICS E+I LA +AA E
Sbjct: 286 AVAQKLLTEQEIDRALLPLMMTRLKLGILEYDAECPYNEVKETEICSPEHIALARKAATE 345
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-----SG 466
+VLLKN+ LPL+ + T+ + GP A+ + ++GNY GI RY + + G SG
Sbjct: 346 SMVLLKNN-GILPLDK-NLHTLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSG 403
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDR 517
A V ++ + + + N+I A + A A+ TI++ G + ++E E DR
Sbjct: 404 TA-VNFRPAFGE-STPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDR 461
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+ LP Q + + + G +V++ GG I E + A++ A YPG+EGG
Sbjct: 462 VSMRLPASQMKFLRDLKARKNG---IVVVLTGGSPIDVREISRLADAVVMAWYPGQEGGY 518
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
A+AD++FG N GRLP+T+ P ++ L P + GRTYK+ YPF
Sbjct: 519 ALADLLFGDENFSGRLPVTF---------PESTDALPPFEDYAMKGRTYKYQTAHIQYPF 569
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGLSYT Y H + + + G+ V+ +
Sbjct: 570 GYGLSYTTVTY----------------AHAK-----VETMPQKGRGMTVSAV-------- 600
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
+N G+ +V VY P + ++ F+R+ ++ G + ++F + L
Sbjct: 601 ---LKNTGNKAVDEVAQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDIPFDRLL 657
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+ + LL G +TI VG
Sbjct: 658 TVQEDGTAQLL-KGNYTITVG 677
>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
12058]
Length = 732
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 249/776 (32%), Positives = 384/776 (49%), Gaps = 99/776 (12%)
Query: 34 VFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL 93
+F+ +G+Q ++ F + + RV DL+SR+TL++K Q L V G
Sbjct: 14 IFLSTGAAAQSIGIQ-NNPAFLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGF 72
Query: 94 P-QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+ + W++ LHGV P T+FPT I A+++ L ++ +S EAR
Sbjct: 73 SIRADQWNQCLHGVKWTEP----------TTNFPTSIALGATWDTELIHRVATVISDEAR 122
Query: 153 AMYNLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
A+YN + GL Y SP IN++R+P WGRI E GEDP+ GR V YV+GLQ
Sbjct: 123 AIYNGWKQDPEFRGEHKGLIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQ 182
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
+ H LK++S KHYA +V+ VDR A+V E+ + E +L F+
Sbjct: 183 GDDSHY---------LKLASTLKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKD 229
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
C+ EG A SVM SYN +NG+P+ + LL ++ +W G++V+D ++ MV+ H
Sbjct: 230 CIVEGKAQSVMASYNAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQR 289
Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
S E+AV +++ AG D +Y + +A+++G + E ++ +L+ + V RLG FD
Sbjct: 290 QISCEEAVGRSIMAGCDFSDAEY-EKYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFD 348
Query: 384 --GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
S Y + I E+ L+ EAAR+ IVLLKN++ LP++ + +K VAV+GP+A+
Sbjct: 349 DFKSVPYSRISPDVIGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYAD 408
Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYA----NVTYKTGCDDVACKSNN------------ 485
+ GNY G+P ++P+ G V Y G K
Sbjct: 409 --LFNQGNYGGVPKDPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEA 466
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
+ A E A+ +D + G +E E DR+ L LPG Q +L+ V EV K V++V+
Sbjct: 467 EMKKAVEMARNSDVVFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNK-KVVVVL 525
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
MSAG V A E NI A+L A +PG+EGG AIADV+FG +NPGG+LP T Y D +
Sbjct: 526 MSAGPV--AVPEVKKNIPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYASD--EQ 581
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
+P T + G TY + L+ FG+GLSY++F Y+ L +
Sbjct: 582 VPSTD------EYDISKGFTYMYLKKKPLFAFGHGLSYSKFHYSDLQIS----------- 624
Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA 725
P V VND + +N+G G +VV +Y +
Sbjct: 625 ---------------SPVVSVNDT-----VSVVLKVKNMGKRTGEEVVQLYVRDVKAKVV 664
Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVGNG 780
K++ GF+R+ ++ + I+ + KSL D + + L+ G I +G+
Sbjct: 665 RPTKELRGFKRIALQPNEEQEIRLML-PVKSLAFYDESIGDFLVEPGSFEILLGSA 719
>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 763
Score = 355 bits (912), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 241/640 (37%), Positives = 345/640 (53%), Gaps = 69/640 (10%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP--RLGLPQYEWWSEALH 104
L +++ F ++SL + RV DLVSR+TLDE V Q+ + P RLG+ Y W SE LH
Sbjct: 21 LISAAYPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLH 80
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------- 156
GV V P D + AT+FP I ASF+ L + +A+ E RA +N
Sbjct: 81 GV--VPP----DGL---ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVY 131
Query: 157 LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
GL+ +SP IN+AR P WGR ET GEDPF++G YVRGLQ +
Sbjct: 132 QEHTGLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGD---------HP 182
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
R + ++ CKH+ + V R+ FDA+V E+D + TFL F CVK G SVMCS
Sbjct: 183 RYVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMCS 241
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
YNR+N +P+CA+ +LL +R EW GY+V+D +++ ++ +H + DS D VA +
Sbjct: 242 YNRINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHY-TDSIVDTVASAVN 300
Query: 337 AGLDLDC------GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
AG +LD G Y G+AV GK+KE + + +K L+ MRLG FD P+
Sbjct: 301 AGCNLDLAFPVGDGMYIK--IGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNP 357
Query: 388 YVSLGKQDICSDENIELAAEAAREG-----IVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
Y +L + S+E+ ELA +AA + VLLK + LPL++ V +AV+GP A+
Sbjct: 358 YANLNLSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLDTL-VNKLAVIGPFADN 416
Query: 443 TVAMIGNYAGIPCR--YMSPIAGFSGYANVTYKT-GCDDVACKSNNSIFAASEAAKTADA 499
+ G+Y+ P + ++P G S A T T GC C + S + A AD
Sbjct: 417 PSYLFGDYSPNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKA-AVTGADL 475
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAET 558
++ G + +EAE +DR DL LPG Q QL+ V + A G P+IL++ +AG +DI +A
Sbjct: 476 IVVCLGTGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVE 535
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVF-------GKFNPGGRLPITWYNGDYVQMLPLTSM 611
N I+ I+ +P + G A+ + G NPGGRLPITW P +
Sbjct: 536 NPAIQVIVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW---------PRSMN 586
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
+ P+ + GRTY+++NG L+PFGYGLSY F Y+ L
Sbjct: 587 QVPPMTNYTMEGRTYRYFNGDPLFPFGYGLSYGSFSYSSL 626
>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
Length = 721
Score = 354 bits (909), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 219/623 (35%), Positives = 328/623 (52%), Gaps = 50/623 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGLPQYEWWSEALHGV 106
FCD SL + R DL R+TLDE QQL + A GVPRLGL Y + +E LHG+
Sbjct: 44 FCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGLHGI 103
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LG 158
+ + V AT +P V A+ N SL ++ + TE RA+ N G
Sbjct: 104 RDA------NVVNYPATLYPQVTAMAATANASLIHEMSTIMGTELRAVNNRAQELGEIFG 157
Query: 159 RAG-LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
R G L+ + P +N+ RD RWGR E+ EDP++ G YAVN+V GL+ +S+
Sbjct: 158 RGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQRN--------SSK 209
Query: 218 PLKVSSCCKHYAAYDVDNWKG-VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
L+ ++ CKH AY + + + R+ F+A + E D+ +T+L F CV+ G +MCS
Sbjct: 210 YLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELGHVQQIMCS 269
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
YN VNGIP+CA + N VR W G IV+DCD++ + + H + + EDAV L+
Sbjct: 270 YNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNY-TRTPEDAVTVALQ 328
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQ 394
G DLDCG +Y+ +AVQQ + +S+ + + LG FD S Y LG++
Sbjct: 329 GGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPDTSVPYRQLGRE 388
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLN-SAKVKTVAVVGPHANATVAMIGNYAG- 452
I + + + A+RE +VLL+N LP+ SA +K VA++GP+ N T M+G
Sbjct: 389 AIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIK-VALIGPYVNLTTIMMGGKLDY 447
Query: 453 IPCRYMSPIAGFS--GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
P + GF G ++T GC+ + ++ A + A AD ++ GL +
Sbjct: 448 TPSFITTYFQGFQAIGITHLTSSPGCN-ITAPLPGALDKAVQIATQADLVVLTLGLSSDI 506
Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG------VDIAFAETNTNIKA 564
E E DRE L LP Q L + ++ ++V++ GG + A T T I+A
Sbjct: 507 EHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIARTPTIIEA 566
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
Y G+ G A+A+ +FG+ NP G LP T + + +P T M LRP + G+PGR
Sbjct: 567 F----YGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATGFPGR 622
Query: 625 TYKFYNGPTLYPFGYGLSYTQFK 647
T++F++ P ++PFG+GLSY+ F
Sbjct: 623 THRFFDAPVMWPFGHGLSYSTFS 645
>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
Length = 704
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 219/611 (35%), Positives = 321/611 (52%), Gaps = 63/611 (10%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ LV++M L EK L + + RLG+P Y WWSEALHGV+ G AT
Sbjct: 8 KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I A F+E +I ++TEARA YN G+T W+PNIN+ RDP
Sbjct: 58 VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ R V ++ GLQ E H K ++C KH+A V +
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHY--------WKAAACAKHFA---VHS 166
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+R+HFDA V+++D+ ET+L FE V +G + +M +YNRVNG P+C LL
Sbjct: 167 GPEEERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDI 226
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
++ EW GY+V+DC +I+ H + E A A + G L+CG Y + A
Sbjct: 227 LKEEWGFDGYVVSDCWAIRDFHTEHMVTHTATESA-ALAINNGCQLNCGNTYLHML-QAY 284
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
++G V E I KS + L + M+LG FD + +Y + + + ++A + AR +VL
Sbjct: 285 KEGLVTEETITKSAQKLMAIRMKLGLFDKNCEYNKIPYEVNDCKVHRDIALDVARRSMVL 344
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
LKN+ LPLN + K + V+GP AN+ + GNY G RY + + G Y A V
Sbjct: 345 LKNN-GILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVY 403
Query: 472 YKTGC----DDVACKS--NNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
Y GC + ++ S N+ + A A+ +D I+ GLD S+E E + D
Sbjct: 404 YAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTGNAFAAGD 463
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+ DL L G Q L+ +V ++ K P IL++ S + I A+ +AIL YPG+ GG
Sbjct: 464 KSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQEYC--EAILETWYPGQSGG 520
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+A+A ++FG+++P G+LPIT+Y T+ L GRTY++ LYP
Sbjct: 521 KALAQLLFGEYSPSGKLPITFYK---------TTEELPDFRDYSMAGRTYRYMKNEALYP 571
Query: 637 FGYGLSYTQFK 647
FGYGL+Y + +
Sbjct: 572 FGYGLNYAKVE 582
>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
Length = 721
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 242/741 (32%), Positives = 371/741 (50%), Gaps = 96/741 (12%)
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
S K++++RMT+ EK+ QL + + + LG+ Y+WWSE LHGV G
Sbjct: 31 SRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR---------- 80
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVAR 173
AT FP I A+F+E+L ++IG AV+TE RA +N+ R AGLT+WSPN+N+ R
Sbjct: 81 ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIFR 140
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR ET GEDP + G YVRGLQ + LK +C KHYA V
Sbjct: 141 DPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFY---------LKTGACAKHYA---V 188
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+ R+ D + +D+ ET+L F+M V++G +VM +YNRV G P LL
Sbjct: 189 HSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLT 248
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
+R W +G+IV+DCD+I H+++ + E+A A +KAGL+++CG + G
Sbjct: 249 DILRKSWGFNGHIVSDCDAINDFYGGHRYV-KTPEEACAAAIKAGLNVECGHTFKAMQG- 306
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAARE 411
A+ QG + E D+D++L L ++LG D + Y S + +ICS + LA AA E
Sbjct: 307 ALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALALRAADE 366
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGY 467
+VLLKN+ LPL+ ++T+ V GP A+ ++GNY G+ RY + + G S
Sbjct: 367 AMVLLKNN-GILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSG 424
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDRE 518
+V ++ + + N+ +A +EA A+ I++ G + ++E E DR
Sbjct: 425 TSVNFRPAFMQITEELNDMNWAVNEAC-AAEVAIVVMGNNGNMEGEEGEAIASASRGDRV 483
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+ LP Q + +V + KG I+V+++ GG I E + A++ A YPG+EGG A
Sbjct: 484 GIGLPASQMNYLRRV-KARKGGRIVVVLT-GGSPIDLREISKLADAVVMAWYPGQEGGEA 541
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ D++FG N GRLPIT+ P L D GRTYK+ +G +YPFG
Sbjct: 542 LGDLLFGDKNFSGRLPITF---------PADVDSLPAFDDYSMNGRTYKYMSGNVMYPFG 592
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSY + Y +DA +V ++ + +
Sbjct: 593 YGLSYGRVTY-------------------------TDAR-------VVGRIKKGEPLAVE 620
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC-KSL 757
V N G +V Y P + + ++GF+RV + +K VF + L
Sbjct: 621 VVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPP--KSSVKAVFKIVPERL 678
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+ ++ L G +T+ +G
Sbjct: 679 MTIQSDGSSKLLKGNYTLTIG 699
>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 252/775 (32%), Positives = 386/775 (49%), Gaps = 88/775 (11%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEA 102
+ F FC++SL + RV+DL+ R+ LDEKV L A P+ +GLP+Y W +
Sbjct: 31 EHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLT--ARASPKGNMSSIGLPEYNWGANC 88
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---- 158
+HGV + GT+ ATSFP + A F+ + Q V E RA++ G
Sbjct: 89 VHGVQSTC-GTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141
Query: 159 -----RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GL WSPNIN+ RDPRWGR ETP EDP V +Y V Y +GLQ EG +
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ--EGKDK--- 196
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
R L+ KHYAAY +++ G+DR F+A V+ D +T+L FE V G A V
Sbjct: 197 ---RFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYN VNG+P CA+ +L ++ +R GYI +D +I + + E
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLA 313
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSL 391
L +G D++ G Y V G++ E +D +++ + LG FD Y +
Sbjct: 314 IL-SGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
++ + E+ +L+ + +R+ IVLL+N N LPL AK K +AV+GPHA A A++GNY
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPL--AKGKKLAVIGPHAAAKRALLGNYL 430
Query: 452 GIPCR--------YMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
G C +P+ + G +N Y G + S A AA+ A+
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKG-SGINDTSTGGFDEAEAAARKAETV 489
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
++ G+D S+E E+ DRE++ +P Q QL+ +V K P ++V+ + GGV + E
Sbjct: 490 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFN-GGV-VGAEELIL 546
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
+ ++ A YPG G +A++D++FG P G+LP+T Y +YV TS+ ++ +
Sbjct: 547 HTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYV-----TSVDMKSMSMTK 601
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
YPGR+Y++Y ++PFG+GLSYT+F L S +S +
Sbjct: 602 YPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS--------------------SSGVTDPS 641
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP----PAEIAATYIKQVIGFQR 736
P V+ L V N G+ G +VV + +P AA +Q+ ++R
Sbjct: 642 EPIVVTRQLDQ----TVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDYRR 697
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIHL 788
V +R + +++KF +L +VD + N G + + + NG V+F IHL
Sbjct: 698 VSLRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751
>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
Length = 809
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 251/777 (32%), Positives = 392/777 (50%), Gaps = 92/777 (11%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEA 102
+ F FC++SL + RV+DL+ R+ LDEKV L A P+ +GLP+Y W +
Sbjct: 31 EHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLT--ARASPKGNMSSIGLPEYNWGANC 88
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---- 158
+HGV + GT+ ATSFP + A F+ + Q V E RA++ G
Sbjct: 89 VHGVQSTC-GTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141
Query: 159 -----RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GL WSPNIN+ RDPRWGR ETP EDP V +Y V Y +GLQ EG +
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ--EGKDK--- 196
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
R L+ KHYAAY +++ G+DR F+A V+ D +T+L FE V G A V
Sbjct: 197 ---RFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYN VNG+P CA+ +L ++ +R GYI +D +I + + + + +A
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAI-AGIYHQRHYTKTLCEAGRL 312
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSL 391
+ +G D++ G Y V G++ E +D +++ + LG FD Y +
Sbjct: 313 AILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
++ + E+ +L+ + +R+ IVLL+N N LPL AK K +AV+GPHA A A++GNY
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPL--AKGKKLAVIGPHAAAKRALLGNYL 430
Query: 452 GIPCR--------YMSPIAGFS---GYANVTYK--TGCDDVACKSNNSIFAASEAAKTAD 498
G C +P+ + G +N Y +G +D + + AA+ A+T
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAGFDEAEAAARKAET-- 488
Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
++ G+D S+E E+ DRE++ +P Q QL+ +V K P ++V+ + GGV + E
Sbjct: 489 -VVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFN-GGV-VGAEEL 544
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
+ ++ A YPG G +A++D++FG P G+LP+T Y +YV TS+ ++ +
Sbjct: 545 ILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYV-----TSVDMKSMSM 599
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
YPGR+Y++Y ++PFG+GLSYT+F L S +S +
Sbjct: 600 TKYPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS--------------------SSGVTD 639
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP----PAEIAATYIKQVIGF 734
P V+ L V N G+ G +VV + +P AA +Q+ +
Sbjct: 640 PSEPIVVTRQLDQ----TVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDY 695
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIHL 788
+RV +R + +++KF +L +VD + N G + + + NG V+F IHL
Sbjct: 696 RRVSLRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751
>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 833
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 259/760 (34%), Positives = 361/760 (47%), Gaps = 121/760 (15%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CDS+ R LV ++T+DEK+ L D + G PRLGLP Y WWSE LHGV+ PG
Sbjct: 37 CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAG-SPGVV 95
Query: 115 FDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
F+ ATSF VI A+ ++ L ++G A+STEARA G GL YW+PNIN
Sbjct: 96 FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYWTPNINP 155
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
+DPRWGR ETPGEDP + Y V GL+ N KV + CKH+AAY
Sbjct: 156 YKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEG----------NGTVRKVIATCKHFAAY 205
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC---------------- 275
D++ W+G+ RY FDA V+ QD+ E +L PF+ C ++ S+MC
Sbjct: 206 DLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFPSFPRL 265
Query: 276 ----------------SYNRVNGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQ-V 315
SYN +NG P+CA L+ +R W+ + YI +DC++IQ
Sbjct: 266 VTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCNAIQDF 325
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKY 371
+ DNH F + + +A A AG D C YT+ G A Q + E+ ID +L+
Sbjct: 326 LPDNHNF-SQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVG-AYNQSLLSESVIDTALRR 383
Query: 372 LYTVLMRLGFFD-GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
LY L+R G+ D G P S K S + LPL+
Sbjct: 384 LYEGLIRAGYLDHGRPASSSPDKAPFSS---------------------PDFLPLDLTG- 421
Query: 431 KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFA 489
KTVA++G ANAT + G Y+G+P Y +P+ + Y G + ++ A
Sbjct: 422 KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNSTDADTWTAA 481
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A AA++AD + G D +V +E LDRE + P Q LI ++A+V K ++VI
Sbjct: 482 AMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQVGK--PMVVIQLGD 539
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
VD N NI +ILW GYPG+ GG A+ DV+ GK GRLP+T Y YV +PLT
Sbjct: 540 QVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYVDEVPLT 599
Query: 610 SMPLRPVD--------------------------------SLGYPGRTYKFYNGPTLYPF 637
M LRP + +L PGRTYK+Y P L PF
Sbjct: 600 EMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYPRPVL-PF 658
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGL YT F +S + + N + +++ S + C + L + F
Sbjct: 659 GYGLHYTPFN---ISLSLSTSSNASSTTDNTSISIRSLLTSQTCTAI---HLDLCPFSPF 712
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
V N GS V +++ +K ++G++RV
Sbjct: 713 SVSITNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRV 752
>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 449
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 183/457 (40%), Positives = 279/457 (61%), Gaps = 13/457 (2%)
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQD 395
+D++CG Y N+T +AV++ KV E++ID++L L+++ MRLG F+G+P Y +
Sbjct: 1 MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+CS E+ +A EAAR+GIVLLKN LPL+ +K ++A++GP+A+ + ++GNYAG PC
Sbjct: 61 VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120
Query: 456 RYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
+ ++P G Y T Y GC VAC S+ +I A + AK AD +++ GLD + E E
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVAC-SSAAIDQAIKIAKEADQVVLVMGLDQTQEREE 179
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR DL LPG Q +LI VA AK PV+LV++ G VDI+FA+ + NI ILWAGYPGE
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG A+A+++FG NPGGRLP+TWY D+ + +P+T M +RP S GYPGRTY+FY G +
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDFTK-VPMTDMRMRPQPSSGYPGRTYRFYKGKKV 298
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-D 693
+ FGYGLSY+ + Y L+S T+ +++L + N + KT + + C+
Sbjct: 299 FEFGYGLSYSNYSYELVSVTQN-KISLRSSIDQKAENSSPIGYKTISE---IEEELCERS 354
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
F V +N G G V+++++ + IK++I FQ V + AG N I++ N
Sbjct: 355 KFSVTVRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNP 414
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
C+ L+ + ++ G + VG+ +PI++
Sbjct: 415 CEHLSRANEDGLMVMEEGSQYLLVGDK--EYPINITI 449
>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 721
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 240/741 (32%), Positives = 371/741 (50%), Gaps = 96/741 (12%)
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
S K++++RMT+ EK+ QL + + + LG+ Y+WWSE LHGV G
Sbjct: 31 SRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR---------- 80
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVAR 173
AT FP I A+F+E+L ++IG AV+TE RA +N+ + AGLT+WSPN+N+ R
Sbjct: 81 ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIFR 140
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR ET GEDP + G YVRGLQ + LK +C KHYA V
Sbjct: 141 DPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFY---------LKTGACAKHYA---V 188
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+ R+ D + +D+ ET+L F+M V++G +VM +YNRV G P LL
Sbjct: 189 HSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLT 248
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
+R W +G+IV+DCD+I H+++ + E+A A +KAGL+++CG + G
Sbjct: 249 DILRKSWGFNGHIVSDCDAINDFYGGHRYV-KTPEEACAAAIKAGLNVECGHTFKAMQG- 306
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAARE 411
A+ QG + E D+D++L L ++LG D + Y S + +ICS + LA AA E
Sbjct: 307 ALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALALRAADE 366
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGY 467
+VLLKN+ LPL+ ++T+ V GP A+ ++GNY G+ RY + + G S
Sbjct: 367 AMVLLKNN-GILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSG 424
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDRE 518
+V ++ + + N+ +A +EA A+ I++ G + ++E E DR
Sbjct: 425 TSVNFRPAFMQITEELNDMNWAVNEAC-AAEVAIVVMGNNGNMEGEEGEAIASASRGDRV 483
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+ LP Q + +V + KG I+V+++ GG I + + A++ A YPG+EGG A
Sbjct: 484 GIGLPASQLNYLRRV-KARKGGRIVVVLT-GGSPIDLRKISKLADAVVMAWYPGQEGGEA 541
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ D++FG N GRLPIT+ P L D GRTYK+ +G +YPFG
Sbjct: 542 LGDLLFGDKNFSGRLPITF---------PADVDSLPAFDDYSMNGRTYKYMSGNVMYPFG 592
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSY + Y +DA +V ++ + +
Sbjct: 593 YGLSYGRVTY-------------------------TDAR-------VVGRIKKGEPLAVE 620
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC-KSL 757
V N G +V Y P + + ++GF+RV + +K VF + L
Sbjct: 621 VVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPP--KSSVKAVFKIVPERL 678
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+ ++ L G +T+ +G
Sbjct: 679 MTIQSDGSSKLLKGNYTLTIG 699
>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 721
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 241/741 (32%), Positives = 369/741 (49%), Gaps = 96/741 (12%)
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
S K++++RMT+ EK+ QL + + + LG+ Y+WWSE LHGV G
Sbjct: 31 SRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR---------- 80
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVAR 173
AT FP I A+F+E+L ++IG AV+TE RA +N+ R AGLT+WSPN+N+ R
Sbjct: 81 ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIFR 140
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
D RWGR ET GEDP + G YVRGLQ + LK +C KHYA +
Sbjct: 141 DLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFY---------LKTGACAKHYAVHSG 191
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
R+ D + +D+ ET+L F+M V++G +VM +YNRV G P LL
Sbjct: 192 PEGT---RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLT 248
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
+R W +G+IV+DCD+I H+++ + E+A A +KAGL+++CG + G
Sbjct: 249 DILRKSWGFNGHIVSDCDAINDFYGGHRYV-KTPEEACAAAIKAGLNVECGHTFKAMQG- 306
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAARE 411
A+ QG + E D+D++L L ++LG D + Y S + +ICS + LA AA E
Sbjct: 307 ALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALALRAADE 366
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGY 467
+VLLKN+ LPL+ ++T+ V GP A+ ++GNY G+ RY + + G S
Sbjct: 367 AMVLLKNN-GILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSG 424
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDRE 518
+V ++ + + N+ +A +EA A+ I++ G + ++E E DR
Sbjct: 425 TSVNFRPAFMQITEELNDMNWAVNEAC-AAEVAIVVMGNNGNMEGEEGEAIASASRGDRV 483
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+ LP Q + +V + KG I+V+++ GG I E + A++ A YPG+EGG A
Sbjct: 484 GIGLPASQLNYLRRV-KARKGGRIVVVLT-GGSPIDLREISKLADAVVMAWYPGQEGGEA 541
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+ D++FG N GRLPIT+ P L D GRTYK+ +G +YPFG
Sbjct: 542 LGDLLFGDKNFSGRLPITF---------PADVDSLPAFDDYSMNGRTYKYMSGNVMYPFG 592
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSY + Y +DA +V ++ + +
Sbjct: 593 YGLSYGRVTY-------------------------TDAR-------VVGRIKKGEPLAVE 620
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC-KSL 757
V N G +V Y P + + ++GF+RV + +K VF + L
Sbjct: 621 VVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPP--KSSVKAVFKIVPERL 678
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
V ++ L G +T+ +G
Sbjct: 679 MTVQSDGSSKLLKGNYTLTIG 699
>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 258/763 (33%), Positives = 370/763 (48%), Gaps = 91/763 (11%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRM-TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
SS+ FCD+ L R+KDLVSR+ T D Q + + +GLP Y W + A+HG+ N
Sbjct: 105 SSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQN 164
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG-RAGLTYWSP 167
D P TSFP +A+FN SL K +G+ + E RA YN GL WSP
Sbjct: 165 TA--CLADGQCP--TSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNGLDTWSP 220
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
IN +RDPRWGR E+PGE PFV G+Y Y GLQ N D + V+ KH
Sbjct: 221 TINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQ------NGDDKDYTQAVVT--LKH 272
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+ AY V+++ V RY ++A V+E D+ +T+ +E VK VMCSYN +NG+P+C
Sbjct: 273 WVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVPTCG 332
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
+P L +R +W GYI +D DSI + +H + +++ A L G D+D G Y
Sbjct: 333 NPA-LTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVL-ATRDGLLGGCDIDSGDTY 390
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELA 405
+ AV Q V + +D +L Y + LG FD + Y + ++ + E +
Sbjct: 391 ADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTNAYDRISADEVGMSSSQETS 450
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC---------- 455
AAR+ + LLKND TLP A K VAV+G +N+ ++GNY G C
Sbjct: 451 LLAARKSMTLLKNDGQTLPF--ATGKKVAVIGKSSNSAEDILGNYVGPICPSGAFDCVQT 508
Query: 456 RYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
Y A G A T DDVA I A + A AD ++L + E
Sbjct: 509 LYQGVAAANQGGAT----TLSDDVA-----DINTAIQLAMDAD-QVVLTISNYGQAGEGK 558
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
DR + L Q +L+ V +V K P +V+++ G + + + + +AIL A PG G
Sbjct: 559 DRTYIGLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEA--QAILVAFAPGVHG 615
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY-----------PGR 624
G+A+A+ +FG NPGG+LP+T Y DYV + +M ++ V L PGR
Sbjct: 616 GQAVAETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPGR 675
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+YK+Y G LYPF YGLSYT F NL+++ T
Sbjct: 676 SYKYYTGEPLYPFAYGLSYTTF----------------------NLSWSPAPPMT----T 709
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-------IKQVIGFQRV 737
+ LR + N GS G +VV + KP +E T IK++ GFQRV
Sbjct: 710 FTSTLRS---TTYTATVTNTGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQRV 766
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+ G++ ++ F NA ++L V + L +GE I + G
Sbjct: 767 ALGPGQSTQVTFELNA-ETLAQVTLDGHRELHSGEFEIELTRG 808
>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
gi|224035251|gb|ACN36701.1| unknown [Zea mays]
gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 405
Score = 348 bits (893), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 182/408 (44%), Positives = 254/408 (62%), Gaps = 17/408 (4%)
Query: 377 MRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
MRLGFFDG P+ + +LG D+C+ N ELA EAAR+GIVLLKN LPL++ +K++
Sbjct: 1 MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSM 59
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASE 492
AV+GP+ANA+ MIGNY G PC+Y +P+ G Y+ GC +V C N+ + AA++
Sbjct: 60 AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATK 119
Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
AA +AD T+++ G D S+E ESLDR L LPG Q QL++ VA + GP ILV+MS G D
Sbjct: 120 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFD 179
Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
I+FA+++ I AILW GYPGE GG AIADV+FG NP GRLP+TWY + + +P+T M
Sbjct: 180 ISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMR 238
Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
+RP S GYPGRTY+FY G T+Y FG GLSYT F ++L+S K + + L + C
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT--- 295
Query: 673 TSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
+CP V C+ F+ + +N G G V ++S PPA + K +
Sbjct: 296 ------EQCPSVEAEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHL 348
Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+GF++V + G+ + F + CK L++VD N + G HT+ VG+
Sbjct: 349 LGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 396
>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
Length = 454
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 178/449 (39%), Positives = 272/449 (60%), Gaps = 13/449 (2%)
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVSLGKQD 395
+D++CG Y +AV +GK++E DID++L L++V +RLG FDG + + LG +D
Sbjct: 1 MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+C++E+ +LA EAAR+GIVLLKN++ LPLN V ++A++GP AN ++ G+Y G C
Sbjct: 61 VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120
Query: 456 RYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
S G Y T Y GC +V+C S++ A AKTAD I++AG+DLS E E
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR L LPG Q L++ VA +K PVILV+ G VD++FA+ ++ I +ILW GYPGE
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G +A+AD++FG++NPGGRLP+TWY + +P+ M +R + GYPGRTY+FY G +
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTN-VPMNDMNMRANPNRGYPGRTYRFYTGERV 299
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQV--NLNKLQHCRNLNYTSDASKTRCPGVLVNDL-RC 691
Y FG GLSYT + Y LS + + +L R L+ D R + ++++ C
Sbjct: 300 YGFGEGLSYTNYAYKFLSAPSKLSLSGSLTATSRKRILHQRGD----RLDYIFIDEISSC 355
Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ F ++ NVG DGS VV+++S+ P T KQ++GF+R+ + ++ +
Sbjct: 356 NSLRFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSIL 415
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ CK L+I + ++P G H + +G+
Sbjct: 416 LDPCKHLSIANGQGKRIMPVGSHVLLLGD 444
>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
Length = 691
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 223/628 (35%), Positives = 331/628 (52%), Gaps = 74/628 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D SL R + L MT +E+ QL A + RLG+P Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A F++ L K+ + S EARA YN GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + VRGLQ + + +K ++C
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQG----------DGKVMKAAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA+ +DMEET+L FE VKE SVM +YNRVNG P+
Sbjct: 164 KHFA---VHSGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPA 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA L+ + EW+ GY V+DC +I+ ++H A++ E A A LKAG D++CG
Sbjct: 221 CASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKAGCDVNCGC 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y N A+ +G + + I + +L +RLG FD + + + E+ ++
Sbjct: 278 TYQNLLA-ALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVACAEHKAVS 336
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
E A + +VLLKN+ LPL+ K KT+AV+GP+A++ A+ GNY G+ RY + + G
Sbjct: 337 LECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395
Query: 464 --FSGYANVTYKTGCDDVACKSNNSIFAASE-------AAKTADATIILAGLDLSVEAE- 513
F G V + GC + KS + + A + AAK AD I+ GLD ++E E
Sbjct: 396 DRFEG--RVIFAEGC-HLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEE 452
Query: 514 --------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
S D+ L LP Q L+ ++ V K PV+ V+ + ++ T + A+
Sbjct: 453 GDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN-----TESQPDAL 506
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
+ A YPG EGG+A+A+V+FG +P G+LP+T+Y D ++ T ++ GRT
Sbjct: 507 IHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK--------GRT 557
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
Y++ L+PFGYGL+Y K N + +
Sbjct: 558 YRYTTDNILFPFGYGLTYGGVKVNAVEY 585
>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 745
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 230/658 (34%), Positives = 351/658 (53%), Gaps = 66/658 (10%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL---GDFAHG----VPRLGLP 94
FS + +S F F ++SLP++ RV+DLV R+ L+E V Q+ G +++G + RL +
Sbjct: 15 FSLISTILSDFPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIG 74
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y W +E L G + GP ATSFP A+F+ L K+I A + E RA
Sbjct: 75 PYSWNTECLRGDLSAGP----------ATSFPQAFGLAATFDAVLIKQIANATAYEVRAK 124
Query: 155 YNL--------GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
YN GL+ +SP IN+AR P WGRI ET GEDP++ G A ++V GLQ
Sbjct: 125 YNNYTKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQ--- 181
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
G+ + R + ++ CKH+ AY R FDA+V+++D+ TFL F C++
Sbjct: 182 GN------HPRYVTANAGCKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPAFHECIQ 235
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G S+MCSYN +NG+P+CA+ KLL +R EW+ GY+++D +++ + D H + D
Sbjct: 236 AG-TYSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDM 294
Query: 327 KEDAVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
+ A+A + +GL+L+ T+ T AV+QG V + + L+ MRLG F
Sbjct: 295 LDTAIA-CVNSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEF 353
Query: 383 DGSPQYVSLGKQD---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
D P+ K D I S E+ EL+ +AA + VLLKN+ LPL K+ +AVVGP
Sbjct: 354 D-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPF 411
Query: 440 ANATVAMIGNYA-GIPCRYMSPIAGFSGYANV--TYKTGCDDVACKSNNSIFAASEAAKT 496
+ + + G+ + + ++P G S A + T+ +GC AC + + +A
Sbjct: 412 GDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLATTFASGCLSPACTEYDPK-STKQAIDR 470
Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLI-NQVAEVAKGPVILVIMSAGGVDIAF 555
D ++ G VE E+ DR +L LPG Q +L+ + V A PVIL++ +AG +DI +
Sbjct: 471 VDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLFNAGPLDITW 530
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGK--FNPGGRLPITWYNGDYVQMLPLTSMPL 613
A +N I I+ +P + G A+ + NPGGRLPITW P + +
Sbjct: 531 AVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPITW---------PKSMSQV 581
Query: 614 RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
P++ GRTY+++NG L+PFGYGLSYT F Y+ L T + + + C ++N
Sbjct: 582 PPMEDYTMEGRTYRYFNGDPLFPFGYGLSYTTFHYSDLLITPSTPI-----KPCSSIN 634
>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
Length = 511
Score = 346 bits (888), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 195/486 (40%), Positives = 277/486 (56%), Gaps = 16/486 (3%)
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
Y+ +DCD++ + D H + S ED VA ++KAG+D++CG Y AVQ+G + E D
Sbjct: 16 YVASDCDAVATIRDAHHYTL-SPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKD 74
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID++L L+ V MRLG FDG P+ Y LG D+CS + LA EAA++GIVLLKND
Sbjct: 75 IDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDA 134
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NVTYKTGCDD 478
LPL + V ++AV+GP+A+ A+ GNY G PC +P+ G GY + GCD
Sbjct: 135 GALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDS 194
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
AC + AA+ A+ ++D ++ GL E E LDR L LPG Q LI VA A+
Sbjct: 195 PACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAAR 253
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PVILV+++ G VD+ FA+ N I AIL AGYPG+ GG AIA V+FG NP GRLP+TWY
Sbjct: 254 RPVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWY 313
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL-SFTKTI 657
++ + +P+T M +R + GYPGR+Y+FY G T+Y FGYGLSY++F + SF+ +
Sbjct: 314 PEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSN 372
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGSTDGSDVV 713
NL+ L D LV ++ RC F V+ QN G DG V
Sbjct: 373 AGNLSLLAGVMARRAGDDGGGMSS--YLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSV 430
Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
++Y + P + +Q+IGF+ V+ G + F + C+ + V ++ G H
Sbjct: 431 LMYLRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAH 490
Query: 774 TIFVGN 779
+ VG+
Sbjct: 491 FLMVGD 496
>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 748
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 253/767 (32%), Positives = 370/767 (48%), Gaps = 106/767 (13%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGLPQYEWWSEALH 104
F F D SLP RVKD+V +++LD+ V+Q+ A G+P+ + Y+W +E L
Sbjct: 27 FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG------ 158
G N G ATSFP I ASFN L K++ A + E RA
Sbjct: 87 GDVNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136
Query: 159 --RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
GL+ WSP +N+ RDPRWGR ET GEDP++ G +V GLQ
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQ-----------GD 185
Query: 217 RPLKV--SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
P V ++ CKH+ + + R FDA VT D TFL F+ CV+ G A S+M
Sbjct: 186 DPTYVIANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAG-ALSLM 244
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-----SKED 329
CSYNR+NG+P+CA+ KLL +R EW+ GY+V+D +++ +V H + D +
Sbjct: 245 CSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAA 304
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF---DGSP 386
L+ G G + +AV++G V + ++ L+ V +LG F D +
Sbjct: 305 NAGTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNN 364
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT---LPLNSAKVKTVAVVGPHANAT 443
Y ++ I SDE+I+L+ +AA E IVL+KND + LPL + K VVGP
Sbjct: 365 PYANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENA 424
Query: 444 VAMIGNYAG--IPCRYMSPIAGFS----GYANVTYKTGCDD-VACKSNNSIFAASEAAKT 496
M G+Y+ + ++P+AG G + Y+ GC D AC+ + + A +
Sbjct: 425 DTMFGDYSPTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDG-YKVRTACEG 483
Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG--PVILVIMSAGGVDIA 554
D I+ AGL +E E D D++LPG+Q L+ AE A G P+IL++ +A +DI+
Sbjct: 484 VDLVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTD-AESASGSAPIILLLFNANPLDIS 542
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A++N AIL A YPG+E G AIA+V+ G +NP GRLP TW L +P
Sbjct: 543 YAKSNPRFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW-------PASLDQVP-- 593
Query: 615 PVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
D + Y RTY+++ LYPFGYGLS+T F Y+ L+ T N
Sbjct: 594 --DMIDYTMKERTYRYFTQEPLYPFGYGLSFTTFNYSDLNVASTANTN------------ 639
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
+ V N G+ DG +V Y K A I Q++
Sbjct: 640 ------------------GEGSIAVSVTVMNTGTMDGDEVTQAYVKWDNVAEAPNI-QLV 680
Query: 733 GFQRVFVRAGRNKRIKFVFNACK-SLNIVDYAANTLLPAGEHTIFVG 778
G R F+ G++ + F + + I +P G +++FVG
Sbjct: 681 GVSRKFISKGQSITVSFTIKPEQLQVWINGDDGKWSIPGGTYSLFVG 727
>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
queenslandica]
Length = 742
Score = 345 bits (886), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 247/732 (33%), Positives = 369/732 (50%), Gaps = 97/732 (13%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQ-------LGDFAHGVPRLGLPQYEWWSEALH 104
F F ++SL RVKD+V +TL+E V+Q L A G+PRL + Y+W +E L
Sbjct: 24 FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG------ 158
G NV G ATSFP I ASFN L K++ A + E RA +
Sbjct: 84 G--NVSAGD--------ATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAAVKDGSY 133
Query: 159 --RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
GL+ WSP +N+ RDPRWGR ET GEDP++ G YV GLQ G+ NS
Sbjct: 134 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQ---GN------NS 184
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
R + ++ CKH+ + R+ FDA+V+ +D TFL F+ CV+ G A S+MCS
Sbjct: 185 RYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAG-ALSLMCS 243
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
YNR+NG+P+CA+ LL +R EWD GY+V+D +++ +V H + D + A
Sbjct: 244 YNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAADAANA 303
Query: 337 AGL--DLDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYV 389
D + G+ + N +AV+ V + ++ L+ V M+LG FD + Y
Sbjct: 304 GTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFDPPDNNPYA 363
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQN----TLPLNSAKVKTVAVVGPHANATVA 445
++ I SD +I L+ +AA E IVL+KND LP+ + +VK +VGP ++
Sbjct: 364 NIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPITN-EVKKACMVGPFSDDPEV 422
Query: 446 MIGNYAGIPCR--YMSPIAGFS----GYANVTYKTGCDD-VACKSNNSIFAASEAAKTAD 498
+ G+Y+ R ++ +AG G + Y GC+D AC++ +S S A +
Sbjct: 423 LFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRS-ACDGVE 481
Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK-GPVILVIMSAGGVDIAFAE 557
I+ AGL +E+E D D+ LPG+Q L+ +K VIL++ +A +DI +A+
Sbjct: 482 LIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPLDIRYAK 541
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
T+ I IL A YPG+ G+AIA+V+ G++NP GRLP TW P + + +
Sbjct: 542 TDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW---------PASLDQVPGIT 592
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
+ RTY+++ LYPFGYGLSYT F Y+ NLN +S A+
Sbjct: 593 NYTMKERTYRYFTQEPLYPFGYGLSYTTFHYS-------------------NLNISSTAT 633
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
+ + V+ L N GS DG++V VY I+ Q++G +
Sbjct: 634 ASGAGMIAVSVL-----------VTNTGSMDGTEVTQVYVW--CNISYAPKLQLVGVNKD 680
Query: 738 FVRAGRNKRIKF 749
F+ G+ + F
Sbjct: 681 FISKGKTLEVSF 692
>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 742
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 254/768 (33%), Positives = 367/768 (47%), Gaps = 112/768 (14%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
+ S+ + ++ L V S + A GS++P + ++ D S P R
Sbjct: 3 LRSVALSTAAVLLSVASCVSASAQGSNAPASGGE--------------VYRDMSRPIEDR 48
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
+ DL+ R TL EK QL GVPRLGLP + W++ LHGV + P T
Sbjct: 49 ITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP----------TTL 98
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG------LTYWSPNINVARDPRWG 178
FP A+++ L + A+S EARA+YN G L Y SP IN++RDPRWG
Sbjct: 99 FPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSPVINISRDPRWG 158
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
RI E EDP + GR V YVRGLQ DL LK+++ KH+A +V++
Sbjct: 159 RIQEVFSEDPLLTGRMGVAYVRGLQ-------GDDLQH--LKLAATVKHFAVNNVES--- 206
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R H +A V E+++ E +L + + E A SVM SYN +NG+P + LL +R
Sbjct: 207 -GRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNHWLLTDVLRK 265
Query: 299 EWDLHGYIVADCDSIQVMVDNH--------KFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
+W G++ D ++ ++ + ++ A A ++AG D D ++ TN
Sbjct: 266 KWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDSDDVEFETNL 325
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAE 407
AVQ+G + E D+D +L+ + V RLG +D PQ Y +G + S + +L+
Sbjct: 326 P-LAVQRGLLTEKDVDGALRNVLRVGFRLGAYD-PPQASKYSRIGMDVVRSQAHRDLSQR 383
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
A E + LL N + LPL +VK+VAV+GP A GNY G P S G
Sbjct: 384 VAEESMTLLLNRRQFLPLQRDQVKSVAVIGP-AGGEAYETGNYYGTPAVKTSVTEGLRAL 442
Query: 468 ----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
V Y+ G V + I A+ A+ +D ++ G +L VEAE DR DL LP
Sbjct: 443 LGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEGRDRRDLNLP 502
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q +L+ V A V LV+M+AG + + +A + ++ AIL A YPGE GG AIA +
Sbjct: 503 GAQQRLLEAVY-AANPKVALVLMNAGPLGVTWA--HDHVPAILSAWYPGELGGAAIARTL 559
Query: 584 FGKFNPGGRLPITWY-NGDYVQMLPLTSMPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGL 641
FG NPGG LP T Y N D V P D S GY TY+++ G LYPFG+GL
Sbjct: 560 FGLNNPGGHLPYTVYANLDGVP-------PQNEYDVSRGY---TYQYFKGVPLYPFGHGL 609
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT F Y+ L T+T D+ V F
Sbjct: 610 SYTHFDYSKLKVTQT----------------------------------SGDHANVTVSF 635
Query: 702 --QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
N G + G++V +YS ++ + GF+RV ++ G +K +
Sbjct: 636 TLTNTGQSAGAEVTQLYSHQVKSSEVQPLRTLRGFERVTLQPGESKAV 683
>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
Length = 690
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 237/752 (31%), Positives = 363/752 (48%), Gaps = 113/752 (15%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D +L R +D+ R++ +EK +Q A RLG Y WWSE LHGV+ G
Sbjct: 6 YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT-- 63
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A F++ + G+ S EARA YN A GLT W
Sbjct: 64 --------ATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLW 115
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPN+N+ RDPRWGR ET GEDP++ V Y +GLQ + + L+ ++C
Sbjct: 116 SPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQG----------DGKVLRTAACA 165
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA+ +DM ET++ FE VK+ SVM +YNRVNG P+
Sbjct: 166 KHFA---VHSGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPA 222
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA ++N+ EW G+ V+DC +I+ NH + E A A LK G DL+CG
Sbjct: 223 CASDFVMNKLE--EWGFDGHFVSDCWAIRDFHTNHGVTKTAPESA-ALALKKGCDLNCGN 279
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y + A +G + E D+ +S L +RLG FD S +Y L + DE+ E +
Sbjct: 280 TYLHLLA-AFNEGLINEEDLRRSCIKLMRTRVRLGMFDKSTEYDGLDYDIVACDEHKEFS 338
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
+ +VLLKN+ LPL+ +K KT+ V+GP+A++ A+ GNY G Y++ ++G
Sbjct: 339 LRCSERSMVLLKNN-GILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIR 397
Query: 466 ---------GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE--- 513
+ YK C +A ++ + A +T + L LD ++E E
Sbjct: 398 EAHDGRVLYTEGSHLYKDRCMGLAL-PDDRLSEAEIITRTLRCSGSLCWLDATIEGEEGD 456
Query: 514 ------SLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNIKAIL 566
S D+ DL LP Q +L+ V +AKG PVI+V + +++ + A++
Sbjct: 457 TGNEFSSGDKNDLRLPESQRKLVKTV--MAKGKPVIIVTAAGSAINV-----EADCDALI 509
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
A YPG+ GGRA+A+++FGK +P G+LP+T+Y D ++ + ++ RTY
Sbjct: 510 QAWYPGQLGGRALANILFGKVSPSGKLPVTFYE-DASKLPDFSDYSMK--------NRTY 560
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
++ G L+PFGYGL+Y++ + + LSF +
Sbjct: 561 RYSEGNILFPFGYGLTYSETECSELSFENGVAT--------------------------- 593
Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
V N GS DVV +Y K +E A + GF+RV + AG ++
Sbjct: 594 ------------VKVTNTGSRFTEDVVQIYIKGYSENAVPN-HSLCGFKRVALDAGESRI 640
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ ++ V+ + E T++ G
Sbjct: 641 VQITLPE-RAFMAVNEKGEWIKEGSEFTLYAG 671
>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
albus 8]
Length = 691
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 222/628 (35%), Positives = 330/628 (52%), Gaps = 74/628 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D SL R + L MT +E+ QL A + RLG+P Y WW+E +HG++ G
Sbjct: 4 YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A F++ L K+ + S EARA YN GLT W
Sbjct: 62 --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + VRGLQ + + +K ++C
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQG----------DGKVMKAAACA 163
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FDA+ +DMEET+L FE VKE SVM +YNRVNG P+
Sbjct: 164 KHFA---VHSGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPA 220
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
CA L+ + EW+ GY V+DC +I+ ++H A++ E A A LKAG D++CG
Sbjct: 221 CASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKAGCDVNCGC 277
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
Y N A+ +G + + I + +L +RLG FD + + + E+ ++
Sbjct: 278 TYQNLLA-ALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVACAEHKAVS 336
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
E A + +VLLKN+ LPL+ K KT+AV+GP+A++ A+ GNY G+ RY + + G
Sbjct: 337 LECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395
Query: 464 --FSGYANVTYKTGCDDVACKSNNSIFAASE-------AAKTADATIILAGLDLSVEAE- 513
F G V + GC + KS + + A + AAK AD I+ GLD ++E E
Sbjct: 396 DRFEG--RVIFAEGC-HLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEE 452
Query: 514 --------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
S D+ L LP Q L+ ++ V K PV+ V+ + ++ T + A+
Sbjct: 453 GDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN-----TESQPDAL 506
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
+ A YPG EG +A+A+V+FG +P G+LP+T+Y D ++ T ++ GRT
Sbjct: 507 IHAFYPGAEGSKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK--------GRT 557
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
Y++ L+PFGYGL+Y K N + +
Sbjct: 558 YRYTTDNILFPFGYGLTYGGVKVNAVEY 585
>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
vinifera]
Length = 785
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 166/285 (58%), Positives = 212/285 (74%), Gaps = 2/285 (0%)
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YIV+DC ++V+VDN +L +SK DAVA+TL+AGLDL+CG YYT+ +V GKV + +
Sbjct: 10 YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
+D++LK +Y +LMR+G+FDG P Y SLG +DIC+ ++IELA EAAR+GIVLLKND LP
Sbjct: 70 LDRALKNIYVLLMRVGYFDGIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLP 129
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSN 484
L K + +VGPHANAT MIGNYAG+P +Y+SP+ FS NVTY TGC D +C ++
Sbjct: 130 LKPGK--KLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASCSND 187
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
A EAAK A+ TII G DLS+EAE +DR D LPG QT+LI QVAEV+ GPVILV
Sbjct: 188 TYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILV 247
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
++S +DI FA+ N I AILW G+PGE+GG AIADVVFGK+NP
Sbjct: 248 VLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNP 292
>gi|147826476|emb|CAN72807.1| hypothetical protein VITISV_033721 [Vitis vinifera]
Length = 236
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 157/216 (72%), Positives = 177/216 (81%)
Query: 34 VFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL 93
+VCD R++ LGL M SF FCD SL Y R KDLVSRMTL EKV Q A GV RLGL
Sbjct: 16 TYVCDESRYALLGLDMKSFAFCDKSLSYEERAKDLVSRMTLQEKVMQSVHTASGVRRLGL 75
Query: 94 PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
P+Y WWSEALHG+SN+GPG FD+ IPGATSFPTVIL+TA+FN++LWK +G+ VSTE RA
Sbjct: 76 PEYSWWSEALHGISNLGPGVFFDETIPGATSFPTVILSTAAFNQTLWKTLGRVVSTEGRA 135
Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
MYNLG AGLT+WSPNINV RD RWGR ET GEDPF+VG +AVNYVRGLQDVEG EN TD
Sbjct: 136 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTD 195
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
LNSRPLKVSSCCKHYAAYD+D+W VDR+ FDARV+
Sbjct: 196 LNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVS 231
>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
Length = 273
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 163/261 (62%), Positives = 191/261 (73%), Gaps = 14/261 (5%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F CDP + +F FC LP RV DL+ RMTL EKV L + A VPRLG+
Sbjct: 27 FACDPEDGTS-----RNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIK 81
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
YEWWSEALHGVSNVGPGT F PGATSFP VI T ASFN +LW+ IG+ VS EARAM
Sbjct: 82 GYEWWSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAM 141
Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
+N G AGLTYWSPN+N+ RDPRWGR ETPGEDP V G+YA +YVRGLQ +G
Sbjct: 142 FNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDGDR----- 196
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QDME+TF PF MCVKEG +SVM
Sbjct: 197 ----LKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVM 252
Query: 275 CSYNRVNGIPSCADPKLLNQT 295
CSYN+VNGIP+CADPKLL +T
Sbjct: 253 CSYNQVNGIPTCADPKLLKKT 273
>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
Length = 696
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 197/493 (39%), Positives = 292/493 (59%), Gaps = 30/493 (6%)
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCD++ ++ D + A S EDAVA LKAG+D++CG Y T +A+QQ KV ETDID++
Sbjct: 221 DCDAVSIIYDAQGY-AKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRA 279
Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
L L++V +RLG F+G P Y ++ ++CS + LA +AAR GIVLLKN+ LP
Sbjct: 280 LLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPF 339
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSN 484
+ V ++AV+GP+A+ ++GNYAG PC+ ++P+ Y N Y GCD VAC SN
Sbjct: 340 SKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SN 398
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
+I A AK AD +++ GLD + E E DR DL LPG Q +LI VA AK PV+LV
Sbjct: 399 AAIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLV 458
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G VDI+FA N I +I+WAGYPGE GG AI++++FG NPGGRLP+TWY +V
Sbjct: 459 LICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVN 518
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNK 663
+ +T M +R + GYPGRTYKFY GP +Y FG+GLSY+ + Y + +T + +N +K
Sbjct: 519 -IQMTDMRMR--SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSK 575
Query: 664 LQ-HCRNLNYT--SDASKTRCPGVLVNDLRCDDYFEFK--VDFQNVGSTDGSDVVIVYSK 718
Q + ++ YT S+ K C D + K V+ +N G G V+++++
Sbjct: 576 AQTNSDSVRYTLVSEMGKEGC-----------DVAKTKVTVEVENQGEMAGKHPVLMFAR 624
Query: 719 PP--AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
E KQ++GF+ + + G ++F C+ L+ + +L G++ +
Sbjct: 625 HERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLT 684
Query: 777 VGNGGVSFPIHLN 789
VG+ P+ +N
Sbjct: 685 VGDS--ELPLIVN 695
Score = 233 bits (593), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 113/215 (52%), Positives = 143/215 (66%), Gaps = 14/215 (6%)
Query: 30 SSSPVFVCDPGR-FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
S+ P CDP +KL + FC + LP R +DLVSR+T+DEK+ QL + A G+
Sbjct: 19 SAPPPHSCDPSNPTTKL------YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGI 72
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
PRLG+P YEWWSEALHGV+ GPG F+ + ATSFP VILT ASF+ W +I Q +
Sbjct: 73 PRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIG 132
Query: 149 TEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DV 205
EAR +YN G+A G+T+W+PNIN+ RDPRWGR ETPGEDP + G YAV YVRGLQ
Sbjct: 133 KEARGVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSF 192
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
+G + S L+ S+CCKH+ AYD+D WK D
Sbjct: 193 DGRKTL----SNHLQASACCKHFTAYDLDRWKDCD 223
>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 722
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 249/766 (32%), Positives = 378/766 (49%), Gaps = 106/766 (13%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSI--RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWW 99
F + L +F F S + + K ++S++TLDEK+ QL A G+ RLG+ Y W
Sbjct: 10 FISVALVSVTFTFAQSKKEKEMIQKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWL 69
Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR 159
+EALHGV G AT FP I A+F+ + ++IG A++TE RA + + +
Sbjct: 70 NEALHGVGRDGR----------ATVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQ 119
Query: 160 --------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
AGLT+W+PN+N+ RDPRWGR ET GEDPF+ G +V+G+Q
Sbjct: 120 RQKNYSMYAGLTFWAPNVNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQ-------- 171
Query: 212 TDLNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARV--TEQDMEETFLRPFEMCVKE 267
+ P LK ++C KH+A + G +R A V T+ D+ ET+L F+M V++
Sbjct: 172 ---GNDPFYLKAAACGKHFAVHS-----GPERTRHTANVEPTKHDLYETYLPAFKMLVQQ 223
Query: 268 GDASSVMCSYNRVNGIPSCADPK-LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G S+M +Y R+ G SC+ K LL +R +W G++V+DC ++ M + HK L S
Sbjct: 224 GKVESIMGAYQRLYG-ESCSGSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHK-LVKS 281
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF--DG 384
+ +AVA +KAGL+L+CG +A++Q + E D+DK+L L ++LG D
Sbjct: 282 EAEAVAFAIKAGLNLECGNSMRTMK-DALKQKLITEKDLDKALLPLMMTRLKLGILQPDV 340
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+ Y + I S +N +A AA E +VLLKND LP+ + ++T+ V GP A
Sbjct: 341 ACPYNEFPESVIGSIDNRNIAQRAAEESMVLLKND-GVLPI-AKDIRTLFVTGPGATDAY 398
Query: 445 AMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
++GNY G+ RY + + G G +V YK G V N+ ++ SE ++ A+ +
Sbjct: 399 YLMGNYFGLSDRYSTYLEGIVGKVSNGTSVNYKQGFMQVFKNLNDVNWSVSE-SRGAEVS 457
Query: 501 IILAGLDLSVE---------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
II+ G + E +E DR DL LP Q Q + +V++ +++V+ GG
Sbjct: 458 IIIMGNSGNTEGEEGDAIASSERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVL--TGGS 515
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
I E A++ A YPG+EGG A+A+++FG N GRLP+T+ P T+
Sbjct: 516 PIDVKEITELADAVVMAWYPGQEGGVALANLLFGDANFSGRLPVTF---------PETTD 566
Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
L D GRTYK+ LYPFGYGLSY + Y + TK
Sbjct: 567 KLPSFDDYSMKGRTYKYMTDNILYPFGYGLSYGKVAYGNATVTKLP-------------- 612
Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
T +S T VD N G+ +VV VY P+ + I+ +
Sbjct: 613 -TKHSSMT-----------------VSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESL 654
Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
+ F+RV + F + L V + L GE+ + +
Sbjct: 655 VAFKRVKIAPHATVTTDFEI-PVERLETVQEDGTSKLLKGEYRVMI 699
>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
Length = 748
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 241/732 (32%), Positives = 367/732 (50%), Gaps = 95/732 (12%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--------GDFAHGVPRLGL 93
F+ L S+F F + SL +S RV DLV R+TLD+ VQQL G A + LG+
Sbjct: 14 FALTPLASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGI 73
Query: 94 PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
Y+W +E L G G ATSFP I A+F++ L + +A +TE RA
Sbjct: 74 GPYQWNTECLRGDVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEVRA 123
Query: 154 MYN--------LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
+N GL+ +SP +N+ R P WGR ET GEDP++ G YA +V+GLQ
Sbjct: 124 KHNDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQG- 182
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
H+ R ++ ++ CKH+ A+ R FDA+V+ +D+ TFL F+ CV
Sbjct: 183 -DHD-------RYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCV 234
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
+ G A S+MCSYN +NG+P+C++ L+ +RGEW+ GY+V+D +I+ + H + +
Sbjct: 235 QAG-AYSLMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNN 293
Query: 326 SKEDAVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
S EDA A ++ AG +L+ T G+AV+ GK++E+ + +K L+ MRLG
Sbjct: 294 S-EDAAAGSVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGE 352
Query: 382 FDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP---LNSAKVKTVAV 435
FD P+ Y S+ I S+E+ L+ AA + +VLLK + + +AV
Sbjct: 353 FD-PPEMNPYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFPSERMAV 411
Query: 436 VGPHANATVAMIGNYAGI--PCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASE 492
+GP AN T + G+Y+ P +P+ G + ++ Y GC D N S
Sbjct: 412 IGPMANNTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKT 471
Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
A AD ++ G +E+E++DR+D+ LPG Q QL+ V + V L++ SAG V+
Sbjct: 472 ALVGADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVN 531
Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLPLT 609
I +A+ + + IL YP + G AI + G+FNP GRLP TWY Y + +P
Sbjct: 532 ITWAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWYR--YTEQIP-- 587
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ +TY+++ G LYPFGYGLSY+ F ++ L F +
Sbjct: 588 -----EMTDYSMARKTYRYFTGVPLYPFGYGLSYSTFVFSKLYFLPKVNAG--------- 633
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
P V+ +V N G DG +V+ VY K +
Sbjct: 634 -----------DPNVV------------QVRVFNEGPFDGDEVLQVYIKWMSTKERMPRV 670
Query: 730 QVIGFQRVFVRA 741
Q++ F+RVF+R+
Sbjct: 671 QLVAFERVFIRS 682
>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
F0055]
Length = 722
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 235/702 (33%), Positives = 360/702 (51%), Gaps = 99/702 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
+ K ++S++TLDEK+ QL A G+ RLG+ Y W +EALHGV G AT
Sbjct: 34 KAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------AT 83
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
FP I A+F+ + +IG A++TE RA + + + AGLT+W+PN+N+ RDP
Sbjct: 84 VFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDP 143
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDPF+ G +V+G+Q + LK ++C KH+A +
Sbjct: 144 RWGRGMETYGEDPFLTGTLGTAFVKGMQGDDPFY---------LKAAACGKHFAVHS--- 191
Query: 236 WKGVDRYHFDARV--TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK-LL 292
G +R A V T++D+ ET+L F+M V++G S+M +Y R+ G SC+ K LL
Sbjct: 192 --GPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYG-ESCSGSKYLL 248
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+R +W G++V+DC ++ M + HK L S+ +AVA +KAGL+L+CG
Sbjct: 249 TDILRKDWGFKGHVVSDCGAVTDMYEGHK-LVKSEAEAVAFAIKAGLNLECGNSMRTMK- 306
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAAR 410
+A+QQ + E D+DK+L L ++LG D + Y + I S+ N ++A +AA
Sbjct: 307 DAIQQKLITEKDLDKALLPLMMTRLKLGILQPDAACPYNEFPESVIGSEANRKIAEQAAE 366
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
E +VLLKN+ LP+ + ++T+ V GP A ++GNY G+ RY + + G G
Sbjct: 367 ESMVLLKNN-GVLPI-AKDIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSN 424
Query: 468 -ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE---------AESLDR 517
+V YK G V N+ ++ SE ++ A+ +I++ G + E AE DR
Sbjct: 425 GTSVNYKQGFMQVFKNLNDVNWSVSE-SRGAEVSILIMGNSGNTEGEEGDAIASAERGDR 483
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+L LP Q + + +V++ +++V+ GG I E A++ A YPG+EGG
Sbjct: 484 VNLRLPDSQMEYLREVSKDRTNKLVVVL--TGGSPIDVKEITELADAVVMAWYPGQEGGV 541
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
A+A+++FG N GRLP+T+ P ++ L D GRTYK+ LYPF
Sbjct: 542 ALANLLFGDANFSGRLPVTF---------PESADRLPAFDDYSMKGRTYKYMTDNILYPF 592
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGLSY++ Y S+A+ T+ P
Sbjct: 593 GYGLSYSKVTY-------------------------SNAAVTKMPTKTTP-------MTV 620
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
VD N G +VV VY P + I+ +IGF+RV +
Sbjct: 621 YVDVTNNGDMPVDEVVQVYLSTPGAGNTSPIESLIGFKRVKI 662
>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
Length = 674
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 221/687 (32%), Positives = 350/687 (50%), Gaps = 99/687 (14%)
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
+ +P+Y +W+EALHGV+ G AT FP I A+F++ L +I + TE
Sbjct: 1 MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50
Query: 151 ARAMYNLGRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
RA YN GLT+WSPN+N+ RDPRWGR ET GEDPF+ ++ V +++GL
Sbjct: 51 GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110
Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
Q ++ LK+++ KH+A + +G+ R+ FDA V+++D+ ET+L F+
Sbjct: 111 QG----------QAKYLKLAATAKHFAVHS--GPEGL-RHGFDAVVSDKDLYETYLPAFK 157
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
V+E D S+M +YN V+G+P+ LL + +W G++V+D + + + +NHK+
Sbjct: 158 AAVEEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKY 217
Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
D+ E + +KAGL+L G + A+ +G V E +I ++ LY +RLG F
Sbjct: 218 TKDAAE-TMGLAIKAGLNLVAGHIEQSLH-EALDRGLVTEEEITNAVISLYATRVRLGMF 275
Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
+Y ++ + + + L+ AA + VLLKND LPL ++ +AVVGP+A++
Sbjct: 276 ATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVVGPNAHS 334
Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGC----DDVA---CKSNNSIFAAS 491
+A++GNY G P R + + G V Y G D A K++ A
Sbjct: 335 EIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAV 394
Query: 492 EAAKTADATIILAGLDLSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVI 542
AA+ +D + + GLD ++E E + D+ +L LPG Q QL+ ++ V K PV+
Sbjct: 395 IAAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVV 453
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+++ S + + E + N++AI+ YPG GG A+ADV+FG +P G+LP+T+Y
Sbjct: 454 VLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVTFYKN-- 511
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP + GRTY++ LYPFGYGL+Y+ V L+
Sbjct: 512 VDNLP-------AFEDYNMAGRTYRYMTDEALYPFGYGLTYS-------------SVELS 551
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
LQ ++ T+ + T QN G+ D +VV VY K
Sbjct: 552 DLQ-VKSYEDTATVTAT---------------------IQNTGNFDTDEVVQVYVKDLGS 589
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKF 749
A Q+ GF+RV++ G + I F
Sbjct: 590 EFAVPNAQLKGFKRVYLGKGAKQTITF 616
>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 323
Score = 338 bits (866), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 161/305 (52%), Positives = 209/305 (68%), Gaps = 14/305 (4%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P F C P FCD +L + R DLVSR+T EK+ QLGD A GVPRLG
Sbjct: 29 PPFSCGPSSAEA----SEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84
Query: 93 LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
+P Y+WW+EALHG++ G G HFD + ATSFP V+LT A+F++ LW +IGQA+ EAR
Sbjct: 85 VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144
Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
A++N+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RYAV +VRG+Q
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG------- 197
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
+ +S L+ S+CCKH AYD+++W GV RY F ARVTEQD+E+TF PF CV E AS
Sbjct: 198 -NSSSSLLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKAS 256
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
VMC+Y +NG+P+CA+ LL TVRG+W L GY+ +DCD++ +M D ++ A + EDAV
Sbjct: 257 CVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRY-APTPEDAV 315
Query: 332 AQTLK 336
A +LK
Sbjct: 316 AVSLK 320
>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
Length = 859
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 241/751 (32%), Positives = 371/751 (49%), Gaps = 94/751 (12%)
Query: 52 FLFCDSSLPYSI---------RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
FLF SS+ + RV DL++ MTL+EK+ G + RLG+P +EW+ EA
Sbjct: 14 FLFSFSSIAQTWKNPNASIEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEA 73
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
LHG+ + T FP I A++N L + A+S EARA+ N G+ +
Sbjct: 74 LHGI-----------ISWNCTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEV 122
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
+SP +N+ARDPRWGR E EDP ++ A YVRG+Q G++ + +K
Sbjct: 123 MMFSPTVNMARDPRWGRNGECYAEDPHLMSEMARMYVRGMQ---GND------PKYVKTV 173
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+ KHY A +V+ R + + ++D+ E + ++ C+ + +A+ +M + N +NG
Sbjct: 174 TTVKHYVANNVE----TKREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNG 229
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
IP A L+N +R EW GY++AD ++Q + K+ + S+ A A +KAG+D +
Sbjct: 230 IPCSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYAS-SQAQAAAMAIKAGVDQE 288
Query: 343 CGQY------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQ 394
C + +A+QQG + E ++D ++K L + G FD Y ++
Sbjct: 289 CFRNKVRQAPMVQALPDALQQGLITEKELDVTVKRLLRLRFMTGDFDDPSLNPYSAIPTS 348
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ D + +LA +AA + IVLLKND LPL +K++A++GP A+ +G Y+G P
Sbjct: 349 VLECDAHKQLALKAAEQSIVLLKNDA-VLPLKK-DLKSIAMIGPFADR--CWMGIYSGHP 404
Query: 455 CRYMSPIAGFSGYAN--VTYKTGCDDVACKSNNSIFAASEA-AKTADATIILAGLDLSVE 511
+SP+ G Y N V++ GC+ A + + A + A AK ++ I++ G D +
Sbjct: 405 KSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTS 464
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ DR+ + LPG Q QLI V V K VILV++ +G + + + NI I+ A
Sbjct: 465 TENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAVTWEQ--KNIPGIVCAWPN 521
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+E G A+A V+FG NPGG+L TWY D +P + RTY ++ G
Sbjct: 522 GQEQGTALAKVLFGDVNPGGKLNATWYQSD-------KDLPNFHDYKMAGGNRTYMYFKG 574
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
LYPFGYGLSYT F + +S K L+
Sbjct: 575 KPLYPFGYGLSYTNFTISDVSINK-------------------------------KTLQA 603
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK--RIKF 749
++Y K N G+ G +VV VY + T +K + GFQR+ V AG +K IK
Sbjct: 604 NEYVTVKAKVNNTGAVAGDEVVQVYIRDVKSKEKTPLKALKGFQRISVAAGASKWVEIKI 663
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+ A N A ++ GE I VGN
Sbjct: 664 PYEAFSHYNTKKEA--LMVAKGEFEILVGNA 692
>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
Length = 757
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 256/764 (33%), Positives = 365/764 (47%), Gaps = 103/764 (13%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL-----GDFAHGVPRLGLPQYEWWSE 101
+Q F F D SL + R DLV+R+TL+E Q G + RLG+ Y W +E
Sbjct: 15 VQSYDFPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITE 74
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
L G N AT++P I ASF+E L + + +S E RA +N RA
Sbjct: 75 CLAGQVNTN-----------ATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAV 123
Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GL+ +SP IN+ R P WGR ET GEDP + G A ++VRGLQ +
Sbjct: 124 GKYSTKVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDD------- 176
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
R L+ ++ CKH+ + V R+ FDA+V +D TFL F+MCV G + S+
Sbjct: 177 --PRYLRANAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAG-SYSL 233
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
MCSYNR+NGIP+CA+ +LL R EW HGYIV+D +I + + H + +S V
Sbjct: 234 MCSYNRINGIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHY-TNSTVATVVA 292
Query: 334 TLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
+KAG +L+ G YY +A++QG + E +I +++ L +RLG FD
Sbjct: 293 AIKAGTNLELGGGSNMYYPKQL-DAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPEAMVD 351
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y +G I S E+ E A +AA G VLLKN N LP+ K +A+VGP NAT +
Sbjct: 352 YNKIGVDVIQSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQYSK-LAIVGPFTNATSELF 410
Query: 448 GNYAG-IPCRY-------MSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
G Y+ + ++ +SP+ G + AN GC + AC S A AD
Sbjct: 411 GTYSSEVNLKFTSTIFEGLSPLGGSTRSAN-----GCTNSAC-SGYVRDDVETAVAGADL 464
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAET 558
I+ G E+E DR L L G+Q ++ + G PVILV+++AG +DI +A+
Sbjct: 465 VIVALGSGQRFESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKL 524
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
+ + AIL GYP + G A+ + + P GRL TW PL +
Sbjct: 525 DPGVTAILSCGYPAQSTGEALRRSLTMSEPQAAPAGRLQATW---------PLNLDQVPK 575
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+ GRTY++Y G LYPFG+GLSYT F Y LS +
Sbjct: 576 ITDYTMQGRTYRYYVGEPLYPFGFGLSYTSFSYTRLSIS--------------------- 614
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
P V+ D +V +N GS D +VV VY P + F
Sbjct: 615 ------PSVITQ----GDNVTVEVCLKNTGSYDSDEVVQVYMSWPQTPFPLPKWTLAAFA 664
Query: 736 RVFVRAGRNKRIKFVFNACK-SLNIVDYAANTLLPAGEHTIFVG 778
R F+ AG+ +K V A + ++ + D A +P G T++ G
Sbjct: 665 RPFISAGQTICVKSVIRADQMAVWLSDDAGFGFVP-GVMTVYAG 707
>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
Length = 412
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 171/310 (55%), Positives = 208/310 (67%), Gaps = 14/310 (4%)
Query: 33 PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
P F C G LGL FC++ LP + R DLVSRMT EK QLGD A+GVPRLG
Sbjct: 84 PPFSCGGG--PSLGLP-----FCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLG 136
Query: 93 LPQYEWWSEALHGVSNVGPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
+P Y+WW+EALHGV+ G G H D + ATSFP V+LT ASFN++LW +IGQA EA
Sbjct: 137 VPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEA 196
Query: 152 RAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
RA YN+G+A GLT WSPN+N+ RDPRWGR ETPGEDP V RYA +VRGLQ G +
Sbjct: 197 RAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSS 253
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
T L S+CCKH AYD+++WKGV RY F A VT QD+ +TF PF CV +G A
Sbjct: 254 NTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKA 313
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG-YIVADCDSIQVMVDNHKFLADSKED 329
S VMC+Y VNG+PSCA+ LL +T RG W L G Y+ ADCD++ +M N +F + ED
Sbjct: 314 SCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIM-RNSQFYRPTAED 372
Query: 330 AVAQTLKAGL 339
VA TLKAG+
Sbjct: 373 TVATTLKAGM 382
>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
Length = 761
Score = 335 bits (858), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 244/768 (31%), Positives = 365/768 (47%), Gaps = 147/768 (19%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D S P +R K L+ +++L+EK + + V RLG+ Y WWSEALHGV+ G
Sbjct: 31 YTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEALHGVARNG--- 87
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL----GR----AGLTYW 165
AT FP I ASF+ + + AVS EAR + GR AGL++W
Sbjct: 88 -------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGRVYQYAGLSFW 140
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP+++G+ + VRGLQ D ++ LK +C
Sbjct: 141 TPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQ--------GDPDADVLKTHACA 192
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + +R+ FDA+V+E+D+ ET+L F+ V + VM +YNR G P
Sbjct: 193 KHYA---VHSGLESNRHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMTAYNRFRGYPC 249
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLDC 343
A L+ + +R EW G +V+DC +I + H F+A + E+A A + GLD++C
Sbjct: 250 AASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVA-TGEEAAALAVANGLDVEC 308
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
G ++ A+ QG +KE D+D++L + T RLG DG + L + E+
Sbjct: 309 GSTFSKIPA-AIDQGLLKEEDLDRNLLRVLTERFRLGEMDGESPWDDLDPAIVEGPEHRA 367
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
L+ + ARE +VLL+N+ LPL + + +A++GP+A+ GNY +P ++ +
Sbjct: 368 LSLDIARETMVLLRNN-GVLPLKAG--EKIALIGPNADDAQMQWGNYNPVPKSTITLLQA 424
Query: 464 F------------------------SGYANVT-------------YKTGCDDVAC----- 481
S YAN+ Y +D+
Sbjct: 425 MQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSVNDIKNYIRRD 484
Query: 482 --KSNNSIFAASEAA-----KTADATIILAGLDLSVEAESL----------DREDLWLPG 524
+ + + A EAA + D + G+ +E E + DR D+ LPG
Sbjct: 485 EEQRRSFMPALDEAAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSGGDRTDIELPG 544
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ + + K V+LV S G I + AIL A YPG+EGG AIADV+F
Sbjct: 545 VQRRLLKALHDAGK-KVVLVNFS--GCAIGLVPETESCDAILQAWYPGQEGGTAIADVLF 601
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
G NP G+LP+T+Y V LP V+ G TY+++ G LYPFGYGLSYT
Sbjct: 602 GDVNPSGKLPVTFYKN--VDQLP-------DVEDYNMEGHTYRYFRGEPLYPFGYGLSYT 652
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
F + P V +L ++D N
Sbjct: 653 SFAFGE-------------------------------PKVKGKNL--------EIDVTNT 673
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
GS G++VV +Y + P + A +K + F+RV V AG+ ++ +
Sbjct: 674 GSVAGTEVVQLYVRKPDDTAGP-VKTLRAFRRVSVPAGQTVKVSIPLD 720
>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
Length = 701
Score = 331 bits (849), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 241/753 (32%), Positives = 370/753 (49%), Gaps = 133/753 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEALHGVSN 108
FC++SLP S RV+DL++R+ LDEK L A PR +GLP+Y W + +HGV +
Sbjct: 34 FCNTSLPVSARVEDLLARLPLDEKAILLT--ARASPRGNMSSIGLPEYNWGANCVHGVRS 91
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GT+ TSFP + N S+ ++
Sbjct: 92 TC-GTNC------PTSFPNPV------NLSIHRR-------------------------- 112
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
RDPRWGR TETP EDP V +Y V Y +GLQ+ + HE+ R L+ KHY
Sbjct: 113 ----RDPRWGRNTETPSEDPLVNSKYGVAYTKGLQEGK-HED-----PRYLQAVVTLKHY 162
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
AY +N+ G +R F+A V+ D +T+ F + +G+A VMCSYN VNG+P+CA+
Sbjct: 163 VAYSYENYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDGNAKGVMCSYNSVNGVPACAN 222
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ--Y 346
+L N+ +RG GYI +D +I+ + D ++ ++ +A + AG D++ G+
Sbjct: 223 NELENKLLRGMLGFDGYITSDSGAIEAISDWLHYVP-TRCEAARLAILAGTDVNSGRGFG 281
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIEL 404
Y V+ ++ +D L++ + LG FD Y + D+ +D +L
Sbjct: 282 YMACLKELVESNQLDVKVVDDVLRHTLKLRFELGLFDPIEDQPYWKVTPNDVNTDAAKKL 341
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR-------- 456
+ + AR+ IVLL+N+Q LPL VK +AVVGPHA A A++GNY G C
Sbjct: 342 SLDLARKSIVLLQNNQPVLPLRRG-VK-LAVVGPHAQAKRALLGNYLGQMCHGDYNEVGC 399
Query: 457 YMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
+P S G ++ TY GC +V S A +A + A+A ++ G+D SVEAE
Sbjct: 400 IKTPFEAVSASNGDSSTTYALGC-NVTGNSTAGFVEAVKAVQGAEAVVLFLGIDKSVEAE 458
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR ++ LP Q QL+ +V V K P ++V+M+ GGV A + A++ A YPG
Sbjct: 459 VRDRNNIDLPAIQVQLLQRVRAVGK-PTVVVLMN-GGVLTA-EDIIGQTDALVEAFYPGF 515
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
G +A+ D++FG NPGG+LP+T Y DYV + + SM + YPGR+Y+++ G
Sbjct: 516 FGAQAMTDILFGDANPGGKLPVTMYRSDYVNTVDMKSM-----NVTAYPGRSYRYFKGEP 570
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
++PFG+GLSYT F DA+ T
Sbjct: 571 VFPFGWGLSYTSFSLK-----------------------ADDATATTA------------ 595
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYS-----KPPAEIAATYI-KQVIGFQRVFVRAGRNKRI 747
++V +T + + +V++ K A AT + KQ+ ++RV ++ + R+
Sbjct: 596 --------KSVSATMNTTISVVFAYFRPIKTDASGPATLLNKQLFDYRRVTLKPSESTRL 647
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
F +L +VD N + G + I + NG
Sbjct: 648 SFEVQR-STLALVDEEGNLVSFPGSYDIIITNG 679
>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
[Taeniopygia guttata]
Length = 685
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 241/718 (33%), Positives = 365/718 (50%), Gaps = 99/718 (13%)
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQA 146
+PRLG+ Y W +E L G D PG AT+FP + A+F+ L ++ A
Sbjct: 9 IPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRVANA 58
Query: 147 VSTEARAMYN----LGR----AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
+TE RA +N GR GL+ +SP +N+ R P WGR ET GEDPF+ G A ++
Sbjct: 59 TATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELARSF 118
Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
V+GLQ + R +K S+ CKH++ + + + Y V E+D TFL
Sbjct: 119 VQGLQGP---------HPRYVKASAGCKHFSVHG--GHENILLYLLT--VLERDWRMTFL 165
Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
F+ CV+ G + S MCSYNR+NG+P+CA+ KLL +RGEW GY+V+D ++++++
Sbjct: 166 PQFQACVRAG-SYSFMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIML 224
Query: 319 NHKFLADSKEDAVAQTLKAG--LDLDCGQYYTNFT--GNAVQQGKVKETDIDKSLKYLYT 374
H + E AVA ++ AG L+L G F A+ G + + ++ L+
Sbjct: 225 GHHYTRSFLETAVA-SVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFY 283
Query: 375 VLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
MRLG FD Y SL + S E+ L+ EAA + VLLKN + TLPL + + +
Sbjct: 284 TRMRLGEFDPPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPLKAQDLSS 343
Query: 433 --VAVVGPHANATVAMIGNYAGIP-CRYM-SPIAGFSGY-ANVTYKTGCDDVACKSNNSI 487
+AVVGP A+ + G+YA +P RY+ +P G ANV++ GC + C+
Sbjct: 344 QHLAVVGPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRCQR---- 399
Query: 488 FAASEAAK---TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVIL 543
++ +E K AD ++ G + VE E+ DR DL LPG+Q +L+ + A G PVIL
Sbjct: 400 YSRAELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVIL 459
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK--FNPGGRLPITWYNGD 601
++ +AG +D+++A+ + + AIL +P + G AIA V+ G+ +P GRLP TW G
Sbjct: 460 LLFNAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATWPAG- 518
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVN 660
+ +P P+++ GRTY++Y LYPFGYGLSYT F+Y L + +
Sbjct: 519 -MHQVP-------PMENYTMEGRTYRYYGQEAPLYPFGYGLSYTTFRYRDLVLSPPV--- 567
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
L C NL+ + V +N G D +VV +Y +
Sbjct: 568 ---LPLCANLSVS-------------------------VVLENTGLRDSEEVVQLYLRWE 599
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
Q++ F+RV V AGR ++ F A + +A + L G T+F G
Sbjct: 600 HSSVPVPRWQLVAFRRVAVPAGREAKLSFQVLAEQR---AVWAQHWHLEPGTFTLFAG 654
>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
Length = 704
Score = 329 bits (844), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 184/483 (38%), Positives = 287/483 (59%), Gaps = 28/483 (5%)
Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
DCD++ V+ K+ A + EDAVA LK+G+ Y N+T +AV++ KV ++ID++
Sbjct: 229 DCDAVNVLHVEQKY-AKTPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRA 282
Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
L L++ MRLG F+G P Y +G +CS E+ LA EAA +GIVLLKN LPL
Sbjct: 283 LHNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPL 342
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSN 484
+ + + ++AV+GP+A+ + ++GNY G C+ ++ + G Y ++ +Y+ GC++V+C S
Sbjct: 343 SKSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTSA 402
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
+ E A+T D I++ GLD S E E LDR DL LPG Q LI VA+ AK P++LV
Sbjct: 403 -AKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLV 461
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP---GGRLPITWYNGD 601
++ +D+ FA+ N I +ILWAGYPG+ G A+A ++FG+ NP GGRLP+TWY D
Sbjct: 462 LLGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQD 521
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
+ + +P+T M +RP S G PGRTY+FY G ++ FGYGLSY+ + Y S V
Sbjct: 522 FTK-VPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFAS------VAQ 574
Query: 662 NKLQHCRNLNYTSDASKTRCPGV-LVNDL---RCDDY-FEFKVDFQNVGSTDGSDVVIVY 716
N+L + N + S+T PG LV+D+ +C++ F+ V +N G G V+++
Sbjct: 575 NQLNVKDSSNQQPENSET--PGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLF 632
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
++ IK+++GFQ V + AG I++ + C+ L+ + ++ G +
Sbjct: 633 ARHAKPGKGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILL 692
Query: 777 VGN 779
VG+
Sbjct: 693 VGD 695
Score = 215 bits (548), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 111/232 (47%), Positives = 147/232 (63%), Gaps = 22/232 (9%)
Query: 17 LLVFSTNAVDANG-----SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSR 71
+L+F+ NG +S P + CD S ++ FC ++LP S R +DLVSR
Sbjct: 7 VLLFARQTKQGNGRPRKQASQPPYSCDSSDPS-----TKTYDFCKTTLPISRRAEDLVSR 61
Query: 72 MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV---SNVGPG-THFDDVIPGATSFPT 127
+T +EK QL D + +PRLG+P YEWWSE LHG+ + V G + F+ I ATSFP
Sbjct: 62 LTFEEKATQLVDTSPAIPRLGIPAYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQ 121
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGR-AGLTYWSPNINVARDPRWGRITETPGE 186
VILT ASF+ +W +IGQ V EARA+YN G+ GL +W+PN+N+ RDPRWGR ETPGE
Sbjct: 122 VILTAASFDAHIWYRIGQ-VGKEARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGE 180
Query: 187 DPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
DP VVG+Y ++VRG+Q EG D L+ S+CCKHY A+D+DNW
Sbjct: 181 DPLVVGKYGASFVRGVQGDSFEGESTLGDH----LQASACCKHYTAHDLDNW 228
>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
Length = 745
Score = 329 bits (843), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 233/726 (32%), Positives = 359/726 (49%), Gaps = 96/726 (13%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------VPRLGLPQYEW 98
L + + F ++SLP+ RVKDLV R+T++E V Q+ G VPRLG+ + W
Sbjct: 21 LHVQDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSW 80
Query: 99 WSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
+E L G DV G ATSFP + A+F+ + + A S E RA +N
Sbjct: 81 NTECLRG-----------DVYAGNATSFPQALGLAATFSTEVICDVASATSIEVRAKFND 129
Query: 158 --------GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
G++ +SP IN+ R P WGR ET GEDPF+ G A +V+ LQ G +
Sbjct: 130 YQRRKIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQ---GDD 186
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
++ ++ CKH+ + V R+ FDA+V+E+D TFL F+ CV+ G
Sbjct: 187 PTY------IRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAG- 239
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
+ S+MCS+NR+NG+P+C + +LL +R EW GY+V+D ++I+ ++ H + +S D
Sbjct: 240 SYSLMCSFNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSV-D 298
Query: 330 AVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
A +KAG +L+ + +A++ GK+ + D+ KS+ L+ MRLG FD
Sbjct: 299 TAALCVKAGCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFDPP 358
Query: 386 PQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
Y + I S+E+ ++ AA + VLLKN LP+ + T++V+GP A+
Sbjct: 359 DHNPYNFIDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPI-TKLFDTISVLGPMADNK 417
Query: 444 VAMIGNYAG--IPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
IG+YA +P +P+ G S + V Y GC+D AC N A ++D
Sbjct: 418 YQQIGSYAPDVMPSYTTTPLQGLSKLSKRVQYAAGCNDNACSKYNRT-EIQRAVNSSDIF 476
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLI-NQVAEVAKG-PVILVIMSAGGVDIAFAET 558
+ G +E E DR + LPG Q QL+ + + AKG P++L++ + G V+I +A+
Sbjct: 477 FVCLGTGPMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADR 536
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
+ + AI+ +P +E G A+ VV NP GRLP TW Y +P
Sbjct: 537 SDRVVAIMECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTW--PKYQDQIP-------S 587
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+ + GRTY++++G LYPFGYGLSY+ F + I +
Sbjct: 588 MVNYSMEGRTYRYFHGDPLYPFGYGLSYSTFNFTNAWMNPIISQGQD------------- 634
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
+V+ N G TDG +V+ VY K I Q++GF+
Sbjct: 635 -------------------LTVRVEVCNEGPTDGDEVIQVYLKWLDTNETMPIHQLVGFE 675
Query: 736 RVFVRA 741
RV +RA
Sbjct: 676 RVSLRA 681
>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 914
Score = 328 bits (842), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 192/455 (42%), Positives = 265/455 (58%), Gaps = 33/455 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ DS ++ R DLV+RMTL+EKV Q+ + A +PRLG+P Y+WW+E LHGV+ G
Sbjct: 34 YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GR GLT+W
Sbjct: 91 -------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFW 143
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSS 223
SPNIN+ RDPRWGR ET GEDPF+ R V +V+G+Q + +NA R K+ +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYR--KLDA 201
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + +R+HFDAR +++D+ ET+L FE VKEG +VM +YNR+ G
Sbjct: 202 TAKHFA---VHSGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGE 258
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+ A LL +R W HGY+V+DC +I + NHK +A ++E A A +K G L+C
Sbjct: 259 SASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVKNGTQLEC 317
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
GQ Y AVQQG + ETDID +L+ L T MRLG FD G ++ L S E+
Sbjct: 318 GQEYATLPA-AVQQGLIGETDIDAALRTLMTARMRLGMFDPPGQLRWAQLPISVNQSPEH 376
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
LA ARE +VLLKND LPL+ AK K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 377 DALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVTIL 435
Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIFAASEA 493
G A+V Y G D V +S+ + EA
Sbjct: 436 QGIRAAAPDADVLYARGADLVEGRSDPAATPLIEA 470
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 146/313 (46%), Gaps = 54/313 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A+ AD + + GL VE E + DR DL LP Q +L+ ++ K
Sbjct: 628 ALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSATGK- 686
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ADV+FG NPGGRLP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQEH--VPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVTFYK 744
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L
Sbjct: 745 A---------SETLPAFDDYAMRGRTYRYFAGTPLYPFGHGLSYTQFAYSDLRL------ 789
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
D K G L L+ N G+ G +VV +Y P
Sbjct: 790 ---------------DRRKVAADGQLSATLKV----------TNTGTRAGDEVVQLYLHP 824
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
A A IK++ GFQR+ + G ++ + F + L I D A + ++ G++ + VG
Sbjct: 825 LAPTRARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEAQKHYVVDPGDYELQVG 884
Query: 779 NGGVSFPIHLNFN 791
+ F+
Sbjct: 885 ASSADVRVRERFS 897
>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
Length = 452
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 179/451 (39%), Positives = 262/451 (58%), Gaps = 20/451 (4%)
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQD 395
+D++CG Y + +A+QQGK+ E DI+++L L+ V MRLG F+G P+ Y +G
Sbjct: 1 MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60
Query: 396 ICSDENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+C+ E+ +LA EAA++GIVLLKND LPL+ V ++AV+G +AN + + GNY G
Sbjct: 61 VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120
Query: 454 PCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
PC ++P+ GY + ++ GC+ AC +I A +AA +AD+ ++ GLD E
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVLFMGLDQDQER 179
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E +DR DL LPG Q LI VA AK PVILV++ G VD++FA+TN I AILWAGYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
E GG AIA V+FG+ NPGGRLP+TWY D+ + +P+T M +R + GYPGRTY+FY GP
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTR-VPMTDMRMRADPATGYPGRTYRFYRGP 298
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQ--VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
T++ FGYGLSY+++ + + L ++ + D +
Sbjct: 299 TVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEA-------IGSET 351
Query: 691 CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFVRAGRNKRI 747
CD F V QN G DG V+V+ + P + + Q+IGFQ + +RA + +
Sbjct: 352 CDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHV 411
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+F + CK + ++ G H + VG
Sbjct: 412 EFEVSPCKHFSRATEDGRKVIDQGSHFVMVG 442
>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
Length = 1202
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 234/665 (35%), Positives = 340/665 (51%), Gaps = 88/665 (13%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV-SN 108
+++ +CD +LP RV DL +R T++E + Q+G A VPRLGLP + EALHGV S
Sbjct: 339 AAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEALHGVWST 398
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL----------- 157
G T FP ASF+ LW+ +G A EARA++
Sbjct: 399 CAAGRC-------PTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHNASDCAR 451
Query: 158 ---GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
G GLT+++PN+N+ARDPRWGRI E P EDP + G Y +VRG Q + A
Sbjct: 452 SLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQGDGAYRVA--- 508
Query: 215 NSRPLKVSSCCKHYAAYDVD---------NWKGV-------DRYHFDARVTEQDMEETFL 258
++ KH+A Y+++ +W G DR+ FDARV+ +D EET++
Sbjct: 509 -------NAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEETYV 561
Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
PF A++ MCSYN VNG P+C D LL +RG + G + DC +++ V
Sbjct: 562 GPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALEDAVA 620
Query: 319 NHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
HK A ++ +A A + AG+D +CG+ T+ A+ G V+ + L+ L +R
Sbjct: 621 RHKRYA-TEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEARLR 679
Query: 379 LGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
LG D + + D + S + LA AAREG+VLL+N LPL+ T+AV
Sbjct: 680 LGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDGR--GTLAV 737
Query: 436 VGPHANATVAMIGNYAGIPCRYM--SPIAGFSGY---ANVTYKTGCDDVACKSNNSIFAA 490
+GP+ANA++ ++ Y G P + SP+ V Y GC + + + ++ A
Sbjct: 738 IGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGC-NASGAATAALDEA 796
Query: 491 SEAAKTADATIILAGL------------DLSV----EAESLDREDLWLPGYQTQLINQVA 534
+ AKTAD ++ GL D + EAES+DR L LPG Q L +++
Sbjct: 797 VDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALFSKIW 856
Query: 535 EVAKGPVILV-IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
+ K + V ++SAG VD +FA+ + A+L AGY GE GG A+AD + G +NPGG L
Sbjct: 857 ALGKPVAVAVFLVSAGAVDASFAK---DKAALLLAGYGGEFGGVAVADALLGAYNPGGAL 913
Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP---FGYGLSYTQFKYNL 650
T + P M +RP S PGRTY+F + + P FG+GLSYT F +L
Sbjct: 914 TATMLPD--AGLPPFRDMAMRP--SAASPGRTYRFLDERRVAPLWRFGFGLSYTAFAVSL 969
Query: 651 LSFTK 655
T+
Sbjct: 970 AGPTR 974
>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
Length = 517
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 192/496 (38%), Positives = 276/496 (55%), Gaps = 24/496 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ CD +L R LV ++++EK+Q L + G PR+GLP Y WWSEALHGV+
Sbjct: 35 LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVA- 93
Query: 109 VGPGTHFDD---VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PGT+F +TS+P +L A F+++L +KIG A+ EARA N G AG YW
Sbjct: 94 YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGWAGFDYW 153
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N +DPRWGR +ETPGED + RYA RGL +E ++ S C
Sbjct: 154 TPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR--------RIISTC 205
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA D ++W G R+ F+A++T QD+ E +L+PF+ C ++ S+MC+YN VNG+PS
Sbjct: 206 KHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVNGVPS 265
Query: 286 CADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA+ LL +R W + + Y+ +DC+++ + NH + A + A +AG+D
Sbjct: 266 CANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHY-APTNAAGTAICFEAGMDTS 324
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDEN 401
C ++ A QG +KE +D++L LY L+R G+FDG Y L Q + S E
Sbjct: 325 CEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQHVNSAEA 384
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP- 460
LA +AA EG+VLLKN+ TLPL+ +A++G A+A + G Y+G SP
Sbjct: 385 QSLALQAAVEGMVLLKNN-GTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHHLYSPA 443
Query: 461 IAGFSGYANVTYKTG---CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
A ++T +G D+ A S+N A EAA AD + GLD S E+LDR
Sbjct: 444 FAARQLGLDITVASGPVLQDNNA--SDNWTTNALEAASGADYILYFGGLDTSAAGETLDR 501
Query: 518 EDLWLPGYQTQLINQV 533
DL P Q L+ V
Sbjct: 502 TDLDWPEAQLTLVKVV 517
>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1283
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 238/748 (31%), Positives = 373/748 (49%), Gaps = 108/748 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF--AHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ + ++P R+ DL+ R+TL+EKV QL D + G+ RL +P +E LHG S
Sbjct: 72 YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHGQS---- 126
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
G+T FP I ++F+ L +++G+A + EA+A NL WSP ++V
Sbjct: 127 ------YATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NL----RVSWSPVLDV 175
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
ARD RWGR+ ET GEDP++VGR V +++G Q + +C KH+A +
Sbjct: 176 ARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGEH--------------MFACPKHFAGH 221
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++++ M L PF +KE +A VM +Y NG+P +L
Sbjct: 222 G-QPVGGRDSH--DYGLSDRVMRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPDNGSKEL 278
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
L + +R EW G++V+DC + + + + + E+A A ++AG+D++CG Y
Sbjct: 279 LQKILREEWGFEGFVVSDCSGPE-NIQRKQSVVGTMEEAAAMAVRAGVDIECGSAYKKAL 337
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC---SDENIELAAEA 408
+AV++G +KE+++D +L+ ++ MRLG FD P ++ + + E+ LA +
Sbjct: 338 ASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHRALARKV 396
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSG 466
A + VLLKN+ N LPL+ +KT+AV+GP NA G+Y+ P + +S + G
Sbjct: 397 AVKSTVLLKNENNLLPLDK-NIKTIAVIGP--NADQGQTGDYSAKYAPGQIISVLEGVKN 453
Query: 467 YAN----VTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLD---------LSVEA 512
+ + V Y GC + + FA A AK ADA I++ G + S
Sbjct: 454 HVSPSTKVLYAQGCTQLDMDTTG--FAEAVNIAKQADAVILVVGDNSNRHENGNKKSTTG 511
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E++D L +PG Q QLI V K PV+LV+++ G + NI++IL YPG
Sbjct: 512 ENVDGATLEIPGVQRQLIKAVEATGK-PVVLVLVN--GKPFTLTWEDENIESILETWYPG 568
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
EEGG A AD++FG NP GRLPI++ + LPL + GR Y +Y+ P
Sbjct: 569 EEGGNATADIIFGDENPSGRLPISFPR--HPGQLPLWY-------NYETSGRNYDYYDMP 619
Query: 633 --TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
LY FG+GLSYT F+Y+ L T +K+ PG
Sbjct: 620 FTPLYRFGHGLSYTTFRYSNLKAT----------------------TKSGDPG------- 650
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ VD +N G G +V +Y T + + GF+RVF++ G K + F
Sbjct: 651 ---FVTVSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLKGFKRVFLKPGEKKTVTFE 707
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
N L++++ +L AG+ + VG
Sbjct: 708 LNPY-LLSLLNPDMKRVLEAGKFRMHVG 734
>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
Length = 515
Score = 323 bits (828), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 191/493 (38%), Positives = 275/493 (55%), Gaps = 24/493 (4%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+S+ CD +L R LV ++++EK+Q L + G PR+GLP Y WWSEALHGV+
Sbjct: 35 LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVA- 93
Query: 109 VGPGTHFDD---VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
PGT+F +TS+P +L A F+++L +KIG A+ EARA N G AG YW
Sbjct: 94 YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGWAGFDYW 153
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N +DPRWGR +ETPGED + RYA RGL +E ++ S C
Sbjct: 154 TPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR--------RIISTC 205
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA D ++W G R+ F+A++T QD+ E +L+PF+ C ++ S+MC+YN VNG+PS
Sbjct: 206 KHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVNGVPS 265
Query: 286 CADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
CA+ LL +R W + + Y+ +DC+++ + NH + A + A +AG+D
Sbjct: 266 CANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHY-APTNAAGTAICFEAGMDTS 324
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDEN 401
C ++ A QG +KE +D++L LY L+R G+FDG Y L Q + S E
Sbjct: 325 CEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQHVNSAEA 384
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP- 460
LA +AA EG+VLLKN+ TLPL+ +A++G A+A + G Y+G SP
Sbjct: 385 QSLALQAAVEGMVLLKNN-GTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHHLYSPA 443
Query: 461 IAGFSGYANVTYKTG---CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
A ++T +G D+ A S+N A EAA AD + GLD S E+LDR
Sbjct: 444 FAARQLGLDITVASGPVLQDNNA--SDNWTTNALEAASGADYILYFGGLDTSAAGETLDR 501
Query: 518 EDLWLPGYQTQLI 530
DL P Q L+
Sbjct: 502 TDLDWPEAQLTLV 514
>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
Length = 569
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 193/551 (35%), Positives = 284/551 (51%), Gaps = 57/551 (10%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+ S CD++ P + R LV M EK+Q + + GV RLGLP Y WWSEALHGV+
Sbjct: 59 LKSNKVCDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVAG 118
Query: 109 VGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
PG F P ATS P IL A+F++ L +K+G + TEARA N +G+ +W+
Sbjct: 119 A-PGIQFSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWT 177
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PNIN +DPRWGR +ETPGED + Y +RGL+ N ++ + CK
Sbjct: 178 PNINPFKDPRWGRGSETPGEDTLRLKGYVAALLRGLEG----------NKAQRRIIATCK 227
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
HYAA D+++W GV R+ FDA+++ QD+ E +L+PF+ C ++ S MCSYN VNG+P+C
Sbjct: 228 HYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPAC 287
Query: 287 ADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
A+ LL +R W+ + Y+ +DC+++Q + NH + A + A AG D C
Sbjct: 288 ANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHY-ASTNAAGTALAFNAGTDSSC 346
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENI 402
G+FDGS Y SLG D+ + +
Sbjct: 347 ----------------------------------EAGYFDGSKALYSSLGWSDVNTPQAQ 372
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
+LA +A +GIV+LKND TLPL VA++G A+ + + G Y+G +P+
Sbjct: 373 QLALQATVDGIVMLKND-GTLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPVY 431
Query: 462 -AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
A G+ A ++N A AA +D + GLD S AE +DR L
Sbjct: 432 AAQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDRTSL 491
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
P Q LI +++ + K P+I +I +D TN + +ILWA +PG++GG A+
Sbjct: 492 EWPSAQLALIKKLSALGK-PLI-IIQEGDQMDNTPLLTNKGVSSILWASWPGQDGGPAVM 549
Query: 581 DVVFGKFNPGG 591
++ G +P G
Sbjct: 550 QIISGAKSPAG 560
>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
bacterium]
Length = 780
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 241/773 (31%), Positives = 351/773 (45%), Gaps = 149/773 (19%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
L +S+ + D SLP R KDLVSR+TL+EK + V LG+ Y WWSEALHGV
Sbjct: 39 LSLSAQPYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGV 98
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------ 160
+ G AT FP I ASF+E L ++ AVS EAR Y + +
Sbjct: 99 ARNG----------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQ 148
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
G+T+W+PNIN+ RDPRWGR ET GEDP++ G+ + VRGLQ +S
Sbjct: 149 YQGVTFWTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPS--------DSPV 200
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
LK +C KHYA + W +R+ +DA V+E+D+ ET+L F+ V + + VM +YN
Sbjct: 201 LKAHACAKHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYN 257
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLK 336
R G P A L+N +RGEW G I +DC +++ + H + D A A
Sbjct: 258 RFRGEPCGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHA 317
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI 396
+D +CGQ Y + AV++G + E D+D++L L+T +LG D + L +
Sbjct: 318 G-VDTECGQAYRHIP-EAVERGLLDEKDLDRNLIRLFTARYQLGEMDDISLWDDLPASIL 375
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
E++ L+ + A+E +VLL+N LPL A VA+VGP+ + GNY +P R
Sbjct: 376 EGPEHLALSRKMAQESMVLLQNKGGILPL--APDVRVALVGPNGDDREMQWGNYNPVPGR 433
Query: 457 YMSPIAGF-SGYANVTYKTGCDDVACK------SNNSIFAASEAAKTA------------ 497
++ + + Y GC V + NN + A ++
Sbjct: 434 TVTLYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGV 493
Query: 498 ---------------------------------DATIILAGLDLSVEAESL--------- 515
D I G+ E E +
Sbjct: 494 QDILNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKG 553
Query: 516 -DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR D+ LP Q L+ + + K VILV S G I + AIL A YPGEE
Sbjct: 554 GDRTDIQLPQVQRDLMKALHDAGKK-VILVNFS--GCAIGLVPETESCDAILQAWYPGEE 610
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG AI DV+FG NP G+LP+T+Y V+ LP ++ G TY+++ G L
Sbjct: 611 GGLAITDVLFGDVNPSGKLPVTFYRS--VEDLP-------DFENYDMKGHTYRYFKGKPL 661
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
+PFGYGLSY+ F+Y K +V N L
Sbjct: 662 FPFGYGLSYSTFRY------KRAKVRNNSL------------------------------ 685
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G + ++VV VY + + +K + F+RV + AG+ ++
Sbjct: 686 ---IIPVKNTGKREATEVVQVYVRRKGDPDGP-VKTLRAFRRVTIPAGKTVKV 734
>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 754
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 245/743 (32%), Positives = 360/743 (48%), Gaps = 104/743 (13%)
Query: 41 RFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGL 93
F+ + F F + SLP R++DLV+R+T++E + QL A + RLG+
Sbjct: 14 HFASSKVTSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGI 73
Query: 94 PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
Y+W +E L G + G AT FP I A+F++ L K+ + V+ EARA
Sbjct: 74 GPYQWNTECLRGYAMNG----------DATCFPQPIGLAATFDQGLIYKMAKTVALEARA 123
Query: 154 MYN-------LG-RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
+N G GL+ +SP IN+ R P WGR ET GEDP + A YV GLQ
Sbjct: 124 KHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQGD 183
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
E + L ++ CKH+ AY R+ F A V++ D+ TF F CV
Sbjct: 184 EIY----------LPATAVCKHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYPAFRECV 233
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
G A VMCSYN +NG+PSCA+P +L T+R ++ GY+V+D ++++ +D +
Sbjct: 234 HAG-AQGVMCSYNAINGVPSCANP-MLETTLRKKFHFDGYVVSDENALE-NIDLYFNFTK 290
Query: 326 SKEDAVAQTLKAGLDLDCGQY-YTN---FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
SK + A L AG+DL+ + TN AV+QG V E + +S K L+ M LG
Sbjct: 291 SKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGE 350
Query: 382 FDGSPQY---VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
FD P++ +++ + S + + A E A + VLLKND LPL K V++VGP
Sbjct: 351 FD-PPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GILPLKQLYDK-VSIVGP 407
Query: 439 HANATVAMIGNY-AGIPCRYMS-PIAG---FSGYANVTYKTGC------DDVACKSNNSI 487
N + A+ G+Y A +Y S P+ S + TGC + C + NS
Sbjct: 408 FINNSEALTGDYPAEFNLKYFSSPLFAANSLSSSGVARFTTGCVGTNNQNLPICATYNST 467
Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
E +D ++ G VEAES DR D+ LPG Q QLI V + A GPVI+V+ +
Sbjct: 468 -NVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANGPVIVVLFN 526
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
AG +D+++ NT A++ + + G A+ +V+ G NP GRLP TW P
Sbjct: 527 AGPLDVSWVMGNT--AAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTW---------P 575
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY-NLLSFTKTIQVNLNKLQH 666
+ + P+ RTY++ L+PFGYGLSYT+F Y + + TIQ
Sbjct: 576 ASMEQVPPMTDYSMHERTYRYSTSSPLFPFGYGLSYTKFWYLDAVVEPTTIQ-------- 627
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
RC +V +V QN G DG +VV +Y +
Sbjct: 628 -------------RCQIPVV-----------RVLIQNTGHLDGEEVVQIYMTSKKKRDRE 663
Query: 727 YIKQVIGFQRVFVRAGRNKRIKF 749
++Q++ FQRV ++AG I
Sbjct: 664 LLRQLVAFQRVPIKAGEEVSISL 686
>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
Length = 834
Score = 318 bits (815), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 233/734 (31%), Positives = 358/734 (48%), Gaps = 102/734 (13%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVP-----RLGLPQYEWWSEALH 104
+ F + LP++ R+ DLV R+TL+EK+QQL G A P RLG+ + W SE +
Sbjct: 34 YPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECVT 93
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
G+ GT +D P T+FP + A+F+ +L K+ ++ E RA N R
Sbjct: 94 GL-----GTDGND--PHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVV 146
Query: 161 ----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
GL+ WSP +N+ R P WGR ET GE P + A ++V G+Q ++
Sbjct: 147 KFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGN---------HT 197
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
R ++ CKH +D + G D RY FDA V++ D+ TFL FE C G M
Sbjct: 198 RYYAAAAACKH-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAAG-VMGYM 251
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYN + G+P+CA+ + + R +W GY+V+D ++ + ++H + A+ AVA
Sbjct: 252 CSYNSIRGVPACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVA-A 310
Query: 335 LKAGLDL---DCGQYYTNFTGNAVQQGKVKE-TDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
L AG D+ D Q+ + + K+ + ID S+ L+ V MRLG FD P+
Sbjct: 311 LNAGCDMEDSDDAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDP 369
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAM 446
+ SL + S ++E+A + A IVLLKN TLPL++A K + ++GP A+ M
Sbjct: 370 WRSLNMSIVSSPAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLM 429
Query: 447 IGNYA-----GIPCRYMSPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
+G Y+ + Y + +A S A+ Y GC C ++ + +
Sbjct: 430 MGKYSPHGSTNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAVTTFIQQGC 489
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV--AKGPVILVIMSAGGVDIAF 555
D ++ G VE+ESLDR ++ PG Q L+ V E K ++L++ +AG VD+A
Sbjct: 490 DTVLLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAA 549
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
E +T + AIL Y G+ G A+AD++ G+ +P GRLP +W N ++ +P P
Sbjct: 550 LEQDTRVAAILDLIYLGQTAGTALADILLGETSPSGRLPFSWPN-------KVSDVP--P 600
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+D GRTY+F L+PFGYGLSYTQF + L+ + V C+ L
Sbjct: 601 IDDYTMQGRTYRFAQADVLFPFGYGLSYTQFNLSHLAAPYILPV-------CQAL----- 648
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
V+ N G G+ + VY + P + I+Q+
Sbjct: 649 --------------------RLSVNVTNTGRLSGAIPLQVYVEWPNAVGGP-IRQLATTT 687
Query: 736 RVFVRAGRNKRIKF 749
RVFV A +K ++
Sbjct: 688 RVFVDAASSKTVQL 701
>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 902
Score = 318 bits (815), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 182/445 (40%), Positives = 256/445 (57%), Gaps = 31/445 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ EG + + P K+ +
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPYRKLDAT 203
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G
Sbjct: 204 AKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G +L+CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 319
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
+ Y+ AV QG + E ID +LK L T MRLG FD G + ++ S +
Sbjct: 320 EEYSTLPA-AVHQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 378
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL+ AK+K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 379 ALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437
Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 129/282 (45%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A++AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 629 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+AD +FG NPGGRLP+T+Y
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 746 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 794
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 795 -----------------------------IAADGSLTATVTVKNTGQRAGDEVVQLYLHP 825
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
K++ GFQR+ ++ G + + F +A +L I D
Sbjct: 826 LTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYD 867
>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 913
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 182/447 (40%), Positives = 262/447 (58%), Gaps = 35/447 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLV+RMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSS 223
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ DV+ +NA R K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYR--KLDA 204
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + DR+HFDA +++D+ ET+L FE VKEG +VM +YNRV G
Sbjct: 205 TAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+ A LL +R W GY+V+DC +I + NHK +A ++E+A A +K G +L+C
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHGTELEC 320
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
G Y+ AV++G + E D+D +L+ L MRLG FD P+ ++ + + ++++ E
Sbjct: 321 GAEYSTLP-TAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPE 378
Query: 404 ---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
LA ARE +VLLKND LPL+ AK+K +AVVGP A+ T+A++GNY G P ++
Sbjct: 379 HDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTV 437
Query: 461 IAGFSGY---ANVTYKTGCDDVACKSN 484
+ G A V Y G D V + +
Sbjct: 438 LQGIREAAPDAEVLYARGADLVEGRDD 464
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 141/300 (47%), Gaps = 54/300 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ADV+FG NPGGRLP+T+Y
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L
Sbjct: 748 ---------ESETLPAFDDYAMRGRTYRYFAGTALYPFGHGLSYTQFAYSDLRL------ 792
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
D SK G L L+ +N G G +VV +Y +P
Sbjct: 793 ---------------DRSKLAADGRLHATLKV----------KNTGQRAGDEVVQLYLQP 827
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
+ K + GFQR+ ++ G + ++F + L + D A ++ G++ + VG
Sbjct: 828 LSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQVG 887
>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 901
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 183/451 (40%), Positives = 257/451 (56%), Gaps = 31/451 (6%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q ++ + D+ + R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+
Sbjct: 28 QAATPPYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR 159
G GAT FP I A+F+ L ++ A+S EARA ++
Sbjct: 88 RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+WSPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ EG + P
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPY 196
Query: 220 -KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ + KH+A V + DR+HFDAR +++D+ ET+L FE VKEG +VM +YN
Sbjct: 197 RKLDATAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 253
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
RV G + A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G
Sbjct: 254 RVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 312
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDI 396
+L+CG+ Y AV+QG + E ID +LK L T MRLG FD G + ++
Sbjct: 313 TELECGEEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVN 371
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
S + LA ARE +VLLKND LPL+ AK+K +AV+GP A+ T+A++GNY G P
Sbjct: 372 QSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAA 430
Query: 457 YMSPIAGFSGY---ANVTYKTGCDDVACKSN 484
++ + G A V Y G D V + +
Sbjct: 431 PVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 130/282 (46%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A++AD + + GL VE E + DR DL LP Q L+ + +
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR- 686
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+AD +FG NPGGRLP+T+Y
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 745 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 793
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 794 -----------------------------IATDGSLTATVTVKNTGQRAGDEVVQLYLHP 824
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
A K++ GFQR+ ++ G + + F NA +L + D
Sbjct: 825 LAPQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYD 866
>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 913
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 181/447 (40%), Positives = 262/447 (58%), Gaps = 35/447 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLV+RMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSS 223
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ + +NA R K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYR--KLDA 204
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + DR+HFDA +++D+ ET+L FE VKEG +VM +YNRV G
Sbjct: 205 TAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+ A LL +R W GY+V+DC +I + NHK +A ++E+A A +K G +L+C
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHGTELEC 320
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
G Y+ +AV++G + E D+DK+L+ L MRLG FD P+ ++ + + ++++ E
Sbjct: 321 GAEYSTLP-SAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPE 378
Query: 404 ---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
LA ARE +VLLKND LPL+ AK+K +AVVGP A+ T+A++GNY G P ++
Sbjct: 379 HDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTV 437
Query: 461 IAGFSGY---ANVTYKTGCDDVACKSN 484
+ G A V Y G D V + +
Sbjct: 438 LQGIREAAPDAEVLYARGADLVEGRDD 464
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 144/313 (46%), Gaps = 54/313 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ADV+FG NPGGRLP+T+Y
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L
Sbjct: 748 ---------ESETLPAFDDYAMRGRTYRYFAGTPLYPFGHGLSYTQFAYSDLRL------ 792
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
D SK G L L+ +N G G +VV +Y +P
Sbjct: 793 ---------------DRSKLAADGRLHATLKV----------KNTGQRAGDEVVQLYLQP 827
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
+ K + GFQR+ ++ G + ++F + L + D A ++ G++ + VG
Sbjct: 828 LSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKGYVVDPGDYELQVG 887
Query: 779 NGGVSFPIHLNFN 791
+ F+
Sbjct: 888 ASSSDVRVRQRFS 900
>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
Length = 731
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 211/638 (33%), Positives = 329/638 (51%), Gaps = 61/638 (9%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDE----KVQQLGDFAHGVPRLGLPQYEWWSE 101
G+ + F F D +L + RV DLV R+T++E V Q G V RLG+ Y++ +E
Sbjct: 14 GVANAKFPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINE 73
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---- 157
+ GV +T+FP I ASF+ L + QA++ E R YN
Sbjct: 74 CITGVR-----------WENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKS 122
Query: 158 ---GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
G G+ ++P IN+ R P WGR ET GEDP++ G+ +V +V+GLQ
Sbjct: 123 QIYGHRGVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGD--------- 173
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
+ R ++ S CKH+ ++ V R+ FDA+V+E+D TFL F+ CV+ G + ++M
Sbjct: 174 HPRYIQASGGCKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAG-SINIM 232
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
CSYNR+NG+P+CA+ KLL +R EW +GY+++D +I+ +V +HK+ E A A +
Sbjct: 233 CSYNRINGVPACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAE-AAADS 291
Query: 335 LKAGLDLD------CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
+KAG +++ G Y N NAV+Q + E ++ ++LK MR G FD
Sbjct: 292 VKAGCNVELTGATGSGVAYFNLL-NAVKQNLISEEELRENLKKPMYSRMRQGEFDPVDMN 350
Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ + + S E+ +LA +A+ VL+KN LPL + +A++GP A+ +
Sbjct: 351 PFTKIDMSVVLSQEHQDLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFADNAETL 409
Query: 447 IGNYAG--IPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
G+Y P +P G +V Y +GCDD +C +N A +A K A +
Sbjct: 410 FGDYIPNWDPKFVSTPYEGLKSLGDDVRYASGCDDPSC-TNYDPKAIEKAVKGAQFVFVC 468
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK-GPVILVIMSAGGVDIAFAETNTNI 562
G+ ++E E DR DL LPGYQ Q++ ++ P++LV+ +AG VD+ + + + +
Sbjct: 469 LGVGSNLEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEV 528
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFN---PGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
I+ YP G+A+ VV + P RLP TW P + +
Sbjct: 529 DGIIECFYPAMGTGKALYQVVTATGDDGVPAARLPSTW---------PAQLHQVPSITDY 579
Query: 620 GYPGRTYKFYN-GPTLYPFGYGLSYTQFKYNLLSFTKT 656
G TY++++ G LYPFGYGLSYT F Y +S + T
Sbjct: 580 NMTGHTYRYFDGGDPLYPFGYGLSYTSFHYQTVSVSPT 617
>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 901
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 180/450 (40%), Positives = 254/450 (56%), Gaps = 29/450 (6%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q ++ + D+ + R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+
Sbjct: 28 QTATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR 159
G GAT FP I A+F+ L ++ A+S EARA ++
Sbjct: 88 RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+WSPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ R
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGERYR 197
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K+ + KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNR
Sbjct: 198 KLDATAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNR 254
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
V G + A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G
Sbjct: 255 VYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGT 313
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
+L+CG+ Y AV+QG + E ID +LK L T MRLG FD G + ++
Sbjct: 314 ELECGEEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQ 372
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
S + LA ARE +VLLKND LPL+ AK+K +AV+GP A+ T+A++GNY G P
Sbjct: 373 SPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAP 431
Query: 458 MSPIAGFSGY---ANVTYKTGCDDVACKSN 484
++ + G A V Y G D V + +
Sbjct: 432 VTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 141/317 (44%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG + + Y G D A + + + A + A++AD + + GL VE E
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 652
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q L+ + K PV+ V+ + + I +A+ + + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK-PVVAVLTAGSALAIDWAQQH--LPA 709
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ D V +N G G +VV +Y P A K++ GFQR+ ++ G
Sbjct: 794 ----IATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQ 849
Query: 745 KRIKFVFNACKSLNIVD 761
+ + F NA +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866
>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 201/512 (39%), Positives = 283/512 (55%), Gaps = 40/512 (7%)
Query: 55 CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
CD R LV + ++EK+Q L D + G RLGLP Y WWSEALHGV+ PG
Sbjct: 39 CDPKATPPERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSEALHGVA-ASPGVR 97
Query: 115 FDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
F+ G ATSF I +A+F++ L K+ +STEARA N G AGL YW+PNIN
Sbjct: 98 FNRTAGGRFSSATSFANSITLSAAFDDELVYKVADTISTEARAFANAGLAGLDYWTPNIN 157
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
+DPRWGR ETPGEDP + Y + GL+ D + R KV + CKHYAA
Sbjct: 158 PYKDPRWGRGHETPGEDPVRIKGYVKALLAGLE-------GDDPSIR--KVVATCKHYAA 208
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
YD++ W+G R+ FDA V+ QD+ E +L PF+ C ++ S MCSYN +NG P+CA
Sbjct: 209 YDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTPACASTY 268
Query: 291 LLNQTVRGEW---DLHGYIVADCDSIQVMVDN---HKFLADSKE-DAVAQTLKAGLDLDC 343
L++ +R W + + YI +DC++IQ + H F + E +A A +AG D C
Sbjct: 269 LMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQAGTDTVC 328
Query: 344 G----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDI 396
YT+ G A Q + E ID +LK LY L+R+G+FD GSP Y S+G +D+
Sbjct: 329 EVPGWPPYTDVIG-AYNQTLLSEEVIDTALKRLYEGLVRVGYFDPASGSP-YRSIGWEDV 386
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA--MIGNYAGIP 454
+ E ELA ++ +G+VLLKND TLPLN + KTVA++G AN+T ++G Y+G P
Sbjct: 387 NTPEAQELALQSGTDGLVLLKND-GTLPLN-LEDKTVALIGFWANSTNGGRILGGYSGFP 444
Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVA-----CKSNNSIFAASEAAKTADATIILAGLDLS 509
SP+ N+TY +A ++ + A E AK ++ + G D S
Sbjct: 445 PYIHSPVDAAEKL-NLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVILYFGGTDTS 503
Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
+ AE LDR+ + P Q +I ++ + + P
Sbjct: 504 IAAEDLDRDSIAWPEIQLAVIEALSALRQAPA 535
>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 343
Score = 315 bits (807), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 216/335 (64%), Gaps = 9/335 (2%)
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
MIGNYAG+ C Y +P+ G YA + +GC+DV C N AA AA+ ADATI++ G
Sbjct: 1 MIGNYAGVACGYTTPLQGIRRYAKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVMG 60
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
LD S+EAE DR+ L LPGYQ +L+++VA ++GP ILV+MS G +D++FA+ + I AI
Sbjct: 61 LDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAI 120
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPG+ GG AIADV+FG NPGG+LP+TWY DY+ +P+T+M +R S GYPGRT
Sbjct: 121 LWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGRT 180
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y+FY GP ++PFG+G+SYT F ++L+ + + V L L RN S+A +
Sbjct: 181 YRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNA-------IR 233
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
V+ C+ +D +N G DG+ ++V+S PP +T KQ+IGF++V + G
Sbjct: 234 VSHANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQ-KQLIGFEKVHLVTGSQ 292
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
KR+K + CK L++VD +P GEH +++G+
Sbjct: 293 KRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGD 327
>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 901
Score = 315 bits (807), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 181/445 (40%), Positives = 254/445 (57%), Gaps = 31/445 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ EG + P K+ +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPYRKLDAT 202
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G
Sbjct: 203 AKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 259
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G +L+CG
Sbjct: 260 ASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 318
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
+ Y AV+QG + E ID +LK L T MRLG FD G + ++ S +
Sbjct: 319 EEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 377
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL+ AK+K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 378 ALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 436
Query: 463 GFSG---YANVTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 437 GIRAAAPKAQVLYARGADLVEGRDD 461
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 141/317 (44%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG + + Y G D A + + + A + A++AD + + GL VE E
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 652
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q L+ + + PV+ V+ + + I +A+ + + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVAVLTTGSALAIDWAQQH--LPA 709
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ D V +N G G +VV +Y P A K++ GFQR+ ++ G
Sbjct: 794 ----IATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQ 849
Query: 745 KRIKFVFNACKSLNIVD 761
+ + F NA +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866
>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 902
Score = 315 bits (806), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 180/445 (40%), Positives = 256/445 (57%), Gaps = 31/445 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GL+ EG + + P K+ +
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRG-EGADAPKNAQGEPYRKLDAT 203
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G
Sbjct: 204 AKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G +L+CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 319
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
+ Y+ AV+QG + E ID +L L T MRLG FD G + ++ S +
Sbjct: 320 EEYSTLPA-AVRQGLIDEAQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 378
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL+ AK+K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 379 ALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437
Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 130/282 (46%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A +AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 629 ALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+AD +FG NPGGRLP+T+Y
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 746 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 794
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 795 -----------------------------IAADGSLTATVTVKNTGQRAGDEVVQLYLHP 825
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
K++ GFQR+ ++AG + + F+ +A +L I D
Sbjct: 826 LTPQRERAGKELHGFQRITLQAGEQRALHFILDAKNALRIYD 867
>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 901
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 179/450 (39%), Positives = 254/450 (56%), Gaps = 29/450 (6%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q ++ + D+ + R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+
Sbjct: 28 QAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GR 159
G GAT FP I A+F+ L ++ A+S EARA ++
Sbjct: 88 RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERY 137
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+WSPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ G R
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERYR 197
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K+ + KH+A V + DR+HFDA +++D+ ET+L FE VK+G +VM +YNR
Sbjct: 198 KLDATAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKDGKVDAVMGAYNR 254
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
V G + A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G
Sbjct: 255 VYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGT 313
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
+L+CG+ Y+ AV+QG + E ID +LK L T MRLG FD G + +
Sbjct: 314 ELECGEEYSTLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSQIPASVNQ 372
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
S + LA ARE +VLLKND LPL+ A++K +AV+GP A+ T+A++GNY G P
Sbjct: 373 SPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALLGNYYGTPAAP 431
Query: 458 MSPIAGFSGY---ANVTYKTGCDDVACKSN 484
++ + G A V Y G D V + +
Sbjct: 432 VTVLQGIRAAAPNAQVLYARGADLVEGRDD 461
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 139/317 (43%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG + + Y G D A + + + A + A++A+ + + GL VE E
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEGEE 652
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q L+ + K PV+ V+ + + I +A+ + + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK-PVVAVLTTGSALAIDWAQQH--LPA 709
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ D V +N G G +VV +Y P K++ GFQR+ + G
Sbjct: 794 ----IATDGSLTATVTVKNTGQRAGDEVVQLYLHPLTPQRERAGKELHGFQRIALTPGEQ 849
Query: 745 KRIKFVFNACKSLNIVD 761
+ + F NA +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866
>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 901
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 182/445 (40%), Positives = 252/445 (56%), Gaps = 31/445 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 34 YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 91 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ EG + P K+ +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPYRKLDAT 202
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH A V + DR+HFDAR +++D+ ET+L FE VKEG +VM +YNRV G
Sbjct: 203 AKHLA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGES 259
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G +L+CG
Sbjct: 260 ASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 318
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
+ Y AV+QG + E ID +LK L T MRLG FD G + ++ S +
Sbjct: 319 EEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 377
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL+ AK K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 378 ALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 436
Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 437 GIRAAAPNAQVLYARGADLVEGRDD 461
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 141/317 (44%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG + + Y G D A + + + A + A++AD + + GL VE E
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGARPPLQEALDVARSADVVVFVGGLTGDVEGEE 652
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q L+ + + PV+ V+ + + I +A+ + + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVAVLTTGSALAIDWAQQH--LPA 709
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ D V +N G G +VV +Y P A K++ GFQR+ ++ G
Sbjct: 794 ----IATDGSLAATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQ 849
Query: 745 KRIKFVFNACKSLNIVD 761
+ + F NA +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866
>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 850
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 183/464 (39%), Positives = 264/464 (56%), Gaps = 42/464 (9%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
S +F+ + +G S F D L R DLV+RMTLDEKV Q+ + A +PRL
Sbjct: 4 SGIFLALAASPALIGQTTSQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRL 63
Query: 92 GLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
G+P Y+WW+EALHGV+ G AT FP I A+++ +L +I + +STEA
Sbjct: 64 GIPAYDWWNEALHGVARAG----------LATVFPQAIGLAATWDATLMHRIAETISTEA 113
Query: 152 RAMYNLG--------RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
RA YN GLT+WSPNIN+ RDPRWGR ET GEDPF+ R AV +++G+Q
Sbjct: 114 RAKYNEAIRNDDHSRYRGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQ 173
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
+ H KV + KHYA V + R+ FD + + +D+ +T+L F
Sbjct: 174 GEDPHY---------YKVIATAKHYA---VHSGPESSRHQFDVKPSPRDLADTYLPAFRA 221
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
+ E A S+MC+YNRV+GIP+CA LL + +RGEW G++V+DC ++ + H +
Sbjct: 222 SIVEARADSLMCAYNRVDGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQ 281
Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
D+ A A +KAG DL CG Y +AV+ G + E +I++SL+ L+ +LG FD
Sbjct: 282 PDAAS-ASAVAVKAGTDLTCGNEYRALV-DAVKTGLITEPEINRSLERLFVARFKLGMFD 339
Query: 384 GSPQYVSLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
P+ V ++ S + ++A EAAR+ IVLLKND TLPL S+ +K +AV+GP A
Sbjct: 340 -PPERVPFSNIPYSEVDSAGHRKIALEAARKSIVLLKND-GTLPLKSS-IKKIAVIGPAA 396
Query: 441 NATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGCDDVA 480
+ A++GNY G ++P+AG ++G A V Y G + A
Sbjct: 397 DDAEALLGNYNGFSSLQVTPLAGIEHQWAGKAEVRYALGANYTA 440
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 131/274 (47%), Gaps = 60/274 (21%)
Query: 487 IFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEV 536
+ AA EA AD T+ GL+ S+E E + DR +L LP Q +LI A +
Sbjct: 594 LAAAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLIE--AAI 651
Query: 537 AKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPI 595
A G PV++V+ S V + FA + + A+L Y GEE G AIAD + G NP GRLP+
Sbjct: 652 ATGKPVVVVLASGSAVAMNFAAQHAS--ALLETWYNGEETGTAIADTLAGINNPSGRLPV 709
Query: 596 TWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK 655
T+Y V LP P + GRTY+++NG LY FG+GLSY++F+Y
Sbjct: 710 TFYRS--VDQLP-------PFEEYAMKGRTYRYFNGDALYSFGFGLSYSKFQY------- 753
Query: 656 TIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV 715
+ L+ R + T AS+ R N S +G +VV +
Sbjct: 754 ------SALKTRRAGSGTIVASRVR----------------------NASSIEGDEVVQL 785
Query: 716 YSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
Y + I+ + GFQR+ +R G ++ + F
Sbjct: 786 YVN-GSGADGDPIRSLRGFQRIHLRPGESREVHF 818
>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 902
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 182/446 (40%), Positives = 255/446 (57%), Gaps = 33/446 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRLG+ Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG--- 91
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 144
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH--ENATDLNSRPLKVSS 223
SPNIN+ RDPRWGR ET GEDPF+ R V +VRGLQ G +NA + R K+ +
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYR--KLDA 202
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G
Sbjct: 203 TAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G +L+C
Sbjct: 260 SASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELEC 318
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G+ Y+ AV QG ++E ID SL+ L T MRLG FD G + + S +
Sbjct: 319 GEEYSTLPA-AVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAH 377
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
LA ARE +VLLKND LPL+ K+K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436
Query: 462 AGFSGY---ANVTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 437 QGIRAAAPNAQVLYARGADLVEGRDD 462
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 132/287 (45%), Gaps = 53/287 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A++A+ + + GL VE E + DR DL LP Q +L+ + K
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+AD +FG NPGGRLP+T+Y
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L +
Sbjct: 746 ---------ESEALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDR---- 792
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
N + D F V +N G G +V +Y P
Sbjct: 793 ---------------------------NTVAADGSFTATVTVKNTGQRAGDEVAQLYLHP 825
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
K++ GFQRV + G + ++F NA ++L I D T
Sbjct: 826 LTPQRERAGKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQRKT 872
>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 902
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 182/446 (40%), Positives = 255/446 (57%), Gaps = 33/446 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRLG+ Y+WW+EALHGV+ G
Sbjct: 35 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG--- 91
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 92 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 144
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH--ENATDLNSRPLKVSS 223
SPNIN+ RDPRWGR ET GEDPF+ R V +VRGLQ G +NA + R K+ +
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYR--KLDA 202
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G
Sbjct: 203 TAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A +K G +L+C
Sbjct: 260 SASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELEC 318
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G+ Y+ AV QG ++E ID SL+ L T MRLG FD G + + S +
Sbjct: 319 GEEYSTLPA-AVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAH 377
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
LA ARE +VLLKND LPL+ K+K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436
Query: 462 AGFSGY---ANVTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 437 QGIRAAAPNAQVLYARGADLVEGRDD 462
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 131/287 (45%), Gaps = 53/287 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A++A+ + + GL VE E + DR DL LP Q +L+ + K
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+AD +FG NPGGRLP+T+Y
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L +
Sbjct: 746 ---------ESEALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDR---- 792
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
N + D F V +N G G +V +Y P
Sbjct: 793 ---------------------------NTVAADGSFTATVTVKNTGQRAGDEVAQLYLHP 825
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
K++ GFQRV + G + + F NA ++L I D T
Sbjct: 826 LTPQRERAGKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQRKT 872
>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 913
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 179/447 (40%), Positives = 257/447 (57%), Gaps = 35/447 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ + R DLVSRMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSS 223
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ + +NA R K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYR--KLDA 204
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + DR+HFDA +++D+ ET+L FE VKEG +VM +YNRV G
Sbjct: 205 TAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+ A LL +R W GY+V+DC +I + NHK +A ++E A A + G +L+C
Sbjct: 262 SASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVNNGTELEC 320
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
G+ Y+ AV++G + E D+DK+L+ L MRLG FD P + + + ++++ E
Sbjct: 321 GEEYSTLPA-AVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDTLRWAQIPLSANQSPE 378
Query: 404 ---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
LA ARE +VLLKND LPL+ K+K +AV+GP A+ T+A++GNY G P ++
Sbjct: 379 HDALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437
Query: 461 IAGFSGY---ANVTYKTGCDDVACKSN 484
+ G A V Y G D V + +
Sbjct: 438 LQGIREAAPDAEVLYARGADLVEGRDD 464
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 141/300 (47%), Gaps = 54/300 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E + DR DL LP Q +L+ + K
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK- 689
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ADV+FG NPGGRLP+T+Y
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L
Sbjct: 748 ---------ESEKLPAFDDYAMRGRTYRYFAGTALYPFGHGLSYTQFAYSDLRL------ 792
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
D SK G L L+ +N G G +VV +Y P
Sbjct: 793 ---------------DRSKLATDGSLHATLKV----------KNTGQRAGDEVVQLYLHP 827
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
+ K++ GFQR+ ++ G + + F + L + D A ++ G++ + VG
Sbjct: 828 LSPQRERARKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQVG 887
>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
purpuratus]
Length = 623
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 210/610 (34%), Positives = 321/610 (52%), Gaps = 61/610 (10%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGLPQYEWWS 100
Q S F + SLP+ R+ DL+SR+ +D+ QL A + RL + +Y W +
Sbjct: 26 QKSQLPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNT 85
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--- 157
E L G + G AT+FP + +A+F+ L ++ A E RA YN
Sbjct: 86 ECLRGDAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQ 135
Query: 158 -----GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
GL +SP IN+ R P WGR ET GEDP++ G A ++V GLQ
Sbjct: 136 KGDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGN------- 188
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
+ R L ++ CKH+AAY R+ FDA+V+++D++ TF F+ C+K G S
Sbjct: 189 --HPRYLLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAG-TYS 245
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMCSYN VNGIP+CA+ LLN +R EW GY+V+D ++++ H + S D
Sbjct: 246 VMCSYNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNY-TTSYLDTAI 304
Query: 333 QTLKAGLDLDCGQYYT---NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
++LKAG +LD G ++ AV+ G + D+ S+ L+ +RLG FD
Sbjct: 305 KSLKAGCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFDPPDHNP 364
Query: 388 YVSLG-KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
YV L Q + S E+ E+A +AA + VL+KND +TLP+ + T+AVVGP AN + +
Sbjct: 365 YVKLNVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPI-EGTIHTLAVVGPFANNSKLL 423
Query: 447 IGNYAGIP-CRYMSPI-AGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
G+YA P R+++ + G S A T + +GC C + + A AD ++
Sbjct: 424 FGDYAPNPDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQ-GVLNAVTGADVVVVC 482
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNI 562
G + +E+E DR D+ LPG Q QL+ A A G PVIL++ +AG ++I +A ++ ++
Sbjct: 483 LGTGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSV 542
Query: 563 KAILWAGYPGEEGGRAIADVVFGK---FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
+AI+ +P + G A+ ++F NPGGRLP TW P T + P+++
Sbjct: 543 QAIVECFFPAQATGVAL-RMMFQNAPGANPGGRLPSTW---------PATVAQIPPMENY 592
Query: 620 GYPGRTYKFY 629
GRTY+++
Sbjct: 593 SMDGRTYRYF 602
>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
Length = 902
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 188/498 (37%), Positives = 272/498 (54%), Gaps = 54/498 (10%)
Query: 9 LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
+ +++ L VF ++A A+ ++P +P ++ D S + R DL
Sbjct: 17 VALGMALVLPVFPSHAEGAD--AAPSAASEP-------------VYRDLSRSFHDRAADL 61
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
V+ MTL+EK Q+ + A +PRLG+ Y+WW+E LHGV+ G AT FP
Sbjct: 62 VAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ----------ATVFPQA 111
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYWSPNINVARDPRWGRI 180
I A+F+ L ++ A+S EARA YN GR GLTYWSPNIN+ RDPRWGR
Sbjct: 112 IGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHGRYEGLTYWSPNINIFRDPRWGRG 171
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ET GEDP++ R V +V GLQ N K+ + KH+A V + D
Sbjct: 172 QETYGEDPYLTERMGVAFVTGLQGD---------NPTYRKLDATAKHFA---VHSGPEAD 219
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
R+HFD +E+D+ ET+L F+ V+E D +VM +YNRVNG P+ P+LL Q +R +W
Sbjct: 220 RHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRVNGEPATGSPRLLGQILRKDW 279
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
GY+V+DC +++ + +HK + D+ E A A +K G+DLDCG Y AV G +
Sbjct: 280 GFKGYVVSDCGAVEDIYKHHKVV-DTVEAASALAVKNGVDLDCGTEYAALV-KAVHDGLI 337
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
KE++ID +L L MRLG FD + + + + S ++ LA AARE +VLLKN
Sbjct: 338 KESEIDAALTRLMQARMRLGMFDPASKVPWSDVPYSVNQSPQHDALARRAARESMVLLKN 397
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF---SGYANVTYKTG 475
D LPL S +K +AV+GP A+ +A++GNY G P ++ + G + A V Y G
Sbjct: 398 D-GVLPL-SKDIKHIAVIGPTADDVMALVGNYHGTPADPVTILRGIREAAPQAKVVYARG 455
Query: 476 CDDVACKSNNSIFAASEA 493
D V +S+ + EA
Sbjct: 456 VDLVEGRSDPTGMPLVEA 473
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 129/289 (44%), Gaps = 54/289 (18%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+ GL VE E + DR DL LP Q +L+ + K PV+LV+ S
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSA 700
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+ + +A N ++ A+L A YPG+ GG A+ADV+FGK +P GRLP+T+Y S
Sbjct: 701 LAVDWA--NQHLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYK---------AS 749
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
L D GRTY+++ G LYPFGYGLSYT+F Y L KL H
Sbjct: 750 EKLPAFDDYRMDGRTYRYFKGEPLYPFGYGLSYTKFTYADL-----------KLDH---- 794
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
N + +D V N G G +VV +Y + K
Sbjct: 795 ----------------NKIGKNDKLHVTVKVHNAGKRAGDEVVQLYLRGVGTPHERSNKD 838
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVD-YAANTLLPAGEHTIFVG 778
+ G QR+ ++ G+ + + F + L D A + AG + + +G
Sbjct: 839 LRGIQRITLQPGQTRDVSFDVSPATDLRYYDTKKAAYAVDAGRYEVQIG 887
>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 896
Score = 308 bits (788), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 185/454 (40%), Positives = 255/454 (56%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ LP+ R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 40 YLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG--- 96
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR------AGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ L R GLT+W
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTFW 149
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ +G K+ +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + DR+HFD +E+D+ ET+L F+ V+EG ++VM +YNRVNG +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A + L +R +W GYIV+DC +I+ + NHK + + E A A +K G DLDCG
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC---SDENI 402
Y AV+ G + E ID SLK L T MRLG FD P V+ + S ++
Sbjct: 316 TYAALP-KAVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHD 373
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL +K +AVVGP A+ ++++GNY G P ++ +
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431
Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
G + A V Y G D V + + + A +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 131/287 (45%), Gaps = 53/287 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
A +AA+ A+ + + GL VE E +D R D LP Q +L+ Q +
Sbjct: 623 AVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + + +A+ + + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y
Sbjct: 682 PVVAVLTTGSALAVDWAQQH--VPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPITFYK 739
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+ LP D GRTY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 740 --EAERLPA-------FDDYAMRGRTYRYFTGTALYPFGHGLSYTQFAYSDLRLDRTT-- 788
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
L D + +N G G +VV +Y P
Sbjct: 789 -----------------------------LGADGTLRATLKVRNTGKRAGDEVVQLYLHP 819
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
K++ GFQR+ ++ G + + F A +L I D T
Sbjct: 820 LDPKRERAGKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQRKT 866
>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 904
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 176/444 (39%), Positives = 249/444 (56%), Gaps = 29/444 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + + R DLVSRMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTFW 146
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ R K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDATA 206
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G +
Sbjct: 207 KHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 263
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A LL +R +W GY+V+DC +I + +HK +A ++E A A + G +L+CG+
Sbjct: 264 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTHGTELECGE 322
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIE 403
Y+ AV QG + E ID +L+ L T MRLG FD G + + S +
Sbjct: 323 EYSTLPA-AVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAHDA 381
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LA ARE +VLLKND LPL+ A +K +AV+GP A+ T+A++GNY G P ++ + G
Sbjct: 382 LARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 464 FSGY---ANVTYKTGCDDVACKSN 484
A V Y G D V +++
Sbjct: 441 IRAAAPNAQVLYARGADLVEGRND 464
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 143/317 (45%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG S + Y G D A + + + A + A++AD + + GL VE E
Sbjct: 596 AGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 655
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q +L+ + K PV+ V+ + + I +A+ + + A
Sbjct: 656 MKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK-PVVAVLTAGSALAIDWAQQH--VPA 712
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 713 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMHGR 763
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L ++
Sbjct: 764 TYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRST--------------------------- 796
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
L D V +N G G +VV +Y P K++ GFQR+ ++ G+
Sbjct: 797 ----LTADGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGQQ 852
Query: 745 KRIKFVFNACKSLNIVD 761
+ ++F NA +L I D
Sbjct: 853 RELRFTINAKDALRIYD 869
>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 904
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 176/444 (39%), Positives = 249/444 (56%), Gaps = 29/444 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + + R DLVSRMTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 37 YLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTFW 146
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ R K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDATA 206
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + DR+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G +
Sbjct: 207 KHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 263
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A LL +R +W GY+V+DC +I + +HK +A ++E A A + G +L+CG+
Sbjct: 264 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTHGTELECGE 322
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIE 403
Y+ AV QG + E ID +L+ L T MRLG FD G + + S +
Sbjct: 323 EYSTLPA-AVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAHDA 381
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LA ARE +VLLKND LPL+ A +K +AV+GP A+ T+A++GNY G P ++ + G
Sbjct: 382 LARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440
Query: 464 FSGY---ANVTYKTGCDDVACKSN 484
A V Y G D V +++
Sbjct: 441 IRAAAPNAQVLYARGADLVEGRND 464
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 143/317 (45%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG S + Y G D A + + + A + A++AD + + GL VE E
Sbjct: 596 AGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 655
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q +L+ + K PV+ V+ + + + +A+ + + A
Sbjct: 656 MKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK-PVVAVLTAGSALAVDWAQQH--VPA 712
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 713 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMHGR 763
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L ++
Sbjct: 764 TYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRST--------------------------- 796
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
L D V +N G G +VV +Y P K++ GFQR+ ++ G+
Sbjct: 797 ----LTADGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGQQ 852
Query: 745 KRIKFVFNACKSLNIVD 761
+ ++F NA +L I D
Sbjct: 853 RELRFTINAKDALRIYD 869
>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
Length = 534
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 198/599 (33%), Positives = 313/599 (52%), Gaps = 101/599 (16%)
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCV---------- 265
RP ++++ CKH AAY ++ DR++F A + D E T+L F+ CV
Sbjct: 7 RP-RIAATCKHLAAYSLE----TDRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHY 61
Query: 266 ---------KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
++ A VMCSYN ++G+P+CADP LL +R +W+ G +V+DC ++ +
Sbjct: 62 NASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNI 121
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
NH+F+A S E+AV L++G+DLDCG + +F A + + E DID++L L+ VL
Sbjct: 122 HSNHRFVA-SYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVL 180
Query: 377 MRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT-----LPLNSAKVK 431
M LG+FD + + + D E+ +LA EAA + IVLLKN N LPL+ AK K
Sbjct: 181 MDLGYFDETDEPDAKSSDDEM--EHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHK 238
Query: 432 TVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
+A+ GP A+ ++GNY G+P ++P+ G + K G + VA + S+
Sbjct: 239 EIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLA-------KMGVE-VAFRQRASVCDFH 290
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG---PVILVIMSA 548
+ ATI++ GLD S+EAE DR L LP Q LI ++ +K PV+LV++S
Sbjct: 291 GES----ATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVVSG 346
Query: 549 GGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL 608
G VD++ + +++I A++ YPG+ GG A+A V++G +NP G+L T Y Y+ + L
Sbjct: 347 GMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEVSL 406
Query: 609 TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
M +RP +PGRT+++Y G +YPFGYGLSYT F+Y + T++V ++
Sbjct: 407 HDMRMRPDGK--FPGRTHRYYRGDVIYPFGYGLSYTSFRYAMEFLGGTVKVTVS------ 458
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS-DVVIVYSKPPAEIAATY 727
N GS DGS V++ +S P A
Sbjct: 459 ----------------------------------NSGSMDGSVAVLLFHSAPQAGNEQEP 484
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
+ +IGF++++V G ++ + F+ K +N + AG HT + N + +
Sbjct: 485 FRSLIGFEKIYVSVGDSQLVS--FDVSKRMNPGE--------AGSHTFRIENESIDVEV 533
>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 904
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 177/445 (39%), Positives = 253/445 (56%), Gaps = 31/445 (6%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D++ + R DLVSRMTL+EK Q+ + A +PRL +P Y+WW+EALHGV+ G
Sbjct: 37 YLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 93
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ GLT+W
Sbjct: 94 -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 146
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ EG + P K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPYRKLDAT 205
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + +R+HFDAR +++D+ ET+L FE VK+G +VM +YNRV G
Sbjct: 206 AKHFA---VHSGPEAERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+ A LL +R +W GY+V+DC +I + +HK +A ++E A A + G +L+CG
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTHGTELECG 321
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
+ Y+ AV QG + E ID +L+ L T MRLG FD G + + S +
Sbjct: 322 EEYSTLPA-AVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAHD 380
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL+ A +K +AV+GP A+ T+A++GNY G P ++ +
Sbjct: 381 ALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439
Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
G A V Y G D V +++
Sbjct: 440 GIRAAAPNAQVLYARGADLVEGRND 464
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 142/317 (44%), Gaps = 60/317 (18%)
Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
AG S + Y G D A + + + A + A++AD + + GL VE E
Sbjct: 596 AGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 655
Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
+ DR DL LP Q +L+ + K PV+ V+ + + I +A+ + + A
Sbjct: 656 MKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK-PVVAVLTAGSALAIDWAQQH--VPA 712
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG+ GG A+AD +FG NPGGRLP+T+Y S L D GR
Sbjct: 713 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYTMHGR 763
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+++ G LYPFG+GLSYTQF Y+ L ++
Sbjct: 764 TYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRST--------------------------- 796
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
L D V +N G G +VV +Y P K++ GFQR+ ++ G
Sbjct: 797 ----LTADGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGEQ 852
Query: 745 KRIKFVFNACKSLNIVD 761
+ ++F NA +L I D
Sbjct: 853 RELRFTINATDALRIYD 869
>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
Length = 349
Score = 305 bits (782), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 198/308 (64%), Gaps = 20/308 (6%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
+S+ FC+ SL R L+S +TL+EK++QL D A G+PR G+P YEWWSE+LHG++
Sbjct: 38 NSYTFCNQSLSVPTRAHSLISLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGIAIN 97
Query: 110 GPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
GPG F + AT FP VI++ A+FN +LW IG A++ EARAM+N+G++GLT+W+PN
Sbjct: 98 GPGVSFTIGPVSAATGFPQVIISAAAFNRTLWFLIGSAIAIEARAMHNVGQSGLTFWAPN 157
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-----------------DVEGHENA 211
+N+ RDPRWGR ETPGEDP + YA+ +V+G Q E
Sbjct: 158 VNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKGFQGGNWKSGVSGSGSGRYGFGEKRMLR 217
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
D L +S+CCKH AYD++ W RY F+A VTEQD+E+T+ PF C++EG AS
Sbjct: 218 DDDGDDGLMLSACCKHLTAYDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCIEEGKAS 277
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
+MCSYN VNG+P+CA LL Q R EW GYIV+DCD++ + + + + S EDAV
Sbjct: 278 CLMCSYNEVNGVPACAREDLL-QKAREEWGFEGYIVSDCDAVATIFEYQNY-SKSAEDAV 335
Query: 332 AQTLKAGL 339
A LKAG+
Sbjct: 336 AIALKAGM 343
>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
Length = 890
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 180/438 (41%), Positives = 247/438 (56%), Gaps = 41/438 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D +LP+ R DLVS+MTL+EKV Q+ A + RLG+P+Y WW+E LHGV G
Sbjct: 40 YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA-MYNLGR-------AGLTYW 165
AT FP I A +++ +I AVS EARA ++ R GLT+W
Sbjct: 98 --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDPF+ G AV+Y++GLQ + R LK+ +
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDD---------DRYLKLVATS 200
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ V + DR+HFDAR + +D T+ F+ ++E SVMC+YNR NG+P
Sbjct: 201 KHFL---VHSGPEPDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPC 257
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
C K + +R EW GYIV+DC ++ H + + E+A A +KAG DL+CG
Sbjct: 258 CGS-KPVENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCG 316
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDEN 401
Y +AV+QG V E +ID +K L +RLG FD P+ Y ++ + S E+
Sbjct: 317 NSYPALV-DAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSVVDSKEH 374
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
ELA AAR+ +VLLKND NTLPL+ VK VAV+GP+AN ++ NY G P ++P+
Sbjct: 375 RELALIAARKSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPSNPVTPL 433
Query: 462 AGFSGY---ANVTYKTGC 476
G ANV Y GC
Sbjct: 434 DGIRQKLPNANVQYALGC 451
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 144/299 (48%), Gaps = 56/299 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A +D ++ GL ++E E + DR D+ LP QT L+ + + K
Sbjct: 610 AIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK- 668
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+LV+++ + I + N + AIL A YPG+ GG AIADV+FG +NP GRLP+T+Y
Sbjct: 669 PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFYK 726
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+T +P P + GRTY+++ G L+PFGYGLSYT FKY+ L
Sbjct: 727 S-------VTQLP--PFEDYSMDGRTYQYFKGEALFPFGYGLSYTSFKYDNL-------- 769
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
V+ + L VD N G+ DG +VV +Y
Sbjct: 770 ------------------------VVPDKLEAGKEVTVHVDVTNTGNRDGDEVVQLYVSH 805
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
P ++ + I+ + GF R+ ++AG K + F + L + ++PAG + VG
Sbjct: 806 P-DVESAPIRSLQGFDRIALKAGETKTVSFTLKP-EQLAVYQPQNGLVVPAGNLKLSVG 862
>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 883
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 169/424 (39%), Positives = 245/424 (57%), Gaps = 37/424 (8%)
Query: 43 SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
S G + + D++LP R DLV R+TLDEK QL A G+PRLG+P Y++WSE
Sbjct: 26 SPAGTRTPLLPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWSEG 85
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-- 160
LHG++ G AT FP + A+F+E L +IG+ +STEARA YN A
Sbjct: 86 LHGIARSG----------YATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVAHD 135
Query: 161 ------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
GLT WSPNIN+ RDPRWGR ET GEDPF+ R +V GLQ D
Sbjct: 136 LRSIFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQ-------GDDP 188
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
N + KH+A V + +R+ F+A + D+ +T+L F + EG A S+M
Sbjct: 189 NY--YRAIGTPKHFA---VHSGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIM 243
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLADSKEDAVA 332
C+YN + G P+CA LL++ +R +W G++ +DC +I D H + D+ E A
Sbjct: 244 CAYNAIEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDA-EQASV 302
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
++AG D +CG Y N +AV++G ++E+++D L+ L+ +LG FD Q Y S
Sbjct: 303 DGIRAGTDTNCGGTYRNL-ASAVRKGMIQESELDVPLRRLFLARFKLGLFDPPSQVKYAS 361
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+ + S + ELA +AARE +VLLKN+ +TLPL+ A+VKT+AV+GP+A++ +++ GNY
Sbjct: 362 MPITENMSSSHTELALQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNY 420
Query: 451 AGIP 454
IP
Sbjct: 421 NAIP 424
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 142/301 (47%), Gaps = 55/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
A EA K ADA + GL +E E +D R DL LP Q QL+ + A+ +
Sbjct: 606 AMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASGK 664
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+++V+++ + + +A+ + + AIL A YPG+ G +AIA+ + GK NP GRLP+T+Y
Sbjct: 665 PLVVVLLNGSALAVNWAQEHAD--AILEAWYPGQAGAQAIAETLSGKNNPSGRLPVTFYR 722
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
V LP P RTY+++ G LY FGYGLSY+ F Y+
Sbjct: 723 S--VNDLP-------PFTDYAMANRTYRYFKGKPLYEFGYGLSYSTFSYS---------- 763
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ SK R L D + D +N + G +V +Y P
Sbjct: 764 -------------NAHLSKER--------LDAGDTLRVEADVKNTSTLAGDEVAELYLTP 802
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
P + ++ + GF+ V + G++K + F + + L+ VD + AG +++ VG
Sbjct: 803 P-QNGVYPLRSLEGFEHVHLLPGQSKHVSFTLDP-RQLSEVDEKGIRAVRAGVYSVTVGG 860
Query: 780 G 780
G
Sbjct: 861 G 861
>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 876
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 177/445 (39%), Positives = 252/445 (56%), Gaps = 42/445 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ P+ R DLV+RMTL+EK Q+ + A +PRL +P+Y+WW+EALHGV+ G
Sbjct: 20 YLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 76
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ L R GLT+W
Sbjct: 77 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTFW 129
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ +G K+ +
Sbjct: 130 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 180
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + DR+HFD +E+D+ ET+L F+ V+EG ++VM +YNRVNG +
Sbjct: 181 KHFA---VHSGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNGESA 237
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A + L +R +W GYIV+DC +I+ + NHK + + E A A +K G DLDCG
Sbjct: 238 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 295
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
Y AV+ G + E ID +LK L T MRLG FD P V + ++++ +
Sbjct: 296 TYAALPA-AVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHD 353
Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL +K +AV+GP A+ ++++GNY G P ++ +
Sbjct: 354 ALARRTARESLVLLKND-GVLPLKPT-LKRIAVIGPTADDPMSLLGNYYGTPAAPVTILQ 411
Query: 463 GF---SGYANVTYKTGCDDVACKSN 484
G + A V Y G D V + +
Sbjct: 412 GIRDAAPQAQVIYARGSDLVEGRED 436
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 132/282 (46%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
A +AA+ A+ + + GL VE E +D R D LP Q +L+ Q +
Sbjct: 603 AVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 661
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ DV+FG+ +PGGRLP+T+Y
Sbjct: 662 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYK 719
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+ LP D GRTY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 720 --EAERLPA-------FDDYAMRGRTYRYFQGKPLYPFGHGLSYTQFAYSDLRLDRTT-- 768
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 769 -----------------------------VAADGTLTATVTLKNTGQRAGDEVVQLYLHP 799
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
+K++ G QR+ ++ G ++++F A +L I D
Sbjct: 800 LKPQRERALKELHGLQRITLQPGEQRQLRFTIKAQDALRIYD 841
>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.G.57.14]
gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
Y.N.15.51]
gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.G.57.14]
gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
Y.N.15.51]
Length = 754
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 221/700 (31%), Positives = 347/700 (49%), Gaps = 120/700 (17%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ +N ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H V +++ ETFL PFE+ VK G S+M +Y+ ++GIP +P+LL +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
W G +V+D D I+ + H+ +A +K +A L++G+D+ DC Y NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVNA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+++G V E+ ID++++ + + RLG D + + + ++ ELA + ARE IV
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S V +AV+GP+AN M+G+Y +GI + I
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G + V Y GCD +A +S A E A+ AD I + +GL LS
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L LPG Q +L+ ++ + K P+ILV+++ G + + +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSSIINYVKAV 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIADV+FG +NPGGRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPGGRLPITF---------PMDTGQIPLYYNRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
R Y L+ FGYGLSYTQF+Y+ L T K I N N
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+D +NVG +G DVV +Y A +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G +R+KF+ ++L D ++ GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|297736784|emb|CBI25985.3| unnamed protein product [Vitis vinifera]
Length = 241
Score = 302 bits (773), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 142/214 (66%), Positives = 168/214 (78%)
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDPF V YAV+YVRGLQDVEG EN TDLNSRPLKVSS KH+AAYD+DNW VDR HF
Sbjct: 9 GEDPFTVSVYAVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHF 68
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
+ARV+EQDM ETFLRPFE CV+EGD S VMCS+N +NGIP CADP+L T+R EW+LHG
Sbjct: 69 NARVSEQDMAETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHG 128
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
YIV+DC SI+ +V++ KFL + E+AVA LKAGLDL+CG YY + +AV G+V + D
Sbjct: 129 YIVSDCWSIETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHD 188
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
+D+SL LY VLMRLGFFDG P SLGK DI +
Sbjct: 189 LDQSLSNLYVVLMRLGFFDGIPALASLGKDDISA 222
>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 850
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 174/427 (40%), Positives = 246/427 (57%), Gaps = 31/427 (7%)
Query: 72 MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILT 131
MTL+EK Q+ + A +PRLG+P Y+WW+EALHGV+ G GAT FP I
Sbjct: 1 MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50
Query: 132 TASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYWSPNINVARDPRWGRITET 183
A+F+ L ++ A+S EARA ++ GLT+WSPNIN+ RDPRWGR ET
Sbjct: 51 AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110
Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSCCKHYAAYDVDNWKGVDRY 242
GEDPF+ R V +V+GLQ EG + + P K+ + KH+A V + DR+
Sbjct: 111 YGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPYRKLDATAKHFA---VHSGPEADRH 166
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
HFDAR +++D+ ET+L FE VK+G +VM +YNRV G + A LL +R +W
Sbjct: 167 HFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGF 226
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
GY+V+DC +I + +HK +A ++E A A +K G +L+CG+ Y+ AV+QG + E
Sbjct: 227 KGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECGEEYSTLPA-AVRQGLIDE 284
Query: 363 TDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID +L L T MRLG FD G + ++ S + LA ARE +VLLKND
Sbjct: 285 AQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND- 343
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCD 477
LPL+ AK+K +AV+GP A+ T+A++GNY G P ++ + G A V Y G D
Sbjct: 344 GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGAD 403
Query: 478 DVACKSN 484
V + +
Sbjct: 404 LVEGRDD 410
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 129/282 (45%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A++AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 577 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 635
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+AD +FG NPGGRLP+T+Y
Sbjct: 636 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 693
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYTQF Y+ L +T
Sbjct: 694 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 742
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 743 -----------------------------IAADGSLTATVTVKNTGQRAGDEVVQLYLHP 773
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
K++ GFQR+ ++ G + + F +A +L I D
Sbjct: 774 LTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYD 815
>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
756C]
Length = 896
Score = 301 bits (771), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 180/454 (39%), Positives = 254/454 (55%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D + P R DLVSRMTL+EK Q+ + A +PRL +P+Y+WW+EALHGV+ G
Sbjct: 40 YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ L R GLT+W
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKRYQGLTFW 149
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ +G K+ +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + DR+HFD +E+D+ ET+L F+ V+EG ++VM +YNRVNG +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A + L +R +W GYIV+DC +I+ + NHK + + E A A +K G DLDCG
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
Y AV+ G + E ID+SL L +RLG FD P V + ++++ +
Sbjct: 316 TYAALPA-AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHD 373
Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL +K +AVVGP A+ ++++GNY G P ++ +
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431
Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
G + A V Y G D V + + + A +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E +D R D LP Q +L+ Q +
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
D + LP D GRTY++++G LYPFG+GL+YTQF Y+ L +T
Sbjct: 740 ED--ERLPA-------FDDYAMRGRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT-- 788
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 789 -----------------------------VAADGTLRATVWVKNTGQRAGDEVVQLYLHP 819
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
K++ GFQR+ ++ G ++ + F ++L I D
Sbjct: 820 LNPQRERARKELRGFQRITLQPGEHREVSFTITPREALRIYD 861
>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
306]
Length = 886
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 186/462 (40%), Positives = 251/462 (54%), Gaps = 43/462 (9%)
Query: 32 SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSI---RVKDLVSRMTLDEKVQQLGDFAHGV 88
S VFV LGL + + P R LV++M+ +EKV Q + A +
Sbjct: 2 SSVFVSRLAMAVGLGLTLPCLALATPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAI 61
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
PRLG+P YEWWSE LHG++ G AT FP I AS+N SL +++G VS
Sbjct: 62 PRLGIPAYEWWSEGLHGIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVS 111
Query: 149 TEARAMYNLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
TEARA +N AGLT WSPNIN+ RDPRWGR ET GEDPF+ G+ AV ++
Sbjct: 112 TEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFI 171
Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
RGLQ DLN P +++ KH A V + R+ FD V+ D+E T+
Sbjct: 172 RGLQ-------GEDLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPHDVEATYTP 219
Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
F + EG A SVMC+YN ++G P CA LLN VRG+W G++V+DCD++ M
Sbjct: 220 AFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQF 279
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
H F D+ + A LKAG DL+CG Y G A+ +G+V E +D+SL L+ RL
Sbjct: 280 HYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTAIARGEVDEALLDQSLVRLFAARYRL 337
Query: 380 GFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
G + + Y LG +D+ + + LA +AA E IVLLKND NTLPL + +AV+G
Sbjct: 338 GELEAPRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPLRAG--TRLAVIG 395
Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKTGC 476
P+A+A A+ NY G ++P+ G G V+Y G
Sbjct: 396 PNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 682 WAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 731 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863
>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
NCPPB 4381]
Length = 886
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N +L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHLA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+++G V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPLN+ +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V Y G
Sbjct: 431 VRYAQGA 437
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 146/294 (49%), Gaps = 55/294 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
+DA + GL VE E L DR D+ LP Q L+ + A+ + P+++V+M
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S V + +A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR------- 724
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
++ L S GRTY+++ G L+PFGYGLSYT+F Y+ + T
Sbjct: 725 --STKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTT--------- 773
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
L+ + + +N G+ G +V VY + P + +
Sbjct: 774 ----------------------LQAGNPLQVTTTVRNTGTHAGDEVAQVYLQYP-DRPQS 810
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
++ ++GFQRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 863
>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
NCPPB 702]
Length = 886
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 241/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV+ M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N +L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHLA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+++G V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPLN+ +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V Y G
Sbjct: 431 VRYAQGA 437
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 146/294 (49%), Gaps = 55/294 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
+DA + GL VE E L DR D+ LP Q L+ + A+ + P+++V+M
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S V + +A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR------- 724
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
++ L S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 725 --STKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD----------------- 765
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
P + L+ + + +N G+ G +V VY + P + +
Sbjct: 766 --------------APQLSTTALQAGNPLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQS 810
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
++ ++GFQRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 863
>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
HVE10/4]
Length = 754
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 220/700 (31%), Positives = 347/700 (49%), Gaps = 120/700 (17%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ +N ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H V +++ ETFL PFE+ VK G S+M +Y+ ++GIP +P+LL +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
W G +V+D D I+ + H+ +A +K +A L++G+D+ DC Y+ NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YSEPLVNA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+ +G V E+ ID++++ + + RLG D + + + ++ ELA + ARE IV
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S V +AV+GP+AN M+G+Y +GI + +
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G + V Y GCD +A +S A E A+ AD I + +GL LS
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L LPG Q +L+ ++ + K P+ILV+++ G + + +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAV 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIADV+FG +NPGGRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPGGRLPITF---------PMDTGQIPLYYNRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
R Y L+ FGYGLSYTQF+Y+ L T K I N N
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+D +NVG +G DVV +Y A +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G +R+KF+ ++L D ++ GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 896
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 179/454 (39%), Positives = 253/454 (55%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D + P R DLVSRMTL+EK Q+ + A +PRL +P+Y+WW+EALHGV+ G
Sbjct: 40 YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ A GLT+W
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTFW 149
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ +G K+ +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + DR+HFD +E+D+ ET+L F+ V+EG ++VM +YNRVNG +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A + L +R +W GYIV+DC +I+ + NHK + + E A A +K G DLDCG
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
Y AV+ G + E ID+SL L +RLG FD P V + ++++ +
Sbjct: 316 TYAALPA-AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHD 373
Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL +K +AVVGP A+ ++++GNY G P ++ +
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431
Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
G + A V Y G D V + + + A +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E +D R D LP Q +L+ Q +
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
D + LP D GRTY++++G LYPFG+GL+YTQF Y+ L +T
Sbjct: 740 ED--ERLPA-------FDDYAMRGRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT-- 788
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 789 -----------------------------VAADGTLRATVSVKNTGQRAGDEVVQLYLHP 819
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
K++ GFQR+ ++ G ++ + F ++L I D
Sbjct: 820 LNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYD 861
>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
str. 85-10]
Length = 889
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 40 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 90 VFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 197
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNG 257
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 375
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 376 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 434 VSYAQGA 440
Score = 129 bits (324), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+ FGYGLSYT+F Y+ Q++ LQ +L T+
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAYD------APQLSTTTLQAGSSLQVTT 787
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+N G+ G +V VY + P + + ++ ++GF
Sbjct: 788 -------------------------TVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866
>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
91-118]
Length = 874
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 25 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 74
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 75 VFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 134
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 135 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 182
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 183 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNG 242
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 243 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 300
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 301 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 360
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 361 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 418
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 419 VSYAQGA 425
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 669
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 670 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 718
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+ FGYGLSYT+F Y+ Q++ LQ +L T+
Sbjct: 719 AYVSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAYD------APQLSTTTLQAGSSLQVTT 772
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+N G+ G +V VY + P + + ++ ++GF
Sbjct: 773 -------------------------TVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 806
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 807 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 851
>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB1386]
Length = 886
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 180/427 (42%), Positives = 242/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N SL +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + EG A SVMC+YN ++G P CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD+I M H F D+ +VA LKAG DL+CG Y G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVA-ALKAGHDLNCGHAYREL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNVAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 431 VSYAQGA 437
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 145/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 682 WAKTHAD--AIMAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+ + T N LQ
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSTTTLQAGNPLQ--------- 781
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ +R N G+ G +V VY + P + + ++ ++GF
Sbjct: 782 ----------VIATVR------------NTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863
>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
L.D.8.5]
Length = 754
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 220/700 (31%), Positives = 346/700 (49%), Gaps = 120/700 (17%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ +N ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H V +++ ETFL PFE+ VK G S+M +Y+ ++GIP +P+LL +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
W G +V+D D I+ + H+ +A +K +A L++G+D+ DC Y NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVNA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+++G V E+ ID++++ + + RLG D + + + ++ ELA + ARE IV
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S V +AV+GP+AN M+G+Y +GI + I
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G + V Y GCD +A +S A E A+ AD I + +GL LS
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L LPG Q +L+ ++ + K P+ILV+++ G + + +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSSIINYVKAV 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIADV+FG +NP GRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPSGRLPITF---------PMDTGQIPLYYNRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
R Y L+ FGYGLSYTQF+Y+ L T K I N N
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+D +NVG +G DVV +Y A +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G +R+KF+ ++L D ++ GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 889
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 176/427 (41%), Positives = 245/427 (57%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R DLV+ M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 40 RAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR--------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N GR AGLT WSPNIN+ RD
Sbjct: 90 VFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFRD 149
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++ GLQ DL+ P +++ KH A V
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIHGLQ-------GEDLD-HPRTIATP-KHLA---VH 197
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A +VMC+YN ++G P+CA L+N
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLING 257
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + ++ LA +AA E
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALALQAAAES 375
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN+ NTLPLN+ +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 376 IVLLKNNANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 434 VSYAQGA 440
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 141/286 (49%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+ + T
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTA----------------- 776
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 777 --------------VQAGSTLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866
>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 889
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 40 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 90 VFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 197
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNG 257
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 316 IARGEVDEALLDQSLVRLFATRYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 375
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 376 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 434 VSYAQGA 440
Score = 129 bits (323), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+ FGYGLSYT+F Y+ Q++ LQ +L T+
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAYD------APQLSTTTLQAGSSLQVTT 787
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+N G+ G +V VY + P + + ++ ++GF
Sbjct: 788 -------------------------TVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866
>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 896
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 179/454 (39%), Positives = 253/454 (55%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D + P R DLVSRMTL+EK Q+ + A +PRL +P+Y+WW+EALHGV+ G
Sbjct: 40 YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
GAT FP I A+F+ L ++ A+S EARA ++ A GLT+W
Sbjct: 97 -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTFW 149
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V +V+GLQ +G K+ +
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + DR+HFD +E+D+ ET+L F+ V+EG ++VM +YNRVNG +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A + L +R +W GYIV+DC +I+ + NHK + + E A A +K G DLDCG
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
Y AV+ G + E ID+SL L +RLG FD P V + ++++ +
Sbjct: 316 TYAALPA-AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQSPQHD 373
Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLLKND LPL +K +AVVGP A+ ++++GNY G P ++ +
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431
Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
G + A V Y G D V + + + A +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 53/282 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E +D R D LP Q +L+ Q +
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+ V+ + + I +A+ + + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
D + LP D GRTY++++G LYPFG+GL+YTQF Y+ L +T
Sbjct: 740 ED--ERLPA-------FDDYAMRGRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT-- 788
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ D V +N G G +VV +Y P
Sbjct: 789 -----------------------------VAADGTLRATVSVKNTGQRAGDEVVQLYLHP 819
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
K++ GFQR+ ++ G ++ + F ++L I D
Sbjct: 820 LNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYD 861
>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
L.S.2.15]
Length = 754
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 220/700 (31%), Positives = 346/700 (49%), Gaps = 120/700 (17%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L I + ++ R + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ +N ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H V +++ ETFL PFE+ VK G S+M +Y+ ++GIP +P+LL +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
W G +V+D D I+ + H+ +A +K +A L++G+D+ DC Y NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVNA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+++G V E+ ID++++ + + RLG D + + + ++ ELA + ARE IV
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S V +AV+GP+AN M+G+Y +GI + I
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G + V Y GCD +A +S A E A+ AD I + +GL LS
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L LPG Q +L+ ++ + K P+ILV+++ G + + +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSSIINYVKAV 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIADV+FG +NP GRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPSGRLPITF---------PMDTGQIPLYYNRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
R Y L+ FGYGLSYTQF+Y+ L T K I N N
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+D +NVG +G DVV +Y A +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G +R+KF+ ++L D ++ GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 889
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 176/427 (41%), Positives = 245/427 (57%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R DLV+ M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 40 RAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR--------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N GR AGLT WSPNIN+ RD
Sbjct: 90 VFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFRD 149
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++ GLQ DL+ P +++ KH A V
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIHGLQ-------GDDLD-HPRTIATP-KHLA---VH 197
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A +VMC+YN ++G P+CA L+N
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLING 257
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + ++ LA +AA E
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALALQAAAES 375
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN+ NTLPLN+ +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 376 IVLLKNNANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 434 VSYAQGA 440
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 141/286 (49%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+ + T
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTA----------------- 776
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 777 --------------VQAGSTLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866
>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
REY15A]
Length = 754
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 220/700 (31%), Positives = 347/700 (49%), Gaps = 120/700 (17%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L I + ++AR + G+ SP ++V +DPRWGR
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQARLV------GVNQCLSPVLDVCKDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ +N ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H V +++ ETFL PFE+ VK G S+M +Y+ ++GIP +P+LL +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
W G +V+D D I+ + H+ +A +K +A L++G+D+ DC Y+ NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YSEPLVNA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+ +G V E+ ID++++ + + RLG D + + + ++ ELA + ARE IV
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S V +AV+GP+AN M+G+Y +GI + +
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G + V Y GCD +A +S A E A+ AD I + +GL LS
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L LPG Q +L+ ++ + K P+ILV+++ G + + +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAV 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIADV+FG +NP GRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPSGRLPITF---------PMDTGQIPLYYNRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
R Y L+ FGYGLSYTQF+Y+ L T K I N N
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+D +NVG +G DVV +Y A +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G +R+KF+ ++L D ++ GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722
>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
citri pv. mangiferaeindicae LMG 941]
Length = 874
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 182/449 (40%), Positives = 247/449 (55%), Gaps = 43/449 (9%)
Query: 45 LGLQMSSFLFCDSSLPYSI---RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
LGL + + P R LV++M+ +EKV Q + A +PRLG+P YEWWSE
Sbjct: 3 LGLTLPCLALAPPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSE 62
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
LHG++ G AT FP I AS+N SL +++G VSTEARA +N
Sbjct: 63 GLHGIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGP 112
Query: 160 -------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AGLT WSPNIN+ RDPRWGR ET GEDPF+ G+ AV ++RGLQ
Sbjct: 113 GKDHQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GE 165
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
DLN P +++ KH A V + R+ FD V+ D+E T+ F + EG A S
Sbjct: 166 DLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 220
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMC+YN ++G P CA LLN VRG+W G++V+DCD++ M H F D+ + A
Sbjct: 221 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 280
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
LKAG DL+CG Y G A+ +G+V E +D+SL L+ RLG + + Y
Sbjct: 281 -ALKAGHDLNCGHAYREL-GTAIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYAR 338
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
LG +D+ + + LA +AA E IVLLKND NTLPL + +AV+GP+A+A A+ NY
Sbjct: 339 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANY 396
Query: 451 AGIPCRYMSPIAGFS---GYANVTYKTGC 476
G ++P+ G G V+Y G
Sbjct: 397 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 425
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 669
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 670 WAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 718
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 719 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 753
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 754 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 806
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG++T+FVG G
Sbjct: 807 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 851
>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 886
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 243/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N SL +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL+ P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLD-HPRTIATP-KHIA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y + G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYRDL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+++G V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 431 VSYAQGA 437
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 143/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG A+A ++ G NPGGRLP+T+Y ++ L
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAMARMLAGDDNPGGRLPVTFYR---------STKDLP 730
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863
>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 886
Score = 298 bits (764), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 177/427 (41%), Positives = 243/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N SL +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL+ P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLD-HPRTIATP-KHIA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y + G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYRDL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+++G V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 431 VSYAQGA 437
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863
>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
axonopodis pv. punicae str. LMG 859]
Length = 886
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 178/427 (41%), Positives = 241/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------AT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N SL +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + EG A SVMC+YN ++G P CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 431 VSYAQGA 437
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 142/286 (49%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVALN 681
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 863
>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
str. GSPB2388]
Length = 886
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 178/427 (41%), Positives = 241/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 37 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N SL +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 87 VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DLN P +++ KH A V
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 194
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ D+E T+ F + EG A SVMC+YN ++G P CA LLN
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNG 254
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + + LA +AA E
Sbjct: 313 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 373 IVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 431 VSYAQGA 437
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVALN 681
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT+F Y+
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863
>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
19865]
Length = 889
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 176/427 (41%), Positives = 245/427 (57%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 40 RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 90 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL+ P +++ KH A V
Sbjct: 150 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLD-HPRTIATP-KHIA---VH 197
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + +G A SVMC+YN ++G P+CA LLN
Sbjct: 198 SGPEPGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNG 257
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A +LKAG DL+CG Y G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGYAYRAL-GTA 315
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+++G+V E +D+SL L+ RLG + + Y +LG +DI + N LA +AA +
Sbjct: 316 IERGEVDEALLDQSLVRLFAARYRLGELEAPHKDPYATLGAKDIDNTANRALALKAAAQS 375
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKND NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 376 IVLLKNDANTLPLKAG--ARLAVIGPNADALAALEANYQGTSSTPVTPLLGLRQRFGVHQ 433
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 434 VSYAQGA 440
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 140/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT F Y + T
Sbjct: 734 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAYGAPQLSSTT----------------- 776
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
L+ + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 777 --------------LQAGSTLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQSPLRSLVGF 821
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F +A ++L+ VD + AG++T+FVG G
Sbjct: 822 QRVHLKPGEQRTLTFTLDA-RALSDVDRTGQRAVEAGDYTLFVGGG 866
>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
Length = 754
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 218/699 (31%), Positives = 342/699 (48%), Gaps = 118/699 (16%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L + + ++ R + G+ SP ++V RDPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H V +++ ETFL PFE+ VK G S+M +Y+ ++G+P +P+LL +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFTGNA 354
W G +V+D D I+ + HK +A +K +A L++G+D +DC Y A
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHK-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVTA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+++G V E ID++++ + + RLG D S + + ++ ELA +AARE IV
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S + +AV+GP+AN M+G+Y +GI + IA
Sbjct: 374 LLKNENNMLPL-SKNINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G V Y GC D+A +S A E AK AD I + +GL LS
Sbjct: 433 VGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L L G Q +L+ ++ + K P+ILV+++ G + + +KAI
Sbjct: 492 KYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAI 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIAD++FG +NP GRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADIIFGDYNPSGRLPITF---------PMDTGQIPLYYSRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
R Y + L+ FGYGLSYTQF+Y+ L T
Sbjct: 599 ---RPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTP------------------------- 630
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
++ Y +D +NVG+ +G +VV +Y A +K++ GF +V ++
Sbjct: 631 ------KEVGPLSYITILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLK 684
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
G +R+KF ++L D ++ GE+ I +GN
Sbjct: 685 PGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIGN 722
>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
Length = 754
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 218/699 (31%), Positives = 341/699 (48%), Gaps = 118/699 (16%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T+FP I +++N L + + ++ R + G+ SP ++V RDPRWGR
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP++V + Y+ GLQ ++ + KH+AA+ + + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+ H R ++ ETFL PFE+ VK G S+M +Y+ ++G+P +P+LL +R E
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFTGNA 354
W G +V+D D I+ + HK +A +K +A L++G+D +DC Y A
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHK-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVTA 313
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+++G V E ID++++ + + RLG D S + + ++ ELA +AARE IV
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIV 373
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
LLKN+ N LPL S + +AV+GP+AN M+G+Y +GI + IA
Sbjct: 374 LLKNENNMLPL-SKNINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKK 432
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
G V Y GC D+A +S A E AK AD I + +GL LS
Sbjct: 433 VGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491
Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
V E DR L L G Q +L+ ++ + K P+ILV+++ G + + +KAI
Sbjct: 492 KYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAI 548
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
+ A +PGEEGG AIAD++FG +NP GRLPIT+ P+ + +PL R S
Sbjct: 549 IEAWFPGEEGGNAIADIIFGDYNPSGRLPITF---------PMDTGQIPLYYSRKPSSF- 598
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
R Y + L+ FGYGLSYTQF+Y+ L T
Sbjct: 599 ---RPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTP------------------------- 630
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
++ Y +D +NVG+ +G +VV +Y A +K++ GF +V ++
Sbjct: 631 ------KEVGPLSYITILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLK 684
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
G +R+KF ++L D ++ GE+ I +GN
Sbjct: 685 PGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIGN 722
>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
[Verrucomicrobiae bacterium DG1235]
Length = 730
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 233/720 (32%), Positives = 341/720 (47%), Gaps = 95/720 (13%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F D LP R+ DL++ MTL+EKV +G F G+PRL + +Y SE HGV+ GP
Sbjct: 27 YPFQDPDLPNEERIDDLITCMTLEEKVDLMG-FVPGIPRLDV-KYTRISEGYHGVAQGGP 84
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRAGLTYWSPN 168
T FP A+++ +L ++ +TE R +Y R+GL +PN
Sbjct: 85 SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPN 144
Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
++ARDPRWGR E GEDPF+ G A + GL A D + R LK +S KH+
Sbjct: 145 ADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGL--------AGD-HPRYLKATSLLKHF 195
Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
A N DR+ + E+ E + +PFEM +++G A S+M +YN +NG P+
Sbjct: 196 LA----NSNEDDRFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHVH 251
Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
P +L V GEW L G I D + +V+ HK D A A +KAG++L + T
Sbjct: 252 P-MLRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPT-ATAACIKAGINLFLDNH-T 308
Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSD----EN 401
+AV+Q V E +ID ++ + + LG D P+ Y ++G + E
Sbjct: 309 QAALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGHEPGLEPWELPET 367
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
E R+ IVLLKN+ N LPL+ +K+ +VA+VGP AN T ++ Y+G P + P
Sbjct: 368 HAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLANTT--LLDWYSGTPPYAIPPR 425
Query: 462 AGFSGYANV-----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---- 512
G GYAN K G + VA S+ ++ E A + D I++ G A
Sbjct: 426 DGIEGYANSGPFPSPAKFGSNWVADMSDTAL----EVAASRDVAIVVVGNHPESNAGWGV 481
Query: 513 --------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
E++DR+++ L Q + I +V A P +V++ A N A
Sbjct: 482 VTSPSEGKEAVDRQEIILQPDQEEFIQKV--YAANPNTIVVL-VSNFPYAMPWAAENAPA 538
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
I+ + +E G A+ADV+FG +NPGG+ TW Q+ P+ +R GR
Sbjct: 539 IVHITHASQEQGNALADVLFGDYNPGGKTVQTWPKS-LDQLPPMMDYDIRR-------GR 590
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY + YPFGYGLSYT F+ + L K + +DA+ T
Sbjct: 591 TYMYSQHEPQYPFGYGLSYTTFELSKLKAPKKL---------------KADATAT----- 630
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
KV N G DG +VV +Y + P KQ+ GFQRV V AG++
Sbjct: 631 ------------IKVRVANTGERDGDEVVQLYVRYPNSKVERPSKQLKGFQRVTVPAGKS 678
>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
Length = 349
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 212/341 (62%), Gaps = 10/341 (2%)
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
+ TV MIGNYAG+ C Y +P+ G Y ++ GC DV C N AA AA+ ADAT
Sbjct: 1 DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADAT 60
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
+++ GLD S+EAE +DR L LPG+Q +L+++VA ++GP ILV+MS G +D+ FA+ +
Sbjct: 61 VLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDP 120
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
I AI+W GYPG+ GG AIADV+FG NPGG+LP+TWY +YV LP+T M +R + G
Sbjct: 121 RISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARG 180
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
YPGRTY+FY GP ++PFG GLSYT F +NL + V L L+ N S A
Sbjct: 181 YPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKA---- 236
Query: 681 CPGVLVNDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
V V+ C+ + VD +N GS DG+ ++V++ PP A+ KQ++GF ++
Sbjct: 237 ---VRVSHADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASS-KQLMGFHKIH 292
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ AG KR++ + CK L++VD +P GEH + +G+
Sbjct: 293 IAAGSEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 333
>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 889
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 175/427 (40%), Positives = 244/427 (57%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R DLV+ M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 40 RAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR--------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N GR AGLT WSPNIN+ RD
Sbjct: 90 VFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFRD 149
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++ GLQ DL+ P +++ KH A V
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIHGLQ-------GDDLD-HPRTIATP-KHLA---VH 197
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A +VMC+YN ++G P+CA L+N
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLING 257
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
+ +G+V E +D+SL L+ RLG + + Y LG +D+ + ++ LA +AA E
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALALQAAAES 375
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN+ NTLPL + +AV+GP+A+A A+ NY G ++P+ G G
Sbjct: 376 IVLLKNNANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433
Query: 470 VTYKTGC 476
V+Y G
Sbjct: 434 VSYAQGA 440
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 140/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA ++ G NPGGRLP+T+Y ++ L
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+PFGYGLSYT F Y+ + T
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTCFAYDAPQLSSTA----------------- 776
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 777 --------------VQAGSTLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG + + F +A ++L+ VD + + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDPSGQRAVEAGNYTLFVGGG 866
>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
Length = 870
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 172/443 (38%), Positives = 254/443 (57%), Gaps = 46/443 (10%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + S RV DLV+R+TL+EKV QL D + + RL +P+Y WW+EALHGV+ G
Sbjct: 33 LYLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAGK- 91
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
AT FP I A+F+E L ++G A+S E RA ++ A GLTY
Sbjct: 92 ---------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMYTGLTY 142
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP++ R AVN++ GLQ N+ LK +
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGD---------NTEYLKSVAT 193
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA V + V R+ D +++D+ ET+L F+ + + +SVMC+YN VNG P
Sbjct: 194 LKHYA---VHSGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTP 250
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
+C + +L+ +R E++ GYIV+DC +I D +H + +++ A A LK G DL+
Sbjct: 251 ACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHN-IVNTEAKAAAMALKTGTDLN 309
Query: 343 CGQYYTN---FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI--- 396
CG ++ N + AV++G V+E D+DK+LK L +LG FD +P+ V I
Sbjct: 310 CGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPYSDTSIDIV 368
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
S++++ L EAA++ +VLLKN+Q LPL + VA++GP+A+ ++GNY G+P
Sbjct: 369 GSNKHLALTQEAAKKSLVLLKNEQ-VLPLKGN--EKVALIGPNADNEAILLGNYNGMPIV 425
Query: 457 YMSPIAGFS---GYANVTYKTGC 476
++P G N+TY G
Sbjct: 426 PITPKLALEQRLGKNNLTYTAGS 448
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 132/303 (43%), Gaps = 57/303 (18%)
Query: 494 AKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVIL 543
A AD + + G+ ++E E + DR ++ LP Q L+ ++ + K P++L
Sbjct: 603 ANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTGK-PIVL 661
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
V MS G +A N NI AI+ YPGE G A+ +++G+++P G+LPIT+Y V
Sbjct: 662 VNMS--GSAMALNWENENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITFYKS--V 717
Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
LP RTYK+Y G LYPFG+GLSY FKY
Sbjct: 718 SDLP-------DFKDYSMKNRTYKYYEGEVLYPFGFGLSYADFKY--------------- 755
Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
+N ++ DA DL N S DVV VY P
Sbjct: 756 ----KNTRHSIDAGS--------GDLN------LTTTITNQSSFSADDVVQVYVSMPDAP 797
Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG-GV 782
T KQ++GF+ + ++ IKF K L+ ++ + G I VG+G G+
Sbjct: 798 IKTPNKQLVGFKHITLKNESKNDIKFTIPKNK-LSYINEQGIAVAYKGRLIITVGSGQGI 856
Query: 783 SFP 785
P
Sbjct: 857 KIP 859
>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
Length = 893
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 173/449 (38%), Positives = 259/449 (57%), Gaps = 42/449 (9%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
+++ F D+SL RV DLVSR+T EK+ Q+ + + RLG+P Y WW+E+LHGV+
Sbjct: 41 ATYPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARA 100
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR------AG 161
G AT +P I ++F+E L ++ ++S E RA Y+ L + G
Sbjct: 101 GK----------ATVYPQAIGLASTFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGG 150
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
LT+WSPNIN+ RDPRWGR ET GEDPF+ GR A+N+V+G+Q + NS LK
Sbjct: 151 LTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQ-------GENDNSDYLKA 203
Query: 222 SSCCKHYAAYD-VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+ KHYA + + + D YH T +D+ ET+L F M + E + S+MC+YNRV
Sbjct: 204 VATIKHYAVHSGPEKTRHSDDYH----PTRKDLFETYLPAFRMAIAETNVQSLMCAYNRV 259
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNH-KFLADSKEDAVAQTLKAGL 339
+G P+C + +L+ + +RG+ +GY+V+DC +I ++ + DS +A A +K+G
Sbjct: 260 DGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHVVDSPAEAAAWAVKSGT 319
Query: 340 DLDCGQYYTNFTGN---AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQ 394
DL+CG + N N A+QQG + E ID ++K L+ ++LG FD + Y +G
Sbjct: 320 DLNCGDSHGNTYTNLHYALQQGLITEDYIDIAVKRLFKARIKLGMFDEQDRVPYSEIGMD 379
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ S +++ L EAA + IVLLKN+ LPL A VK VAV+GP+A ++GNY G+P
Sbjct: 380 VVGSPKHLALTQEAAEKSIVLLKNN-GVLPL-KAGVK-VAVIGPNAVDEDVLVGNYHGVP 436
Query: 455 CRYMSPIAGF---SGYANVTYKTGCDDVA 480
+ + P+ G G ANV Y G +A
Sbjct: 437 VKPVLPLEGIVNRVGEANVFYAPGSAQIA 465
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 126/301 (41%), Gaps = 55/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A AA+ AD I + G+D +E E + DR + LP QT L+ Q+ K
Sbjct: 620 ALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK- 678
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV++V S G +A + + AIL A YPGE G A+A++++G +P GRLP+T+Y
Sbjct: 679 PVVMVNFS--GSAMALNWESEKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFYK 736
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
G V LP + RTYKFY G LY FG+GL Y F YN L
Sbjct: 737 G--VDDLP-------AFNDYHMENRTYKFYRGEPLYAFGHGLGYVDFAYNNL-------- 779
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
V+ N V N G DV VY
Sbjct: 780 ------------------------VVANTAEAGKALPIAVSVTNTGKMQAEDVAQVYISL 815
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
A T I+ + F+R + AG + ++F A + L +D T G + VG+
Sbjct: 816 LDAPANTPIRDLKAFKRTKLAAGESTELEFNLPA-RVLTYIDDNGKTQTYTGRVEVTVGS 874
Query: 780 G 780
G
Sbjct: 875 G 875
>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 747
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 223/726 (30%), Positives = 347/726 (47%), Gaps = 76/726 (10%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
F D LP R+ DL+ RMTL+EK+ + A VPRLG+ + E HGV+ GP
Sbjct: 34 FQDPELPAEQRIDDLIGRMTLEEKIDCMAMRA-AVPRLGV-KGSRHIEGYHGVAQGGPSN 91
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRAGLTYWSPNIN 170
T FP A+++ L +++ + EAR ++ RAGL +PN +
Sbjct: 92 WGRRNPTATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSPRYDRAGLIVRAPNAD 151
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
+ARDPRWGR E GEDPF G A +VRGLQ + R K S KH+ A
Sbjct: 152 LARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQGD---------DPRYFKAVSLVKHFLA 202
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
++ + +F +E+ E + +PFEM + +G A ++M +YN VNG P+ P
Sbjct: 203 NSNEDGRESSSSNF----SERQWREYYAKPFEMAIVDGGAPALMAAYNAVNGTPAHVHP- 257
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
+L V EW L+G + D ++++V+ H D A A +KAG++ ++
Sbjct: 258 MLRDIVMAEWKLNGILCTDGGGLRLLVEKHHAFPDLP-SAAAACVKAGINHFLDRHKDAV 316
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGK----QDICSDENIEL 404
T AV +G + E D+D +L+ L+ V ++LG D + Y ++G+ + + L
Sbjct: 317 T-EAVARGSITERDLDAALRGLFRVSLKLGLLDPDERVPYAAIGRNGEAEPWLRPDTQAL 375
Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
+ + IVLLKN LPL+ KVKTVA+VGP N + Y G P + P G
Sbjct: 376 VRKVTQRSIVLLKNSGALLPLDRTKVKTVALVGPLVNTV--LPDWYGGTPPYTVPPSIGV 433
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD------------LSVEA 512
A K G +A + AA E A+T++ I+ G D S
Sbjct: 434 EKVAGEGVKVGW--LADMGD----AAVELARTSEIAIVCVGNDPISAGGWELVRTPSEGK 487
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E++DR+DL LP Q + I +V +A P +V++ + A ++ AI+ +
Sbjct: 488 EAVDRKDLALPRDQEKFIRRV--LAANPRTIVVLIS-NFPYAMPWVVKHVPAIVHLTHAS 544
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
+E G A+ DV++G+ NP G+L TW Q+ P+ L GRTY+++ G
Sbjct: 545 QELGHALGDVLWGEVNPDGKLAQTWPK-SLKQLPPMMDYDL-------THGRTYQYFKGE 596
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT---SDASKTRCPGVLVNDL 689
+PFG+GLSYT F + ++V L+ +H T S A +T P +++
Sbjct: 597 PQFPFGFGLSYTTFNLS------NLRVGLDVARHVGAGAETPAESPAPRTFAPNAILS-- 648
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
V+ N G+ G +VV VY++ P + +KQ+ GFQR+ V AG ++
Sbjct: 649 -------IAVEVTNTGTRAGDEVVQVYARYPHSKVSRPLKQLCGFQRISVAAGETAHVRL 701
Query: 750 VFNACK 755
A +
Sbjct: 702 QLPASR 707
>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 796
Score = 295 bits (755), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 224/738 (30%), Positives = 352/738 (47%), Gaps = 114/738 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P EA+HG VG T FPT I +++N L KK+ ++
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAK 173
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E RA + T + P I++AR+PRW R+ ET GEDP+++ + V G Q HE
Sbjct: 174 EIRA-----QGSNTAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQG--SHE 226
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA--RVTEQDMEETFLRPFEMCVKE 267
+DL S V++ KH+AAY V + H A + ++D+ + ++ P + V
Sbjct: 227 --SDLKSNE-HVAATLKHFAAYGVS-----EGGHNGAAVHIGQRDLFQNYMYPVKEAVDN 278
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G SVM +Y+ ++G+PS A LL ++ +W G++++D SI+ ++ +H + D++
Sbjct: 279 G-VMSVMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDH-HIVDTE 336
Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
EDA A + AG+D+D G Y + +AV GKV E ID++++ + TV +LG F+
Sbjct: 337 EDAAAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGLFENPY 396
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
++ + + E+IELA E AR+ I +LKN+ N LPLN +++ +AV+G +A+
Sbjct: 397 ANEKQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNADMQYNQ 455
Query: 447 IGNYAGIPCRYMSPIAGFSGY------ANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
+G+Y P + I G AN+ Y G V + +I AA EAAK A+
Sbjct: 456 LGDYTA-PQSEENIITVLEGIQHKMPNANIEYVKGT-AVRDTTQTNIPAAVEAAKNAEVA 513
Query: 501 IILAG----LDLSVE----------------------AESLDREDLWLPGYQTQLINQVA 534
I++ G D E E DR L L G Q +L+ V
Sbjct: 514 IVVLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELLQAV- 572
Query: 535 EVAKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
VA G P +LV++ G + N+ IL A YPG+EGG AIADV+FG FNP GRL
Sbjct: 573 -VATGTPTVLVLIK--GRPLLLNWPAENVPVILDAWYPGQEGGSAIADVIFGDFNPAGRL 629
Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT-YKFYNGPTLYPFGYGLSYTQFKYNLLS 652
P++ +P + + + +P R Y + LYPFGYGLSY++FKY+ L
Sbjct: 630 PVS---------VPKSLGQIPVYYNYWFPNRRDYVETDAKPLYPFGYGLSYSEFKYSDLK 680
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
+ + K R + E + N DG +V
Sbjct: 681 --------------------VATSGKGR-----------NTKIEISLKISNTSKVDGDEV 709
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+ +Y + + +KQ+ F+RV ++AG K ++F K L++ D + AGE
Sbjct: 710 IQLYIRDMVSTVLSPVKQLRAFERVSIKAGETKTVQFEL-LPKELSLFDTEMKQKVQAGE 768
Query: 773 HTIFVGNGGVSFPIHLNF 790
+ +G + F
Sbjct: 769 FKLMIGASSEDIRLETTF 786
>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
CL02T12C04]
Length = 861
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 174/449 (38%), Positives = 247/449 (55%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H+ D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASADAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D P + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q + + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + ++ D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPL-EGTYELLYGGTS 848
>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
suwonensis 11-1]
Length = 916
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 175/445 (39%), Positives = 258/445 (57%), Gaps = 32/445 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL + R LVSRMTL+EK Q+ + + + RLGLP Y+WW+EALHGV+ G
Sbjct: 50 WLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG--- 106
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYW 165
GAT FP I ASF+ L ++ A+S EARA ++ GR GLT+W
Sbjct: 107 -------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTFW 159
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ R V++VRGLQ ++ + L+ + K+ +
Sbjct: 160 SPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMD-PQTGQPLDPKYRKLDATA 218
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + DR+ FD ++QD+ +T+L FE VKE D +VM +YNRV G +
Sbjct: 219 KHFA---VHSGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESA 275
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
LL T+R +W GY+++DC +I + NHK + ++ E+A A +K G +L+CG
Sbjct: 276 SGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIV-ETPEEAAALAVKNGTELNCGS 334
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
Y + AV++G + E ++D +L L+ M LG FD P+ V + +++ E
Sbjct: 335 TYADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD-PPEQVRWAQVPYSVNQSAEHD 393
Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA + A+E +VLLKND LPL S ++ +AVVGP A+ T+A++GNY G P ++ +
Sbjct: 394 ALARKMAQESLVLLKND-GVLPL-SKDIRRLAVVGPTADDTMALLGNYYGTPADPVTILR 451
Query: 463 GFSGYA---NVTYKTGCDDVACKSN 484
G A +V Y G D V + +
Sbjct: 452 GIREAAPGVDVVYARGVDLVEGRDD 476
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 142/301 (47%), Gaps = 56/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A EAA +ADA + + GL VE E + DR D+ LP Q +L+ V K
Sbjct: 643 ALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHATGK- 701
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV++V+ + + I +A N+ IL A YPG+ GG A+ + +FG +NPGGRLP+T+Y+
Sbjct: 702 PVVMVLTTGSALGIDWA--RRNVPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVTFYS 759
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
D + LP P D RTY+++ G L+PFG+GLSYT F Y+ L
Sbjct: 760 AD--EKLP-------PFDDYAMKERTYRYFTGQPLFPFGHGLSYTSFGYSGL-------- 802
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
KL R D V +N G G +VV +Y P
Sbjct: 803 ---KLDRKR--------------------AGAGDEVTVSVTVKNQGKRAGDEVVQLYLAP 839
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN--TLLPAGEHTIFV 777
+K++ GFQRV ++ G ++ + F + L + D AA T+ P G + + V
Sbjct: 840 VKPQRERALKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDP-GRYEVQV 898
Query: 778 G 778
G
Sbjct: 899 G 899
>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 888
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 176/427 (41%), Positives = 240/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL P +++ KH A V
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A +LKAG DL+CG Y G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGTAYRAL-GTA 314
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
+++G+V E +D+SL L+ RLG +Y LG +DI + N LA +AA E
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN TLPL + +AV+GP+A+A A+ NY G + ++P+ G G
Sbjct: 375 IVLLKNANATLPLKAG--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432
Query: 470 VTYKTGC 476
V Y G
Sbjct: 433 VRYAQGA 439
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 139/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA + G NPGGRLP+T+Y ++ L
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR---------STKDLP 732
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT F Y+ + T
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAYDAPQLSSTT----------------- 775
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
L+ + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 776 --------------LQAGSPLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQSPLRSLVGF 820
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F +A ++L+ VD + AG++ +FVG G
Sbjct: 821 QRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGG 865
>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 874
Score = 294 bits (752), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 181/449 (40%), Positives = 247/449 (55%), Gaps = 43/449 (9%)
Query: 45 LGLQMSSFLFC---DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
LGL + F D S R LV++M+ DEKV Q + A +PRL +P YEWWSE
Sbjct: 3 LGLCLPCIAFAAPADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSE 62
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
LHG++ G AT FP I AS+N +L +++G VSTEARA +N
Sbjct: 63 GLHGIARNG----------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGP 112
Query: 160 -------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AGLT WSPNIN+ RDPRWGR ET GEDPF+ G+ AV ++RGLQ
Sbjct: 113 GKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GD 165
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
DLN P +++ KH A V + R+ FD V+ +DME T+ F + +G A S
Sbjct: 166 DLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWS 220
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMC+YN ++G P+CA LLN VRG+W G++V+DCD++ M H F D+ + A
Sbjct: 221 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 280
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
LKAG DL+CG Y G A+++G+V E +D+SL L+ RLG + + Y
Sbjct: 281 -ALKAGHDLNCGHAYREL-GTAIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYAR 338
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
LG +D+ + + LA +AA E IVLLKN TLPL + +AV+GP+A+A A+ NY
Sbjct: 339 LGAKDVDNAAHRALALQAAAESIVLLKNTATTLPLKAG--TRLAVIGPNADALAALEANY 396
Query: 451 AGIPCRYMSPIAGFS---GYANVTYKTGC 476
G ++P+ G G V Y G
Sbjct: 397 QGTSATPITPLLGLRQHFGAQQVRYAQGA 425
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 143/294 (48%), Gaps = 55/294 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
+DA + GL VE E L DR D+ LP Q L+ + A+ + P+++V+M
Sbjct: 603 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 661
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S V + +A+ N + AI+ A YPG+ GG AIA + G NPGGRLP+T+Y
Sbjct: 662 SGSAVALNWAKANAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR------- 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
++ L S GRTY+++ G L+PFGYGLSYT F Y+
Sbjct: 713 --STKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAYD----------------- 753
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
P + L+ + + +N GS G +V VY + P + +
Sbjct: 754 --------------APRLSTRTLQAGNPLQVTTTVRNTGSRAGDEVAQVYLQYP-DRPQS 798
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
++ ++GFQRV ++ G + + F +A ++L+ VD + + AGE+ +FVG G
Sbjct: 799 PLRSLVGFQRVHLKPGEQRELTFTLDA-RALSDVDRSGQRAVEAGEYRVFVGGG 851
>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
str. 8004]
Length = 888
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 176/427 (41%), Positives = 240/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 39 RAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL P +++ KH A V
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A +LKAG DL+CG Y G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGTAYRAL-GTA 314
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
+++G+V E +D+SL L+ RLG +Y LG +DI + N LA +AA E
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN TLPL + +AV+GP+A+A A+ NY G + ++P+ G G
Sbjct: 375 IVLLKNANATLPLKAG--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432
Query: 470 VTYKTGC 476
V Y G
Sbjct: 433 VRYAQGA 439
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 139/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA + G NPGGRLP+T+Y ++ L
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR---------STKDLP 732
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT F Y+ + T
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAYDAPQLSSTT----------------- 775
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
L+ + +N G+ G +V VY + P + + ++ ++GF
Sbjct: 776 --------------LQAGSPLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQSPLRSLVGF 820
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F +A ++L+ VD + AG++ +FVG G
Sbjct: 821 QRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGG 865
>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
Length = 923
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 168/425 (39%), Positives = 249/425 (58%), Gaps = 37/425 (8%)
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
Y R+ DL+S MT +EK++QL + A +PRLGL Y +W+E+LHGV +
Sbjct: 113 YKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGV-----------LAE 161
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
GATSFP I A+++ L ++ AVS EARA+ L GLTYWSP IN+ARDPRWGR
Sbjct: 162 GATSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPRWGRN 221
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E+ EDP+++ R V +++G+Q + LK + KH+ A N +
Sbjct: 222 EESYSEDPYLLSRMGVAFIKGMQGDHPYY---------LKTVATPKHFIA----NNEEER 268
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
R+ + V +++ E +L F+ + E A S+M +YN +N +PS A+ L+ +R +W
Sbjct: 269 RHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRRQW 328
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
GY+V+DC +I M+ HKF E AVA+++ AG DL+CGQ Y F +A+ +G +
Sbjct: 329 GFEGYVVSDCGAIHDMLYGHKFFKTGAE-AVARSILAGCDLNCGQAYREFIKDALDEGLL 387
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
+E DID +L + + RLG FD P+ Y S+GK + S EN LA +AAR+ IVLLK
Sbjct: 388 REKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARKSIVLLK 446
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN-----VTY 472
N+ + LP++ +K+K++AV+GP NA A +G Y+G P +SP+ G A+ V Y
Sbjct: 447 NN-DILPIDKSKIKSIAVIGP--NAREAQLGIYSGFPNVLISPLEGIKNKADSLDIRVGY 503
Query: 473 KTGCD 477
GCD
Sbjct: 504 VKGCD 508
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 139/290 (47%), Gaps = 41/290 (14%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + A D I++ G+ + E LDR+++ LP Q +L+ Q AEV P I++++ G
Sbjct: 661 AKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEV--NPNIVIVLVNG 718
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A A KAI+ Y GE GG+A+ADV+FG +NPGG+LP T+Y + LP
Sbjct: 719 G-PVALAGAEKYAKAIVENWYNGEFGGQALADVLFGDYNPGGKLPQTFYAS--TEQLP-- 773
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
P+ D + P RTY + N L+PFG+GLSYT FKY+ L
Sbjct: 774 --PMSDYDIINNP-RTYMYLNEQALFPFGHGLSYTTFKYDSLK----------------- 813
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
++ N L D + NVG+ +G +VV +Y+ K
Sbjct: 814 --------------IVSNTLNETDTLSLQFRLTNVGNRNGDEVVQIYASCKDAKFKVPRK 859
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
Q+ F+R+ ++ G +K ++F + Y + ++ G I +G+
Sbjct: 860 QLKRFRRLTLQTGESKVLEFKIPVDELAFYSTYENDFVVEKGAWEILIGS 909
>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
Length = 888
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 176/427 (41%), Positives = 240/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWWSE LHG++ G AT
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL P +++ KH A V
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A +LKAG DL+CG Y G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGTAYRAL-GTA 314
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
+++G+V E +D+SL L+ RLG +Y LG +DI + N LA +AA E
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN TLPL + +AV+GP+A+A A+ NY G + ++P+ G G
Sbjct: 375 IVLLKNANATLPLKAG--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432
Query: 470 VTYKTGC 476
V Y G
Sbjct: 433 VRYAQGA 439
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 140/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR D+ LP Q L+ + A+ + P+++V+MS V +
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+T+ + AI+ A YPG+ GG AIA + G NPGGRLP+T+Y ++ L
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR---------STKDLP 732
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT+F Y
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAYE------------------------- 767
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + V L+ + +N G G +V VY + P + + ++ ++GF
Sbjct: 768 ------TPRLSVTTLQAGSPLQVTTTVRNTGERAGDEVAQVYLQYP-DRPQSPLRSLVGF 820
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F +A ++L+ VD ++ AG++ +FVG G
Sbjct: 821 QRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRVVEAGDYRLFVGGG 865
>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
Length = 468
Score = 293 bits (749), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 171/441 (38%), Positives = 252/441 (57%), Gaps = 44/441 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + S RV DLV+R+TL+EKV QL D + + RL +P+Y WW+EALHGV+ G
Sbjct: 33 LYLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAGK- 91
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-----GRA---GLTY 164
AT FP I A+F+E L ++G A+S E RA ++ R+ GLTY
Sbjct: 92 ---------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMYTGLTY 142
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP++ R AVN++ GLQ N+ LK +
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGD---------NAEYLKSVAT 193
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA V + V R+ D +E+D+ ET+L F+ + + +SVMC+YN VNG P
Sbjct: 194 LKHYA---VHSGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTP 250
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD-NHKFLADSKEDAVAQTLKAGLDLDC 343
+C + +L+ +R E++ GYIV+DC +I D + ++ A A LK G DL+C
Sbjct: 251 ACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMALKTGTDLNC 310
Query: 344 GQYYTN---FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI---C 397
G ++ N + AV++G V+E D+DK+LK L +LG FD +P+ V I
Sbjct: 311 GDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPYSDTSIDVVG 369
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
S++++ L EAA++ +VLLKN+Q LPL + +A++GP+A+ ++GNY G+P
Sbjct: 370 SNKHLALTQEAAQKSLVLLKNEQ-VLPLKGN--EKIALIGPNADNEAILLGNYNGMPIVP 426
Query: 458 MSPIAGFS---GYANVTYKTG 475
++P G N+TY G
Sbjct: 427 ITPKLALEQRLGKNNLTYTAG 447
>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 861
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 174/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D P + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 126/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q N + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKN--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + + D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTGV-NFEWFDAESNTMRPL-EGTYELLYGGTS 848
>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
CL03T12C04]
Length = 861
Score = 292 bits (748), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 174/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D P + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 126/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q + + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + + D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTGV-NFEWFDVESNTMRPL-EGTYELLYGGTS 848
>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
Length = 888
Score = 292 bits (748), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 175/427 (40%), Positives = 240/427 (56%), Gaps = 40/427 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++M+ +EKV Q + A +PRLG+P YEWW+E LHG++ G AT
Sbjct: 39 RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------YAT 88
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
FP I AS+N L +++G VSTEARA +N AGLT WSPNIN+ RD
Sbjct: 89 VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDPF+ G+ AV ++RGLQ DL P +++ KH A V
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+ R+ FD V+ +D+E T+ F + EG A SVMC+YN ++G P+CA LLN
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VRG+W G++V+DCD++ M H F D+ + A LKAG DL+CG Y G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGTAYRAL-GTA 314
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
+++G+V E +D+SL L+ RLG +Y LG +DI + N LA +AA E
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
IVLLKN TLPL ++ +AV+GP+A+A A+ NY G + ++P+ G G
Sbjct: 375 IVLLKNANATLPLKAS--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432
Query: 470 VTYKTGC 476
V Y G
Sbjct: 433 VRYAQGA 439
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 147/294 (50%), Gaps = 55/294 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
+DA + GL VE E L DR D+ LP Q L+ + A+ + P+++V+M
Sbjct: 617 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLM 675
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S V + +A+T+ + AI+ A YPG+ GG AIA + G NPGGRLP+T+Y
Sbjct: 676 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR------- 726
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
++ L P S GRTY+++ G L+PFGYGLSYT+F Y +T +++ LQ
Sbjct: 727 --STKDLPPYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAY------ETPRLSATTLQA 778
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
L T+ +N G G +V VY + P E +
Sbjct: 779 GSPLQVTT-------------------------TVRNTGERAGDEVAQVYLQYP-ERPQS 812
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
++ ++GFQRV ++ G + + F +A ++L+ VD + AG++ +FVG G
Sbjct: 813 PLRSLVGFQRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGG 865
>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
Length = 861
Score = 292 bits (747), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 174/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D P + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q N + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQCDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + ++ D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848
>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
curdlanolyticus YK9]
Length = 733
Score = 292 bits (747), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 230/760 (30%), Positives = 369/760 (48%), Gaps = 110/760 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGV----PRLGLPQYEWWSEA-----------LHGVSN 108
+ + L+S+MTL++KV Q+ F G P G +++ E L G +
Sbjct: 23 QAEQLLSKMTLEDKVGQMTQFDWGYNPINPETGESEHDLIIELIRQGKVGSIFNLSGAAE 82
Query: 109 VG--------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
P DVI G T FP + A++N + ++ A STEA
Sbjct: 83 ANELQGLIEQHTELKIPMVIGRDVIHGYRTVFPIPLAMAAAWNPEVARQTSAAASTEALT 142
Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
G+T+ ++P I+V+RDPRWGRI E+ GEDP++ Y +V G Q G AT
Sbjct: 143 ------DGVTWVFAPMIDVSRDPRWGRIAESIGEDPYLTAAYGRAWVEGSQIDNGPGRAT 196
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
+SC KH+A Y + G D D ++++++ + L PF+ V+ G A S
Sbjct: 197 ---------ASCPKHFAGYGMAE-AGRDYNTVD--LSDRELRDIILPPFQDAVEAG-ALS 243
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
+M S+N +NGIP+CA+ LL +R EW G + +D +++ ++ + +A ++E+A
Sbjct: 244 IMASFNEINGIPACANEYLLKTILRDEWGFEGVVASDYNALVELIVHG--VAANEEEACE 301
Query: 333 QTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV-- 389
T+ AG D+D +T V+ G+V E+ +D S++ + + ++LG + S V
Sbjct: 302 MTVLAGCDMDMHSGIFTRQLPKLVRAGRVPESVVDDSVRRILAMKIKLGLLEQSKSDVSQ 361
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
S Q + S E +ELA EAAR+ IVLL+N + LPL+ A ++AV+GP A+ +G
Sbjct: 362 SAATQPLKS-EYVELAREAARQSIVLLQNKEQVLPLSKAGA-SIAVIGPLADNATDPLGC 419
Query: 450 YA--GIPCRYMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
+A G ++ + G A ++ Y GC D+ S AA EAA+++D ++L
Sbjct: 420 WALDGRSDEVVTALEGIRQAAAEGTSIRYAQGC-DIDSDSEEGFEAALEAARSSDVVVML 478
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G ++ ES R L LPG Q L+ VA++ K P++ VI+S G + FA
Sbjct: 479 LGESATMSGESRSRAALDLPGKQRALVEAVAKLGK-PIVAVILS--GRPLTFAWLPEQAS 535
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYP 622
AI+ A + G + G AIADV+FG FNP GRLP+T+ N + + RP P
Sbjct: 536 AIVQAWHLGVQSGNAIADVLFGDFNPSGRLPVTFPQNVGQIPIYHYRKKTGRP------P 589
Query: 623 GRTYKFY----NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
Y Y LYPFGYGL+YT+F+Y + +K+
Sbjct: 590 AGAYSSYYIDSTTEPLYPFGYGLTYTEFEYGAIQTSKS---------------------- 627
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
+ D+ + V +NVG+ G +VV Y + +K+++ F++V
Sbjct: 628 ---------SIGADEQLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVAFRKVK 678
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
V AG + + F A + L I+D + G+ T+++G
Sbjct: 679 VAAGESVDVTFTIGAAE-LAILDKHMKRTVEPGDFTLWIG 717
>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
Length = 860
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 161/438 (36%), Positives = 249/438 (56%), Gaps = 40/438 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + +LP+ R +DL+ R++LDEK+ + + + RLG+P+Y WW+EALHGV+ G
Sbjct: 23 YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNG--- 79
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I A+++ L +I +S EARA YN G++ W
Sbjct: 80 -------RATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ G AV++++GLQ + + LK +
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQD---------KKYLKTIATP 183
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH A V + +R+HF+A V+ D+ ET+L F+ + +G A SVMC+YNR+ G
Sbjct: 184 KHLA---VHSGPEPERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKAC 240
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
C LL +R +W G +V+DC ++ + ++HK + DS E A A + +G DL+CG
Sbjct: 241 CGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHK-IVDSPEKAAALAVSSGTDLECGN 299
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENI 402
+ + NA + G + E +ID +L+ + +LG FD P+ VS + D + + N
Sbjct: 300 TFLSLK-NAYRDGLITEKEIDSALRRVLLARFKLGMFD-PPEIVSYSQIDESYLDNSYNR 357
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
E+A EAAR+ IVLLKND LPL+S+ + +AV+GP+A+ +++GNY G P Y++P+
Sbjct: 358 EIALEAARKSIVLLKNDNKLLPLDSS-INKIAVIGPNADNLESLLGNYHGFPSEYITPLQ 416
Query: 463 GFSGY---ANVTYKTGCD 477
V Y+ GCD
Sbjct: 417 AIRRVLKNGEVFYEKGCD 434
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 140/299 (46%), Gaps = 56/299 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A +DA I+ GL +E E+L DR L LP Q +LI ++ K
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVILV+++ G + + + NI AIL A YPG+ GGRAI DV++GK+NP G+LP+T Y
Sbjct: 650 PVILVLLNGGPISTVWE--SENIPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+ +P P ++ GRTY+++ G LYPFG+GL+YT + + +
Sbjct: 708 SE-------NDLP--PFENYDMEGRTYRYFKGEVLYPFGWGLNYTDITISNIELS----- 753
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
N+++ +D V +N G+ G + V +Y+K
Sbjct: 754 --------------------------ANEIKDNDTIRVVVKLKNNGNLAGEETVQLYTK- 786
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
A IK + GF+++ + G ++F + VD +P G + I VG
Sbjct: 787 -ALKDNRTIKTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843
>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
CL09T03C04]
Length = 788
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 238/817 (29%), Positives = 369/817 (45%), Gaps = 151/817 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
L+ + P RV+DL+S+MTL+EK Q+ +G R+ LPQ W +E G+ N
Sbjct: 42 LYENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGN 100
Query: 109 VG---------------------------------------PGTHFDDVIPG-----ATS 124
+ P ++ I G AT
Sbjct: 101 IDEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATY 160
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+ + EA A LG + +SP +++A+DPRWGR ET
Sbjct: 161 FPAQCGQGATWNKKLIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP++VG + LQ +L + P KH+A Y + +
Sbjct: 216 GEDPYLVGELGKQMITSLQKY-------NLVATP-------KHFAVYSIPIGGRDGKTRT 261
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M ++ PF M +E A VM SYN +G P L + +R EW G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + + HK +AD+ ED +AQ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
GK+ + +DK + + + RLG FD Y GKQ + S E+ ++ EAAR+
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
+VLLKN+ N LPL S ++++AV+GP+AN +I Y A P + + I +A
Sbjct: 434 LVLLKNETNLLPL-SKSIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAE 492
Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
V YK GCD + + + A AAK A+ + +L G +L+V E
Sbjct: 493 VIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVR-ED 551
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K PVILV++ I +A ++ AIL A +PGE
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEF 608
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G+A+A+ +FG +NPGGRL +T+ V +P + P +P Y L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----AL 660
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GLSYT F Y+ L + + Q ++ D +
Sbjct: 661 YPFGHGLSYTTFTYSDLHISPSHQ-----------------------------GVQGDIH 691
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
K+ +N G G +VV +Y + TY K + GF+R+ ++AG + + F
Sbjct: 692 VSCKI--KNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP- 748
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+ L + D N + G + +G +H F
Sbjct: 749 QDLGLWDKNMNFRVEPGSFKVMLGASSTDIRLHGQFE 785
>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
Length = 861
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 180/467 (38%), Positives = 252/467 (53%), Gaps = 47/467 (10%)
Query: 45 LGLQMSSFLF-CDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
LG+ S LF C LPY R +DL+ R+TL+EKV + + + +PRLG+ +Y
Sbjct: 9 LGVCSLSLLFSCAQKLPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEY 68
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
+WW+EALHGV G AT FP I ASFN+SL ++ AVS EAR
Sbjct: 69 DWWNEALHGVGRAGL----------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSR 118
Query: 157 -------LGR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
L R GLT+W+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E
Sbjct: 119 IFSENGVLKRYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPE-- 176
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKE 267
N + K+ +C KH+A + W +R+ FDA +T +D+ ET+L F+ V++
Sbjct: 177 ------NGKYDKLHACAKHFAVHSGPEW---NRHSFDAENITPRDLWETYLPAFKDLVQK 227
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLAD 325
D VMC+YNR G P C +LL Q +R EW G +V+DC +I H D
Sbjct: 228 ADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD 287
Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
KE A A + +G DL+CG Y + +AV+ G + E ID SLK L T LG D
Sbjct: 288 -KEHASAGAVLSGTDLECGGEYGSL-ADAVKAGLIDEKQIDVSLKRLLTARFELGEMDEQ 345
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
P + + + S E+ +LA ARE +VLL+N + LPLN+ VAV+GP+AN +V
Sbjct: 346 PAWAEIPASTLNSKEHQDLALRMARESLVLLQNKNDILPLNTD--LKVAVMGPNANDSVM 403
Query: 446 MIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIFA 489
GNY GIP ++ + V Y+ GCD + ++ S+F+
Sbjct: 404 QWGNYNGIPGHTVTLLEAVRSKLPEGQVMYEPGCDRTSREALQSLFS 450
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 120/290 (41%), Gaps = 55/290 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A E K AD + G+ S+E E + DR D+ LP Q + + + K
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQR---DLLKALKKA 647
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
+V ++ G I + +AIL YPG+ GG AI DV+FG +NP GRLP+T+Y
Sbjct: 648 GKKVVFINYSGSAIGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
D Q+ ++ GRTY++ L+PFG+GLSYT F Y +K
Sbjct: 708 -DAGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEADLSK---- 754
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
N D + N G DG +VV VY +
Sbjct: 755 -----------NTIGDGGTVT----------------LTIPVSNAGQRDGDEVVQVYLRC 787
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
A+ + + F+RV + AG K++ +S D A NT+ P
Sbjct: 788 MADKEGPHYT-LRAFKRVHIPAGETKQVTIPLT-YESFEWFDTATNTVHP 835
>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
Length = 868
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 171/426 (40%), Positives = 242/426 (56%), Gaps = 39/426 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R LV++MTL EKV Q+ + A +PRLG+P Y+WWSE LHG++ G AT
Sbjct: 32 RAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNG----------YAT 81
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---GRA-----GLTYWSPNINVARDP 175
FP I AS++ SL +G +STEARA +N GRA GLT WSPNIN+ RDP
Sbjct: 82 VFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRDP 141
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGR ET GEDP++ G+ AV +VRG+Q D P +++ KH+ A+ +
Sbjct: 142 RWGRGQETYGEDPYLTGQLAVAFVRGIQG--------DDPQHPRAIATP-KHFVAH---S 189
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R FD V+ D+E+T+L F V +G A SVMC+YN ++G P+CA+ LL+
Sbjct: 190 GPEAGRDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTPACANAGLLDTR 249
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
+R +W GY+V+DCD++ + H F D + +VA ++AG DLDCG Y + AV
Sbjct: 250 LRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVA-AVQAGTDLDCGHTYASLA-QAV 307
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGI 413
+QG + E+ +D SL L+T RLG G+ Y +G I S + +LA +AA E +
Sbjct: 308 RQGDIAESALDASLVRLFTARYRLGELGSRGNDPYARIGADQIDSPAHRKLALQAALESL 367
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANV 470
VLLKN +TLPL++ +AV+GP A+A + NY G ++P+ G G +V
Sbjct: 368 VLLKNAHSTLPLHAG--MRLAVIGPDADALETLEANYHGTARHPVTPLQGLRARFGADHV 425
Query: 471 TYKTGC 476
Y G
Sbjct: 426 AYAQGA 431
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 144/295 (48%), Gaps = 57/295 (19%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
ADA + GL VE E L DR D+ LP Q L+ + A + P+I+V++
Sbjct: 598 ADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVVLL 656
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S V + +A+ + + AIL A YPG+ GG AIA V+ G +NPGGRLP+T+Y
Sbjct: 657 SGSAVALDWAQQHAD--AILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYR------- 707
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
++ L P S GRTY++++G LYPFGYGLSYT+F Y
Sbjct: 708 --STRDLPPYVSYAMQGRTYRYFDGRPLYPFGYGLSYTRFTYA----------------- 748
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAA 725
P + L+ + + +N G G +VV VY PP+ +A
Sbjct: 749 --------------APTLSAATLKAGGTLQVSAEVRNAGQRAGDEVVQVYLDTPPSPLAP 794
Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+ ++GF+R+ + AG + ++F A + L+ VD A + G++ +F+G G
Sbjct: 795 RH--ALVGFRRIHLAAGEQRLVRFTL-APRQLSSVDAAGARAVEPGQYRVFIGAG 846
>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 788
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 237/817 (29%), Positives = 368/817 (45%), Gaps = 151/817 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
L+ + P RV+DL+S+MTL+EK Q+ +G R+ LPQ W +E G+ N
Sbjct: 42 LYENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGN 100
Query: 109 VG---------------------------------------PGTHFDDVIPG-----ATS 124
+ P ++ I G AT
Sbjct: 101 IDEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATY 160
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+ + EA A LG + +SP +++A+DPRWGR ET
Sbjct: 161 FPAQCGQGATWNKKLIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP++VG + LQ +L + P KH+A Y + +
Sbjct: 216 GEDPYLVGELGKQMITSLQKY-------NLVATP-------KHFAVYSIPIGGRDGKTRT 261
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M ++ PF M +E A VM SYN +G P L + +R EW G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + + HK +AD+ ED +AQ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
GK+ + +DK + + + RLG FD Y GKQ + S E+ ++ EAAR+
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
+VLLKN+ N LPL S ++++AV+GP+AN +I Y A P + + I +
Sbjct: 434 LVLLKNETNLLPL-SKSIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTE 492
Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
V YK GCD + + + A AAK A+ + +L G +L+V E
Sbjct: 493 VIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVR-ED 551
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K P+ILV++ I +A +I AIL A +PGE
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PIILVMLDGRASSINYAA--AHIPAILHAWFPGEF 608
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G+A+A+ +FG +NPGGRL +T+ V +P + P +P Y L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----AL 660
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GLSYT F Y+ L + + Q ++ D +
Sbjct: 661 YPFGHGLSYTTFTYSDLHISPSHQ-----------------------------GVQGDIH 691
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
K+ +N G G +VV +Y + TY K + GF+R+ ++AG + + F
Sbjct: 692 VSCKI--KNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP- 748
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+ L + D N + G + +G +H F
Sbjct: 749 QDLGLWDKNMNFRVELGSFKVMLGASSTDIRLHGQFE 785
>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 861
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 173/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H+ D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPD-KEHASAGAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 130/301 (43%), Gaps = 58/301 (19%)
Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
K DA +IL G+ S+E E + DR D+ LP Q + + + K
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V ++ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP + GRTY++ L+PFG+GLSYT F Y +K N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------N 755
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
+ N+ T + NVG DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
+ F+RV + AG+ + + ++ D +NT+ P E T + GG
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPL-EGTYELLYGGT 847
Query: 783 S 783
S
Sbjct: 848 S 848
>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
CL03T12C18]
Length = 861
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 172/449 (38%), Positives = 245/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E +++ K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DTKYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++ G DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAAVRTGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D P + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPASVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q N + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + ++ D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848
>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
4393]
Length = 883
Score = 290 bits (741), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 178/445 (40%), Positives = 253/445 (56%), Gaps = 41/445 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+S + R LV++MTL+EK Q+ + A + RLG+P Y+WW+EALHGV+ G
Sbjct: 24 WQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVARAGQ-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-------GR-AGLTYW 165
AT FP I A+F+ L ++ +S EARA ++ GR GLT+W
Sbjct: 82 --------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGRYQGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDP++ R V +V+GLQ D R K+ +
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQ-------GDDPVYR--KLDATA 184
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + DR+HFDAR +++D+ +T+L FE VKEG +VM +YNRV G +
Sbjct: 185 KHFA---VHSGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVYGESA 241
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A LL +R +W GY+V+DC +I V + H LA S+E A A +K G +L+CGQ
Sbjct: 242 SASQFLLRDVLRRDWGFTGYVVSDCWAI-VDIWKHHHLAPSREAAAALAVKNGTELECGQ 300
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE---NI 402
Y AV+QG + E +ID ++ L+T MRLG FD P+ V + ++ +
Sbjct: 301 EYATLPA-AVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVPAHD 358
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA +AA+E +VLLKND LPL S +K +AVVGP A+ T+A++GNY G P ++ +
Sbjct: 359 ALALQAAQESLVLLKND-GVLPL-SRTLKRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 416
Query: 463 GFSGYAN---VTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 417 GIRDAAKGIEVRYARGVDLVEGRDD 441
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 94/313 (30%), Positives = 142/313 (45%), Gaps = 54/313 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 608 ALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHATGK- 666
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV++V+ GG +A ++ AIL + YPG+ GG A+ +FG+ NP GRLP+T+Y
Sbjct: 667 PVVMVLT--GGSALAVDWAQAHLPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVTFYR 724
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
D Q LP D GRTY+++ G LYPFG+GLSYT+F Y L
Sbjct: 725 AD--QALPA-------FDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGKLHL------ 769
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
DA + + +D R + +V+ N G G +V +Y +
Sbjct: 770 ---------------DAPR------IADDGR----LKLQVEVANTGKRAGDEVAQLYVRR 804
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
A + + GFQRV + G + + F +A ++L D A ++PAG + + +G
Sbjct: 805 LAAAPGDAQQTLRGFQRVHLAPGERRTLTFELDAQQALRQYDDARGAYVVPAGRYEVRIG 864
Query: 779 NGGVSFPIHLNFN 791
+ F
Sbjct: 865 GSSADARVRAGFT 877
>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 902
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 171/439 (38%), Positives = 244/439 (55%), Gaps = 41/439 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D++ P + R DLV RMTLDEK QL D+A +PRLG+P Y+ WSEALHGV+ G
Sbjct: 37 VYRDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAG-- 94
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
AT FP I A+++ + K++G +STEAR YN + GLT+
Sbjct: 95 --------HATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIFWGLTF 146
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDPF+ G+ + ++ G+Q + + P K +
Sbjct: 147 WSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPDA--------AHP-KAVAT 197
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + R+ FD +V+ +D+EET+L F V +G SVMC+YN V+G+
Sbjct: 198 SKHFA---VHSGPESLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNAVDGMG 254
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+CA+ LL + ++ W G++V+DC +I + HK D A A +L AG DL C
Sbjct: 255 ACANKMLLEEHLKQAWGFKGFVVSDCGAIMDVTQGHKNAPDIVH-AAAISLAAGTDLSCS 313
Query: 345 QYYTNFT--GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
+ F +AV++G V E + ++ + LY LG FD GS + + S+E
Sbjct: 314 IWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFDEPGSNPNDKIDMSQVASEE 373
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ A +AA E IVLLKND LPL +A KT+AV+GP A ++ GNY G P R ++P
Sbjct: 374 HRAEALKAAEESIVLLKND-GLLPLKNA--KTIAVIGPTAELLASLEGNYNGQPVRPVTP 430
Query: 461 IAGFS---GYANVTYKTGC 476
+ G G NV Y G
Sbjct: 431 LDGIVKQFGAENVRYAQGS 449
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 51/281 (18%)
Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
L G ++ ++ E DR + LP Q +L+ + K PV++V +S V + +A N
Sbjct: 645 LEGEEMPIKIEGFSGGDRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALNWA--N 701
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDS 618
+ AIL A YPG EGG AIA + G+ NP GRLP+T+Y VQ LP T ++
Sbjct: 702 QHAGAILQAWYPGVEGGTAIAKTLAGESNPAGRLPVTFYAS--VQDLPAFTEYAMK---- 755
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
RTY++Y G L+ FG+GLSY+ FKY + T + DA K
Sbjct: 756 ----NRTYRYYAGKPLWGFGFGLSYSTFKYGEVKLAST----------------SVDAGK 795
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
+ V V N G +VV Y K P + ++ ++GFQRV
Sbjct: 796 SLTATVTVT---------------NTSQVAGDEVVEAYLKTPQKGGPSH--SLVGFQRVP 838
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G ++ + + +SL+ VD + + AGE+ + +G+
Sbjct: 839 LNPGESREVAIEVSP-RSLSAVDDSGKRSILAGEYRLSIGS 878
>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
Length = 861
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 173/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H+ D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAGAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 130/301 (43%), Gaps = 58/301 (19%)
Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
K DA +IL G+ S+E E + DR D+ LP Q + + + K
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V ++ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP + GRTY++ L+PFG+GLSYT F Y +K N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------N 755
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
+ N+ T + NVG DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
+ F+RV + AG+ + + ++ D +NT+ P E T + GG
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPL-EGTYELLYGGT 847
Query: 783 S 783
S
Sbjct: 848 S 848
>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
Length = 861
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 172/451 (38%), Positives = 246/451 (54%), Gaps = 43/451 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE--GHENATDLNSRPLKVSS 223
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E G++ K+ +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD----------KLHA 185
Query: 224 CCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
C KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLD 340
P C +LL Q +R EW G +V+DC +I H+ D KE A A ++ G D
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAAAVRTGTD 301
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
L+CG Y + +AV+ G + E +ID SLK L T LG D P + + + S E
Sbjct: 302 LECGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTSVLNSKE 360
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA ARE +VLL+N N LPLN+ +AV+GP+AN +V GNY GIP ++
Sbjct: 361 HQALALRMARESLVLLQNKNNILPLNTN--LKIAVMGPNANDSVMQWGNYNGIPAHTVTL 418
Query: 461 IAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ + Y+ GCD V K+ S+F
Sbjct: 419 LEAVRAKLPEGQIIYEPGCDRVDRKTLQSLF 449
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 130/301 (43%), Gaps = 58/301 (19%)
Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
K DA +IL G+ S+E E + DR D+ LP Q + + + K
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V ++ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP + GRTY++ L+PFG+GLSYT F Y +K N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------N 755
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
+ N+ T + NVG DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
+ F+RV + AG+ + + ++ D +NT+ P E T + GG
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPL-EGTYELLYGGT 847
Query: 783 S 783
S
Sbjct: 848 S 848
>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
Length = 614
Score = 289 bits (740), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 201/630 (31%), Positives = 302/630 (47%), Gaps = 65/630 (10%)
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR E P EDP + G + Y GLQ E +SR KV
Sbjct: 11 SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGE--------DSRYTKVVVTL 62
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ AY +++ G R++FDA+V+ + +T+ F V EG+A VMCSYN +NG P+
Sbjct: 63 KHWDAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPT 122
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
C P LL + +R W GY+ +D +I+ + H + A++ A D+D G
Sbjct: 123 CTHP-LLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGA 181
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIE 403
Y + +AV G+ D+D++L + LG FD Y + I + +
Sbjct: 182 VYHDALLDAVNSGECSMDDVDRALYNTLKLRFELGLFDPIEDQPYWRINASSINTTYAQD 241
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR------Y 457
L + E ++LL+N N LP K + VAV+GPH NA A++GNY G C
Sbjct: 242 LNMKITLESMILLQNHNNALPFK--KGRKVAVIGPHINAQEALVGNYLGQLCPDDSFDCI 299
Query: 458 MSPIA---GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
SP+A +G +N G +AC ++ SI A AK AD ++L G++ ++EAES
Sbjct: 300 TSPLAAIEAINGMSNTVSAMGSGVLAC-TDASIQEAVNVAKDADYVVLLIGINDTIEAES 358
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR + LP Q +L +A + K ++I GG+ +A + + AI+ AGYPG
Sbjct: 359 NDRTSIDLPQCQHKLTAAIAHLNKTTAAVLI--NGGM-LAIEQEKKQLPAIIEAGYPGFY 415
Query: 575 GGRAIADVVFGKFNP-GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG AIA +FG N GG+LP T Y DY+ + ++ M + PGR+Y++Y G
Sbjct: 416 GGAAIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEMT-----NSPGRSYRYYTGQP 470
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFG+GL+YT F + ++ P +
Sbjct: 471 LWPFGFGLAYTTF-----------------------------SVQSPGPSASTFATGSNT 501
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPA--EIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
F V N G G VV VY P + + + KQ+I F+RV + + +
Sbjct: 502 SFSLPVHVVNTGKRTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLTPNQRLGVTIPL 561
Query: 752 NACKSLNIVD-YAANTLLPAGEHTIFVGNG 780
+A N+VD N + G + + V +G
Sbjct: 562 SA-DVFNMVDPVTGNVVSTPGSYRLVVSDG 590
>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
3_8_47FAA]
Length = 861
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 173/449 (38%), Positives = 245/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 129/297 (43%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q L+ + +V K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGKK---VVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + ++ D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAISLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848
>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
Length = 861
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 173/449 (38%), Positives = 245/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 57/290 (19%)
Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
K DA +IL G+ S+E E + DR D+ LP Q + + + K
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V ++ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP + GRTY++ L+PFG+GLSYT F Y +K N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------N 755
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
+ N+ T + NVG DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+ F+RV + AG+ + + +S D A NT+ P +
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAISLTH-ESFEWFDEATNTMHPVAD 838
>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 861
Score = 288 bits (738), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 174/449 (38%), Positives = 244/449 (54%), Gaps = 39/449 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYW 165
AT FP I ASFN+SL ++ A S EAR L R GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E ++R K+ +C
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +V+DC +I H D KE A A ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGAVRAGTDLE 303
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
CG Y + +AV+ G + E +ID SLK L T LG D + + + S E+
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++ +
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420
Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ Y+ GCD V K+ S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q + + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + ++ D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPL-EGTYELLYGGTS 848
>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 851
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 169/430 (39%), Positives = 252/430 (58%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 27 LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 84
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRAG-------L 162
T FP I A++N L +++ +S EARA +N GRA L
Sbjct: 85 RF--------TVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVL 136
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + H LK+
Sbjct: 137 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHY---------LKIV 187
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 188 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 243
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK+L +KE A +LKAGLDL+
Sbjct: 244 VPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLK-TKEAAATLSLKAGLDLE 302
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y NA +Q V + DID + ++ T M+LG FDG + Y + I S
Sbjct: 303 CGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDGVERNPYTKISPSVIGSK 362
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AAR+ IVLLKN +N LPLN++K+K++AVVG NA G+Y+G P +
Sbjct: 363 EHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVG--INAGKCEFGDYSGAPV--VE 418
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 419 PVSILQGIRN 428
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/294 (30%), Positives = 144/294 (48%), Gaps = 56/294 (19%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + I + G++ S+E E DR D+ LP Q + + ++ +V +++++
Sbjct: 595 AGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSNMIVILV---A 651
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + ++ AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y L
Sbjct: 652 GSSLAINWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 704
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSY+ FKY
Sbjct: 705 ELP--PFDDYDITKGRTYKYFKGEVLYPFGYGLSYSSFKY-------------------- 742
Query: 669 NLNYTSDASKTRCPGVLVNDLRC-DDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAA 725
+DLR D+ E V F +N G +G +V VY + P
Sbjct: 743 ------------------SDLRVKDEADEVAVSFRLKNTGKRNGDEVTQVYVRIPETGGI 784
Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
+K++ GF+RV +++G ++R++ N + L D ++P G I VG
Sbjct: 785 VPVKELKGFRRVPLKSGESRRVEIRLNK-EQLRYWDVGKGQFVVPKGTFDIMVG 837
>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
DSM 17526]
Length = 906
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 167/423 (39%), Positives = 243/423 (57%), Gaps = 34/423 (8%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F F D + RV LV +M+L+EKV Q+ + + +PRL +P+Y WW+E LHGV+ G
Sbjct: 50 FSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVARAGY 109
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-----NLGRA---GLT 163
AT FP I ASF+++L K IG +S EARA + N R GL
Sbjct: 110 ----------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKRGIYTGLD 159
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
+WSPNIN+ RDPRWGR ET GEDP++ G A ++ GLQD +G + LK +
Sbjct: 160 FWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQDSDG---------KYLKTIA 210
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KH+A V + R+ FD V+++D+ ET+L F VKE S+M +YNR G
Sbjct: 211 TSKHFA---VHSGPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIMGAYNRFRGE 267
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
LLNQ +R +W GY+V+DC +IQ + HK +A + +A A + G DL+C
Sbjct: 268 SCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHK-IASTAAEAAAIGVSGGCDLNC 326
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G YYT+ T AV +G + E +ID ++K L+ RLG FD Y + +CS+ +
Sbjct: 327 GNYYTHLT-EAVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVSYAQIPFGIVCSEAH 385
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
LA +AA++ +VLLKN +N LPL+ K+K +AV+GP+A+ +++GNY GIP + ++ +
Sbjct: 386 NTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHGIPKKPVTFL 445
Query: 462 AGF 464
G
Sbjct: 446 DGI 448
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 149/304 (49%), Gaps = 55/304 (18%)
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVA 534
+ I A AK+AD +++ GL +E ES+D R + LP Q L+ V
Sbjct: 615 SKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVK 674
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
E K PVILV+ + + I +A+ N + AI+ AGYPGEEGG A+ADVVFG +NP GRLP
Sbjct: 675 ETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNPAGRLP 731
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
IT+Y V+ LP P + GRTY+++ G LYPFGYGLSYT+F Y L
Sbjct: 732 ITYYQS--VEDLP-------PFEDYDMKGRTYRYFEGKPLYPFGYGLSYTRFSYKDLEVP 782
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
+ D + V N+GS G +VV
Sbjct: 783 AKVNAG--------------------------------DPVQISVTVTNIGSRAGDEVVQ 810
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y I+Q+ GFQR+ ++ G +K + F +A + L++++ + ++ G +
Sbjct: 811 LYLNDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLSA-RQLSMINGESKRVIEEGVFS 869
Query: 775 IFVG 778
I VG
Sbjct: 870 IHVG 873
>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
CL03T00C23]
gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
CL03T12C37]
Length = 801
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 234/803 (29%), Positives = 369/803 (45%), Gaps = 147/803 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
++ DS P +RV++L+S+MTL+EK Q+ +G R+ LP W +E G+ N
Sbjct: 55 IYEDSCAPLEVRVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGN 113
Query: 109 VG---------------PGTHF------------------------DDVIPG-----ATS 124
+ P H ++ I G AT
Sbjct: 114 IDEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATY 173
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+A EAR LG + +SP +++A+DPRWGR ET
Sbjct: 174 FPAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETY 228
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP+ G+ + LQ K+ S KH+A Y + +
Sbjct: 229 GEDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRT 274
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M +L PF + E A VM SYN +G P L + +R EW G
Sbjct: 275 DPHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 334
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + H+ +A+ EDAVAQ + AGL++ T+FT +AV
Sbjct: 335 YVVSDSEAVEFISTKHQ-VANGYEDAVAQAVNAGLNIR-----THFTPPADFILPLRSAV 388
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIV 414
++GK+ + +++ + + V LG FD + Q + S E+ +LA EAAR+ +V
Sbjct: 389 KKGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLV 448
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVT 471
LLKN+ TLPL S +++VAV+GP+A+ +I Y + G A+V
Sbjct: 449 LLKNEHQTLPL-SKSIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVV 507
Query: 472 YKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAESLD 516
YK GCD + A + + A EAAK A+ T+ +L G +L+V E
Sbjct: 508 YKKGCDIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVR-EDRS 566
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
R L LPG Q +L+ ++ ++ K PV+LV++ I FA T++ AI+ A +PGE GG
Sbjct: 567 RTSLDLPGRQEELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGG 623
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+AIA+ +FG +NPGGRL +T+ V +P + P +P Y LYP
Sbjct: 624 QAIAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSETSVYG-----ALYP 675
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FG+GLSYT F+Y+ L + + Q GV N
Sbjct: 676 FGHGLSYTTFQYSDLVISPSKQ------------------------GVQGN-------IS 704
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N+G +G +VV +Y + TY + + GF+R+ ++ + + F +
Sbjct: 705 ISCTIKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QE 763
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
L I D N + G + +G+
Sbjct: 764 LGIWDKQMNFTVEPGMFKVMIGS 786
>gi|391417909|gb|AFM44649.1| Xyl3A [Caldanaerobius polysaccharolyticus]
Length = 789
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 232/825 (28%), Positives = 368/825 (44%), Gaps = 162/825 (19%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLG---------------DFAHGVPR 90
G S L+ D++ P RV+DL+SRMTLDEK+ QL D A + +
Sbjct: 3 GNSKESALYLDATQPVEKRVEDLLSRMTLDEKIAQLSSVWVYELLDNMEFSVDKAKDLLK 62
Query: 91 LGLPQYEWWSEALHGVSNVGPG------------------------THFDD----VIPGA 122
G+ Q + G SN+GP H + + GA
Sbjct: 63 DGIGQIT----RIGGASNLGPKESAQLANEIQRYLIENTRLGIPALVHEESCSGYMAKGA 118
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
T FP I +++N L K++G + + +A+ +P ++VARD RWGR+ E
Sbjct: 119 TCFPQTIGVASTWNTELVKQMGSVIREQMKAV-----GAHQALAPLMDVARDARWGRVEE 173
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD----NWKG 238
T GEDP+++ V+Y+ GLQ N D + + KH+ Y NW
Sbjct: 174 TFGEDPYLISEMGVSYIEGLQG----GNIKD------GIMATVKHFVGYGFSEGGMNWA- 222
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
A + E+++ E FL PFE VK+ +SVM +Y+ ++GIP KLL Q +R
Sbjct: 223 ------PAHIPERELREVFLLPFEAAVKKAKTASVMAAYHELDGIPCHGSKKLLTQILRN 276
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGN 353
EW G +V+D + ++ + H +A K +A L+AG+D+ DC Y
Sbjct: 277 EWGFDGLVVSDYFGVNMLYEYH-HVARDKGEAAKIALQAGVDIELPSRDC---YGQPLKE 332
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
AVQ+G V+E ID+ ++ + + G F+ V + + + +LA + A++ I
Sbjct: 333 AVQKGLVEEALIDEVVRRILRMKFLSGVFENPYVDVEKAAEVFDTPDQRKLAYKLAQQSI 392
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS-------------- 459
VLLKN + LPL +K++AV+GP+A++ +IG+YA PC S
Sbjct: 393 VLLKNQGDLLPLKK-DIKSIAVIGPNADSVRNIIGDYA-YPCHIESLVETKEQSNVFNTP 450
Query: 460 ------------PIAGF--------SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
PI S + Y GC+ V A EAAK +D
Sbjct: 451 VPDKVSLVDNFVPIKSILEGIKGKISPETELHYAKGCE-VTGDDKGGFAEAIEAAKKSDV 509
Query: 500 TIILAG-----LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
I++ G D ES DR DL LPG Q +L+ + P ++V+++ + I
Sbjct: 510 AIVVVGDKAGLTDDCTSGESRDRADLNLPGVQQELVEAIYNTGT-PTVVVLVNGRPLSIN 568
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ + +I AI+ A PGEEG A+ADV+FG +NPGG+LP+++ V +P+ +
Sbjct: 569 W--ISRHIPAIIEAWLPGEEGAAAVADVLFGDYNPGGKLPVSFPRS--VGQVPVY-YNHK 623
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P + Y + LYPFGYGLSYT+F+++ L +
Sbjct: 624 PSGGRSHWKGDYVEMSTKPLYPFGYGLSYTKFEFSNLEIAPS------------------ 665
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ D VD QN G +G +VV +Y + +K++ GF
Sbjct: 666 -------------EVYDDGKVRISVDVQNAGKLEGDEVVQLYVRNEVSNVTRPVKELKGF 712
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+RV +R G K++ F + + L D ++ G + +G+
Sbjct: 713 KRVSLRPGEKKKVVFELSVSQ-LGFYDEDMRYVVQPGTVKVMIGS 756
>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
Length = 898
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 173/440 (39%), Positives = 243/440 (55%), Gaps = 39/440 (8%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D++ + R DLVSRMTL EKV Q+ + A +PRLG+P Y+WW+EALHGV+ G
Sbjct: 42 LYLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAGE- 100
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTY 164
AT FP I A+F+ +L A+S EARA YN GR GLT+
Sbjct: 101 ---------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTF 151
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPN N+ RDPRWGR ET GEDP++ R V +VRGL EG + K+ +
Sbjct: 152 WSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGL---EGDDPTYQ------KLDAT 202
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + +R+ FD +E+D+ ET+L F+ V++G +VM +YNRV+G+P
Sbjct: 203 AKHFA---VHSGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVP 259
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+ A +LL +R +W GY+V+DCD++ + HK + + E A A + G DL+CG
Sbjct: 260 ATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVP-TAEQAAALAVNNGDDLNCG 318
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
Y AV G V E ID ++ L RLG FD G + +L + S ++
Sbjct: 319 TTYATLV-KAVHDGLVNEHTIDTAVTRLMLARFRLGMFDPPGRVPWSTLPMSVVQSPQHD 377
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA A+E +VLLKND LPL S V+ +AV+GP A+ A++GNY G P ++ +
Sbjct: 378 ALALRTAQESMVLLKND-GLLPL-SHNVRRIAVIGPTADNVTALLGNYHGTPKAPVTILQ 435
Query: 463 GFSGY---ANVTYKTGCDDV 479
G A VTY G + V
Sbjct: 436 GIREAVPNAQVTYVQGTELV 455
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/314 (29%), Positives = 143/314 (45%), Gaps = 54/314 (17%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
AA +AA+ AD I GL +E E + DR L LP Q +L+ Q +V
Sbjct: 625 AALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLL-QALQVTG 683
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PV+LV+ + + I +A+ + + AIL A YPG++GG A+AD +FG +P GRLP+T+Y
Sbjct: 684 KPVVLVLTTGSALAIDWAKQH--LPAILLAWYPGQDGGHAVADALFGNVDPAGRLPVTFY 741
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
++ L P D GRTY+++ G L+PFG+GLSYT+F Y+ L +
Sbjct: 742 K---------SARQLPPFDDYAMKGRTYRYFTGQPLFPFGFGLSYTRFAYSDLQLDR--- 789
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
+ L D + +N G G +VV +Y +
Sbjct: 790 ----------------------------DTLGPSDRMRISLRVKNTGQRAGDEVVQLYLR 821
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFV 777
P A IK + GFQR+ ++ G + + F + L D A + A G + + V
Sbjct: 822 PLRAPHARAIKSLRGFQRISLKPGEERSVSFDISPQTDLKYYDVAHHAYAVAPGRYQVQV 881
Query: 778 GNGGVSFPIHLNFN 791
G + +F
Sbjct: 882 GASSADIRLTRDFT 895
>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 853
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 168/430 (39%), Positives = 247/430 (57%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ +++ P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 28 LYKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K+I +S EARA +N G L
Sbjct: 86 RF--------TVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQFSDVL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + H LK+
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHY---------LKIV 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNN 244
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 245 VPCTLNSWLLQKVLRRDWGFQGYVVSDCGGPSLLVNAHKYV-KTKEAAATLSIKAGLDLE 303
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y + NA +Q E DID + ++ T M+LG FDG + Y + I S
Sbjct: 304 CGDDVYDEYLLNAYKQYMASEADIDSAAYHVLTARMKLGLFDGVERNPYAKISPSVIGSK 363
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ +A AARE IVLLKN +N LPLN K+K++AVVG NA G+Y+G P +
Sbjct: 364 EHQTVALNAARECIVLLKNQKNMLPLNVKKLKSIAVVG--INAGKCEFGDYSGAPV--VE 419
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 420 PVSILQGIKN 429
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/291 (31%), Positives = 147/291 (50%), Gaps = 49/291 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I++++ AG
Sbjct: 596 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIILVLVAG 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A N ++ AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y ++ LP
Sbjct: 654 S-SLAVNWENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LEQLPAF 710
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
D GRTY+++ LYPFGYGLSYT FKY+ L
Sbjct: 711 D------DYDITKGRTYQYFKKDVLYPFGYGLSYTTFKYSNLK----------------- 747
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-- 727
DA KT VN +N G G +V VY + P EIA +
Sbjct: 748 ---VDDAGKT------VN---------VSFTLKNTGKRAGDEVAQVYVRLP-EIAGSTQA 788
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
I+Q+ GF+RV ++AG +++++ + + + A ++P G T VG
Sbjct: 789 IRQLKGFRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVG 839
>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
Length = 798
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 234/803 (29%), Positives = 369/803 (45%), Gaps = 147/803 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
++ DS P RV++L+S+MTL+EK Q+ +G R+ LP W +E G+ N
Sbjct: 52 IYEDSYAPLEARVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGN 110
Query: 109 VG---------------PGTHF------------------------DDVIPG-----ATS 124
+ P H ++ I G AT
Sbjct: 111 IDEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATY 170
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+A EAR LG + +SP +++A+DPRWGR ET
Sbjct: 171 FPAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETY 225
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP+ G+ + LQ K+ S KH+A Y + +
Sbjct: 226 GEDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRT 271
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M +L PF + E A VM SYN +G P L + +R EW G
Sbjct: 272 DPHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 331
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + H+ +A+ EDAVAQ + AGL++ T+FT +AV
Sbjct: 332 YVVSDSEAVEFISTKHQ-VANGYEDAVAQAVNAGLNIR-----THFTPPADFILPLRSAV 385
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIV 414
++GK+ + +++ + + V LG FD + Q + S E+ +LA EAAR+ +V
Sbjct: 386 KKGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLV 445
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVT 471
LLKN+ TLPL S +++VAV+GP+A+ +I Y + G A+V
Sbjct: 446 LLKNEHQTLPL-SKSIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVV 504
Query: 472 YKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAESLD 516
YK GCD + A + + A EAAK A+ T+ +L G +L+V E
Sbjct: 505 YKKGCDIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVR-EDRS 563
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
R L LPG Q +L+ ++ ++ K PV+LV++ I FA T++ AI+ A +PGE GG
Sbjct: 564 RTSLDLPGRQKELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGG 620
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
+AIA+ +FG +NPGGRL +T+ V +P + P +P Y LYP
Sbjct: 621 QAIAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSETSVYG-----ALYP 672
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FG+GLSYT F+Y+ L+ + + Q GV N
Sbjct: 673 FGHGLSYTTFQYSDLAISPSKQ------------------------GVQGN-------IS 701
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N+G +G +VV +Y + TY + + GF+R+ ++ + + F +
Sbjct: 702 ISCTIKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QE 760
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
L I D N + G + +G+
Sbjct: 761 LGIWDKQMNFTVEPGMFKVMIGS 783
>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
Length = 897
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 174/445 (39%), Positives = 249/445 (55%), Gaps = 41/445 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D S + R LV++MTLDEK Q+ + A + RLG+P Y+WW+E LHGV+ G
Sbjct: 38 WLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ-- 95
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-------GR-AGLTYW 165
AT FP I A+F+ L ++ +S EARA ++ GR GLT+W
Sbjct: 96 --------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTFW 147
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPN+N+ RDPRWGR ET GEDP++ R V +VRGLQ D R K+ +
Sbjct: 148 SPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ-------GDDPVYR--KLDATA 198
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH A V + DR+HFDAR + +D+ +T+L FE VKEGD +VM +YNRV G +
Sbjct: 199 KHLA---VHSGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESA 255
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A LL +R +W GY+V+DC +I V + H + ++E A A ++ G +L+CGQ
Sbjct: 256 SASRFLLRDVLRRDWGFKGYVVSDCWAI-VDIWKHHHIVTTREAAAALAVRNGTELECGQ 314
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE---NI 402
Y +AV+QG + E +ID ++ L+T MRLG FD P+ V + ++ +
Sbjct: 315 EYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHD 372
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA +AA+ +VLLKND LPL S +K +AVVGP A+ T+A++GNY G P ++ +
Sbjct: 373 ALALKAAQASLVLLKND-GILPL-SRDIKRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 430
Query: 463 GFSGYAN---VTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 431 GIREAAKGVEVRYARGVDLVEGRDD 455
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 127/284 (44%), Gaps = 53/284 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A +AA+ AD + + GL VE E + DR DL LP Q L+ + K
Sbjct: 622 ALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHATGK- 680
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV++V+ GG IA +++ AIL + YPG+ GG A+ +FG NP GRLP+T+Y
Sbjct: 681 PVVMVLT--GGSAIAVDWAQSHLPAILMSWYPGQRGGTAVGQALFGDVNPAGRLPVTFYK 738
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
S L D GRTY+++ G LYPFG+GLSYT+F Y L
Sbjct: 739 ---------ASEALPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGTLRLD----- 784
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
LR D VD N G+ G +VV +Y +
Sbjct: 785 --------------------------AGSLRADGRLGVAVDVTNAGTRSGDEVVQLYVRR 818
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
+ ++++ GFQR+ + G ++ + F A ++L D A
Sbjct: 819 EHAGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQALRHYDEA 862
>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
Length = 614
Score = 285 bits (729), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 190/574 (33%), Positives = 284/574 (49%), Gaps = 53/574 (9%)
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
V R+GLP+Y+W A+HGV + D + TSFP + ++N S + ++G+ +
Sbjct: 72 VSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVNYGFTWNYSAYLELGRII 130
Query: 148 STEARAMYNLG-----------RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAV 196
E RA++ G GL WSPNIN+AR P WGR E PGEDPF+ G++
Sbjct: 131 GVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRNQEVPGEDPFMNGQFGK 190
Query: 197 NYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEET 256
Y GLQ G ++ L+ KH+ AY +++ G R++F+A V+ + +T
Sbjct: 191 AYTLGLQ---GDDDTY------LQAIVTLKHWDAYSLEDSDGATRHNFNAIVSNFSLMDT 241
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
+ F + V EG A VMCSYN VNGIP+CA P LL +R W GY+ +D +++ +
Sbjct: 242 YWPAFRVAVTEGKAKGVMCSYNAVNGIPTCAHP-LLRTVLRDLWKFDGYVSSDTGAVEDI 300
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
DNHK+ A A D+D G Y V +G + D+D +L+ +
Sbjct: 301 SDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRMEDVDNALRNTLRLR 360
Query: 377 MRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
LG FD + Y + + ++ + E +VLL+N N LPL A VA
Sbjct: 361 FELGLFDPVENQSYWHVPLAAVNTNASRATNMLHTLESMVLLQNKNNVLPL--ASNTKVA 418
Query: 435 VVGPHANATVAMIGNYAGIPCR------YMSP---IAGFSGYANVTYKTGCDDVACKSNN 485
++GPHA A M+GNY G C +SP + G VTY G + C S +
Sbjct: 419 LIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAVTYAPGTNVTTC-SQS 477
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
I A A AD +++ G+D S+EAES DR+ + LP Q QL + + V K P ++V+
Sbjct: 478 HIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLASAIFAVGK-PTVIVL 536
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
++ G +A AI+ AGYPG GG AIA + G+ + GDY+
Sbjct: 537 LNGGM--LAIENEKQQADAIIEAGYPGFYGGTAIAQTLTGQNE---------HLGDYINW 585
Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
+ ++ M + PGRTY++Y TL+ F +
Sbjct: 586 INMSDMEMT-----SGPGRTYRYYKNETLWAFHF 614
>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
CL02T12C30]
Length = 805
Score = 285 bits (728), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 249/863 (28%), Positives = 400/863 (46%), Gaps = 152/863 (17%)
Query: 8 LLCFSLSI-ALLVFSTNAVDANGSSSP-----VFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
L C S + LL +T+++ A+ +P ++ F+K G + ++ D S P
Sbjct: 12 LFCLSFGLLPLLNANTSSLQASKPDAPKNQKKIYQKGWIDFNKNGKKD---IYEDLSQPI 68
Query: 62 SIRVKDLVSRMTLDEKVQQLGD-FAHG-VPRLGLPQYEW-------------------W- 99
RV+DL+ +MT++EK QLG + +G V + LP EW W
Sbjct: 69 DKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNIDEHLNGEWK 128
Query: 100 -----------SEALHGV-------SNVG-PGTHFDDVIPG-----ATSFPTVILTTASF 135
+EA++ V + +G P ++ I G +T FP I ++
Sbjct: 129 RTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGCTW 188
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
++ L +IG+ EA+A LG + +SP ++++RDPRWGR E+ GED ++ G
Sbjct: 189 DKELIYEIGRITGEEAKA---LGYTNI--YSPILDLSRDPRWGRTVESYGEDSYLAGELG 243
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY-HFDARVTEQDME 254
V G+Q +V S KH+A Y + G D Y D + Q++
Sbjct: 244 RQQVLGIQSN--------------RVVSTPKHFAIYGIPG-GGRDCYSRTDPHASPQEVH 288
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E L PF + +E A MCS+N NG P A L+ + +R +W GY+V+D +I
Sbjct: 289 ELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAID 348
Query: 315 VMVDNHKF--LADSKEDAVAQTLKAGLDL----DCGQYYTNFTGNAVQQGKVKETDIDKS 368
N KF + D++E+AVA L AGL++ + + + A+Q+G V+E+ +D+
Sbjct: 349 ---KNVKFYHIVDTEEEAVASELNAGLNVRTFFEQSEVFIEALRRALQKGLVEESTLDQR 405
Query: 369 LKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
++ + V LG FD YV L + + SD+N E++ AARE IVLLKN+ NTLPL
Sbjct: 406 VREVLYVKFWLGLFDDP--YVKDTKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPL 463
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD---- 477
S +K +AV+GP A+ ++ Y ++ + G N+ Y GC+
Sbjct: 464 -SKTLKNIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGCNVRDK 522
Query: 478 ----------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
+++ K I A E AK A+ II G D ES R +L L G Q
Sbjct: 523 NFPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQK 582
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+L+ V + PV+LV+ + G + + N+ AI+ A YPGE G+A+A+V+FG +
Sbjct: 583 ELVRAV-QATGTPVVLVLFN--GRPVTLNWEDANLPAIVEAWYPGEFSGQAVAEVLFGDY 639
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NPGG+L T+ V +P + P +P + G+ + +G LYPFGYGLSYT F+
Sbjct: 640 NPGGKLSTTFPKS--VGQIPW-AFPFKPNAT----GKGFARVDG-ELYPFGYGLSYTTFE 691
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
+ +Q + K+ L T +N GS
Sbjct: 692 IS------NLQPSATKIADGDTLTVTCKV-------------------------KNTGSV 720
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
G +VV +Y + + K++ GF+RV + G K + F N ++ + + +
Sbjct: 721 KGDEVVQLYLNDETSSISRFEKELCGFERVALEPGEEKTVTFKVNR-RAYGMYNDKNEFV 779
Query: 768 LPAGEHTIFVGNGGVSFPIHLNF 790
+ G+ +F GN S P++ F
Sbjct: 780 VEPGKFFLFAGNSSKSTPLNAEF 802
>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 861
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 172/451 (38%), Positives = 244/451 (54%), Gaps = 43/451 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R +DL+ R+TL+EKV + + + +PRLG+ +YEWW+EALHGV G
Sbjct: 26 YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
AT FP I ASFN+SL ++ A S EAR + G + GLT+W
Sbjct: 84 --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE--GHENATDLNSRPLKVSS 223
+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ E G++ K+ +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD----------KLHA 185
Query: 224 CCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
C KH+A + W +R+ FDA + +D+ ET+L F+ V++ VMC+YNR G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLD 340
P C +LL Q +R EW G +V+DC +I H D KE A A ++ G D
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAAVRTGTD 301
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
L+CG Y + +AV+ G + E +ID SLK L T LG D + + + S E
Sbjct: 302 LECGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKE 360
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA ARE +VLL+N N LPLN+ VAV+GP+AN +V GNY GIP ++
Sbjct: 361 HQALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTL 418
Query: 461 IAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ + Y+ GCD V K+ S+F
Sbjct: 419 LEAVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD + G+ S+E E + DR D+ LP Q + + + K +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G I T +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y V L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
P + GRTY++ L+PFG+GLSYT F Y +K N +
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
N+ T + NVG DG +VV VY + P +
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+ F+RV + AG+ + + ++ D +NT+ P E T + GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848
>gi|383643328|ref|ZP_09955734.1| glycoside hydrolase family 3 [Sphingomonas elodea ATCC 31461]
Length = 799
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 219/687 (31%), Positives = 321/687 (46%), Gaps = 101/687 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P EALHG+ V PGATSFP I +SF+ L + I +
Sbjct: 145 RLGIPML-MHEEALHGL-----------VAPGATSFPQSIALASSFDPKLVENIFSMAAK 192
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARA R +P ++VARDPRWGRI ET GEDP++V + + +RG Q
Sbjct: 193 EARA-----RGANLVLAPVVDVARDPRWGRIEETYGEDPYLVTQMGLAAIRGFQ------ 241
Query: 210 NATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
T + + KV KH + +N V A + E+ + E F PFE VK
Sbjct: 242 -GTTMPLKSDKVFITLKHMTGHGQPENGTNVG----PASLGERTLREDFFPPFEAAVKTL 296
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
SVM SYN ++GIPS A+ LL +RGEW G +V+D +I+ ++ H D K
Sbjct: 297 PVMSVMASYNEIDGIPSHANKWLLTDVLRGEWGFQGAVVSDYFAIRELITRHHLFKDPK- 355
Query: 329 DAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
DA + L AG+D++ G+ YT+ V+QG+V + +ID +++ + + G F+
Sbjct: 356 DAAQRALDAGVDVETPDGEAYTHLV-QLVKQGRVSQGEIDNAVRRVLRMKFEGGLFENPY 414
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
V L + E I L+ +AARE IVLLKN Q LPL++ +K +AV+G HA T
Sbjct: 415 PEVKLAAARTNTPEAIALSRQAARESIVLLKNAQGLLPLDARGIKRMAVIGTHAKDTP-- 472
Query: 447 IGNYAGIPCRYMSPIAGFS----GYANVTYKTGCD-------------DVACKSNNSIFA 489
IG Y+ +P +S + G G V Y G V N+ + A
Sbjct: 473 IGGYSDLPNHVVSVLEGMQAEGKGKFAVDYAEGIRITNHREWSKDAVAQVPASVNDQLRA 532
Query: 490 -ASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEVAKGPVI 542
A E AK AD +++ G + +V E+ D E L LPG Q QL ++ + K PV+
Sbjct: 533 QALETAKNADVVVLVLGGNEAVSREAWADNHLGDSETLDLPGPQDQLAKELIALGK-PVV 591
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+++++ G A A++ Y GE+ G AIADVVFG++NPGG+LP++
Sbjct: 592 VILLN--GRPYAVNYLAEKAPALIEGWYLGEQTGNAIADVVFGRYNPGGKLPVSVARS-- 647
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP+ + R Y F + LYPFGYGLSYT F +
Sbjct: 648 VGQLPIY------YNKKPSARRGYLFGDTSPLYPFGYGLSYTTFDIS------------- 688
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
P + + D +VD N G G +VV ++
Sbjct: 689 ------------------APRLGTPTIGIADKASVEVDVTNTGKVAGDEVVQLFVHDDEA 730
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKF 749
+ ++ F+RV ++ G K ++F
Sbjct: 731 SVTRPVIELKRFERVTLKPGEKKTVRF 757
>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
CL02T12C05]
Length = 863
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 166/439 (37%), Positives = 233/439 (53%), Gaps = 38/439 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
F + LP RV+DLV R+TL EKV + D++ VPRLG+ QY WW+EALHGV G
Sbjct: 24 FNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F++ K++ + VS EARA Y+ GLT+W
Sbjct: 82 --------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ R + VRGLQ S+ K+ +C
Sbjct: 134 TPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPS--------ESKYDKLHACA 185
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W +R+ FD ++ +D+ ET+L F+ V++G VMC+YNR G P
Sbjct: 186 KHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEP 242
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C +LL +R EW G +V+DC +I + H +KE AVA +KAG DLDC
Sbjct: 243 CCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAVKAGTDLDC 302
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G Y + AV++G + E ID SL L LG D + + + S+++
Sbjct: 303 GVDYQSLE-KAVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVSWSDIPYTVVDSEKH 361
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
A E AR+ + LLKN TLPL S + V+GP+AN ++ M GNY G P ++ +
Sbjct: 362 RAKALEVARKSMTLLKNKNGTLPL-SKHCGKIVVIGPNANDSIMMWGNYNGFPSHTVTIL 420
Query: 462 AGFSGY---ANVTYKTGCD 477
G + V Y GC+
Sbjct: 421 EGITHKLDAGQVIYDKGCE 439
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 143/305 (46%), Gaps = 59/305 (19%)
Query: 488 FAASEAAKT---ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVA 534
F +E A T A+A + + G+ VE E L DR + LP Q L+ ++
Sbjct: 588 FNPNEIAATVSDAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELY 647
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
+ K P+IL++ S + ++ AE + AI+ A YPG+ GG A+ADV+FG +NP GRLP
Sbjct: 648 KTGK-PIILILCSGSAIGLS-AEVDL-ADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLP 704
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
+T+Y T+ L + GRTY+++ G L+PFGYGLSYT F+
Sbjct: 705 VTFYK---------TTEQLPDFEDYNMQGRTYRYFKGEALFPFGYGLSYTSFE------- 748
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
+ K Q SK R + ++ + +N G DG +V+
Sbjct: 749 ------IGKAQ----------LSKKR--------IHANESVNLDLWIKNTGERDGEEVIQ 784
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEH 773
VY + + +K + F+RV V++G K+I + S D N + + AGE+
Sbjct: 785 VYIRKLKDKEGP-LKTLRAFKRVHVKSGEKKQIS-IHLPNDSFEFFDPEFNVMRVMAGEY 842
Query: 774 TIFVG 778
+ G
Sbjct: 843 EVLYG 847
>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 762
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 235/792 (29%), Positives = 365/792 (46%), Gaps = 146/792 (18%)
Query: 64 RVKDLVSRMTLDEKVQQL------------GDFAH---------GV-------------P 89
+V+DL+S+MTL+EK+ QL G+F+ G+ P
Sbjct: 9 KVRDLISKMTLEEKIAQLQSVFGKELVDESGNFSEEKAEKLLKNGIGQISRVAGEKGMDP 68
Query: 90 RLGLPQYEWWSEALHGVSNVG-PGTHFDDVI-----PGATSFPTVILTTASFNESLWKKI 143
+ + L + +G P ++ + GAT FP I ++F L +++
Sbjct: 69 ERAVELANKIQKFLKEKTRLGIPAIIHEECLSGFMAKGATVFPQAIGMASTFEPELIRRV 128
Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ RA N+ + SP +++ RDPRWGR ET GEDP++V R A YV+GLQ
Sbjct: 129 SDVIRQHMRAA-NVHQG----LSPVLDIPRDPRWGRTEETFGEDPYLVSRMAAEYVKGLQ 183
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
+ E + + KH+ AY + R A+V E+++ E FL PFE+
Sbjct: 184 GEDWREG----------IIATVKHFTAYGISEGA---RNLGPAKVGERELREVFLFPFEV 230
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
+KEG A S+M +Y+ ++G+P + LL + +R EW GY+V+D +I+++ + H+
Sbjct: 231 AIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAIRMLENFHRVA 290
Query: 324 ADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
D+KE AV L+AG+D+ DC Y AV++G + E I+ S++ +
Sbjct: 291 KDAKEAAVL-ALEAGIDIELPSVDC---YGEPLIQAVKEGLISEEVINASVERVLRAKFM 346
Query: 379 LGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
LG FDG + DI E EL+ E AR IVLLKND LPL S ++TVAV+G
Sbjct: 347 LGLFDGDLEKDPKKVYDIFDKPEFRELSREVARRSIVLLKND-GILPL-SKNIRTVAVIG 404
Query: 438 PHANATVAMIGNY---AGIP----------------CRYMSPIAGF----SGYANVTYKT 474
P+A+ + G+Y A IP R +S + G S V Y
Sbjct: 405 PNADNPRNLHGDYSYTAHIPSVSETLEGVKIPEECAVRTVSILEGIKNKVSAETQVLYAK 464
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG-----LDLSVEAESLDREDLWLPGYQTQL 529
GC+ + S A E AK AD I + G + E DR L L G Q L
Sbjct: 465 GCE-ILSDSKEGFDEAIEIAKRADVIIAVMGEESGLFHRGISGEGNDRTTLELFGIQRDL 523
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ ++ ++ K P++LV+++ G A + N+ AIL A YPGEEGG A+ADV+FG +NP
Sbjct: 524 LRELHKLGK-PIVLVLVN--GRPQALKWEHENLNAILEAWYPGEEGGDAVADVIFGDYNP 580
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
G+LPI++ P + + PV P Y + LYPFG+GLSYT F+
Sbjct: 581 SGKLPISF---------PAVTGQV-PVYYNRKPSAFTDYVEESAKPLYPFGHGLSYTTFE 630
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y+ L H +N + E +N G
Sbjct: 631 YSNLKI------------HPEKVNAL-------------------EKVEISFTIKNTGVR 659
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+G +VV +Y +K++ GF+++ ++ G +KR+ F+ + L D +
Sbjct: 660 EGEEVVQLYVHDQVASLERPVKELKGFKKIHLKPGESKRVTFILYP-EQLAFYDEFMRFV 718
Query: 768 LPAGEHTIFVGN 779
+ G I +G+
Sbjct: 719 VEKGIFEIMIGS 730
>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
PB90-1]
Length = 859
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 246/884 (27%), Positives = 386/884 (43%), Gaps = 140/884 (15%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFL------- 53
M + V +C ++A LVFS + + A S+ +FV + ++
Sbjct: 1 MLRFVHPTVC---TLAALVFSASPLLAAAPSADLFVPSATPPLAAAVYHDGWIDLNKNGA 57
Query: 54 ---FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WSEAL 103
+ DSS P R++DL++RM+L+EK QL +G PR+ P W W + +
Sbjct: 58 RDPYEDSSRPIDARIEDLLARMSLEEKTAQLTTL-YGFPRVLKDERPTSAWREAMWKDGI 116
Query: 104 -----HGVSNVGPGTHFDDVI--------------------------------------- 119
H N G + D +
Sbjct: 117 GNIDEHLNGNTGWTNNLADPVHDLPWSLHARALNEVQRWFIEQTRLGIPVDFTNEGIRGL 176
Query: 120 --PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPR 176
ATSFP + ++++ +L ++IG+ EARA+ G T +SP +++ARDPR
Sbjct: 177 LHSKATSFPAELAVASTWDPALVREIGRITGREARAL------GYTNIYSPVLDLARDPR 230
Query: 177 WGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
WGR ET GEDPF+VG V VRGLQ V S KH+A Y +
Sbjct: 231 WGRTIETYGEDPFLVGTLGVEQVRGLQAEH--------------VVSTLKHFAVYSIPKG 276
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
D + T ++++ FL PF ++E A VM SYN +G+P L++ +
Sbjct: 277 GRDGEARTDPQATWREVQTIFLEPFRRAIREAGALGVMASYNDYDGVPVEGSALFLSEIL 336
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA-- 354
RG+W GY+V+D +++ + H+ +A + DA+ Q ++AGL++ TNFT A
Sbjct: 337 RGQWGFRGYVVSDSAAVEFIHSKHR-VAPTPADAIRQAVEAGLNI-----RTNFTPPAAY 390
Query: 355 -------VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELA 405
V+ GK+ ID ++ + V +LG FD P D + + E++ +A
Sbjct: 391 AEPLRQLVRDGKLAMATIDARVRDVLRVKFQLGLFD-RPYVADPAAADRVVRAPEHLVVA 449
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
A RE IVLLKN+ LPL+ AK++ V V GP A+ A Y +++P+ G
Sbjct: 450 QRAGREAIVLLKNEPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLR 509
Query: 466 GY----ANVTYKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLD 507
V Y G D + + I AA AA+ D I + G
Sbjct: 510 AKLGAAVEVRYAKGVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGET 569
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
+ ES R L LPGYQ +L+ + K P++LV+ + + + +A + LW
Sbjct: 570 DELCRESSSRISLALPGYQQELLEALHATGK-PLVLVLSNGRPLSVVWAARHVPAIVELW 628
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
+PGE+GG A+A V+ G NP GRLPIT+ V LP + P P G R +
Sbjct: 629 --FPGEDGGAALAAVLLGDANPSGRLPITFPQS--VGQLPY-NFPAHP----GSQARDFG 679
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
G +L+PFG+GLSYT F+Y+ L T ++ ++ + S +R V+
Sbjct: 680 QVEG-SLFPFGHGLSYTTFRYSDLRITPE-RIPVDGFGAAGGGDPGLRGSASRATPYSVS 737
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ F D N G+ G +VV +Y + TY + GF RV + G K +
Sbjct: 738 TV---PEFTITCDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAPGETKPV 794
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
F + L + + + ++ G T+ +G + F
Sbjct: 795 TFTLHRAH-LELYNRDGDWVVEPGRFTVMLGASSADIRLRGTFT 837
>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
CL02T12C01]
Length = 596
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 198/633 (31%), Positives = 313/633 (49%), Gaps = 78/633 (12%)
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--L 219
+TYWSPN+N+ RDPRWGR ET GEDP++ YVRGLQ + P L
Sbjct: 1 MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQ-----------GNDPFFL 49
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K ++C KHYA V + R+ F+A +++D+ ET+L FE VKE +VM +YNR
Sbjct: 50 KAAACAKHYA---VHSGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNR 106
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
V G + LL +R +W G++V+DC ++ + HK D E A A LK+GL
Sbjct: 107 VYGESASGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAE-ASAIALKSGL 165
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF---DGSPQYVSLGKQDI 396
+L+CG + A+++ + E D+D +L L ++LG D SP Y ++ I
Sbjct: 166 NLNCGGSFHALK-EALERKLITEVDLDNALMPLMMTRLKLGNLTDDDESP-YKNISDSVI 223
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
S + +A E A++ +VLLKN+ +TLPL VKT+ V GP+A T M+GNY G+ R
Sbjct: 224 ASYTHAMVAREVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPR 282
Query: 457 YMSPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASE--AAKTADATIILAGLDLSV 510
+ + G SG ++ YK G N + + E AA+ A I L+G+D
Sbjct: 283 SNTFLQGIAAKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGE 342
Query: 511 EAESL------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
E +++ D+++L LP +Q + + ++ ++ VI GG I E + A
Sbjct: 343 EGDAIASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVI--TGGSPIDLEEVSELSDA 400
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
++ A YPG+EGG A+ D++FG + GR+P+T+ P+ S L + GR
Sbjct: 401 VIMAWYPGQEGGMALGDLLFGDVSFSGRMPVTF---------PINSDWLPAFEDYNMQGR 451
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TYK+ +YPFGYGL+Y Y+ + LN D +
Sbjct: 452 TYKYMTDNIMYPFGYGLTYGDVSYS----------------DVKILNPKYDGKQE----- 490
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ +N G+ + +VV +Y P T I +IGF+RV + + +
Sbjct: 491 ----------IHVQATLRNNGNNEVEEVVQLYLSAPGAGVITPISSLIGFKRVTLESHLS 540
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
+ ++F+ + +++ + LL G++TI V
Sbjct: 541 QTVEFIIKPDQLKMVMEDGSKNLL-KGKYTIIV 572
>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 737
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 232/751 (30%), Positives = 357/751 (47%), Gaps = 112/751 (14%)
Query: 33 PVFVCDPGRFSKLGLQ-MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
P + G FS L +S+ F ++ L RV DL+SRMTL+EKV L VPRL
Sbjct: 20 PYLLILIGIFSLLNASAQTSYPFQNADLDMETRVDDLLSRMTLEEKVSALST-DPSVPRL 78
Query: 92 GL---PQYEWWSEALHGVSNVGPGT---HFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
G+ P E HGV+ GP D+ +P T FP A++N L +K G+
Sbjct: 79 GIKGAPHIE----GYHGVAMGGPANWAPKGDERVP-TTQFPQAYGMGATWNPELIRKAGE 133
Query: 146 AVSTEARAMYN---LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
S EAR ++ + + GL +PN ++ RDPRWGR E GEDPF+VG + + +GL
Sbjct: 134 IESIEARYIFQNPEISKGGLVVRAPNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGL 193
Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
Q + + + +S KH+ A +N + +FD ++ + TF R
Sbjct: 194 QGD---------DEKYWRTASLLKHFLANSNENTRDSSSSNFDTQLFYEYYGATFRR--- 241
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
+ EG +++ M +YN VNG+P+ P + + W ++G I D ++V HK
Sbjct: 242 -AILEGGSNAYMTAYNAVNGVPAHIHP-MHKEISMARWGVNGIICTDGGGYTLLVRAHKA 299
Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
D A +KAGL+ Y G A+ G + E D+D+ LK +Y V+++LG
Sbjct: 300 YDDYYR-AAEGVIKAGLNQFLDNYREGVWG-ALAHGYLAEEDLDEVLKGVYRVMIKLGQL 357
Query: 383 DGSPQ----YVSLGKQD----ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
D PQ Y S+G+ S E+ E A + ARE +VLLKN++ TLPL ++ VA
Sbjct: 358 D--PQDKVPYASIGRDGKPAPWTSPEHQEAALQMARESVVLLKNEKQTLPLAGDELGKVA 415
Query: 435 VVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAA 494
V+G H T+ ++ Y+G+P +P+ G + K G D V +N AA EAA
Sbjct: 416 VIG-HLADTI-LLDWYSGMPPFMSTPLDG------IKEKMGADKVLFAPDNDYNAAVEAA 467
Query: 495 KTADATIILAG-------------LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
AD I++ G D + E++DR+ L L + + Q A
Sbjct: 468 SQADVAIVVLGNHPYCDSERWGDCPDPGMGREAVDRKTLRL---TDEWLAQRVFEANPNT 524
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
ILV+ S+ I +++ N + AI+ + G+ G A+ADV+FG +NPGG+L TW +
Sbjct: 525 ILVLQSSFPYGINWSQEN--LPAIVHITHNGQSTGTALADVLFGDYNPGGKLTQTWPKSE 582
Query: 602 YVQMLPLTSMPLRPVDSLGY---PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
+ LP D + Y G TY ++NG LYPFG+GLSYT F++
Sbjct: 583 --EQLP---------DMMEYDIRKGHTYMYFNGEPLYPFGFGLSYTSFEW---------- 621
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
++ T + K+ V+V V +NVG G +V+ +Y+
Sbjct: 622 ---------VDMEITGSSVKSNEEEVIVT-----------VKLKNVGQVKGDEVIQLYAS 661
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
P + K + GF+RV + G +K ++
Sbjct: 662 FPETSSRRPDKALKGFKRVTLEPGESKNVQI 692
>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
Length = 853
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 166/430 (38%), Positives = 248/430 (57%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 29 LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 87 RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V GLQ + H LK+
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHY---------LKIV 189
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P +P LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 246 VPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 304
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y NA +Q V + DID + ++ T M+LG FD + Y + I S
Sbjct: 305 CGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPSVIGSK 364
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AAR+ IVLLKN +N LPLN+ K+K++AVVG NA G+Y+G P +
Sbjct: 365 EHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAPV--VE 420
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 421 PVSILQGIRN 430
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 142/293 (48%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y L
Sbjct: 655 S-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 706
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSY+ F Y+ L
Sbjct: 707 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYSDLQVK-------------- 750
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
D E V F +N G +G +V VY + P
Sbjct: 751 -----------------------DGVGEVTVSFRLKNTGKRNGDEVAQVYVRIPETGGIV 787
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
+K++ GF+RV +++G ++R++ N + L D ++P G + VG
Sbjct: 788 PLKELKGFRRVPLKSGESRRVEIKLNK-EQLRYWDVEKGQFVVPKGAFDVMVG 839
>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
3_8_47FAA]
Length = 850
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 166/427 (38%), Positives = 248/427 (58%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 26 LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 83
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 84 RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 135
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + R LK+
Sbjct: 136 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD---------DPRYLKIV 186
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 187 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALND 242
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A +++AGLDL+
Sbjct: 243 VPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQAGLDLE 301
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y + NA +Q V + DID + ++ T M+LG FDG+ + Y + I S
Sbjct: 302 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPSVIGSK 361
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AARE IVLLKN N LPLN KVK++AVVG NA G+Y+G P +
Sbjct: 362 EHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAPV--VD 417
Query: 460 PIAGFSG 466
P++ G
Sbjct: 418 PVSILQG 424
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 139/290 (47%), Gaps = 48/290 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 594 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 651
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y + LP
Sbjct: 652 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDELPAF 708
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
D GRTYK++ G LYPFGYGLSY+ FKY+ L
Sbjct: 709 D------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLK----------------- 745
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
D + T +N G G +V VY + P IK
Sbjct: 746 ---VKDGANT---------------VSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIK 787
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
++ GF+R+ +++G ++ ++ + + L D ++P G I VG
Sbjct: 788 ELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGRFIVPQGAFDIMVG 836
>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 762
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 230/766 (30%), Positives = 363/766 (47%), Gaps = 145/766 (18%)
Query: 62 SIRVKDLVSRMTLDEKVQQL------------GDFAH---------------------GV 88
S +VKDL+++MTL+EK+ QL G+F+ GV
Sbjct: 7 SKKVKDLIAKMTLEEKIAQLQAVYGKDLVDENGNFSEEKAEKLLKNGIGQISRVAGERGV 66
Query: 89 -PRLGLPQYEWWSEALHGVSNVG-PGTHFDDVIPG-----ATSFPTVILTTASFNESLWK 141
P + + L + +G P ++ + G AT FP I ++F L +
Sbjct: 67 SPEKAVELANKIQKFLKEKTRLGIPAIIHEECLSGFMAQGATVFPQAIGMASTFEPELIR 126
Query: 142 KIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRG 201
++ + +A N+ + SP +++ RDPRWGR ET GEDP++V R A YV+G
Sbjct: 127 RVSDVIRQHMKAA-NVHQG----LSPVLDIPRDPRWGRTEETFGEDPYLVSRMATEYVKG 181
Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
LQ + E + + KH+ AY + R A+V E+++ E FL PF
Sbjct: 182 LQGEDWREG----------IVATVKHFTAYGISEGA---RNLGPAKVGERELREVFLFPF 228
Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
E+ +KEG A S+M +Y+ ++G+P + LL + +R EW GY+V+D +++++ + HK
Sbjct: 229 EVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAVRMLENFHK 288
Query: 322 FLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
D+KE AV L+AG+D+ DC Y AV++G + E I+ S++ +
Sbjct: 289 VARDAKEAAVL-ALEAGIDIELPSVDC---YGEPLIQAVKEGLISEEVINASVERVLRAK 344
Query: 377 MRLGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
LG FD + + ++ E +L+ E AR IVLLKND TLPL S +K VAV
Sbjct: 345 FMLGLFDDNLEKDPKKVYEVFDKPEFRDLSREVARRSIVLLKND-GTLPL-SKNLKKVAV 402
Query: 436 VGPHANATVAMIGNY---AGIP----------------CRYMSPIAGF----SGYANVTY 472
+GP+A+ + G+Y A IP R +S + G S V Y
Sbjct: 403 IGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKCVVRTVSILEGIRNKVSPETEVLY 462
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAG-----LDLSVEAESLDREDLWLPGYQT 527
GCD + S + A E AK AD I + G + E DR L L G Q
Sbjct: 463 AKGCD-IISDSKDGFAEAIEMAKEADVIIAVMGEESGLFHRGISGEGNDRTTLELFGVQR 521
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
L+ ++ ++ K P++LV+++ G A + N+ AIL A YPGEEGG A+ADV+FG +
Sbjct: 522 DLLKELHKLGK-PIVLVLIN--GRPQALKWEHENLNAILEAWYPGEEGGNAVADVIFGDY 578
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQ 645
NP G+LPI++ P + + PV P Y + LYPFG+GLSYT
Sbjct: 579 NPSGKLPISF---------PAVTGQI-PVYYNRKPSAFSDYIDESAKPLYPFGHGLSYTT 628
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F+Y+ L + +L K++ +++T +N G
Sbjct: 629 FEYSDLKISPEKVNSLEKVE----ISFT---------------------------IKNTG 657
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
+ DG +VV +Y +K++ GF++++++ G +KR+ F
Sbjct: 658 NRDGEEVVQLYIHDQVASLERPVKELKGFKKIYLKPGESKRVTFTL 703
>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 763
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 213/690 (30%), Positives = 337/690 (48%), Gaps = 96/690 (13%)
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
GAT FP + +++N L++ I +AV+ E RA + G +SP ++V RDPRWGR
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA-----QGGSATYSPVLDVVRDPRWGRT 177
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
ET GEDP +V +AV V+GLQ L+S + + KH+A Y + +
Sbjct: 178 EETFGEDPHLVTEFAVAAVQGLQ-------GERLDSH-TSLLATLKHFAGYGASEGGRNG 229
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
H R ++ E L PF V+ G A SVM +YN ++G+P + LL +R
Sbjct: 230 APVHMGLR----ELHEVDLLPFRKAVEAG-ALSVMTAYNEIDGVPCTSSGYLLQDVLREA 284
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQG 358
W G+++ DC +I ++ H A S +A AQ+LKAG+D++ G + A++QG
Sbjct: 285 WGFDGFVITDCGAIHMLACGHN-TAGSGVEAAAQSLKAGVDMEMSGTMFRAHLHQALEQG 343
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
+ E D++++ + + RLG FD + +Q I E+I LA +AA EGIVLLKN
Sbjct: 344 LITEEDLNRAAGRVLELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKN 403
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGFS---GYANVTYK 473
+ N LPL+S+ T+AV+GP+A+A +G+Y P + ++ + G G + V Y
Sbjct: 404 EGNLLPLDSSS-GTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYA 462
Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA---------- 512
GC + S A A+ AD +++ G +DL A
Sbjct: 463 PGC-RIQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAESD 521
Query: 513 ----ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
E +DR L L G Q +L+ ++ ++ K PVI+V ++ G I + +I +I+ A
Sbjct: 522 MECGEGIDRSTLTLMGVQLELLQELHKLGK-PVIVVYIN--GRPITEPWIDEHIPSIVEA 578
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
YPG+EGG AIAD++FG NP GRLP++ V LP + R G+ Y
Sbjct: 579 WYPGQEGGSAIADMLFGDINPSGRLPLSIPK--EVGQLPNSYNARR------TRGKRYLE 630
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
+ YPFG+GLSYT+F+Y L+ + + +A+
Sbjct: 631 TDLAPRYPFGFGLSYTEFRYGRLTVEPAV------------VPIGGEAT----------- 667
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
++D N G+ DG++VV +Y A K + GF++VF++AG + +
Sbjct: 668 --------VRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVT 719
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F + + L ++ ++ GE I VG
Sbjct: 720 FTIGS-EQLELIGLDLKPVVEPGEFRIQVG 748
>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 941
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 240/862 (27%), Positives = 379/862 (43%), Gaps = 149/862 (17%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPG--RFSKLGLQMSSFLFCDSS 58
M K+++++L S S L T V A + + G F+K G+ ++ D +
Sbjct: 1 MRKLIAAVLLLSNSALLTAQKTMKVPATYKPTKSEMYHKGWIDFNKNGVMD---VYEDPA 57
Query: 59 LPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS-------EALH 104
RV+DL+ +MTLDEK Q+ +G R+ LP EW W E L+
Sbjct: 58 ATVDARVEDLLKQMTLDEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGIGAIDEHLN 116
Query: 105 GVSNVG-PGTHFDDVIPG--------------------------------------ATSF 125
G G P + ++V P AT+F
Sbjct: 117 GFQQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNF 176
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRITETP 184
PT + ++N +L K+G EAR + G T ++P ++V RD RWGR E
Sbjct: 177 PTQLGLGHTWNRALIHKVGLITGREARML------GYTNVYAPILDVGRDQRWGRYEEVY 230
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GE P++V + VRGLQ V++ KH+AAY +
Sbjct: 231 GESPYLVAELGIEMVRGLQQ---------------HVAATGKHFAAYSNNKGAREGMARV 275
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D + + ++E + PF +KE VM SYN +GIP L +R E G
Sbjct: 276 DPQTSPHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRG 335
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKV 360
Y+V+D D+++ + H D KE AV Q+++AGL++ C + V++G +
Sbjct: 336 YVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGL 394
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGIVLLKND 419
E ++ ++ + V +G FD Q G +++ +EN +A +A+RE +VLLKN+
Sbjct: 395 DEETVNDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKEENEAVALQASRESVVLLKNE 454
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYANVTYKTG 475
+TLPLN VK +AV GP+A+ + +Y + + + G +G A V Y G
Sbjct: 455 NSTLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKG 514
Query: 476 CDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
CD V S I A E A+ AD +++ G E+ R L
Sbjct: 515 CDLVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLD 574
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
LPG Q QL+ V K PV+L++++ + + +A+ + AIL A YPG +GG A+AD
Sbjct: 575 LPGRQLQLLQAVQATGK-PVVLILINGRPLSVNWAD--KYVPAILEAWYPGSKGGVALAD 631
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL-----GYPGRTYKFYNGPTLYP 636
++FG +NPGG+L +T+ V +P + P +P + P NG LYP
Sbjct: 632 ILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPASQIDGGKNAGPDGNMSRING-ALYP 687
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGLSYT F+Y+ L T P V+ + +
Sbjct: 688 FGYGLSYTTFEYSNLEIT---------------------------PKVITPNEKA----T 716
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
++ N G G +VV +Y++ TY K + GF+R+ + G K + F+ + K
Sbjct: 717 VRLKVTNTGKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTFILDR-KH 775
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L ++D ++ G+ I G
Sbjct: 776 LELLDADMKRVVEPGDFAIMAG 797
>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
Length = 850
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 166/427 (38%), Positives = 248/427 (58%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 26 LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 83
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 84 RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 135
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + R LK+
Sbjct: 136 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD---------DPRYLKIV 186
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 187 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALND 242
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A +++AGLDL+
Sbjct: 243 VPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQAGLDLE 301
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y + NA +Q V + DID + ++ T M+LG FDG+ + Y + I S
Sbjct: 302 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPSVIGSK 361
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AARE IVLLKN N LPLN KVK++AVVG NA G+Y+G P +
Sbjct: 362 EHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAPV--VD 417
Query: 460 PIAGFSG 466
P++ G
Sbjct: 418 PVSILQG 424
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 139/290 (47%), Gaps = 48/290 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 594 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 651
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y + LP
Sbjct: 652 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDELPAF 708
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
D GRTYK++ G LYPFGYGLSY+ FKY+ L
Sbjct: 709 D------DYDITQGRTYKYFKGDVLYPFGYGLSYSSFKYSDLK----------------- 745
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
D + T +N G G +V VY + P IK
Sbjct: 746 ---VKDGANT---------------VSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIK 787
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
++ GF+R+ +++G ++ ++ + + L D ++P G I +G
Sbjct: 788 ELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGQFIVPQGAFDIMIG 836
>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
Length = 712
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 204/621 (32%), Positives = 301/621 (48%), Gaps = 85/621 (13%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
CD++ + R LVS +T EKV L A G PR+GLP+Y WW+EALHGV+ PG
Sbjct: 41 ICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVAGA-PGN 99
Query: 114 HFDDVIP--GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG-LTYWSPNIN 170
++D P ATSFP +L ++F++ L IG+ + TEARA N G G + YW
Sbjct: 100 DYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYW----- 154
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
TP +PF R+ G E
Sbjct: 155 ------------TPNVNPFKDPRWG----------RGSETP------------------- 173
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
G D H + R E C ++ S+MCSYN VNGIP+CA+
Sbjct: 174 -------GEDALHV----------SRYARAME-CTRDAKVGSIMCSYNAVNGIPACANSY 215
Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
L +R W+ + +I +DC ++Q + +H + E A A + G D C
Sbjct: 216 LQETLLRKHWNWTHTNNWITSDCGAMQDIWQHHNYTKTGAEAAKA-AFENGQDSSCEYTT 274
Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAA 406
T ++ +QG + E +D++LK L+ L+ GFFDG ++ SL D+ + +LA
Sbjct: 275 TKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGFFDGDKSEWSSLDFDDVNTRHAQDLAL 334
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGF 464
++A G VLLKND NTLPLN K ++VA++G A+ + G Y+G +P A
Sbjct: 335 QSAVRGAVLLKND-NTLPLNIKKKESVALIGFWADDKTKLQGGYSGPAPHVRTPAYAAKM 393
Query: 465 SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
G NV + + + N + A EAAK +D + L GLD + E DR DL P
Sbjct: 394 LGLNTNVAWGPTLQNSSVPDNWTTNAL-EAAKKSDYIVYLGGLDATAAGEERDRTDLDWP 452
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
Q L+ +++ + K ++V+ VD N + +ILW YPG+EGG A+ +++
Sbjct: 453 STQLTLLKKLSNLGK--PLVVVQLGDQVDDTPLLKNKGVNSILWVNYPGQEGGTAVMELI 510
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
G+ P GRLP+T Y Y + + + M LRP S PGRTY++Y+ L PFG+G Y
Sbjct: 511 TGRKGPAGRLPLTQYPSKYTEQVGMLEMELRPTKS--SPGRTYRWYSDSVL-PFGFGKHY 567
Query: 644 TQFKYNLLSFTKTIQVNLNKL 664
T FK S + I++N+ K+
Sbjct: 568 TTFKAMFKS--QKIEMNIQKI 586
>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
CL09T03C10]
Length = 1039
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 247/868 (28%), Positives = 392/868 (45%), Gaps = 155/868 (17%)
Query: 13 LSIALLVFSTNAVDANGSSSPVFVCDPGR----------FSKLGLQMSSFLFCDSSLPYS 62
L I+L + S + A +S V P R F+K G++ ++ D S P
Sbjct: 97 LLISLFLGSCATLPAQKTSKIPTVYKPVRTEMYQKGWIDFNKNGIKD---VYEDPSAPID 153
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS-------EALHGVSN 108
R++DL+S+MTL+EK Q+ +G R+ LP EW W E L+G
Sbjct: 154 ARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGFQQ 212
Query: 109 VG-PGTHFDDVIPG--------------------------------------ATSFPTVI 129
G P + + V P AT+FPT +
Sbjct: 213 WGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPTQL 272
Query: 130 LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF 189
++N L ++IG EAR LG + ++P ++V RD RWGR E GE P+
Sbjct: 273 GLGHTWNRQLLRQIGLITGREARM---LGYTNV--YAPILDVGRDQRWGRYEEVYGESPY 327
Query: 190 VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
+V + V+G+Q H + +V++ KH+ AY + D +++
Sbjct: 328 LVAELGIEMVKGMQ----HNH---------QVAATGKHFIAYSNNKGAREGMARVDPQMS 374
Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
+++E + PF+ ++E VM SYN +G P + L +RG+ GY+V+D
Sbjct: 375 PREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDMGFRGYVVSD 434
Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDI 365
D+++ + H D KE AV Q+++AGL++ C Y V++G++ E I
Sbjct: 435 SDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNIRCTFRSPDSYVLPLRELVKEGELSEEII 493
Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
+ ++ + V +G FD Q G +++ N E+A +A+RE IVLLKND+N LP
Sbjct: 494 NDRVRDILRVKFLVGLFDHPYQTDLKGADEEVEKASNEEIALQASRESIVLLKNDKNVLP 553
Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGCDDV- 479
LN++ +K +AV GP+A+ + +Y + S + G G A V Y GC+ V
Sbjct: 554 LNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLYTKGCELVD 613
Query: 480 -------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
+ I A K AD +++ G E+ R L LPG Q
Sbjct: 614 ANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRSSLALPGRQ 673
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
L+ V K PV+LV+++ + I +A+ + AIL A YPG +GG+A+ADV+FG
Sbjct: 674 LDLLKAVVATGK-PVVLVLINGRPLSINWAD--KFVPAILEAWYPGSKGGKAVADVLFGD 730
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--YNGPTLYPFGYGL 641
+NPGG+L +T+ V +P + P +P +D PG NG LYPFG+GL
Sbjct: 731 YNPGGKLTVTF--PKTVGQIPF-NFPCKPSSQIDGGKNPGLNGNMSRVNG-ALYPFGFGL 786
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT F+Y+ L + P ++ + + Y KV
Sbjct: 787 SYTTFEYSDLKIS---------------------------PAIITPNQKT--YVTCKV-- 815
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G G +VV +Y + TY K + GF+RV ++ G K I F + K+L +++
Sbjct: 816 TNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPIDR-KALELLN 874
Query: 762 YAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ ++ GE T+ + G S I LN
Sbjct: 875 ADMHWVVEPGEFTLMI--GASSTDIRLN 900
>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 853
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 166/427 (38%), Positives = 248/427 (58%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 29 LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 87 RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + R LK+
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD---------DPRYLKIV 189
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALND 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A +++AGLDL+
Sbjct: 246 VPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQAGLDLE 304
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y + NA +Q V + DID + ++ T M+LG FDG+ + Y + I S
Sbjct: 305 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPSVIGSK 364
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AARE IVLLKN N LPLN KVK++AVVG NA G+Y+G P +
Sbjct: 365 EHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAPV--VD 420
Query: 460 PIAGFSG 466
P++ G
Sbjct: 421 PVSILQG 427
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 138/290 (47%), Gaps = 48/290 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y + LP
Sbjct: 655 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDELPAF 711
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
D GRTYK++ G LYPFGYGLSY+ FKY+ L
Sbjct: 712 D------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLK----------------- 748
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
D + T +N G G +V VY + P IK
Sbjct: 749 ---VKDGANT---------------ISVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIK 790
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
++ GF+R+ +++G ++ + + + L D ++P G I VG
Sbjct: 791 ELKGFRRIPLKSGESRVVDIELDK-EQLRYWDAGLGQFIVPQGAFDIMVG 839
>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides faecis MAJ27]
Length = 804
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 222/719 (30%), Positives = 340/719 (47%), Gaps = 110/719 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT I +A+++ +L +++G+A++
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGMSATWSPTLIEEVGKAIAK 189
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + + P ++++RDPRW R+ ET GEDP + GR V GL
Sbjct: 190 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGL------- 237
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ DL SR + KH+ AY V Y A V +D+ E FL PF ++ G
Sbjct: 238 GSGDL-SREHATIATLKHFLAYAVPEGGQNGNY---ASVGARDLHENFLPPFREAIEAG- 292
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP A+ LL Q +R EW G++V+D SI+ + ++H F+A + E+
Sbjct: 293 ALSVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVASTMEE 351
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q L AG+D+D G + N AV+ GK+ ET I+ ++ + + +G F+
Sbjct: 352 AAVQALSAGVDIDLGGDAFMNLL-QAVRSGKLDETQINAAVDRILRMKFEMGLFEHPYVN 410
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+++LA + A+ +VLL+N + LPL S K+K VAVVGP+A+ M+G
Sbjct: 411 PKTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPL-SKKIKRVAVVGPNADNRYNMLG 469
Query: 449 NYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
+Y I I+ S + V Y GC + + N I A EAA ++ I
Sbjct: 470 DYTAPQEDKDIRTVLDGVISKLSP-SRVEYVRGCA-IRDTTVNEIAEAVEAAHRSEVIIA 527
Query: 503 LAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQVAEVAKG 539
+ G + + E DR L L G Q L+N + K
Sbjct: 528 VVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNALKTTGK- 586
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+I+V + +D +A + A+L A YPG+ GG AIADV+FG +NP GRLP++
Sbjct: 587 PLIVVYIEGRPLDKVWASECAD--ALLTASYPGQAGGDAIADVLFGDYNPAGRLPVSVPR 644
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
V +P+ P + Y LY FGYGLSYT F+Y+ L T+
Sbjct: 645 S--VGQIPVYYNKKAPRN------HDYVEMAASPLYGFGYGLSYTTFEYSDLQITQ---- 692
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
K+ C +FE +N G+ DG +V +Y K
Sbjct: 693 ------------------KSPC------------HFEVSFKVKNTGNYDGEEVAQLYLKD 722
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+KQ+ F+R F+R G K I F K L+I+D + ++ G+ I +G
Sbjct: 723 EYASVVQPLKQLKHFERFFLRKGEEKEILFTLTE-KDLSIIDRSMKRVVETGDFRIMIG 780
>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 788
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 239/817 (29%), Positives = 371/817 (45%), Gaps = 151/817 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
L+ + P RV+DL+S+MTL+EK Q+ +G R+ LPQ W +E G+ N
Sbjct: 42 LYENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGN 100
Query: 109 V-----GPGT-----------HFD-----------------------DVIPG-----ATS 124
+ G G H D + I G AT
Sbjct: 101 IDEEHNGLGAFKSEYSFPYAKHVDAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATY 160
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+ + EA A LG + +SP +++A+DPRWGR ET
Sbjct: 161 FPAQCGQGATWNKKLIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP++VG + LQ +L + P KH+A Y + +
Sbjct: 216 GEDPYLVGELGKQMITSLQKY-------NLVATP-------KHFAVYSIPIGGRDGKTRT 261
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M ++ PF M +E A VM SYN +G P L + +R EW G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + + HK +AD+ ED +AQ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
GK+ + +DK + + + LG FD Y GKQ + S E+ ++ EAAR+
Sbjct: 376 DDGKISQETLDKRVAEILRIKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
+VLLKN+ + LPL S ++++AV+GP+A+ +I Y A P + + I +A
Sbjct: 434 LVLLKNETHLLPL-SKSIRSIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAE 492
Query: 470 VTYKTGCDDVA-----------CKSNNSIFAASE---AAKTADATI-ILAGLDLSVEAES 514
V YK GCD + K+ + E AAK A+ + +L G +L+V E
Sbjct: 493 VIYKKGCDIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVR-ED 551
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K PVILV++ I +A ++ AIL A +PGE
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEF 608
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G+A+A+ +FG +NPGGRL +T+ V +P + P +P Y L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----AL 660
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GLSYT F Y+ L + + Q ++ D +
Sbjct: 661 YPFGHGLSYTTFTYSDLHISPSHQ-----------------------------GVQGDIH 691
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
K+ +N G G +VV +Y + TY K + GF+R+ ++AG + + F
Sbjct: 692 VSCKI--KNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP- 748
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+ L + D N + G + +G +H F
Sbjct: 749 QDLGLWDKNMNFRVEPGSFKVMLGASSTDIRLHGQFE 785
>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
Length = 772
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 205/632 (32%), Positives = 318/632 (50%), Gaps = 71/632 (11%)
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
++P ++VARDPRWGRI+E GED ++ A V+G Q DL S+P + +
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQ-------GDDL-SQPHTILAT 227
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+AAY G D + D ++++++ +T+L PF+ V G +S M S+N +NG+P
Sbjct: 228 AKHFAAYG-QGQAGRDYHTTD--MSDRELRDTYLPPFKAAVDAG-VTSFMTSFNELNGVP 283
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC- 343
+ A+ LL +R EW G++V D SI MV H F D+ + A +KAG+D+D
Sbjct: 284 ASANKYLLTDILRDEWSFEGFVVTDYTSINEMV-KHGFARDN-DHAGELAVKAGVDMDMQ 341
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDEN 401
G Y ++ N V QGKV ID + + + + RLG F+ +Y + + Q+I + N
Sbjct: 342 GSVYFDYLANQVTQGKVSPQQIDNAARRILEMKYRLGLFEDPYRYSNEEREAQEIYKEYN 401
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
++ A + AR+ +VLLKN+ LPL+ + + T+AV+GP A++ +IG+++ RY PI
Sbjct: 402 LQAAQDVARKSMVLLKNENQQLPLSKSDL-TIAVIGPLADSKEDLIGSWSAAGDRYEKPI 460
Query: 462 AGFSGY-------ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAE 513
+G + V Y G +NS F A+ A I+LA G + E
Sbjct: 461 TLLTGIKAKVADPSKVLYAKGASYEFSHQDNSGFEAAIAIAKKADVIVLAMGEKWDMTGE 520
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
+ R L PG Q L+ Q+ ++AK P++LV+M+ + I +A + N+ AIL A YPG
Sbjct: 521 ATSRTSLDFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEWA--DQNVDAILEAWYPGT 577
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDS-LGYPGRTY 626
GG AIADV+FG +NP G+LP+T+ V +PL T P ++ Y R
Sbjct: 578 MGGPAIADVLFGDYNPSGKLPVTFPRN--VGQIPLYYNMKNTGRPYSKDNAEQKYVSRYI 635
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
N P LY FG+GLSYT F Y+ +S K + KL
Sbjct: 636 DSLNTP-LYHFGHGLSYTTFDYSKISLNKAVITAKEKLTAS------------------- 675
Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
+D N G+ DG +VV +Y + +KQ+ GF+++F+ G K
Sbjct: 676 ------------IDVTNSGNYDGEEVVQLYIRDRIGSVTRPVKQLKGFKKIFLHKGETKT 723
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ F + + L + AGE +F+G
Sbjct: 724 VSFSI-STEDLAFHRQDMSFGAEAGEFDLFIG 754
>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
VPI-5482]
Length = 853
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 165/430 (38%), Positives = 248/430 (57%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 29 LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 87 RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V GLQ + H LK+
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHY---------LKIV 189
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P +P LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 246 VPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 304
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y NA +Q V + DID + ++ T M+LG FD + Y + I S
Sbjct: 305 CGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPSVIGSK 364
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AAR+ +VLLKN +N LPLN+ K+K++AVVG NA G+Y+G P +
Sbjct: 365 EHQQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAPV--VE 420
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 421 PVSILQGIRN 430
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 144/291 (49%), Gaps = 50/291 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y L
Sbjct: 655 S-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 706
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSY+ F Y+ L
Sbjct: 707 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYSDLQ---------------- 748
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
V D + F++ +N G +G +V VY + P +
Sbjct: 749 -----------------VKDGGGEVTVSFRL--KNTGKRNGDEVAQVYVRIPETGGIVPL 789
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
K++ GF+RV +++G ++R++ + + L D ++P G + VG
Sbjct: 790 KELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVG 839
>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 865
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 175/452 (38%), Positives = 241/452 (53%), Gaps = 39/452 (8%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F + ++SL R DL+ R+TL+EKV + + + +PRLG+ Y+WW+EALHGV G
Sbjct: 25 FPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRAGI 84
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NLGR-AGLT 163
AT FP I ASF++ L K+ AVS EARA Y NL R GLT
Sbjct: 85 ----------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQGLT 134
Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
+W+PNIN+ RDPRWGR ET GEDP++ R V VRGLQ + N + K+ +
Sbjct: 135 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPD--------NMKYDKLHA 186
Query: 224 CCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
C KHYA + W +R+ F+A + +D+ ET+L F+ V+E D VMC+YNR G
Sbjct: 187 CAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEG 243
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVAQTLKAGLD 340
P C +LL Q +R EW G IV+DC +I +H+ D KE A A + +G D
Sbjct: 244 EPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPD-KETASAGAVLSGTD 302
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
L+CG Y + AVQ+G + E ID S+K L T LG D + S+ + S
Sbjct: 303 LECGNNYKSLP-EAVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYSVVDSKA 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ +LA E AR+ IVLL+N N LPL +A++GP+AN +V GNY G P +
Sbjct: 362 HKDLALEIARKSIVLLQNRNNILPLKED--MKIALIGPNANDSVMQWGNYNGFPSHTSTL 419
Query: 461 IAGFSGYA---NVTYKTGCDDVACKSNNSIFA 489
+ Y GCD + S S+F+
Sbjct: 420 YEALKERIPANQLIYDFGCDRTSGISLESVFS 451
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 132/302 (43%), Gaps = 58/302 (19%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
A+ + K AD + G+ S+E E + DR + LP Q +LI+++ ++ K
Sbjct: 593 ASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKLGK 652
Query: 539 GPVILVIMSAGGVDIAFAETNTNI-KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
P+I V S V + E + I AIL A YPG+ GG A+ADV+FG +NP G+LP+T+
Sbjct: 653 -PIIFVNYSGSAVGL---EPESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTF 708
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y + LP GRTY++ LY FG+GLSYT F Y + ++
Sbjct: 709 YK--HTDQLP-------DFQDYSMKGRTYRYMTESPLYSFGHGLSYTNFTYGPATLSQ-- 757
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
+T G V + QN G+ DG +VV VY
Sbjct: 758 --------------------QTISQGKEVT---------LTIPVQNTGNYDGEEVVQVYL 788
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIF 776
+ + F+RV + G+ + F ++ ++ D NT+ + G + +
Sbjct: 789 SCSGDKEGPS-HTLRAFKRVHIAKGQRANVSFTLDS-ETFQWFDTNTNTMRMVEGNYELL 846
Query: 777 VG 778
G
Sbjct: 847 YG 848
>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 909
Score = 282 bits (721), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 170/422 (40%), Positives = 236/422 (55%), Gaps = 44/422 (10%)
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
+MT +EKV Q + A +PRLG+P YEWW+E LHG++ G AT FP I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWGRIT 181
A++N +L +++G STEARA +NL AGLT WSPNIN+ RDPRWGR
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G+ AV ++ GLQ DL + P +++ KH A V + R
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQ-------GDDL-THPRTIATP-KHLA---VHSGPEPGR 224
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN +RG+W
Sbjct: 225 HGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWG 284
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
G++V+DCD++ M H F AD+ + A LKAG DL+CG Y + G A+ +G
Sbjct: 285 FTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDL-GKAIARGDAD 342
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLK 417
E +DKSL L+ RLG PQ Y LG +D+ S + LA +AA++ IVLL+
Sbjct: 343 EALLDKSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQ 400
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKT 474
N TLPL +AV+GP+A+A A+ NY G ++P+ G G ANV Y
Sbjct: 401 NRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQ 458
Query: 475 GC 476
G
Sbjct: 459 GA 460
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 138/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR DL LP Q L+ + A+ + P+++V+MS V +
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + AI+ A YPG+ GG AIA V+ G NPGGRLP+T+Y ++ L
Sbjct: 705 WAKQHAD--AIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYR---------STKDLP 753
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+ FG GLSYT+F Y
Sbjct: 754 AYVSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA------------------------- 788
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + + + +N G+ G +VV VY + P + A + ++ ++GF
Sbjct: 789 ------APQLSATTLQAGAHLQVRTQVRNSGTRAGDEVVQVYLEFP-QRAQSPLRTLVGF 841
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F A + L+ VD A + G++ +FVG G
Sbjct: 842 QRVTLQPGEARDVSFEL-APRQLSDVDRAGQRAVQPGDYRVFVGGG 886
>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 871
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 177/446 (39%), Positives = 243/446 (54%), Gaps = 48/446 (10%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
+F F + +L RV DLVSRM++DEK+ QL D + + RLG+P+Y WW+E+LHGV+
Sbjct: 22 ENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARA 81
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------G 161
G AT FP I +S++ L + +S EARA ++ L R G
Sbjct: 82 G----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMYQG 131
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
LT+WSPN+N+ RDPRWGR ET GEDPF+ G+ + YV GLQ N + LKV
Sbjct: 132 LTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGT---------NEKYLKV 182
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
+ KHYA V + R+ F+A ++ D+ ET+L F VKEG SVM +YNR
Sbjct: 183 IATAKHYA---VHSGPEPSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFR 239
Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
G A P L N +R W GYIV+DC ++ + HK D+ A A LK GLDL
Sbjct: 240 GESCSASPFLFN-ILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAA-TASALALKDGLDL 297
Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
+CG + + A+ + + E DID ++K L+T +LG FD + VS + + N
Sbjct: 298 ECGSSFKSLK-EAIDRKLISEADIDIAVKRLFTARFKLGMFD-PEEIVSYAQIPYSVNNN 355
Query: 402 IE---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
LA A+++ IVLLKN NTLPL S +KTVAV+GP+AN ++ GNY+G+P
Sbjct: 356 SAHDWLARVASQKSIVLLKNQNNTLPL-SRDIKTVAVIGPNANDVQSLWGNYSGVPS--- 411
Query: 459 SPIAGFSGYAN-------VTYKTGCD 477
+PI G N V Y G D
Sbjct: 412 NPITVLKGIQNKLEPNTKVLYAKGTD 437
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 152/309 (49%), Gaps = 55/309 (17%)
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
A N + A + A ADA +++ GL+ +E E + DR L LP Q +L
Sbjct: 582 AEPQENVLQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEEL 641
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PVILV+++ + I +A N ++ AIL AGYPG++GG AIADV+FG +NP
Sbjct: 642 MKAMTATGK-PVILVLINGSALSINWA--NDHVPAILTAGYPGQQGGNAIADVLFGDYNP 698
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLP+T+Y + LP ++ GRTY+++ LYPFG+GLSYT+FKY+
Sbjct: 699 AGRLPVTYYKS--TEQLP-------AFENYDMKGRTYRYFQKKPLYPFGFGLSYTKFKYS 749
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
L L ++ + FE VD N+G DG
Sbjct: 750 NLK--------------------------------LPTNVTPEKDFEILVDVTNIGERDG 777
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V+ +Y K I Q+ GF+RV ++ G K ++F + L++++ ++
Sbjct: 778 DEVIELYLKDEKASTPRPILQLEGFERVNLKKGETKTVRFTITP-RQLSLINKKGQRVIE 836
Query: 770 AGEHTIFVG 778
G TI VG
Sbjct: 837 PGWFTISVG 845
>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
Length = 801
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 245/864 (28%), Positives = 383/864 (44%), Gaps = 154/864 (17%)
Query: 12 SLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSR 71
+L I +L+FS + S ++ + F+K G ++ D + P +R++DL+S+
Sbjct: 5 ALIIGILLFSFPKELHSQSKQKIYHKNWVDFNKNG---KKDVYEDPTRPVDLRIEDLLSQ 61
Query: 72 MTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNVG----------------- 110
MTL+EK Q+ +G R+ LP +W ++ G+ N+
Sbjct: 62 MTLEEKSCQMATL-YGFGRVLKDELPTPDWKNQIWKDGIGNIDEQLNNLAYHPSAVTDKA 120
Query: 111 --PGTHFDDV--------------IP--------------GATSFPTVILTTASFNESLW 140
P H + IP ATSFP+ + A++N++L
Sbjct: 121 WPPSNHIKALNTIQEFFVEDTRLGIPVDFTNEGIRGLCHEKATSFPSQLGVGATWNKNLV 180
Query: 141 KKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
KIG EAR + G T +SP +++ARDPRWGR+ E GEDP++VG V
Sbjct: 181 GKIGHITGKEARLL------GYTNVYSPILDIARDPRWGRVVECYGEDPYLVGELGYQMV 234
Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
+G+Q KV S KH+A Y DA +TE+++ +L
Sbjct: 235 KGIQQE--------------KVVSTPKHFAIYSAPKGGRDGDARTDAHITERELFSLYLH 280
Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
PF+ +K+ A VM SYN NG+P + LN +R +W GY+V+D +++ + D
Sbjct: 281 PFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWGFKGYVVSDSRAVEFIADK 340
Query: 320 HKFLADSKEDAVAQTLKAGL----DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
H D K DAV Q + AGL D + + V++G + ID ++ + V
Sbjct: 341 HHVAKDRK-DAVRQAVLAGLNVRTDFTMPEDFILPVRELVKEGGLDMATIDDRVRDILRV 399
Query: 376 LMRLGFFDGSPQYVSLGKQDICSDENI------ELAAEAAREGIVLLKNDQNTLPLNSAK 429
G FD GKQ +D+ + E+A +A+ E IVLLKN++N LPL+ +K
Sbjct: 400 KFWQGLFDA-----PYGKQMKEADKTVGKPEYQEVAYQASLESIVLLKNEENILPLDFSK 454
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGC---DDVACK 482
K+V V GP+A A + Y +S G F + Y GC D+
Sbjct: 455 YKSVLVTGPNAKAINHSVSRYGPSHIDVVSVFDGIKEKFPKDVEIKYTKGCVFFDENWPD 514
Query: 483 S-----------NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
S + I A AKT I++ G D ES R L LPG Q +L+
Sbjct: 515 SELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVGESRSRTSLDLPGNQQKLVE 574
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
++ + PVI+V+++ + I + + + I+ + G+ GG AIADV+ G +NPGG
Sbjct: 575 EIYKTGT-PVIVVLINGRPMTINWV--DKYVPGIVEGWFQGKFGGSAIADVLVGSYNPGG 631
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR----TYKFYNGPTLYPFGYGLSYTQFK 647
+LP+++ V LP+ + P +P P + + K G LYPFGYGLSYT F+
Sbjct: 632 KLPVSFPK--TVGQLPM-NFPSKPGAQADQPAKGPNGSGKTRVGGFLYPFGYGLSYTTFE 688
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y L I+ L V+V+ VD N G
Sbjct: 689 YTNLKIRSNIKNGLGD--------------------VVVS-----------VDITNSGKR 717
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
G ++V +Y Y KQ+ GF+R+ + AG K + F + + L++ + +
Sbjct: 718 KGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVNFTL-SPEDLSLYNRQMEFV 776
Query: 768 LPAGEHTIFVGNGGVSFPIHLNFN 791
L G TI +G+ IH++ N
Sbjct: 777 LEPGSFTIMIGSSAED--IHVSGN 798
>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
12058]
Length = 939
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 231/808 (28%), Positives = 367/808 (45%), Gaps = 142/808 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + R++DL+S+MTLDEK Q+ +G R+ LP EW W
Sbjct: 49 VYEDPNATLDARIEDLLSQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 107
Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
E L+G G P + +V P
Sbjct: 108 IDEHLNGFQQWGLPPSDNPNVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 167
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWG
Sbjct: 168 YRATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWG 221
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRG+Q H + +V++ KH+ AY +
Sbjct: 222 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFVAYSNNKGAR 268
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ +KE VM SYN +GIP L + +RG
Sbjct: 269 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRG 328
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C Y
Sbjct: 329 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 387
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
V++G + E I+ ++ + V +G FD Q G +++ EN +A +A+RE +
Sbjct: 388 VKEGGLSEDIINDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKAENEAVALQASRESL 447
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYAN 469
+LLKN+ N LPL+ +KT+AV GP+AN + +Y + ++ + G G A
Sbjct: 448 ILLKNENNVLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAE 507
Query: 470 VTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
V Y GCD V + I A E A+ AD +++ G E+
Sbjct: 508 VLYAKGCDLVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENK 567
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q +L+ V K PV+LV+++ + I +A+ + AIL YPG +G
Sbjct: 568 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWAD--KFVPAILETWYPGSKG 624
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
G A+ADV+FG +NPGG+L +T+ V +P + P +P +D PG N
Sbjct: 625 GTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRVN 681
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G +LYPFGYGLSYT F+Y+ + + + T++ T +R
Sbjct: 682 G-SLYPFGYGLSYTTFEYSNIEISPKMM--------------TANQKAT---------VR 717
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
C N G G +VV +Y + TY K + GF+RV ++ G K + F+
Sbjct: 718 C--------KVTNTGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFI 769
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K L ++D ++ G+ +I VG
Sbjct: 770 LDR-KHLELLDKHMEWVVEPGDFSIMVG 796
>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
22836]
Length = 745
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 221/758 (29%), Positives = 358/758 (47%), Gaps = 105/758 (13%)
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY-----EWWSEALHG--VSNVG------- 110
V DL+ RMTL+EK+ Q + G + P E+ + + G + VG
Sbjct: 35 VDDLLRRMTLEEKIGQTVLYTSGYDVITGPTVDPNYKEYLKKGMVGGIFNAVGADYTRSL 94
Query: 111 ------------PGTHFDDVIPGA-TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
P DVI G T FP + + S++ ++ + ++EA A
Sbjct: 95 QKIAVEETRLGIPLIFGYDVIHGQRTIFPIPLAESCSWDLEAMERSARIAASEATA---- 150
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
G+ + ++P ++++RDPRWGR+ E GED ++ A V+G Q +N + +N+
Sbjct: 151 --EGINWIYAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQ----GDNLSAVNT 204
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
V +C KHYAAY G D D + E + T+L PF+ + G ++M S
Sbjct: 205 ----VVACVKHYAAYGA-TMAGRDYNTVDMSLNE--LWNTYLPPFKAALDAG-CGTIMTS 256
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N +NGIP+ + LL +R +W+ +G++V D SI M+ H + D K A +
Sbjct: 257 FNDLNGIPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMIP-HGYANDEKHSA-EIAMN 314
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ- 394
AG+D+D G Y N +++GKV E D+ ++ + + + +LG F+ +Y ++
Sbjct: 315 AGVDMDMQGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFEDPYRYCDANREK 374
Query: 395 -DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
DI + N E A + AR+ +VLLKND+ TLPL K VA++GP ++G ++ +
Sbjct: 375 TDILTPANKEAARDMARKSMVLLKNDKQTLPLKENK--RVALIGPLVKDKYEILGCWSAM 432
Query: 454 PCRYMSPIAGFSGYA------NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
R P++ + G ++Y GCD + + A A +D +++ G
Sbjct: 433 GNRDTIPVSVYDGLVEAIGKDKISYAKGCD-IQSEDTKGFAEAVRVASASDVVVMVMGEF 491
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
++ E+ R +L LPG Q L+ + + K PV+LV+M+ + I + + N + AIL
Sbjct: 492 HNMSGENNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEKDN--LDAILE 548
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRP-VDSLG 620
A +PG GG AIADV+ GK+NP G+L +T+ V +PL T P P V
Sbjct: 549 AWFPGTMGGAAIADVLTGKYNPSGKLTMTFPQN--VGQIPLFYNHKNTGRPYDPNVPQFA 606
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
Y R + N P LYPFGYGLSYT F Y+ L+ + N L+
Sbjct: 607 YGSRYWDVSNEP-LYPFGYGLSYTTFTYSDLTLSSKEITKENPLK--------------- 650
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
V N G DG +VV +Y++ +K++ GF++VF++
Sbjct: 651 ----------------VSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLK 694
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
AG +K I F + L + + G+ +FVG
Sbjct: 695 AGESKVIDFTL-SVNDLRFYNSQLEYVYEPGDFHLFVG 731
>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.4]
Length = 755
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 210/705 (29%), Positives = 349/705 (49%), Gaps = 122/705 (17%)
Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
++ AT+FP I ++++ L +++ + +A+ + SP ++V RDPRW
Sbjct: 97 MVKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI-----GTNQCLSPVLDVCRDPRW 151
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNW 236
GR ET GED ++V + YV+GLQ EN ++ + KH+AA+ +
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQG----EN---------ELIATVKHFAAHGFPEGG 198
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
+ + H V +++ E FL PFE+ +K G A SVM +Y+ ++GIP ++ +LL + +
Sbjct: 199 RNIAPVH----VGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKIL 254
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFT 351
R EW G +V+D D+I+ + HK + KE A+ L+AG+D +DC +
Sbjct: 255 RQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPL 310
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAA 409
AV++G + E+ ID++++ + + +LG F+ Y++ + + + ++ ELA + A
Sbjct: 311 LEAVKEGLISESIIDRAVERVLRIKEKLGLFND--HYINENNVPEKLDNSKSRELALDVA 368
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-------AGIPCRYMSPIA 462
R+ IVLLKND N LPLN + T+AV+GP+AN ++G+Y A + ++ +
Sbjct: 369 RKSIVLLKND-NILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLE 426
Query: 463 GF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----- 509
G S NV Y GCD +A +S A E AK D I + +GL LS
Sbjct: 427 GIMRKVSNNTNVLYAKGCD-IAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVP 485
Query: 510 ----------VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
V E DR L LPG Q +L+ ++ + K P+ILV+++ G +A +
Sbjct: 486 GKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVN--GRPLALSSIF 542
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---R 614
+ AI+ A +PGEEGG AIADV+FG +NP GRLPI++ P+ + +P+ R
Sbjct: 543 NEVNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISF---------PIDTGQIPIYYNR 593
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
SL R Y L+PFGYGLSYT+FKY+ L T
Sbjct: 594 KPSSL----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTP------------------- 630
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ + ++ +NVG +G + V +Y + IK++ GF
Sbjct: 631 ------------KEVNSSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGF 678
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+V+++ ++I F ++L D ++ G++ I +G
Sbjct: 679 AKVYLKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722
>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 909
Score = 281 bits (719), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 168/422 (39%), Positives = 237/422 (56%), Gaps = 44/422 (10%)
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
+MT +EKV Q + A +PRLG+P YEWW+E LHG++ G AT FP I
Sbjct: 67 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWGRIT 181
A++N +L +++G STEARA +NL AGLT WSPNIN+ RDPRWGR
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G+ AV ++RGLQ DL + P +++ KH A V + R
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQ-------GDDL-THPRTIATP-KHLA---VHSGPEPGR 224
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+ FD V+ D+E T+ F + +G A +VMC+YN ++G P+CA LLN +RG+W
Sbjct: 225 HGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWG 284
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
G++V+DCD++ M H F AD+ + A LKAG DL+CG Y + G A+ +G
Sbjct: 285 FTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDL-GKAIARGDAD 342
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLK 417
E +D+SL L+ RLG PQ Y LG +D+ S + LA +AA++ IVLL+
Sbjct: 343 EAVLDQSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQ 400
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKT 474
N TLPL +AV+GP+A+A A+ NY G ++P+ G G AN+ Y
Sbjct: 401 NRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANLRYAQ 458
Query: 475 GC 476
G
Sbjct: 459 GA 460
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 140/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR DL LP Q L+ + A+ + P+++V+MS V +
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + AI+ A YPG+ GG AIA V+ G NPGGRLP+T+Y ++ L
Sbjct: 705 WAKQHAD--AIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYR---------STKDLP 753
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+ FG GLSYT+F Y Q++ LQ NL
Sbjct: 754 AYVSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTY------AAPQLSATTLQAGANL---- 803
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+ + +N G+ G +VV VY +PP + A + ++ ++GF
Sbjct: 804 ---------------------QVRTQVRNSGTRAGDEVVQVYLQPP-QGAQSPLRTLVGF 841
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F + L+ VD A + G++ +FVG G
Sbjct: 842 QRVTLQPGEAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGG 886
>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
Length = 895
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 169/422 (40%), Positives = 236/422 (55%), Gaps = 44/422 (10%)
Query: 71 RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
+MT +EKV Q + A +PRLG+P YEWW+E LHG++ G AT FP I
Sbjct: 53 KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 102
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWGRIT 181
A++N +L +++G STEARA +NL AGLT WSPNIN+ RDPRWGR
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G+ AV ++ GLQ DL + P +++ KH A V + R
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQ-------GDDL-THPRTIATP-KHLA---VHSGPEPGR 210
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN +RG+W
Sbjct: 211 HGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWG 270
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
G++V+DCD++ M H F AD+ + A LKAG DL+CG Y + G A+ +G
Sbjct: 271 FTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDL-GKAIARGDAD 328
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLK 417
E +D+SL L+ RLG PQ Y LG +D+ S + LA +AA++ IVLL+
Sbjct: 329 EALLDQSLVRLFAARYRLGEL--QPQRKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQ 386
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKT 474
N TLPL +AV+GP+A+A A+ NY G ++P+ G G ANV Y
Sbjct: 387 NRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQ 444
Query: 475 GC 476
G
Sbjct: 445 GA 446
Score = 129 bits (323), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 139/286 (48%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR DL LP Q L+ + A+ + P+++V+MS V +
Sbjct: 632 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 690
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + AI+ A YPG+ GG AIA V+ G NPGGRLP+T+Y ++ L
Sbjct: 691 WAKQHAD--AIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYR---------STKDLP 739
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
S GRTY+++ G L+ FG GLSYT+F Y Q++ LQ NL
Sbjct: 740 AYVSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTY------AAPQLSATTLQAGANL---- 789
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+ + N G+ G +VV VY +PP + A + ++ ++GF
Sbjct: 790 ---------------------QVRTQVSNSGTRAGDEVVQVYLQPP-QGAQSPLRTLVGF 827
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F + L+ VD A + G++ +FVG G
Sbjct: 828 QRVTLQPGEAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGG 872
>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
Length = 756
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 220/697 (31%), Positives = 338/697 (48%), Gaps = 110/697 (15%)
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
EALHG + G+T FP I +++N L ++ A+ E R+ R
Sbjct: 138 EALHGC-----------MAKGSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
SP IN+ARDPR GR ET GEDP++ R AV Y++G+Q+ +G
Sbjct: 182 IHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQE-QG------------- 227
Query: 221 VSSCCKHYAAYDVDNWKGVDRY--HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
V + KH+ A V + G D Y HF R+ + E + F ++E A S+M +YN
Sbjct: 228 VIATPKHFVANFVGD-GGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYN 282
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
++GIP ++ LL + +R EW GY+V+D S+ ++ HK +A+SK +A +L+AG
Sbjct: 283 SLDGIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAAKLSLEAG 341
Query: 339 LDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVS 390
LD+ DC + G +++ K+ + +D++++ + V +G FD P Y
Sbjct: 342 LDMELPDSDC---FEEIPG-LIRESKLSQDTLDEAVRRVLRVKFWIGLFDNPFVDPDYAE 397
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+ + CS E+ ELA ARE IVLLKN + LPLN ++++AV+GP NA V +G Y
Sbjct: 398 --RINDCS-EHRELALRVARESIVLLKN-EGILPLNK-DIRSIAVIGP--NAAVPRLGGY 450
Query: 451 AGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
+G + ++P+ G V + GC + S + A + A+ +D I+ G
Sbjct: 451 SGYGVKVVTPLEGIKNKLGDKVKVYFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFMGN 509
Query: 507 DL-SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
+ E E DR +L LPG Q LI ++ PVI+V+++ G I ++A+
Sbjct: 510 SVPETEGEQRDRHNLNLPGVQEDLIKEICN-TNTPVIVVLIN--GSAITMMNWIDKVQAV 566
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL--TSMPLRPVDS-LGYP 622
+ A YPGEEGG AIADV+FG +NPGG+LPI++ Y LPL P VD +
Sbjct: 567 IEAWYPGEEGGNAIADVLFGDYNPGGKLPISF--PKYSSQLPLYYNHKPSGRVDDYVDLR 624
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
G Y L+PFGYGLSYT FKY+ NL T +
Sbjct: 625 GNQY-------LFPFGYGLSYTDFKYS-------------------NLRITPE------- 651
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
++ D D +N+G G +VV +Y A IK++ F+RV + G
Sbjct: 652 -----EIPRDGEVVITFDIENIGKYKGDEVVQLYLHDEFASVARPIKELKRFERVTLDVG 706
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K + F N + L + ++ G + +G+
Sbjct: 707 ERKTVSFKLNR-RDLEFLSMDMELVVEPGRFEVLIGS 742
>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
Length = 777
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 238/805 (29%), Positives = 367/805 (45%), Gaps = 153/805 (19%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSE-------- 101
++ D S P RVKDL+S+M +DEK Q+ +G R+ LP +W SE
Sbjct: 31 IYEDPSAPIEERVKDLLSQMNMDEKTCQMATL-YGSGRVLADALPTEKWKSEIWKDGIGN 89
Query: 102 ------------------------ALHGV-------SNVGPGTHF-DDVIPG-----ATS 124
A+H + + +G F ++ I G AT
Sbjct: 90 IDEEHNGLGKFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATF 149
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP +++N+ L +IG+ + EA A LG + +SP +++A+DPRWGR E
Sbjct: 150 FPAQCGQGSTWNKELIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRAVECY 204
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP++VG+ ++ LQ K+ + KH+A Y +
Sbjct: 205 GEDPYLVGQLGKQMIQSLQK--------------HKLVATPKHFAVYSIPVGGRDGGTRT 250
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M +L PF + +E A VM SYN +G P + L Q +R EW G
Sbjct: 251 DPHVAPREMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKG 310
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------NAV 355
Y+V+D D+++ + HK +AD+ E+AV Q++ AGL++ TNF+ +A+
Sbjct: 311 YVVSDSDAVEFISSKHK-VADNNEEAVVQSVNAGLNV-----RTNFSSPAGFIKPLRSAI 364
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAAEAAREG 412
+GKV + ID+ + + V LG FD Y GK + + E+ +A EAAR+
Sbjct: 365 AKGKVSQATIDQRVSEILYVKFWLGLFDNP--YRGDGKLADKIVHCKEHQAVALEAARQS 422
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY----AGIPCRYMSPIAGFSGYA 468
IVLLKN N LPL +K+VAV+GP+A+ +I Y A I Y G A
Sbjct: 423 IVLLKNQDNLLPLQKT-LKSVAVIGPNADEQKELICRYGPSNAPIKTVYKGIKEALPG-A 480
Query: 469 NVTYKTGCD--------------DVACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAE 513
V YK GC+ D+ K + A EAAK+A+ I +L G +++V E
Sbjct: 481 KVVYKKGCEIVDPHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREE 540
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
R L LPG Q +L+ V ++ K P ILV++ I +A+ + AIL A +PGE
Sbjct: 541 R-SRTSLDLPGRQEELLKAVCKLGK-PTILVMIDGRASSINYAK--KYVPAILHAWFPGE 596
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
G+A+A+ +FG NPGG+L +T+ V +P + P +P G
Sbjct: 597 FCGQAVAETIFGDNNPGGKLAVTFPKS--VGQIPF-AFPFKPGSDSGCGTSVTG-----A 648
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFG+GLSYT F+YN L + Q L ++ K C
Sbjct: 649 LFPFGHGLSYTTFEYNNLKISPEQQGVLGEV-------------KVSC------------ 683
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
+N G G +VV +Y + TY+K + GF+R+ ++ K++ F +
Sbjct: 684 ------TVKNTGKRPGDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLSP 737
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVG 778
+ L I D + G + +G
Sbjct: 738 -QDLAIWDKNMKFQVEPGTFKVMIG 761
>gi|160884133|ref|ZP_02065136.1| hypothetical protein BACOVA_02110 [Bacteroides ovatus ATCC 8483]
gi|423291392|ref|ZP_17270240.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
CL02T12C04]
gi|156110475|gb|EDO12220.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392663392|gb|EIY56942.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
CL02T12C04]
Length = 735
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 222/768 (28%), Positives = 358/768 (46%), Gaps = 115/768 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EK+ QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D EN ++++C KHY Y R + ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
M VK G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +++ + ++
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304
Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
LA +K+DA AGL++D + Y + V++GKV +D+S++ + V RLG
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRYLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
F+ V+ K +++ +AA+ A E +VLLKND LPL + K +AVVGP A
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422
Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
++G++ G + Y A F G A + Y GC ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ +D I+ G L+ E+ R + LP Q +L+ ++ E K P+ILV+ + G +
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLE 537
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
AIL PG G R++A ++ G+ NP G+L +T P ++ +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMT---------FPYSTGQIP 588
Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ R G+ G YK LYPFG+GLSYT+FKY +
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
T A+K ++ D +V N G+ DG++ V + P +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVK 676
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++ F++ F++AG K +F + + V+ L AGE+ I V
Sbjct: 677 ELKHFEKQFIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
12058]
Length = 863
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 183/461 (39%), Positives = 246/461 (53%), Gaps = 51/461 (11%)
Query: 52 FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
FL C S PY R DLV R+TL+EK + + + +PRLG+ Y+WW+EALH
Sbjct: 18 FLSC-SQPPYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALH 76
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NL 157
GV G AT FP I ASFN L + AVS EARA L
Sbjct: 77 GVGRAGL----------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGL 126
Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
R GLT W+PN+N+ RDPRWGR ET GEDP++ G+ + VRGLQ EG
Sbjct: 127 KRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEG--------G 178
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
+ K+ +C KH+A + W +R+ FDA V +D+ ET+L F+ V++ VMC
Sbjct: 179 KYDKLHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMC 235
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQ 333
+YNR G P C +LL Q +R EW G IV+DC +I + H+ D KE A A+
Sbjct: 236 AYNRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPD-KEHASAK 294
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
+ G D++CG+ Y + AV+ G + E ID SLK L LG D +P+ VS +
Sbjct: 295 AVLTGTDVECGESYASLP-QAVKAGLIDEKKIDISLKRLMKARFELGEMD-NPELVSWAQ 352
Query: 394 ---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+ S E+ ELA ARE +VLL+N+QN LPLN K VAVVGP+AN +V GNY
Sbjct: 353 IPYSVVDSKEHRELALRMARESLVLLQNNQNVLPLN--KSLKVAVVGPNANDSVMQWGNY 410
Query: 451 AGIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
G P ++ + G Y A + Y+ GCD + + S+F
Sbjct: 411 NGFPGHTVTLLEGIRQYLPEAQLIYEPGCDLTSDVTLQSVF 451
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 128/298 (42%), Gaps = 56/298 (18%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
+ K AD + G+ +VE E + DRE + LP Q++L+ AE+ K
Sbjct: 595 QRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLL---AELKKAGK 651
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+V ++ G IA AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y
Sbjct: 652 KVVFVNFSGSAIALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK-- 709
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
++ L + GRTY++ L+PFG+GLSYT F+Y
Sbjct: 710 -------STSQLPDFEDYSMKGRTYRYMAEAPLFPFGHGLSYTTFRY------------- 749
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
DAS + +++ + + N G DG +VV VY + P
Sbjct: 750 ------------GDAS------LSTQEVKEGEQAILTIPVSNTGERDGEEVVQVYLRRPG 791
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
+ + F+RV + G + + + D NT+ P G++ I G
Sbjct: 792 DKEGPS-HALRAFKRVNIAKGTTGNVTISLSK-EDFEWFDTETNTMRPIEGDYEILYG 847
>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
DSM 18315]
Length = 868
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 165/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+ R+T +EKV Q+ + + RLG+PQY+WW+EALHGV+
Sbjct: 22 RQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F++ + VS EARA Y+ +
Sbjct: 82 RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ R V V+GLQ + +
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGD---------DPKYF 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ FD VT +D+ +T+L FE VKEG+ VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD------NHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I + H+ D+ E A A
Sbjct: 240 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A++ GK+ E D+D SL+ L LG FD Q Y +
Sbjct: 299 AVLNGTDLECGNSYRALV-KALKDGKISENDLDVSLRRLLKGRFELGMFDPDEQVPYAQI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E++ A E A + +VLLKN NTLPL S ++ +AVVGP+A + + NY
Sbjct: 358 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P ++ + G V Y+ GC+ A
Sbjct: 417 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 448
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 124/270 (45%), Gaps = 56/270 (20%)
Query: 490 ASEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVA 537
A+ AAK DA +I+ G+ +E E + DR ++ LP Q +++ +
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATG 655
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
K PV+ V+ + + + + E N I AIL A Y G+E G A+AD++FG +NP GRLP+T+
Sbjct: 656 K-PVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTF 712
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y + LP + GRTY++ LYPFGYGLSYT F Y
Sbjct: 713 YKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFGYGLSYTNFAY--------- 754
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
RN +S G + D F D N G DG +V +Y
Sbjct: 755 ----------RNAKLSS--------GKIAKDQSVTLTF----DIANTGKMDGDEVAQIYI 792
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
K P + IK + F RV V+AG ++ +
Sbjct: 793 KNPNDPEGP-IKALKAFLRVHVKAGDSQEV 821
>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
Length = 897
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 172/445 (38%), Positives = 248/445 (55%), Gaps = 41/445 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D S + R LV++MTL+EK Q+ + A + RLG+P Y+WW+E LHGV+ G
Sbjct: 38 WLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ-- 95
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-------GR-AGLTYW 165
AT FP I A+F+ L ++ +S EARA ++ GR GLT+W
Sbjct: 96 --------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTFW 147
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPN+N+ RDPRWGR ET GEDP++ R V +VRGLQ D R K+ +
Sbjct: 148 SPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ-------GDDPVYR--KLDATA 198
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH A V + DR+HFDAR + +D+ +T+L FE VKEGD +VM +YNRV G +
Sbjct: 199 KHLA---VHSGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESA 255
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A LL +R +W GY+V+DC +I V + H + ++E A A ++ G +L+CGQ
Sbjct: 256 SASRFLLRDVLRRDWGFKGYVVSDCWAI-VDIWKHHRIVTTREAAAALAVRNGTELECGQ 314
Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE---NI 402
Y +AV+QG + E +ID ++ L+T MRLG FD P+ V + ++ +
Sbjct: 315 EYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHD 372
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
LA +AA+ +VLLKND LPL S + +AVVGP A+ T+A++GNY G P ++ +
Sbjct: 373 ALALKAAQASLVLLKND-GILPL-SRNTRRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 430
Query: 463 GFSGYAN---VTYKTGCDDVACKSN 484
G A V Y G D V + +
Sbjct: 431 GIREAAKGVEVRYARGVDLVEGRDD 455
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 127/289 (43%), Gaps = 54/289 (18%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+ + GL VE E + DR DL LP Q L+ + K PV++V+ GG
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLT--GG 689
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
IA ++ AIL + YPG+ GG A+ +FG NP GRLP+T+Y +
Sbjct: 690 SAIAVDWAQAHLPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFYKAG-------EA 742
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
MP D GRTY+++ G LYPFG+GLSYT+F Y L
Sbjct: 743 MPA--FDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGTLRLD---------------- 784
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
+ LR D VD N G+ G +VV +Y + + +++
Sbjct: 785 ---------------ADSLRADGRLGVAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQE 829
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
+ GFQRV + G + + F A ++L D A A + G + + VG
Sbjct: 830 LRGFQRVQLAPGERRTVTFTLEAAQALRHYDEARAAYAVQPGAYEVRVG 878
>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
CL03T12C01]
Length = 788
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 237/820 (28%), Positives = 369/820 (45%), Gaps = 157/820 (19%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
L+ + P RV+DL+S+MTL+EK Q+ +G R+ LPQ W +E G+ N
Sbjct: 42 LYENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGN 100
Query: 109 V-----GPGT-----------HFD-----------------------DVIPG-----ATS 124
+ G GT H D + I G AT
Sbjct: 101 IDEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATY 160
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+ + EA A LG + +SP +++A+DPRWGR ET
Sbjct: 161 FPAQCGQGATWNKELIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP++VG + LQ +L + P KH+A Y + +
Sbjct: 216 GEDPYLVGELGKQMITSLQK-------HNLVATP-------KHFAVYSIPVGGRDGKTRT 261
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M ++ PF M +E A VM SYN +G P L + +R EW G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + HK +A++ ED +AQ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISSKHK-VANTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
GK+ + +DK + + V LG FD Y GKQ + S E+ ++ EAAR+
Sbjct: 376 ADGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
+VLLKN+ N LPL S ++++AV+GP+A+ +I Y A P + + I +
Sbjct: 434 LVLLKNEMNLLPL-SKSLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTE 492
Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
V Y+ GCD + + + A AAK A+ + +L G +L+V E
Sbjct: 493 VIYRKGCDIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVR-ED 551
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K PV+LV++ I +A ++ AIL A +PGE
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEF 608
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G+A+A+ +FG +NPGGRL +T+ V +P + P +P Y L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----VL 660
Query: 635 YPFGYGLSYTQFKYNLLSFT---KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFG+GLSYT F Y L + + +Q ++N + C
Sbjct: 661 YPFGHGLSYTTFSYGDLKISPLRQGVQGDIN--------------------------ISC 694
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
+N G G +VV +Y + TY K + GF+R+ + AG + + F
Sbjct: 695 --------KIKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRL 746
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+ L + D N + G+ + +G+ +H F
Sbjct: 747 RP-QDLGLWDKNMNFRVEPGKFKVMIGSSSTDIRLHGRFE 785
>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 846
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 165/442 (37%), Positives = 255/442 (57%), Gaps = 42/442 (9%)
Query: 42 FSKLGL-QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS 100
F +LG+ M+ L+ + + P R++DL+S++T++EK+ L + G+ R+G+ +Y +
Sbjct: 10 FMQLGVVSMAQDLYKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGN 69
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
EALHG+ + PG T FP I + +N L I +S EARA +N
Sbjct: 70 EALHGI--IRPGKF--------TVFPQAIGLASMWNPELHHIIASVISDEARARWNELER 119
Query: 161 G----------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
G LT+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ
Sbjct: 120 GKKQKDQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGD----- 174
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
+ R LK S KH+AA + ++ +R++ DA +TE DM E +L FE C++EG A
Sbjct: 175 ----HPRYLKSVSTPKHFAANNEEH----NRFYCDAAITETDMREYYLPAFEKCIREGKA 226
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
S+M +YN +NG+P A+ LLN+ ++ +W +GYIV+DC + +++ +H+++ + E A
Sbjct: 227 ESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAA 285
Query: 331 VAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
+KAGLDL+CG Y + NA +Q V +ID + ++ MRLG FD +
Sbjct: 286 AMIAIKAGLDLECGDYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFDDPEKNP 345
Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
Y L + + +++ ELA EAAR+ IVLLKN +NTLPLN+ K+K++AVVG NA
Sbjct: 346 YNHLSPEIVGCEKHKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVG--INAANCEF 403
Query: 448 GNYAGIPCRYMSPIAGFSGYAN 469
G+Y+G P +P++ G N
Sbjct: 404 GDYSGTPVN--APVSVLDGIRN 423
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 92/289 (31%), Positives = 135/289 (46%), Gaps = 46/289 (15%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
AS+ + +D I + G++ S+E E DR + LP Q I + + P +V++ AG
Sbjct: 590 ASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKA--NPNTIVVLVAG 647
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + NI AI+ A YPGE+GG AIA+V+FG +NP GRLP+T+YN ++ LP
Sbjct: 648 S-SMAVGWMDQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IEDLPAF 704
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ D RTY ++ G LY FGYGLSYT+F Y RN
Sbjct: 705 N------DYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDY-------------------RN 739
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
LN D+ F V +N G +G +V VY + P T +K
Sbjct: 740 LNIKQDSQNIT--------------LNFSV--KNSGKYNGDEVAQVYVQFPDLGIKTPLK 783
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
Q+ GF+RV ++ G ++I + D P+G + VG
Sbjct: 784 QLKGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYNFMVG 832
>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
SP1PR4]
Length = 885
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 178/450 (39%), Positives = 244/450 (54%), Gaps = 42/450 (9%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
F + GL + D +L R +DLV RMTL+EK Q+ + A + RLG+P Y++WSE
Sbjct: 19 FGQSGLAQKP-AYLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSE 77
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
LHGV+ G AT FP I A+++E L +IG VSTEARA YN
Sbjct: 78 GLHGVARSG----------YATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQH 127
Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
GLT WSPNIN+ RDPRWGR ET GEDPF+ R +VRG+Q D
Sbjct: 128 GVHSIYFGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQ-------GDD 180
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
N + + KH+A V + R+ F+ V++ D+ +T+L F + EG A S+
Sbjct: 181 PNY--FRTIATPKHFA---VHSGPESTRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSI 235
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAV 331
MC+YNR++G P+CA LL Q +RG+W G++ +DC +I H F + KEDA
Sbjct: 236 MCAYNRIDGQPACASDLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHF-SKEKEDAS 294
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
A +KAG D CG+ Y T +AV+ G + E ++D SL+ L+ +RLG FD + Y
Sbjct: 295 AAGVKAGTDTACGKTYLGLT-SAVKSGLITEHEMDISLERLFEARIRLGLFDDPARMPYA 353
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
L ++ S + LA AARE IVLLKN N LPL+ VK +AV+GP+A + A+ GN
Sbjct: 354 RLTMAEVNSPAHRALALRAARESIVLLKNANNLLPLHG--VKNIAVIGPNAASLDALEGN 411
Query: 450 YAGIPCRYMSPIAGFSGY---ANVTYKTGC 476
Y I P+ G + A V Y G
Sbjct: 412 YNAIARDPAMPVDGIAAAFPGAKVVYAQGA 441
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 132/292 (45%), Gaps = 59/292 (20%)
Query: 498 DATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
D + GL +E E + DR D+ LP Q +L+ V K P+I+V+M+
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELLRAVKATGK-PLIVVLMN 678
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
+ + +ET+ A+L A YPGE G +AIA+ + GK NP GRLP+T+Y+ + LP
Sbjct: 679 GSAIALKDSETD----ALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSN--IDQLP 732
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
D RTY+++ G LY FG GLSYT F+Y +S + T
Sbjct: 733 A-------FDDYSMANRTYRYFKGQPLYAFGGGLSYTTFRYGKVSLSAT----------- 774
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAAT 726
L + + + N G G +V VY PP IA
Sbjct: 775 --------------------HLHAGEDLTVEAEVTNTGKVAGDEVAQVYLTPPQTSIAPR 814
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ ++G+QRV + G++K ++F + + L+ VD AG + I VG
Sbjct: 815 F--ALVGYQRVHLLPGQSKPMRFTLHP-RELSQVDAQGVRAASAGHYEIKVG 863
>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.14.25]
gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
M.16.27]
Length = 755
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 210/705 (29%), Positives = 348/705 (49%), Gaps = 122/705 (17%)
Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
++ AT+FP I ++++ L +++ + +A+ + SP ++V RDPRW
Sbjct: 97 MVKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI-----GTNQCLSPVLDVCRDPRW 151
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNW 236
GR ET GED ++V + YV+GLQ EN ++ + KH+AA+ +
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQG----EN---------ELIATVKHFAAHGFPEGG 198
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
+ + H V +++ E FL PFE+ +K G A SVM +Y+ ++GIP ++ +LL + +
Sbjct: 199 RNIAPVH----VGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKIL 254
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFT 351
R EW G +V+D D+I+ + HK + KE A+ L+AG+D +DC +
Sbjct: 255 RQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPL 310
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAA 409
AV++G + E+ ID++++ + + +LG F+ Y++ + + + ++ ELA + A
Sbjct: 311 LEAVKEGLISESIIDRAVERVLRIKEKLGLFNN--HYINENNVPEKLDNSKSRELALDVA 368
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-------AGIPCRYMSPIA 462
R+ IVLLKND N LPLN + T+AV+GP+AN ++G+Y A ++ +
Sbjct: 369 RKSIVLLKND-NILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLE 426
Query: 463 GF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----- 509
G S NV Y GCD +A +S A E AK D I + +GL LS
Sbjct: 427 GIMRKVSNNTNVLYAKGCD-IAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVP 485
Query: 510 ----------VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
V E DR L LPG Q +L+ ++ + K P+ILV+++ G +A +
Sbjct: 486 GKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVN--GRPLALSSIF 542
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---R 614
+ AI+ A +PGEEGG AIADV+FG +NP GRLPI++ P+ + +P+ R
Sbjct: 543 NEVNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISF---------PIDTGQIPIYYNR 593
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
SL R Y L+PFGYGLSYT+FKY+ L T
Sbjct: 594 KPSSL----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTP------------------- 630
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
++ + ++ +NVG +G + V +Y + IK++ GF
Sbjct: 631 ------------KEVNSSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGF 678
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+V+++ ++I F ++L D ++ G++ I +G
Sbjct: 679 AKVYLKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722
>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 853
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 165/430 (38%), Positives = 247/430 (57%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 29 LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 87 RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V GLQ + H LK+
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHY---------LKIV 189
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVKEG A+S+M +YN +N
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 246 VPCTLNSWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 304
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y NA +Q V + DID + ++ T M+LG FD + Y + I S
Sbjct: 305 CGDDVYDGPLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDSGERNPYTKISPSVIGSK 364
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AAR+ IVLLKN +N LPLN+ K+K++AVVG NA G+Y+G P +
Sbjct: 365 EHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAPV--VE 420
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 421 PVSILQGIRN 430
Score = 129 bits (323), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 142/293 (48%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + ++ AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y L
Sbjct: 655 S-SLAVNWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 706
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSY+ F Y+ L
Sbjct: 707 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYSDLQVK-------------- 750
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
D E V F +N G +G +V VY + P
Sbjct: 751 -----------------------DGGDEVTVSFRLKNTGKRNGDEVAQVYVRIPETGGIV 787
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
+K++ GF+RV +++G ++R++ + + L D ++P G + VG
Sbjct: 788 PLKELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVG 839
>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
Length = 868
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+SR+T +EK+ Q+ + + RLG+P Y+WW+EALHGV+
Sbjct: 22 KQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F+++ + VS EARA Y+ +
Sbjct: 82 RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ + V RGLQ D N
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ F+A T +D+ ET+L FE VKEGD VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E+I A + AR+ IVLLKN N LPL+ +K +AVVGP+A + + NY
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P + ++ + G A V Y+ GC+ A
Sbjct: 417 GFPSKTVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)
Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
F+G+P Y L K+ + N+ L AR +G V K ND
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536
Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
L +++AKV V G T+ A G I YM +G A++ + G
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
++ A + K AD + + G+ +E E + DR ++ +P Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PV+ V+ + G +A N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLPIT+Y V LP GRTY++ LYPFGYGLSYT F Y
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
+K + + ++ D N G DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V +Y K P + A +K + F+RV V+AG + + + D +
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843
Query: 770 AGEHTIFVG 778
G++ I G
Sbjct: 844 PGKYQILYG 852
>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
2_1_33B]
Length = 868
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+SR+T +EK+ Q+ + + RLG+P Y+WW+EALHGV+
Sbjct: 22 KQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F+++ + VS EARA Y+ +
Sbjct: 82 RAG----------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ + V RGLQ D N
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ F+A T +D+ ET+L FE VKEGD VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E+I A + AR+ IVLLKN N LPL+ +K +AVVGP+A + + NY
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P + ++ + G A V Y+ GC+ A
Sbjct: 417 GFPSKTVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)
Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
F+G+P Y L K+ + N+ L AR +G + K ND
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPIEFKLSGNDAFR 536
Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
L +++AKV V G T+ A G I YM +G A++ + G
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
++ A + K AD + + G+ +E E + DR ++ +P Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PV+ V+ + G +A N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLPIT+Y V LP GRTY++ LYPFGYGLSYT F Y
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
+K + + ++ D N G DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V +Y K P + A +K + F+RV V+AG + + + D +
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843
Query: 770 AGEHTIFVG 778
G++ I G
Sbjct: 844 PGKYQILYG 852
>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
CL02T00C15]
gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
CL02T12C06]
Length = 788
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 235/820 (28%), Positives = 367/820 (44%), Gaps = 157/820 (19%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
L+ + P RV+DL+S+MTL+EK Q+ +G R+ LPQ W +E G+ N
Sbjct: 42 LYENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGN 100
Query: 109 V-----GPGT-----------HFD-----------------------DVIPG-----ATS 124
+ G GT H D + I G AT
Sbjct: 101 IDEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATY 160
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +IG+ + EA A+ +SP +++A+DPRWGR ET
Sbjct: 161 FPAQCGQGATWNKELIARIGEVEAKEAVAL-----EYTNIYSPILDIAQDPRWGRCVETY 215
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDP++VG + LQ +L + P KH+A Y + +
Sbjct: 216 GEDPYLVGELGKQMITSLQK-------HNLVATP-------KHFAVYSIPVGGRDGKTRT 261
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M ++ PF M +E A VM SYN +G P L + +R EW G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + HK +A++ ED +AQ + AGL++ T+FT AV
Sbjct: 322 YVVSDSEAVEFISSKHK-VANTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
GK+ + +DK + + V LG FD Y GKQ + S E+ ++ EAAR+
Sbjct: 376 ADGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
+VLLKN+ N LPL S ++++AV+GP+A+ +I Y A P + + I +
Sbjct: 434 LVLLKNEMNLLPL-SKSLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTE 492
Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
V Y+ GCD + + + A AAK A+ + +L G +L+V E
Sbjct: 493 VIYRKGCDIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVR-ED 551
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K PV+LV++ I +A ++ AIL A +PGE
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEF 608
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G+A+A+ +FG +NPGGRL +T+ V +P + P +P Y L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----VL 660
Query: 635 YPFGYGLSYTQFKYNLLSFT---KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFG+GLSYT F Y L + + +Q ++N + C
Sbjct: 661 YPFGHGLSYTTFSYGDLKISPLRQGVQGDIN--------------------------ISC 694
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
+N G G +VV +Y + TY K + GF+R+ + AG + + F
Sbjct: 695 --------KIKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRL 746
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+ L + D N + G+ + +G+ +H F
Sbjct: 747 RP-QDLGLWDKNMNFRVEPGKFKVMIGSSSTDIRLHGRFE 785
>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
CL02T12C29]
Length = 868
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 164/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+ R+T +EKV Q+ + + RLG+PQY+WW+EALHGV+
Sbjct: 22 RQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F++ + VS EARA Y+ +
Sbjct: 82 RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ R V V+GLQ + +
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGD---------DPKYF 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ FD VT +D+ +T+L FE VKEG+ VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD------NHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I + H+ D+ E A A
Sbjct: 240 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A++ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGNSYRALV-KALKDGKISENDLDVSLRRLLKGRFELGMFDPDERVPYAQI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E++ A E A + +VLLKN NTLPL S ++ +AVVGP+A + + NY
Sbjct: 358 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P ++ + G V Y+ GC+ A
Sbjct: 417 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 448
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 124/270 (45%), Gaps = 56/270 (20%)
Query: 490 ASEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVA 537
A+ AAK DA +I+ G+ +E E + DR ++ LP Q +++ +
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATG 655
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
K PV+ V+ + + + + E N I AIL A Y G+E G A+AD++FG +NP GRLP+T+
Sbjct: 656 K-PVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTF 712
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y + LP + GRTY++ LYPFGYGLSYT F Y
Sbjct: 713 YKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFGYGLSYTNFAY--------- 754
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
RN +S G + D F D N G DG ++ +Y
Sbjct: 755 ----------RNAKLSS--------GKIAKDQSVTLTF----DIANTGKMDGDEIAQIYI 792
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
K P + IK + F RV V+AG ++ +
Sbjct: 793 KNPNDPEGP-IKALKAFLRVHVKAGDSQEV 821
>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
CL03T12C09]
Length = 868
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 234/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+SR+T +EK+ Q+ + + RLG+P Y+WW+EALHGV+
Sbjct: 22 KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F+++ + VS EARA Y+ +
Sbjct: 82 RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ + V RGLQ D N
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ FD T +D+ ET+L FE VKEGD VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E+I A + AR+ IVLLKN N LPL+ +K +AVVGP+A + + NY
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P + ++ + G A V Y+ GC+ A
Sbjct: 417 GFPTKTVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)
Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
F+G+P Y L K+ + N+ L AR +G V K ND
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536
Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
L +++AKV V G T+ A G I YM +G A++ + G
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
++ A + K AD + + G+ +E E + DR ++ +P Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PV+ V+ + G +A N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLPIT+Y V LP GRTY++ LYPFGYGLSYT F Y
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
+K + + ++ D N G DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V +Y K P + A +K + F+RV V+AG + + + D +
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVR 843
Query: 770 AGEHTIFVG 778
G++ I G
Sbjct: 844 PGKYQILYG 852
>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
8503]
gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
CL09T03C24]
Length = 868
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+SR+T +EK+ Q+ + + RLG+P Y+WW+EALHGV+
Sbjct: 22 KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F+++ + VS EARA Y+ +
Sbjct: 82 RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ + V RGLQ D N
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ F+A T +D+ ET+L FE VKEGD VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E+I A + AR+ IVLLKN N LPL+ +K +AVVGP+A + + NY
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P + ++ + G A V Y+ GC+ A
Sbjct: 417 GFPTKTVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)
Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
F+G+P Y L K+ + N+ L AR +G V K ND
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536
Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
L +++AKV V G T+ A G I YM +G A++ + G
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
++ A + K AD + + G+ +E E + DR ++ +P Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PV+ V+ + G +A N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLPIT+Y V LP GRTY++ LYPFGYGLSYT F Y
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
+K + + ++ D N G DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V +Y K P + A +K + F+RV V+AG + + + D +
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843
Query: 770 AGEHTIFVG 778
G++ I G
Sbjct: 844 PGKYQILYG 852
>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
Length = 868
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 234/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+SR+T +EK+ Q+ + + RLG+P Y+WW+EALHGV+
Sbjct: 22 KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F+++ + VS EARA Y+ +
Sbjct: 82 RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ + V RGLQ D N
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ FD T +D+ ET+L FE VKEGD VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E+I A + AR+ IVLLKN N LPL+ +K +AVVGP+A + + NY
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P + ++ + G A V Y+ GC+ A
Sbjct: 417 GFPTKTVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)
Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
F+G+P Y L K+ + N+ L AR +G V K ND
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536
Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
L +++AKV V G T+ A G I YM +G A++ + G
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
++ A + K AD + + G+ +E E + DR ++ +P Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PV+ V+ + G +A N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLPIT+Y V LP GRTY++ LYPFGYGLSYT F Y
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
+K + + ++ D N G DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V +Y K P + A +K + F+RV V+AG + + + D +
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843
Query: 770 AGEHTIFVG 778
G++ I G
Sbjct: 844 PGKYQILYG 852
>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
D13]
Length = 868
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+SR+T +EK+ Q+ + + RLG+P Y+WW+EALHGV+
Sbjct: 22 KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F+++ + VS EARA Y+ +
Sbjct: 82 RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ + V RGLQ D N
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ F+A T +D+ ET+L FE VKEGD VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+ GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E+I A + AR+ IVLLKN N LPL+ +K +AVVGP+A + + NY
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P + ++ + G A V Y+ GC+ A
Sbjct: 417 GFPSKTVTIVEGIRNKVPNAEVIYELGCNHTA 448
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)
Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
F+G+P Y L K+ + N+ L AR +G V K ND
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536
Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
L +++AKV V G T+ A G I YM +G A++ + G
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
++ A + K AD + + G+ +E E + DR ++ +P Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K PV+ V+ + G +A N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLPIT+Y V LP GRTY++ LYPFGYGLSYT F Y
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
+K + + ++ D N G DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+V +Y K P + A +K + F+RV V+AG + + + D +
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIR 843
Query: 770 AGEHTIFVG 778
G++ I G
Sbjct: 844 PGKYQILYG 852
>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
Length = 876
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 162/461 (35%), Positives = 245/461 (53%), Gaps = 38/461 (8%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ FLF + L + RV DLV+R+TL+EKV Q+ + + +PRL +P Y+WW+E LHGV+
Sbjct: 25 KQKEFLFQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVA 84
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL----GRA--- 160
T F T +P I A+F+++ K+ + E RA+YN GR
Sbjct: 85 R----TPFK-----VTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNER 135
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GLTYW+PNIN+ RDPRWGR ET GEDP++ G ++V+GLQ + +
Sbjct: 136 YLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDD---------PKY 186
Query: 219 LKVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
LK ++C KHYA + G + R+ FD VT ++ +T+L F+ V E + VMC+
Sbjct: 187 LKAAACAKHYAVHS-----GPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKVAGVMCA 241
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
YN P CA L+ +R +W GY+ +DC +I NHK D+ E A A +
Sbjct: 242 YNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDA-ESASADAVF 300
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQ 394
G D+DCG AV+ GK+ E ID S+K L+ + RLG FD +Y
Sbjct: 301 HGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMFDPVEMVKYAQTPTS 360
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ +DE+ A + AR+ IVLL+N+ TLPL S K+K + V+GP+ + +A++GNY G P
Sbjct: 361 VLENDEHKAHALKMARQSIVLLRNENKTLPL-SKKLKKIVVLGPNVDNAIAILGNYNGTP 419
Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+ + + G + + +N+++ S+ K
Sbjct: 420 SKLTTVLEGIKEKVGSNTEVVYEKAVNFTNDTLLVYSDVKK 460
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 119/268 (44%), Gaps = 54/268 (20%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K ADA + + G+ +E E + DR + LP QT L+ + K P++ V
Sbjct: 606 KDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTTGK-PIVFV 664
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+M+ + I + N I AI A Y G+ G A+ADV+FG +NP GRLP+T+Y D
Sbjct: 665 MMTGSAIAIPWEAEN--IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVTFYKSD--- 719
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
L P RTY+++ G LY FGYGLSYT FKY+ L ++
Sbjct: 720 ------ADLSPFVDYKMDNRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAPSV------- 766
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
+N+ T V N G G +VV +Y
Sbjct: 767 IKGKNVPIT-------------------------VKVTNTGKVSGEEVVQLYVINQNTAI 801
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
+K + GF+R+ ++AG++K I F +
Sbjct: 802 KAPLKTLKGFERISLKAGKSKTITFTLS 829
>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 766
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 221/738 (29%), Positives = 339/738 (45%), Gaps = 111/738 (15%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E V + +A RLG+P + E HG +G AT FP + +++
Sbjct: 90 EAVNVIQRYAIEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L++ + +AV+ E R+ + G +SP ++V RDPRWGR ET GEDP +V +A
Sbjct: 138 NPELFRSMCRAVAAETRS-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFA 192
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDME 254
V V+GLQ +A D + + KH+A Y + + H R ++
Sbjct: 193 VAAVQGLQG--DRLDAED------SLLATLKHFAGYGASEGGRNGAPVHMGLR----ELH 240
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E L PF V+ G A SVM +YN ++G+P + LL+ +R W G+++ DC +I
Sbjct: 241 EIDLLPFRKAVEAG-AQSVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGFVITDCGAID 299
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ H A S E+A AQ L AG+D++ G + + A++QG + E D++ ++ +
Sbjct: 300 MLKSGHNTAA-SGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQGHITEDDLNTAVGRVL 358
Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
+ RLG FD ++ I +E+IELA A EGIVLLKN+ N LPLN K +
Sbjct: 359 AMKFRLGLFDRPYTDPERAEKVIGCEEHIELARRVAAEGIVLLKNEGNVLPLNP-KTGKI 417
Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSGY------ANVTYKTGCDDVACKSNN 485
AV+GP+ANA +G+Y P + ++ + G + V Y GC + S
Sbjct: 418 AVIGPNANAPYNQLGDYTSPQPPGQIITVLEGIRRHIGEDADTRVLYAPGC-RIQGDSRE 476
Query: 486 SIFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDL 520
+ A A AD ++ G +DL A E +DR L
Sbjct: 477 GLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVTGLAQSDMECGEGIDRSTL 536
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q +L+ ++ ++ K PV++V ++ G I + +I AIL A YPG+EGG AIA
Sbjct: 537 HLMGVQLELLQEIHKLGK-PVVVVYIN--GRPITEPWIDEHIPAILEAWYPGQEGGSAIA 593
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
D++FG NP GRL +T V LP+ R G+ Y + YPFGYG
Sbjct: 594 DILFGDVNPSGRLTLTIPK--EVGQLPINYNAKR------TRGKRYLETDLEPRYPFGYG 645
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
LSYT F Y LS + + D ++
Sbjct: 646 LSYTDFHYGNLSVEPAV-------------------------------IPADGSAAVRIV 674
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
N G DG++VV +Y A K + F +VF++AG ++ + F + L ++
Sbjct: 675 VTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGESREVTFTVGP-EQLELI 733
Query: 761 DYAANTLLPAGEHTIFVG 778
++ GE I VG
Sbjct: 734 GPDMKAVVEPGEFRIRVG 751
>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
6_1_58FAA_CT1]
Length = 853
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 165/440 (37%), Positives = 251/440 (57%), Gaps = 41/440 (9%)
Query: 43 SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
+ + + + L+ D + P R+ DL+SR+T++EK+ L + G+PRLG+ +Y +EA
Sbjct: 19 TSVAVAQTKELYKDMNAPQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEA 78
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG- 161
LHGV V PG T FP I + +N L +I A+S EAR +N G
Sbjct: 79 LHGV--VRPGNF--------TVFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGK 128
Query: 162 ---------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
LT+WSP +N+ARDPRWGR ET GEDPF+ G+ V +V+GLQ G++
Sbjct: 129 DQKGFFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQ---GND--- 182
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
R LK+ S KH+AA + ++ +R+ + ++E+++ E +L FE C+KEG A S
Sbjct: 183 ---PRYLKIVSTPKHFAANNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQS 235
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
+M +YN +N +P +P LL Q +R EW +GY+V+DC +V +HK++ + E A
Sbjct: 236 IMSAYNAINDVPCTLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVK-TPEAAAT 294
Query: 333 QTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
++KAGLDL+CG Y NA +Q V + DID + + M LG FD + Y
Sbjct: 295 LSIKAGLDLECGDNVYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFDDPEKNPYN 354
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
++ + +++ +LA EAAR+ +VLLKN++N LPLN KVK++AVVG NA G+
Sbjct: 355 AISPSIVGCEKHRQLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVG--INAGNCEFGD 412
Query: 450 YAGIPCRYMSPIAGFSGYAN 469
Y+G P +P++ G N
Sbjct: 413 YSGTPVN--APVSVLEGIKN 430
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 132/291 (45%), Gaps = 48/291 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A A + D T+ + G++ S+E E DR + LP Q I + + V++++
Sbjct: 597 AGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPNTVVVLV---A 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + NI AIL A YPGE+GG A+A+ +FG +NPGGRLP+T+Y + LP
Sbjct: 654 GSSLAINWIDENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYYRS--LDELPAF 711
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
D GRTY ++ LYPFGYGLSYT+F Y
Sbjct: 712 D------DYDIQKGRTYMYFENKPLYPFGYGLSYTRFDYK-------------------- 745
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
N S+ S DD K +N G G +V VY + P +K
Sbjct: 746 -NLKSEVS--------------DDAVNLKFTVKNTGKYAGDEVAQVYVRFPESGIKVPLK 790
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGN 779
Q+ GF+RV + G++ ++ V K L + D P+G + VG+
Sbjct: 791 QLKGFERVHIGKGKSAQVS-VSIPKKELRLWDEKDGKFYTPSGNYIFMVGS 840
>gi|262405981|ref|ZP_06082531.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345510488|ref|ZP_08790055.1| beta-glucosidase [Bacteroides sp. D1]
gi|262356856|gb|EEZ05946.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|345454434|gb|EEO48987.2| beta-glucosidase [Bacteroides sp. D1]
Length = 735
Score = 278 bits (712), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 224/768 (29%), Positives = 356/768 (46%), Gaps = 115/768 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EKV QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D EN ++++C KHY Y R + ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
M VK G A ++M S+N ++G+P A+P ++ + ++ W G+IV+D +++ + ++
Sbjct: 248 MGVKAG-APTLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304
Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
LA +K+DA AGL++D + Y V++GKV +D+S++ + V RLG
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
F+ V+ K +++ +AA+ A E +VLLKND LPL + K +AVVGP A
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422
Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
++G++ G + Y A F G A + Y GC ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ +D I+ G L+ E+ R + LP Q +L+ ++ E K PVILV+ + G +
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVL--SNGRPLE 537
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
AIL PG G R++A ++ G+ NP G+L +T+ P ++ +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588
Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ R G+ G YK LYPFG+GLSYT+FKY +
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
T A+K ++ D +V N GS DG++ V + P +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVK 676
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++ F++ ++AG K +F + + V+ L AGE+ I V
Sbjct: 677 ELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 863
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 166/464 (35%), Positives = 254/464 (54%), Gaps = 35/464 (7%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
+ D S P S+R+++++ +MTL+EKV QL + + +PRL LP Y +W+E LHGV+ G
Sbjct: 48 IIGDLSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE- 106
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-NLGRAGLTYWSPNINV 171
T FP I ++++ L K+I A+STEAR Y ++G+ GLTYW+P IN+
Sbjct: 107 ---------VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGK-GLTYWAPTINM 156
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
ARDPRWGR ET GEDP++ R V +V+GLQ H N LK + KH+ A
Sbjct: 157 ARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQG--DHPNY-------LKTVATVKHFVAN 207
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
+ +N DR+ +++ + + E + +E CVKE + S+M +YN NGIP L
Sbjct: 208 NQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWL 263
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
L +R EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y
Sbjct: 264 LEDVLRKEWGFDGFVVSDCGAIGVMNWQHR-IVNSLEEAAALGINSGCDLECGGTYRENL 322
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
AVQ+G V E ID++L + T+ +LG FD Y K+ + ++ LA EAA
Sbjct: 323 VAAVQRGLVSEYAIDRALTRVLTMRFKLGEFDPIELVPYNHYDKKLLAGEQFRRLAYEAA 382
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-- 467
+ I+LLKN+ N LP++ V+++A+VGP A+ +G Y+G P +S + G
Sbjct: 383 VKSIILLKNEDNFLPIDKKDVRSIAIVGPFADNN--YLGGYSGKPVHNISLLQGVKKMVG 440
Query: 468 --ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
++Y G V ++S AS+ + G DL+
Sbjct: 441 EEVEISYIEGT-SVVSPVDSSYLLASDGVNNGLTADYIDGHDLN 483
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 141/309 (45%), Gaps = 46/309 (14%)
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
N I + AD ++ G D + E+ D ++LP Q L+ ++ +V P I +
Sbjct: 597 NQIDKVKKIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKV--NPRIAL 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
I+ G + ++ +IL A YPG+EGG A+A ++FG NP G+LP+T Y + Q
Sbjct: 655 ILQTGN-PLTSQWAAEHVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESE--Q 711
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP +D + GRTY++ + LY FG+GLSY+ F+Y L + V+
Sbjct: 712 QLP------NILDYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYADLQCNDVVHVD---- 761
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEI 723
L+C + +N+ G +V+ VY S+ +
Sbjct: 762 ----------------------GTLQC------SIKVKNISDVVGEEVIQVYVSREKTPV 793
Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+K++I F RV ++ +K + F + L++ +L +G++++FVG G
Sbjct: 794 YTFPLKKLIAFARVNLKPNESKTVTFTITP-RQLSVWQDGEWKML-SGKYSLFVGGGQKE 851
Query: 784 FPIHLNFNY 792
+N ++
Sbjct: 852 LSKGMNKDF 860
>gi|336412679|ref|ZP_08593032.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
3_8_47FAA]
gi|335942725|gb|EGN04567.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
3_8_47FAA]
Length = 735
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 221/776 (28%), Positives = 357/776 (46%), Gaps = 113/776 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EKV QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D EN ++++C KHY Y R + ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
M VK G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +++ + ++
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304
Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
LA +K+DA AGL++D + Y V++GKV +D+S++ + V RLG
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
F+ V+ K +++ +AA+ A E +VLLKND LPL + K +AVVGP A
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KRIAVVGPMAK 422
Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
++G++ G + Y A F G A + Y GC ++ S FA A +
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKPQG--NDRSGFAGALDVV 480
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ +D I+ G L+ E+ R + LP Q +L+ ++ E K P+ILV+ + G +
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLE 537
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
AIL PG G R++A ++ G+ NP G+L IT P ++ +
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAIT---------FPYSTGQIP 588
Query: 615 PVDSLGYPGRTYK-FYNGPT---LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
+ GR ++ FY T Y FGYGLSYT+F+Y +++ + T KL
Sbjct: 589 IYYNRRKSGRWHQGFYKDITSDPFYSFGYGLSYTEFQYGVVTPSSTTVKRGEKLS----- 643
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
+V NVG DG++ V + P +K+
Sbjct: 644 --------------------------VEVTVTNVGKRDGAETVHWFISDPYCSITRPVKE 677
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
+ F++ F++ G + +F + + L VD L AGE+ I+V + V +
Sbjct: 678 LKHFEKQFIKVGETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWVQDQKVKIEL 733
>gi|336404202|ref|ZP_08584900.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
gi|335943530|gb|EGN05369.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
Length = 735
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 222/768 (28%), Positives = 357/768 (46%), Gaps = 115/768 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EK+ QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D EN ++++C KHY Y R + ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
M VK G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +++ + ++
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304
Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
LA +K+DA AGL++D + Y V++GKV +D+S++ + V RLG
Sbjct: 305 LAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
F+ V+ K +++ +AA+ A E +VLLKND LPL + K +AVVGP A
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422
Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
++G++ G + Y A F G A + Y GC ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ +D I+ G L+ E+ R + LP Q +L+ ++ E K PVILV+ + G +
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVL--SNGRPLE 537
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
AIL PG G R++A ++ G+ NP G+L +T+ P ++ +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588
Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ R G+ G YK LYPFG+GLSYT+FKY +
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
T A+K ++ D +V N G+ DG++ V + P +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVK 676
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++ F++ ++AG K +F + + V+ L AGE+ I V
Sbjct: 677 ELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
Length = 882
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 163/423 (38%), Positives = 235/423 (55%), Gaps = 40/423 (9%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV++MTL EK+ Q + A +PRLG+P Y+WWSE LHG++ G AT FP
Sbjct: 37 LVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLG---------RAGLTYWSPNINVARDPRWG 178
I AS+N L + +G STEARA +NL AGLT WSPNIN+ RDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R ET GEDP++ G+ AV+++RGLQ ++ P +++ KH+A V +
Sbjct: 147 RGMETYGEDPYLTGQLAVSFIRGLQG--------NIPDHPRTIATP-KHFA---VHSGPE 194
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN +R
Sbjct: 195 PGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
+W +G++V+DCD+I M H F D+ A A LK+G DL+CG Y + A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIDDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTYRDLN-QAIARG 312
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
+ E +D++L L+ RLG Y ++G + I + + LA +AA + +VLL
Sbjct: 313 DIDEALLDQALIRLFAARQRLGTLQPREHDPYATIGIKHIDTPAHRALALQAAVQSLVLL 372
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
KN NTLPL T+AV+GP A++ A+ NY G ++P+ G G A + Y
Sbjct: 373 KNSGNTLPLTPG--TTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGAAKIHYA 430
Query: 474 TGC 476
G
Sbjct: 431 QGA 433
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 139/301 (46%), Gaps = 55/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A A ADA + GL VE E L DR + LP Q L+ V K
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+I+V+MS V + +A+ + N AIL A YPG+ GG AIA + G NPGGRLP+T+Y
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAN--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYR 720
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
Q LP P S GRTY+++ G LYPFGYGLSYTQF Y
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFTYE---------- 761
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
P + L+ D +N G+ G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGDTLTVTAHVRNTGTRAGDEVVQLYLEP 800
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
P A ++ ++GF+RV +R G ++ + F + + L+ V + AG + +FVG
Sbjct: 801 PHSPQAP-LRNLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQRSVEAGHYHLFVGG 858
Query: 780 G 780
G
Sbjct: 859 G 859
>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
17393]
gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 161/408 (39%), Positives = 231/408 (56%), Gaps = 28/408 (6%)
Query: 58 SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDD 117
S P S RV++L+S+MTL+EKV QL + +PRL LP Y +W+E LHGV+ G
Sbjct: 53 SQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE------ 106
Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
T FP I ++++ L KK+ A+STEAR Y GLTYWSP IN+ARDPRW
Sbjct: 107 ----VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDPRW 162
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ET GEDP++ R V +V+GLQ H + LK + KH+ A + +N
Sbjct: 163 GRNEETYGEDPYLTSRLGVAFVKGLQG--DHPDY-------LKTVATIKHFVANNQEN-- 211
Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
DR+ +++ + + E + +E CVKE DA SVM +YN NG+ LL +R
Sbjct: 212 --DRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLR 269
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
EW G++V+DC +I VM H+ + +S E+A A + +G DL+CG Y AV+
Sbjct: 270 KEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGINSGCDLECGGTYREKLVAAVKM 328
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAREGIVL 415
G V E IDK+L + T +LG FD Y K+ + ++ +LA EAA + IVL
Sbjct: 329 GLVSEQAIDKALTRVLTARFKLGEFDPIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVL 388
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
LKND + LP++ K+++VA+VGP A+ +G Y+G P +S + G
Sbjct: 389 LKNDNDFLPVDKKKIRSVAIVGPFADNN--YLGGYSGKPVHNVSLLQG 434
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 153/331 (46%), Gaps = 55/331 (16%)
Query: 467 YANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
Y N T C V+ N + I E AD ++ G D + E+ D ++LP
Sbjct: 578 YINKTGAAACMLVSDFGNSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMT 637
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q L+ ++ +V P +I+ G + N+ AIL A YPG+EGG+A+A ++FG
Sbjct: 638 QELLLKEIYKV--NPRTALILHTGN-PLTSKWAAENVPAILQAWYPGQEGGKALAGILFG 694
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY---PGRTYKFYNGPTLYPFGYGLS 642
NP G+LP+T Y + + LP D L Y GRTY++ + LY FG+GLS
Sbjct: 695 SENPSGKLPMTIYESE--EQLP---------DILDYDIWKGRTYQYLSSKPLYGFGHGLS 743
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
Y+ F+Y L + R D + ++ +
Sbjct: 744 YSNFEYTHLQSDDVV--------------------------------RPDGTLQCSIEIK 771
Query: 703 NVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N+ G +VV VY S+ + +K+++ F RV ++ G +K + F A + L+I
Sbjct: 772 NISDVAGEEVVQVYISRENTPVYTFPLKKLVAFARVDLKPGESKTVTFTI-APRQLSIWQ 830
Query: 762 YAANTLLPAGEHTIFVGNG--GVSFPIHLNF 790
+LP G++++FVG+G G+S I+ NF
Sbjct: 831 EGIWKMLP-GKYSLFVGSGQEGLSKGINRNF 860
>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
Length = 881
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 167/446 (37%), Positives = 245/446 (54%), Gaps = 52/446 (11%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + SL R DLV RMT++EKV QL + + VPRL +P Y+WWSEALHGV+
Sbjct: 30 YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQ----- 84
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
PG T +P + A+F+ +++ + + E R + G GL +W
Sbjct: 85 ------PGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDPF+ R V YV+GLQ D L +S+
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQG--------DDPKYYLAISTP- 189
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KHYA V + R+ D +V++ D +T+L F V E A SVMC+YN +NG P+
Sbjct: 190 KHYA---VHSGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPA 246
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-- 343
C + LL +RG+W+ GY+V+DC++I + +HKF ++ +A A ++ G+D +C
Sbjct: 247 CVNEFLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKF-TKTQAEASALAVQRGMDNECVD 305
Query: 344 -------GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--- 393
Y F +A +QG +KE++ID +L L+T M+LG FD P+ V K
Sbjct: 306 FGKQKDDHDYRPYF--DAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDP 362
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+++ S E+ ELA A E +VLLKND TLPL + +K +AV+GP A T ++GNY G
Sbjct: 363 KELESAEHRELARTLANESMVLLKND-GTLPLKKSGLK-IAVIGPLAEQTRYLLGNYNGT 420
Query: 454 PCRYMSPIAGFSGY---ANVTYKTGC 476
P +S + G A +T++ G
Sbjct: 421 PSHTVSVLEGLRAEFPDAQITFERGT 446
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 147/302 (48%), Gaps = 55/302 (18%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
AA AAK AD I + G+ +E E + DR L LP + QL+ ++ K
Sbjct: 602 AAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAGK 661
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PV+LV+ + + + +A+ + N AIL YPGEEGG AIA + GK NP GRLP+T+Y
Sbjct: 662 -PVVLVLSNGSALSVNWAQQHAN--AILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTFY 718
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
G + LP P + GRTY+++ G LYPFGYGLSYT F Y L+ K
Sbjct: 719 TG--TEQLP-------PFEDYAMKGRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPKA-- 767
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
L D +V N G +G +V +Y
Sbjct: 768 -----------------------------PLNAGDPVTAQVTVTNTGKVEGDEVAQLYLS 798
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
P IA ++ + GF+R+ ++AG ++ IKF + L++V+ A + ++ GE+++ VG
Sbjct: 799 FP-NIAGAPLRALRGFRRIHLKAGESQTIKFELKD-RDLSMVNEAGDPIIAEGEYSVSVG 856
Query: 779 NG 780
G
Sbjct: 857 GG 858
>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 896
Score = 277 bits (709), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 171/432 (39%), Positives = 232/432 (53%), Gaps = 42/432 (9%)
Query: 60 PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVI 119
P RV +LVS+MTL E+ Q+ + A +PRLG+P Y WWSE LHG++ G
Sbjct: 45 PIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSG--------- 95
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINV 171
AT FP I +A+F+ + ++G VSTEARA YN GLT W+PNIN+
Sbjct: 96 -YATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTLWAPNINI 154
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
RDPRWGR ET GEDPF+ G A YV GLQ N + LK + KH++ Y
Sbjct: 155 VRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGN---------NPKYLKTVATPKHFSVY 205
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
N R+ +A + DM++T+L F M + +G A S+MCSYN V G+PSCA+ KL
Sbjct: 206 ---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCAN-KL 261
Query: 292 LNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L VRG+W GYI +DC +I H + D+ A + L AG D DCG Y
Sbjct: 262 LADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVL-AGTDTDCGTGYKV 320
Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
+VQQG + + ID++++ L+T RLG FD Y S+ + S + A E
Sbjct: 321 LP-QSVQQGLISKAAIDRAVERLFTARFRLGMFDPKADVPYNSIPYSVVDSAAHRAQALE 379
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG- 466
A + +VLLKN+ LPL +A +T+AVVGP+A ++ GNY IP P+ G
Sbjct: 380 DASKSMVLLKNEGGILPLRNA--RTIAVVGPNAANLNSIEGNYNAIPSHPSLPVDGIEAA 437
Query: 467 --YANVTYKTGC 476
A+V Y G
Sbjct: 438 FPQAHVVYAQGS 449
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 133/263 (50%), Gaps = 45/263 (17%)
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
DR L LP Q L++ + K PV+LV+++ + I +A+ + ++ IL A YPGE G
Sbjct: 655 DRTRLSLPQTQQDLLHALVATGK-PVVLVLLNGSALSIDWAKQH--VQGILEAWYPGEAG 711
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
G AI + + G+ +PGG+LPIT+Y V+ LP P GRTY++Y G L+
Sbjct: 712 GEAIGETLSGQNDPGGKLPITFYTS--VKDLP-------PFTDYSMKGRTYRYYTGKPLF 762
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
PFGYGLSYT F+Y+ H R + ++L+ +
Sbjct: 763 PFGYGLSYTTFEYS----------------HVR---------------LSTSNLKAGEPL 791
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
+ + +N G G V VY PP + +K++ GF RV + G+++++ F N +
Sbjct: 792 TVEAEVKNTGHVAGDAVTEVYVTPP-QNGVNPLKELKGFDRVHLAPGQSRQLTFTLNP-R 849
Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
L++VD A + G ++IFVG
Sbjct: 850 DLSLVDEAGKRSVQPGVYSIFVG 872
>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
Length = 303
Score = 277 bits (709), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 135/242 (55%), Positives = 171/242 (70%), Gaps = 13/242 (5%)
Query: 7 SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVK 66
+LL + + +LVF V ++ S P+F CDP GL + FC +++P +RV+
Sbjct: 7 ALLIGNKVVVILVFLLCLVHSSESLRPLFACDPAN----GLT-RTLRFCRANVPIHVRVQ 61
Query: 67 DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
DL+ R+TL EK++ L + A VPRLG+ YEWWSEALHG+S+VGPG F PGATSFP
Sbjct: 62 DLLGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFP 121
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
VI T ASFN+SLW++IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR ETPGE
Sbjct: 122 QVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGE 181
Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
DP V +YA +YVRGLQ T +R LKV++CCKHY AYD+DNW GVDR+HF+A
Sbjct: 182 DPIVAAKYAASYVRGLQ-------GTAAGNR-LKVAACCKHYTAYDLDNWNGVDRFHFNA 233
Query: 247 RV 248
+V
Sbjct: 234 KV 235
>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
Length = 885
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 167/423 (39%), Positives = 230/423 (54%), Gaps = 40/423 (9%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV++MT EK+ Q + A +PRLG+P YEWWSE LHG++ G AT FP
Sbjct: 40 LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWG 178
I A++N L +G STEARA +NL AGLT WSPNIN+ RDPRWG
Sbjct: 90 AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R ET GEDP++ GR AV ++ GLQ D + P +++ KH A V +
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQ--------GDDPAHPRTIATP-KHLA---VHSGPE 197
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R+ FD V+ D E T+ F + +G A SVMC+YN ++G P+CA L++ VRG
Sbjct: 198 PGRHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRG 257
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
+W G++V+DCD+I M H + D+ + A LKAG DL+CG Y G A +G
Sbjct: 258 DWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAA-ALKAGHDLNCGTAYREL-GIAFDRG 315
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
+ E +D+SL L+ RLG + Y LG +DI S + LA +AA++ +VLL
Sbjct: 316 EADEALLDRSLVRLFAARYRLGELQPRRNDPYARLGARDIDSAAHRALALQAAQQSLVLL 375
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
KN TLPL +AV+GP+A+A A+ NY G + ++P+ G G A V Y
Sbjct: 376 KNANATLPLRPG--LRLAVLGPNADALAALEANYQGTSVQPVTPLQGLRTRFGAAQVAYA 433
Query: 474 TGC 476
G
Sbjct: 434 QGA 436
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 141/286 (49%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR DL LP Q L+ + A+ + P+++V+MS V +
Sbjct: 622 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 680
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+AE + + AI+ A YPG+ GG AIA + G NPGGRLP+T+Y ++ L
Sbjct: 681 WAEQHAD--AIIAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYR---------STKDLP 729
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYTQF Y+
Sbjct: 730 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTQFAYD------------------------- 764
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
P + L+ + +N G+ G +VV VY + P + A + ++ ++GF
Sbjct: 765 ------APQLSTTTLQAGQPLQVSTTVRNTGARAGDEVVQVYLQYP-QRAQSPLRSLVGF 817
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F +A + L+ VD + + AG++ +FVG G
Sbjct: 818 QRVHLQPGEARTLSFALDA-RQLSDVDRSGQRAVEAGDYRLFVGGG 862
>gi|393784338|ref|ZP_10372503.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
CL02T12C01]
gi|392666114|gb|EIY59631.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
CL02T12C01]
Length = 857
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 228/795 (28%), Positives = 355/795 (44%), Gaps = 146/795 (18%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQ-------------------LGDFAHGVP----- 89
+ SSLP S RV DL+ RMTL+EK+ Q LG F GV
Sbjct: 28 YRQSSLPISERVDDLLGRMTLEEKIAQIRHIHSWNVFNGQDLDMEKLGKFTGGVSWGFVE 87
Query: 90 ------------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
RLG+P + +E+LHG V G+T +
Sbjct: 88 GFPLTGVNCKKNMQLIQKFMVENTRLGIPVFTV-AESLHG-----------SVHEGSTIY 135
Query: 126 PTVILTTASFNESLWKKIGQAVSTE--ARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
P I ++F L + ++ + A+ M+ + +P I+V RD RWGR+ E+
Sbjct: 136 PQNIAMGSTFRPELAYRKAAMITKDLHAQGMHQV-------LAPCIDVVRDLRWGRVEES 188
Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
GEDP + G + + V+G D N +S KHY + + G++
Sbjct: 189 FGEDPVLCGLFGIAEVKGYMD-----NG---------ISPMLKHYGPHG-NPLSGLNLAS 233
Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
+ + +D+ E +L+PFEM ++ +VM +YN N +P+ A LL + +RG++
Sbjct: 234 VECGL--RDLHEVYLKPFEMVIRNTPVLAVMSTYNSWNHVPNSASHYLLTEVLRGQFGFK 291
Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
GY+ +D +I+++ H+ +A + E+A Q AGLD++ +Q+GK+ E
Sbjct: 292 GYVYSDWGAIEMLKTLHR-VAHNSEEAAMQAFTAGLDVEASSNCYPLLAGLIQKGKLDEE 350
Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTL 423
+++S++ + ++G F+ P ++ E+I L+ E A E +VLLKN+ L
Sbjct: 351 VLNESVRRVLYAKFKMGLFE-DPYGEQYSHSEMHGAESIRLSKEIADESVVLLKNENGLL 409
Query: 424 PLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAG----FSGYANVTYKTGCD 477
PLN+ K+K+VAV+GP NA G+Y ++P+ G G A V Y GCD
Sbjct: 410 PLNADKLKSVAVIGP--NADQVQFGDYTWSRNNKDGVTPLEGIRRLLGGKATVRYAKGCD 467
Query: 478 DVACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLPGYQTQ 528
V+ + I A EAA+ ++ I+ G S E D DL L G Q Q
Sbjct: 468 LVSLNAG-GIKEAVEAARKSEVAILFCGSASAALARDYKSSTCGEGFDLNDLNLTGVQGQ 526
Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
LI +V E PV+LV+++ G A + +I AIL Y GE+ G +IAD++FG +
Sbjct: 527 LIKEVYETGT-PVVLVLVT--GKPFAISWEKKHIPAILTQWYAGEQAGNSIADILFGSIS 583
Query: 589 PGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
P GRL ++ Y LP + S PGR Y F + L+ FG+GL+Y
Sbjct: 584 PSGRLTFSYPQTTGHLPVYYNYLPSDKGFYKNPGSYESPGRDYVFSSPDALWAFGHGLTY 643
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F Y NL + LN D VD +N
Sbjct: 644 TSFVYK----------NLRTDKEHYGLN---------------------DTIYIDVDIKN 672
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G +G +VV +Y T +KQ+ F++V V AG+ + +K A L IV+
Sbjct: 673 TGKREGKEVVQLYVNDKVSTVVTPVKQLRDFKKVDVEAGKTETVKLKV-AVNDLYIVNAG 731
Query: 764 ANTLLPAGEHTIFVG 778
++ GE + VG
Sbjct: 732 NKRVVEPGEFELQVG 746
>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
Length = 946
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 231/821 (28%), Positives = 372/821 (45%), Gaps = 148/821 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + R++DL+S+MTL+EK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPTATIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGA 110
Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
E L+G G P + +++ P
Sbjct: 111 IDEHLNGFQQWGLPPSDNENIWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWG
Sbjct: 171 YKATNFPTQLGLGHTWNRRLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRG+Q H + ++++ KH+ AY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QIAATGKHFIAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E T + PF+ ++E VM SYN +G P + L +RG
Sbjct: 272 EGMARVDPQMSPREVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C Y
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL--GKQDICSDENIELAAEAAREG 412
V++G + E I+ ++ + V +G FD P + L +++ N E+A +A+RE
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLVGLFD-HPYQIDLKGADEEVEKAANEEIALQASRES 449
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYA 468
IVLLKND+N LPL+++ ++ +AV GP+A+ + +Y + S + G G A
Sbjct: 450 IVLLKNDKNILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKA 509
Query: 469 NVTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
V Y GCD V + I A + K AD +++ G E+
Sbjct: 510 EVLYTKGCDLVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGEN 569
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q L+ VA K PV+LV+++ + I +A + + AI+ A YPG +
Sbjct: 570 KSRSSLDLPGRQLDLLKAVAATGK-PVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSK 626
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKFY-- 629
GG+A+ADV+FG++NPGG+L +T+ V +P + P +P +D PG
Sbjct: 627 GGKAVADVLFGEYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGMEGNMSRA 683
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCPGVLVND 688
NG LYPFGYGLSYT F+Y+ L + I N C+
Sbjct: 684 NG-ALYPFGYGLSYTTFEYSDLKISPAIITPNQQTFVTCK-------------------- 722
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
N G G +VV +Y + TY K + GF+RV ++ G K +
Sbjct: 723 ------------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVT 770
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
F + K+L +++ + ++ G+ T+ V G S I LN
Sbjct: 771 FPIDR-KALELLNADMHWVVEPGDFTLMV--GASSTDIRLN 808
>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
DSM 6724]
Length = 756
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 220/697 (31%), Positives = 339/697 (48%), Gaps = 110/697 (15%)
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
EALHG + G+T FP I +++N L ++ A+ E R+ R
Sbjct: 138 EALHGC-----------MAKGSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
SP IN+ARDPR GR ET GEDP++ R AV Y++G+Q+ +G
Sbjct: 182 IHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQE-QG------------- 227
Query: 221 VSSCCKHYAAYDVDNWKGVDRY--HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
V + KH+AA V + G D Y HF R+ + E + F+ +KE A S+M +YN
Sbjct: 228 VIATPKHFAANFVGD-GGRDSYPIHFSERL----LREVYFPAFKASIKEAGALSLMAAYN 282
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
++GIP ++ LL +R EW GY+V+D S+ ++ HK +A+SK +A L+AG
Sbjct: 283 SLDGIPCSSNKWLLTDVLRKEWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAARLALEAG 341
Query: 339 LDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVS 390
LD+ DC + N V+ GK+ E I+++++ + V G FD P Y
Sbjct: 342 LDMELPDSDCFEEMINL----VKGGKLSEETINEAVRRILGVKFWAGLFDNPFVDPDYAE 397
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+ + C+ E+ ELA ARE IVLLKN + LPL S + ++AV+GP NA V +G Y
Sbjct: 398 --RVNDCA-EHRELALRVARESIVLLKN-EGILPL-SKDIGSIAVIGP--NAAVPRLGGY 450
Query: 451 AGIPCRYMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
+G + ++P+ G A + + GC + S + A + A+ +D I+ G
Sbjct: 451 SGYGVKIVTPLEGIKNKMENKAKIYFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFVGN 509
Query: 507 DL-SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
+ E E DR +L LPG Q +LI ++ PVI+V+++ G I ++A+
Sbjct: 510 SVPETEGEQRDRHNLNLPGVQEELIKEICN-TNTPVIVVLIN--GSAITMMNWIDKVQAV 566
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL--TSMPLRPVDSLGYPG 623
+ A YPGEEGG AIADV+FG +NPGG+LPIT+ Y LPL P VD
Sbjct: 567 IEAWYPGEEGGNAIADVLFGDYNPGGKLPITF--PKYSSQLPLYYNHKPSGRVDD----- 619
Query: 624 RTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
Y P L+PFGYGLSYT+F+Y+ NL T +
Sbjct: 620 --YVDLRSPQYLFPFGYGLSYTEFRYS-------------------NLRITPE------- 651
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
++ D + +N+G G +VV +Y +K++ F+R+ + G
Sbjct: 652 -----EIPMDGEITITFEVENIGKYKGDEVVQLYLHDEFASVVRPVKELKRFKRITLAVG 706
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K + F + + L ++ ++ G +F+G+
Sbjct: 707 EKKTVSFKLDR-RDLEFLNIDMEPIVEPGRFEVFIGS 742
>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 782
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 229/820 (27%), Positives = 374/820 (45%), Gaps = 154/820 (18%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEK----VQQLG--DFAH--------------- 86
++ L+ DSS P RV+ L+ MTL+EK VQ G + H
Sbjct: 14 EVEMLLYKDSSKPIPERVEHLLGLMTLEEKAGQLVQPFGWQTYEHKDGEIKLTEAFKAQV 73
Query: 87 ---GVPRL-GLPQYEWWS--------------EALHGVSN-------------VGPGTHF 115
GV L G+ + + W+ EA++ + +G
Sbjct: 74 KNGGVGSLYGVLRADPWTGVTLETGLSPREGTEAVNAIQRYAIENSRLGIPILIGEECSH 133
Query: 116 DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDP 175
+ GAT FP + +++N L++++ +AV+ E RA + G +SP ++V RDP
Sbjct: 134 GHMAIGATVFPVPLSLGSTWNVELYREMCRAVARETRA-----QGGAVTYSPVLDVVRDP 188
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCCKHYAAY-D 232
RWGR E GED +++ AV V GLQ ++G ++ V++ KH+ Y
Sbjct: 189 RWGRTEECFGEDAYLISEMAVASVEGLQGESLDGEDS----------VAATLKHFVGYGS 238
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
+ + H R ++ E L PF V+ G A+S+M +YN ++G+P + +LL
Sbjct: 239 SEGGRNAGPVHMGRR----ELLEVDLLPFRKAVEAG-AASIMPAYNEIDGVPCTTNEELL 293
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFT 351
+ +RGEW G ++ DC +I ++ H D + DA Q ++AG+D++ G +
Sbjct: 294 DGVLRGEWGFDGMVITDCGAIDMLASGHDVAEDGR-DAAIQAIRAGIDMEMSGVMFGKHL 352
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAARE 411
AV+ G+++E +D++++ + T+ RLG F+ ++ I S E++ELA + A E
Sbjct: 353 VEAVRSGQLEEEVLDRAVRRVLTLKFRLGLFERPYADPERAERVIGSAEHVELARQLASE 412
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR--YMSPIAGFSGY-- 467
G+VLLKN LPL SA T+AV+GP+A+A +G+Y R + + G
Sbjct: 413 GVVLLKNKDGVLPL-SADAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLA 471
Query: 468 ---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA- 512
V Y GC + S A A+ AD +++ G +DL A
Sbjct: 472 ETPERVLYAPGC-RINGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGAS 530
Query: 513 -------------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
E +DR +L L G Q +LI ++ ++ K P+++V ++ G IA +
Sbjct: 531 KVTDNAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYIN--GRPIAEPWID 587
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
+ AIL A YPG+EGG AIAD++FG NP GRL I+ +V +P+ R
Sbjct: 588 EHADAILEAWYPGQEGGHAIADILFGDVNPSGRLTISIPK--HVGQVPVYYHGKRS---- 641
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
G+ Y + YPFGYGLSYT+F YN NL SD
Sbjct: 642 --RGKRYLEGDSQPRYPFGYGLSYTEFTYN-------------------NLKLESDT--- 677
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+ D + V+ NVG G++V+ +Y A K++ GF+++F+
Sbjct: 678 ---------INKDGSTKVTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFL 728
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ G + ++F + L + ++ GE + VG
Sbjct: 729 QPGETQTVEFTVGP-EQLQYIGQNYKPVVEPGEFRVHVGK 767
>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
50-1 BON]
Length = 753
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 337/678 (49%), Gaps = 84/678 (12%)
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
GAT FP I ++++ + + + + +A GL SP ++VARDPRWGR+
Sbjct: 108 GATVFPQAIGLASTWDAEAIEAMAGVIRQQMKAAG--AHQGL---SPVLDVARDPRWGRV 162
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ET GEDP++V AV+YVRGLQ DL + + KH+A + ++
Sbjct: 163 EETFGEDPYLVASMAVSYVRGLQ-------GQDLTK---GIFATLKHFAGH---SFSEGG 209
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
R V E+++ + FL PFE V+E +A SVM +Y+ ++G+P A +LL +RG +
Sbjct: 210 RNCAPVHVGERELWDIFLFPFEAAVREANAKSVMNAYHDIDGVPCAASRELLTDILRGHF 269
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY--YTNFTGNAVQQG 358
G +V+D D+I + H F A +K++A Q L+AG+D++ + Y +AV++G
Sbjct: 270 GFDGIVVSDYDAIDRLRKAH-FTAGNKKEAAVQALEAGIDIELPKMDCYGQPLMDAVKEG 328
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
+ E I++S++ + T LG FDG V + E E++ + AR+ IVLLKN
Sbjct: 329 MISEATINESVERVLTAKFELGLFDGVYVDVDSVPGLFETPEQREMSRDIARKSIVLLKN 388
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY-----MSPIAGFSGYAN---- 469
D N LPL S +K++AV+GP+A+ M+G+YA + R + + G N
Sbjct: 389 D-NVLPL-SKDIKSIAVIGPNADNARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLD 446
Query: 470 ---VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV-----EAESLDREDLW 521
+TY GCD + S + A AA+ ADA I++ G + + E+ DR D+
Sbjct: 447 SCRITYAKGCD-IIDPSTDGFVEAVNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDIT 505
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
LPG Q QL+ + + K PVI+V+++ G A E N A++ A YPGEEGG A+AD
Sbjct: 506 LPGVQMQLVKAIKDTGK-PVIVVLIN--GRAFAAKELADNASALMEAWYPGEEGGNAVAD 562
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
V+FG +NP GRLPI+ V +P+ + L+P + Y K + FGYG+
Sbjct: 563 VLFGDYNPAGRLPISLPC--EVGQIPI-NYNLKPASYINYLSTETK-----PAFAFGYGM 614
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT F Y+ LS T + + K+ FKV
Sbjct: 615 SYTTFGYSDLSITPAVAPSAGKVD-----------------------------ISFKV-- 643
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N G G +VV +Y + +K++ GF+RV ++ G K I F A L D
Sbjct: 644 TNAGQLAGDEVVQLYIRDEVSSIVRPVKELKGFKRVNLQPGETKEITFTLYA-DQLAFHD 702
Query: 762 YAANTLLPAGEHTIFVGN 779
++ G I VG+
Sbjct: 703 KDMRLVVEPGTFKIMVGS 720
>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 814
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 224/724 (30%), Positives = 335/724 (46%), Gaps = 109/724 (15%)
Query: 86 HGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
HG RLG+P + E HG +G T FPT I +++N L +++G+
Sbjct: 147 HG--RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGR 192
Query: 146 AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
A++TEA A + + P +++ARDPRW R+ ET GED ++ G V+G Q
Sbjct: 193 AIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQG- 246
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
+ KV + KH+AAY W A V ++MEE PF V
Sbjct: 247 -------EFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAV 296
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
G A SVM SYN ++GIP A+ LL ++ W G++V+D +I + ++ +AD
Sbjct: 297 AAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLYAIGGLREHG--VAD 353
Query: 326 SKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
+ +A + + AG+D D G Y NAV++G V+E I+K++ + + +G FD
Sbjct: 354 TDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFDH 413
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+Q + S E++ELA E AR+ I+LLKN LPLN K+KT+AV+GP+A+
Sbjct: 414 PFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADNIY 472
Query: 445 AMIGNYAGIPCRYMSPIAGFSGY-------ANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
M+G+Y P S + G ++ Y GC V S + A EAA+ +
Sbjct: 473 NMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAARQS 530
Query: 498 DATIILAG----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVA 534
D +++ G D S + E DR L L G Q +LI +V
Sbjct: 531 DVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVG 590
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
++ K P++LV++ G + + AI+ A YPG +GG A+ADV+FG +NP GRL
Sbjct: 591 KLNK-PIVLVLIK--GRPLLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLT 647
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
I+ V LP+ R + Y G YPFGYGLSYT F Y+
Sbjct: 648 ISVPRS--VGQLPVYYNTKRKGNR-----SKYIEEEGTPRYPFGYGLSYTSFNYS----- 695
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
+L ++ C LVN V +N GS DG +VV
Sbjct: 696 --------------DLKAEVVEAEDSC---LVN---------ISVKVRNEGSRDGDEVVQ 729
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y + T KQ+ GFQR+ ++ G K I F + KSL + + G T
Sbjct: 730 LYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPGRFT 788
Query: 775 IFVG 778
+ +G
Sbjct: 789 LMLG 792
>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
43184]
gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
CL09T00C40]
Length = 868
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 161/452 (35%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+ R+T +EK+ Q+ + + RLG+P+Y+WW+EALHGV+
Sbjct: 22 RQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F++ + VS EARA Y+ +
Sbjct: 82 RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRY 131
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ R V V+GLQ + +
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD---------PKYF 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ FD VT +D+ +T+L FE VK+G+ VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNR 239
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ------VMVDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 240 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETHPDA-ESASAD 298
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+++GK+ E D+D SL+ L LG FD + Y +
Sbjct: 299 AVLNGTDLECGNSYKALI-KALKEGKISENDLDVSLRRLLKGRFELGMFDPDERVPYAQI 357
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E++ A E A + +VLLKN NTLPL S ++ +AVVGP+A + + NY
Sbjct: 358 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 416
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P ++ + G V Y+ GC+ A
Sbjct: 417 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 448
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 140/320 (43%), Gaps = 58/320 (18%)
Query: 473 KTGCDDVACK--SNNSIFAASEAAKTADATIIL--AGLDLSVEAESL----------DRE 518
+TG D+ + + + A+ AAK DA +I+ G+ +E E + DR
Sbjct: 577 RTGSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRT 636
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
++ +P Q +++ + K PV+ V+ + G +A + NI AIL A Y G+E G A
Sbjct: 637 NIEIPKVQQEMVKALKATGK-PVVYVLCT--GSALALNWEDANIDAILNAWYGGQEAGTA 693
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
+AD++FG +NP GRLP+T+Y + LP + GRTY++ LYPFG
Sbjct: 694 VADILFGDYNPSGRLPVTFYKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFG 744
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSYT F Y RN +S G + D F
Sbjct: 745 YGLSYTNFAY-------------------RNAKLSS--------GKITKDQSVTLTF--- 774
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
D N G DG +V +Y K P + IK + F RV V+AG ++ + +
Sbjct: 775 -DIANTGKMDGDEVAQIYIKNPNDPEGP-IKALKAFLRVHVKAGDSQEVNIELTPEAFHS 832
Query: 759 IVDYAANTLLPAGEHTIFVG 778
D + G++ I G
Sbjct: 833 FNDNTQTMEVRPGKYQILYG 852
>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
17393]
gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 814
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 224/724 (30%), Positives = 334/724 (46%), Gaps = 109/724 (15%)
Query: 86 HGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
HG RLG+P + E HG +G T FPT I +++N L +++G+
Sbjct: 147 HG--RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGR 192
Query: 146 AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
A++TEA A + + P +++ARDPRW R+ ET GED ++ G V+G Q
Sbjct: 193 AIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQG- 246
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
+ KV + KH+AAY W A V ++MEE PF V
Sbjct: 247 -------EFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAV 296
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
G A SVM SYN ++GIP A+ LL ++ W G++V+D +I + ++ +AD
Sbjct: 297 AAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLYAIGGLREHG--VAD 353
Query: 326 SKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
+ +A + + AG+D D G Y NAV++G V+E I+K++ + + +G FD
Sbjct: 354 TDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFDH 413
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+Q + S E++ELA E AR+ I+LLKN LPLN K KT+AV+GP+A+
Sbjct: 414 PFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADNIY 472
Query: 445 AMIGNYAGIPCRYMSPIAGFSGY-------ANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
M+G+Y P S + G ++ Y GC V S + A EAA+ +
Sbjct: 473 NMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAARQS 530
Query: 498 DATIILAG----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVA 534
D +++ G D S + E DR L L G Q +LI +V
Sbjct: 531 DVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVG 590
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
++ K P++LV++ G + + AI+ A YPG +GG A+ADV+FG +NP GRL
Sbjct: 591 KLNK-PIVLVLIK--GRPLLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLT 647
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
I+ V LP+ R + Y G YPFGYGLSYT F Y+
Sbjct: 648 ISVPRS--VGQLPVYYNTKRKGNR-----SKYIEEEGTPRYPFGYGLSYTSFNYS----- 695
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
+L ++ C LVN V +N GS DG +VV
Sbjct: 696 --------------DLKAEVVEAEDSC---LVN---------ISVKVRNEGSRDGDEVVQ 729
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y + T KQ+ GFQR+ ++ G K I F + KSL + + G T
Sbjct: 730 LYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPGRFT 788
Query: 775 IFVG 778
+ +G
Sbjct: 789 LMLG 792
>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
tundricola MP5ACTX9]
Length = 892
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 172/455 (37%), Positives = 245/455 (53%), Gaps = 45/455 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D +L RV DLVSRMTL+EKV Q + A + RL +P+Y++WSE LHG++ G
Sbjct: 34 YMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARSG--- 90
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
AT FP I A+++ L ++IG +S EARA +N GLT W
Sbjct: 91 -------YATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHSIYYGLTIW 143
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDPF+ GR V +V+G+Q D N + +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQ-------GPDPNY--FRAIATP 194
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ + T D+ +T+L F + E A S+MC+YN V G P+
Sbjct: 195 KHFA---VHSGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAYNAVEGSPA 251
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQ----VMVDNHKFLADSKEDAVAQTLKAGLDL 341
CA LL T+R +W G++ +DC +I +H D KE A A +KAG D
Sbjct: 252 CASKLLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPD-KEAAAAAGIKAGTDS 310
Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
+CGQ Y G+AV++G V E +ID +LK+L+T +LG FD + + + ++ ++ S
Sbjct: 311 NCGQTYLTL-GSAVKKGLVTEAEIDTALKHLFTARFQLGLFDPAAKVAFNAIPFSEVNSP 369
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
+ LA +AA E IVLLKND +TLP + V+T+AV+GP A + GNY IP +
Sbjct: 370 AHQALALKAAEESIVLLKNDAHTLPFKPS-VRTIAVIGPSAATLNNLEGNYNAIPLHPVL 428
Query: 460 PIAGFSGY---ANVTYKTG---CDDVACKSNNSIF 488
P+ G + V Y G D VA ++F
Sbjct: 429 PLDGILTQFKSSKVLYAQGSSFADGVAIAVPRTVF 463
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 124/278 (44%), Gaps = 48/278 (17%)
Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
L G ++ + E DR D+ LP Q Q++ VA K P+++V+++ + + +A N
Sbjct: 636 LEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAVNWA--N 692
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
+ AIL A YPG+ GG AIA+ + GK NP GRLP+T+Y+ + +P D
Sbjct: 693 DHAAAILEAWYPGQAGGTAIAETLAGKNNPAGRLPVTFYSS--IDQIPA-------FDDY 743
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
RTY++ L+ FGYGLSYT F Y+ + +
Sbjct: 744 SMANRTYRYSKAKPLFEFGYGLSYTTFTYSNIKLS------------------------- 778
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
L D + D +N G G +V +Y PP A + + + F RV +
Sbjct: 779 ------TQTLHAGDPLTVEADVRNTGRVAGDEVAELYLTPP-HTAVSPQRALSAFTRVHL 831
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
G + + F + ++L+ VD + G +T+ V
Sbjct: 832 APGELRHVTFTLDP-RTLSQVDEKGARAVTPGNYTLSV 868
>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
CL03T12C18]
gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
CL02T12C04]
Length = 850
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 162/430 (37%), Positives = 246/430 (57%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV DL+SR+T++EK+ L + G+PRLG+ +Y +EALHGV V PG
Sbjct: 26 LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 83
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L +K+ +S EARA +N G L
Sbjct: 84 RF--------TVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVL 135
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + R LK+
Sbjct: 136 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGE---------DPRYLKIV 186
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+ A + ++ +R+ + +++E+ + E + FEMCVK+G A+S+M +YN +N
Sbjct: 187 STPKHFVANNEEH----NRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALND 242
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 243 VPCTLNAWLLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVK-TKETAATLSIKAGLDLE 301
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y + NA +Q V + DID + ++ M+LG FD + Y + I S
Sbjct: 302 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSKERNPYARISPSVIGSK 361
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ ++A +AARE IVLLKN +N LPLN K+K++AVVG NA G+Y+G P +
Sbjct: 362 DHQQVALDAARECIVLLKNQKNMLPLNVDKLKSIAVVG--INAGTCEFGDYSGAPV--IE 417
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 418 PVSVLQGIKN 427
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 89/290 (30%), Positives = 134/290 (46%), Gaps = 48/290 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 594 AGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 651
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y L
Sbjct: 652 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS-------LD 703
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSY+ FKY+ L +
Sbjct: 704 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVKDST----------- 750
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
D +N G G +V VY + P I
Sbjct: 751 ------------------------DKVTVSFRLKNTGRRKGDEVAQVYVRIPETGGIVPI 786
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
K++ GF+RV + G ++ I + + +LPAG + VG
Sbjct: 787 KELKGFRRVPLEPGESRAIDIELDKEQLRYWDTTKEQFILPAGTFDVMVG 836
>gi|423293434|ref|ZP_17271561.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
CL03T12C18]
gi|392678377|gb|EIY71785.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
CL03T12C18]
Length = 735
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 220/768 (28%), Positives = 357/768 (46%), Gaps = 115/768 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EK+ QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D EN ++++C KHY Y R + ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
M VK G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +++ + ++
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304
Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
LA +K+DA AGL++D + Y V++GKV +D+S++ + V RLG
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
F+ V+ K +++ +AA+ A E +VLLKN+ LPL + K +AVVGP A
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNNNQILPLTNK--KKIAVVGPMAK 422
Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
++G++ G + Y A F G A + Y GC ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ +D I+ G L+ E+ R + LP Q +L+ ++ E K P+ILV+ + G +
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLE 537
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
AIL PG G R++A ++ G+ NP G+L +T+ P ++ +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588
Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ R G+ G YK LYPFG+GLSYT+FKY +
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
T A+K ++ D +V N G+ DG++ V + P +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVK 676
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++ F++ F++ G K +F + + V+ L AGE+ I V
Sbjct: 677 ELKHFEKQFIKVGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|298479985|ref|ZP_06998184.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
gi|298273794|gb|EFI15356.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
Length = 735
Score = 276 bits (705), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 223/768 (29%), Positives = 356/768 (46%), Gaps = 115/768 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EKV QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D EN ++++C KHY Y R + ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
M VK G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +++ + ++
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304
Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
LA +K+DA AGL++D + Y V++GKV +D+S++ + V LG
Sbjct: 305 LAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFCLGL 364
Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
F+ V+ K +++ +AA+ A E +VLLKND LPL + K +AVVGP A
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422
Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
++G++ G + Y A F G A + Y GC ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
+ +D I+ G L+ E+ R + LP Q +L+ ++ E K PVILV+ + G +
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVL--SNGRPLE 537
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
AIL PG G R++A ++ G+ NP G+L +T+ P ++ +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588
Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+ R G+ G YK LYPFG+GLSYT+FKY +
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
T A+K ++ D +V N GS DG++ V + P +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVK 676
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
++ F++ ++AG K +F + + V+ L AGE+ I V
Sbjct: 677 ELRHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
CL02T12C01]
Length = 946
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 233/819 (28%), Positives = 371/819 (45%), Gaps = 145/819 (17%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
F+K G++ ++ D + P R++DL+S+M L+EK Q+ +G R+ LP EW
Sbjct: 44 FNKNGIKD---IYEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDDLPTPEW 99
Query: 99 ----WS-------EALHGVSNVG-PGTHFDDVIPG------------------------- 121
W E L+G G P + + V P
Sbjct: 100 KQMLWKDGMGAIDEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFVEETRLGIPVD 159
Query: 122 -------------ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSP 167
AT+FPT + ++N L +IG EAR + G T ++P
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRKLIHQIGLITGREARML------GYTNVYAP 213
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
++V RD RWGR E GE P++V + VRG+Q H + +V++ KH
Sbjct: 214 ILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKH 260
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+ AY + D +++ +++E + PF+ ++E VM SYN +G P +
Sbjct: 261 FIAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQS 320
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG--- 344
L +RG+ GY+V+D D+++ + H D KE AV Q+++AGL++ C
Sbjct: 321 SYYWLTTRLRGQMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRS 379
Query: 345 -QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD-ICSDENI 402
Y VQ+G + E I+ ++ + V +G FD Q G D + +EN
Sbjct: 380 PDSYVLPLRELVQEGGLSEEVINDRVRDILRVKFLVGLFDAPYQTDLKGADDEVEKEENE 439
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
+A +A+RE IVLLKN+ NTLPL+ VK +AV GP+A + +Y + + +
Sbjct: 440 AVALQASRESIVLLKNENNTLPLDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVD 499
Query: 463 G----FSGYANVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILA 504
G +G A V Y GCD V + + I A A+ AD +++
Sbjct: 500 GLREKLNGKAEVLYTKGCDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVL 559
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
G E+ R L LPG Q L+ V K PVILV+++ + + +A + + A
Sbjct: 560 GGGQRTCGENKSRSSLDLPGRQLDLLKAVQATGK-PVILVLINGRPLSVNWA--DKFVPA 616
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGY 621
IL A YPG +GG AIADV+FG +NPGG+L +T+ V +P + P +P +D
Sbjct: 617 ILEAWYPGSKGGTAIADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPHKPSSQIDGGKN 673
Query: 622 PGRT--YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
PG NG LYPFGYGLSYT F+Y+ ++ + + K+Q
Sbjct: 674 PGTKGDMSRVNG-ALYPFGYGLSYTTFEYSDINISPKVITPNQKVQ-------------- 718
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+RC N G G +VV +Y + TY K + GF+R+ +
Sbjct: 719 ---------VRC--------KVTNTGKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHL 761
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ G K + F + K+L +++ + ++ G+ +I +G
Sbjct: 762 QPGETKEVSFTLDR-KALELLNAKNDWVVEPGDFSIMLG 799
>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 912
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 205/689 (29%), Positives = 329/689 (47%), Gaps = 99/689 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E + GV N AT+FPT + ++N L ++IG
Sbjct: 118 RLGIPA-DFTNEGIRGVENYI-----------ATNFPTQLALGHTWNRELIRQIGYITGR 165
Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EAR + G T ++P ++V RD RWGR E GE P++V + +GLQ
Sbjct: 166 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQ----- 214
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
TD+ +V+S KH+ AY + D +++ +++E PF ++E
Sbjct: 215 ---TDM-----QVASTAKHFIAYSNNKGAREGFARVDPQMSWREVENIHAYPFTRVIQEA 266
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
VM SYN +G P + L Q +RG GY+V+D D+++ + HK D KE
Sbjct: 267 GILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDSDAVEYLYSKHKTAKDMKE 326
Query: 329 DAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
AV Q+++AGL++ C + Y +Q+G + ID ++ + V G FD
Sbjct: 327 -AVRQSVEAGLNVRCTFRSPESYVLPLRELIQEGGLSMETIDNRVRDILRVKFLTGLFDT 385
Query: 385 SPQY-VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
Q ++L +++ S+ + ++A +A+REG+VLLKN N LPL+ +++K +AV GP+A+
Sbjct: 386 PYQTDLALADKEVNSEAHQQVALQASREGLVLLKNANNLLPLDKSQIKRIAVCGPNADEA 445
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDV--------------ACKSNN 485
+ +Y + + + G VTY GCD V +
Sbjct: 446 SFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVDANWPESEIISYPLTAEEKT 505
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
I A + K +D +++ G + E+ R L LPG+Q QL+ + K PV+LV+
Sbjct: 506 EIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQQQLLEAIVATGK-PVVLVL 564
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
++ + I +A + + AIL A YPG +GG AIA+ +FG +NPGG+L +T+ V
Sbjct: 565 INGRPLSINWA--DKFVPAILEAWYPGSQGGTAIAEALFGDYNPGGKLTVTF--PKTVGQ 620
Query: 606 LPLTSMPLRP---VDSLGYPGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
+P + P +P VD PG NGP LYPFGYGLSYT F+Y+
Sbjct: 621 IPF-NFPAKPASQVDGGQTPGMKGNQSRINGP-LYPFGYGLSYTTFEYS----------- 667
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
NL +S + P + ++ N G+ G +VV +Y++
Sbjct: 668 --------NLQLSSPVITDKEPVTVTCKIK------------NTGTRSGDEVVQLYTRDV 707
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
TY K + GF+RV + G K++ F
Sbjct: 708 ISSVTTYEKNLRGFERVHLEPGETKKVSF 736
>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
Length = 779
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 211/726 (29%), Positives = 335/726 (46%), Gaps = 123/726 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT + A+++ + ++ G ++
Sbjct: 128 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGLGMAATWSTDVIEQAGVIIAK 175
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++A +PRW R+ ET GEDP + G AV V+GL
Sbjct: 176 EIRL-----QGGHISYGPVLDLAHEPRWSRVEETMGEDPVLSGTIAVAQVKGL------- 223
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
A D+ ++P + KH+ AY + + + + +D+ + FL PF + G
Sbjct: 224 GAGDI-TKPFATIATLKHFIAYGIPE---SGQNGAPSIIGTRDLLDNFLPPFRRAIDAG- 278
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL + +R +W G++V+D SI + H ++ +E
Sbjct: 279 ALSVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFVVSDLYSIDGIYGTHHTVSSLQEA 338
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
+ + L+AG+D+D G +AV+QG+V E ID+++ + + + +G F+
Sbjct: 339 GI-EALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDEAVLRILRMKIEMGLFEHPYVNP 397
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
K + + ENI++A A E I LLKN LPL S +K +AV+GP+A+ M+G+
Sbjct: 398 KTAKTGVRTAENIQVAKRVAEESITLLKNSNKLLPL-SKNIK-IAVIGPNADNRYNMLGD 455
Query: 450 YA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKT 496
Y GI + +SP + +TY GC + N I A AA+
Sbjct: 456 YTAPQQDSNVKTILDGIRSK-LSP-------SQITYVKGCS-IRDTVFNEIGEAVRAARE 506
Query: 497 ADATIILAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQV 533
AD ++ G + + E DR L L G Q++L+ +
Sbjct: 507 ADVIVVAVGGSSARDFKTSYQETGAAITSSKVVSDMESGEGFDRASLSLMGIQSRLLQSL 566
Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
E K P++++ + +D +A + A+L A YPG+EGG AIA+V+FG +NP GRL
Sbjct: 567 KETGK-PMVVIYIEGRPLDKTWASEQAD--ALLTAYYPGQEGGNAIANVLFGDYNPAGRL 623
Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
PIT V LP+ RPV Y LYPFGYGLSYT F Y+ L+
Sbjct: 624 PITVPRS--VGQLPVYYNKKRPVV------HNYVEMASTPLYPFGYGLSYTSFDYSHLNI 675
Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
TK + ++ +E D +N G DG +V
Sbjct: 676 TK----------------------------------KSEEEYEVSFDIRNSGERDGDEVA 701
Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
+Y +KQ+ GF R+ ++ G KRI + L+I D ++ AG+
Sbjct: 702 QLYISDKVASVVQPVKQLKGFARIHLKKGETKRITLILKK-DDLSITDRNMERVVEAGDF 760
Query: 774 TIFVGN 779
I +G+
Sbjct: 761 EIQIGS 766
>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
CL03T12C32]
Length = 866
Score = 275 bits (703), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 160/452 (35%), Positives = 235/452 (51%), Gaps = 44/452 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
+ + F + LP R+ DL+ R+T +EK+ Q+ + + RLG+P+Y+WW+EALHGV+
Sbjct: 20 RQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVA 79
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
G AT FP I A+F++ + VS EARA Y+ +
Sbjct: 80 RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRY 129
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+W+PNIN+ RDPRWGR ET GEDP++ R + V+GLQ + +
Sbjct: 130 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDD---------PKYF 180
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K +C KHYA + W +R+ FD VT +D+ +T+L FE VK+G+ VMC+YNR
Sbjct: 181 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNR 237
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ------VMVDNHKFLADSKEDAVAQ 333
G P C+ KLL +R W I++DC +I H+ D+ E A A
Sbjct: 238 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETHPDA-ESASAD 296
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
+ G DL+CG Y A+++GK+ E D+D SL+ L LG FD + Y +
Sbjct: 297 AVLNGTDLECGNSYKALI-KALKEGKISENDLDVSLRRLLKGRFELGMFDPDERVPYAQI 355
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ S E++ A E A + +VLLKN NTLPL S ++ +AVVGP+A + + NY
Sbjct: 356 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 414
Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
G P ++ + G V Y+ GC+ A
Sbjct: 415 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 446
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 132/301 (43%), Gaps = 56/301 (18%)
Query: 490 ASEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVA 537
A+ AAK DA +I+ G+ +E E + DR ++ +P Q +++ +
Sbjct: 594 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMVKALKATG 653
Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
K PV+ V+ + G +A + NI AIL A Y G+E G A+AD++FG +NP GRLP+T+
Sbjct: 654 K-PVVYVLCT--GSALALNWEDANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTF 710
Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
Y + LP + GRTY++ LYPFGYGLSYT F Y
Sbjct: 711 YKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFGYGLSYTNFAY--------- 752
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
RN +S G + D F D N G DG +V +Y
Sbjct: 753 ----------RNAKLSS--------GKITKDQSVTLTF----DIANTGKMDGDEVAQIYI 790
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
K P + IK + F RV V+AG ++ + + D + G++ I
Sbjct: 791 KNPNDPEGP-IKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILY 849
Query: 778 G 778
G
Sbjct: 850 G 850
>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 175/459 (38%), Positives = 241/459 (52%), Gaps = 47/459 (10%)
Query: 52 FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
FL C S PY R DLV R+TL+EK + + + +PRLG+ Y+WW+EALH
Sbjct: 18 FLSC-SQPPYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALH 76
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------L 157
GV G AT FP I ASFN L + AVS EARA L
Sbjct: 77 GVGRAGL----------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGL 126
Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
R GLT W+PNIN+ RDPRWGR ET GEDP++ G+ + VRGLQ EG +
Sbjct: 127 KRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD----- 181
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
K+ +C KHYA + W +R+ F+A + +D+ ET+L F+ V++ VMC
Sbjct: 182 ---KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMC 235
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQT 334
+YNR G P C +LL Q +R EW +V+DC +I + D K+ A A+
Sbjct: 236 AYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKA 295
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
+ +G D++CG Y + AV++G + E ID SLK L LG D Q + +
Sbjct: 296 VLSGTDVECGDSYASLP-EAVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIP 354
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ S E+ ELA ARE +VLL+N+Q+ LPLN K VAVVGP+AN +V GNY G
Sbjct: 355 YSVVDSKEHRELALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNG 412
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G Y + + Y+ GCD + + S+F
Sbjct: 413 FPSHTITLLEGIREYLPESQIIYEPGCDLTSDVTLQSVF 451
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 132/298 (44%), Gaps = 56/298 (18%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
+ K AD I G+ +VE E + DRE + LP Q++L+ AE+ K
Sbjct: 595 DKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLL---AELKKAGK 651
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+V ++ G IA + AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y
Sbjct: 652 KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK-- 709
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
++ L + GRTY++ L+PFG+GLSYT F+Y S +N
Sbjct: 710 -------STKQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNAS------LNT 756
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
++++ + T + N G DG +VV VY + P
Sbjct: 757 SEIKDGEQVTLT-------------------------IPVSNTGKYDGEEVVQVYLRHPG 791
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
+ + F+RV + G + + ++ D + NT+ P G++ I G
Sbjct: 792 DKEGPS-HALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847
>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
Length = 884
Score = 275 bits (702), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 192/564 (34%), Positives = 279/564 (49%), Gaps = 75/564 (13%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
F C G S L Q + F + LP + R+++L+ +TL+EKV + + + V RLG+P
Sbjct: 14 FYCLLGN-SNLKSQEIPYKFRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIP 72
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y+WW+EALHGV+ G AT FP I A++NES K+ +S EARA
Sbjct: 73 AYDWWNEALHGVARSGK----------ATVFPQAIGMAATWNESGHKQTFDLISDEARAK 122
Query: 155 YN-------LGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
YN GR GL++W+PNIN+ RDPRWGR ET GEDP++ R V VRGLQ +
Sbjct: 123 YNEAIRNGERGRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDD 182
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
+ K +C KH+A + W +R+ +DA + +D+ ET+L F+ VK
Sbjct: 183 ---------PKYFKTHACAKHFAVHSGPEW---NRHSYDATASGRDLWETYLPAFKALVK 230
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV-----DNHK 321
E + VMC+YN G P C +LL +R W+ G +V+DC +I + HK
Sbjct: 231 EANVQEVMCAYNAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHK 290
Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
A + DAV + DL+CG YTN AV+QG + + ID SL+ + LG
Sbjct: 291 DAAAAAADAVIHS----TDLECGSAYTNLL-EAVRQGLISQQQIDISLRRVLRGWFELGM 345
Query: 382 FDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
D + + + L Q + S E+++ A + ARE + LLKN+ + LPL S +K +AV+GP+
Sbjct: 346 LDPAERLPWSQLPYQIVASKEHVQQALKVARESMTLLKNNGSILPL-SKSIKKIAVIGPN 404
Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSG---YANVTYKTGCDDVACKSNNSI---FAASEA 493
A +V + GNY G P ++ + G +A + Y GCD V S+ F +S
Sbjct: 405 AADSVMLWGNYNGTPNSTVTILQGIKNKLPHAEIIYDKGCDWVDPWVRTSLFEGFTSSPK 464
Query: 494 AKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
+ LS E T LIN +A + +AGG +
Sbjct: 465 GQKGMKVEFFNNTQLSGSPE-------------TTLINTLA--------IKYNNAGGTAL 503
Query: 554 A----FAETNTNIKAILWAGYPGE 573
A T+T I + A Y GE
Sbjct: 504 AQGVNLQNTSTRISGVFTAPYTGE 527
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 131/298 (43%), Gaps = 50/298 (16%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K DA + GL +E E + D+ + LP Q +L++ + K PV+ V
Sbjct: 606 KEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTGK-PVVFV 664
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV- 603
+ + G +A + N A+L A Y G+E G A+ADV+FG +NP GRLPIT+Y
Sbjct: 665 LCT--GSSLALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITFYKSLSQL 722
Query: 604 --QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
+L + + ++ GRTY++ LY FG+GLSY++F Y T
Sbjct: 723 DNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNYGEAKLTS------ 776
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
++ + + N+ + G +VV VY K
Sbjct: 777 -------------------------GTVKIGNTLNISIPLTNISNNKGEEVVQVYVKRNG 811
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
+ A +K + GF+RV + AG K + F A ++ D + + L P AG +TI G
Sbjct: 812 DPDAP-VKSLKGFKRVAIAAGETKHLDFQLTA-EAFEFYDPSKDELGPKAGNYTIMYG 867
>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 175/459 (38%), Positives = 241/459 (52%), Gaps = 47/459 (10%)
Query: 52 FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
FL C S PY R DLV R+TL+EK + + + +PRLG+ Y+WW+EALH
Sbjct: 18 FLSC-SQPPYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALH 76
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------L 157
GV G AT FP I ASFN L + AVS EARA L
Sbjct: 77 GVGRAGL----------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGL 126
Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
R GLT W+PNIN+ RDPRWGR ET GEDP++ G+ + VRGLQ EG +
Sbjct: 127 KRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD----- 181
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
K+ +C KHYA + W +R+ F+A + +D+ ET+L F+ V++ VMC
Sbjct: 182 ---KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMC 235
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQT 334
+YNR G P C +LL Q +R EW +V+DC +I + D K+ A A+
Sbjct: 236 AYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKA 295
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
+ +G D++CG Y + AV++G + E ID SLK L LG D Q + +
Sbjct: 296 VLSGTDVECGDSYASLP-EAVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIP 354
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ S E+ ELA ARE +VLL+N+Q+ LPLN K VAVVGP+AN +V GNY G
Sbjct: 355 YSVVDSKEHRELALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNG 412
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G Y + + Y+ GCD + + S+F
Sbjct: 413 FPSHTITLLEGIREYLPESQIIYEPGCDLTSDVTLQSVF 451
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 132/298 (44%), Gaps = 56/298 (18%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
+ K AD I G+ +VE E + DRE + LP Q++L+ AE+ K
Sbjct: 595 DKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLL---AELKKAGK 651
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+V ++ G IA + AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y
Sbjct: 652 KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK-- 709
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
++ L + GRTY++ L+PFG+GLSYT F+Y S +N
Sbjct: 710 -------STKQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNAS------LNT 756
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
++++ + T + N G DG +VV VY + P
Sbjct: 757 SEIKDGEQVTLT-------------------------IPVSNTGKYDGEEVVQVYLRHPG 791
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
+ + F+RV + G + + ++ D + NT+ P G++ I G
Sbjct: 792 DKEGPS-HALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847
>gi|103486503|ref|YP_616064.1| glycoside hydrolase [Sphingopyxis alaskensis RB2256]
gi|98976580|gb|ABF52731.1| glycoside hydrolase, family 3-like protein [Sphingopyxis alaskensis
RB2256]
Length = 772
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 229/737 (31%), Positives = 344/737 (46%), Gaps = 110/737 (14%)
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP------------- 111
+ DL+ +MTLDEK QL G + + E + VG
Sbjct: 57 IADLMVKMTLDEKTGQLTLLTSNWESTGPTMRDSYKEDIR-AGRVGAIFNAYTAKYTREL 115
Query: 112 ------GTHFD-------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
GT DVI G T FP + AS++ +K + + EA A
Sbjct: 116 QALAVEGTRLKIPLLFGYDVIHGHRTIFPISLGEAASWDLQAIEKAARISAIEASA---- 171
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
G+ + +SP +++ARDPRWGRI+E GED ++ A VRG Q DL S
Sbjct: 172 --EGIHWTFSPMVDIARDPRWGRISEGAGEDVYLGSLIAKARVRGYQ-------GGDL-S 221
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
RP + + KH+AAY G D + D ++E+ M + +L PF+ A++ M +
Sbjct: 222 RPDTILATAKHFAAYGAAQ-AGRDYHTVD--ISERTMRDVYLPPFKAAADA-GAATFMTA 277
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N +G+P+ LL +R +W G++V D SI MV H + D K+ A Q ++
Sbjct: 278 FNEYDGVPASGSHYLLTDVLRKKWGFKGFVVTDYTSINEMVP-HGYAKDLKQ-AGEQAMR 335
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
AG+D+D G + +V +GKV ID ++K + + RLG FD +Y ++
Sbjct: 336 AGVDMDMQGAVFMENLAKSVAEGKVDTARIDAAVKAILEMKYRLGLFDDPYRYADAAREK 395
Query: 396 --ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
I +E A + AR+ IVLLKN N LPL +A K++AV+GP N+ MIG+++
Sbjct: 396 ATIYKPAFLEAARDVARKSIVLLKNKDNVLPL-AASAKSIAVIGPLGNSKEDMIGSWSAA 454
Query: 454 PCRYMSPI-------AGFSGYANVTYKTGC----DDVACKSNNSIFAASEAAKTADATII 502
R P+ AG + Y G DDV + A A+ +D I
Sbjct: 455 GDRRTRPVTLLEGLQAGAPKGTTIAYAKGASYHFDDVG--KTDGFAEALALAEKSDVIIA 512
Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
G ++ E+ R L LPG Q L+ + + K PVILV+MS I +A + N+
Sbjct: 513 AMGEHWNMTGEAASRTSLDLPGNQQALLEALEKTGK-PVILVLMSGRPNSIEWA--DANV 569
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL---TSMPLRPVDSL 619
AIL A YPG GG AIAD+++G++NP G+LP+T+ V +P+ RP++ L
Sbjct: 570 DAILEAWYPGTMGGHAIADILYGRYNPSGKLPVTFPR--TVGQVPIHYDMKNTGRPIE-L 626
Query: 620 GYPGRTY--KFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
G PG Y ++ N P LYPFGYGLSYT F Y+ ++ D
Sbjct: 627 GAPGAKYVSRYLNTPNTPLYPFGYGLSYTSFTYSPVTL---------------------D 665
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
SK R PG + V N G DG +VV +Y + +K++ GFQ
Sbjct: 666 RSKIR-PG---------EPLTASVTVTNSGPRDGEEVVQLYVRDLVGSVTRPVKELKGFQ 715
Query: 736 RVFVRAGRNKRIKFVFN 752
++ ++ G + ++F
Sbjct: 716 KIGLKKGETRTVRFTLT 732
>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
Length = 333
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 143/335 (42%), Positives = 204/335 (60%), Gaps = 12/335 (3%)
Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
MIGNY G P +Y +P+ G + TY GC +VAC + I A + A ADAT+++ G
Sbjct: 1 MIGNYEGTPGKYTTPLQGLTALVATTYLPGCSNVACGTAQ-IDEAKKIAAAADATVLIVG 59
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
+D S+EAE DR ++ LPG Q LI +VA+ +KG VILV+MS GG DI+FA+ + I +I
Sbjct: 60 IDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSI 119
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
LW GYPGE GG AIADV+FG +NP GRLP TWY YV +P+T+M +RP + GYPGRT
Sbjct: 120 LWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGRT 179
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y+FY G T+Y FG GLSYTQF ++L+ K++ + + + C + ++C V
Sbjct: 180 YRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSIPIEEGHSCHS---------SKCKSVD 230
Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
C + F+ + N G+ GS V ++S PP+ + + K ++GF++VFV A
Sbjct: 231 AVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPS-VHNSPQKHLLGFEKVFVTAKAE 289
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++F + CK L+IVD + G H + VGN
Sbjct: 290 ALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGN 324
>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 944
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 228/807 (28%), Positives = 359/807 (44%), Gaps = 144/807 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + R++DL+ +M+L+EK Q+ +G R+ LP EW W
Sbjct: 52 IYEDPTAAIDARIEDLLKQMSLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110
Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
E L+G G P + ++V P
Sbjct: 111 IDEHLNGFRQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L KIG EAR + G T ++P ++V RD RWG
Sbjct: 171 YKATNFPTQLGLGHTWNRELIHKIGFITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRG+Q H+ V++ KH+AAY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ--YNHQ-----------VAATGKHFAAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF ++E VM SYN +GIP L +RG
Sbjct: 272 EGMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSHYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE A+ Q+++AGL++ C +
Sbjct: 332 EIGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AIRQSVEAGLNIRCTFRSPDSFVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
V++G + E I+ ++ + V G FD +P L D + +EN +A +A+RE
Sbjct: 391 VKEGGLSEEIINDRVRDILRVKFLTGLFD-TPYQSDLAGADREVEKEENGSIALQASRES 449
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
IVLLKN+ N LPL+ + VK +AV GP+A+ + +Y + ++ + G SG A
Sbjct: 450 IVLLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKA 509
Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
V Y GCD V I A+E A+ +D +++ G E+
Sbjct: 510 EVLYTKGCDLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGEN 569
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q QL+ + K PVILV+++ + + +A + + AIL A YPG +
Sbjct: 570 KSRSSLDLPGRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAK 626
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
GG A+ADV+FG +NPGG+L +T+ V +P + P +P +D PG
Sbjct: 627 GGIALADVLFGDYNPGGKLTVTFPK--TVGQIPF-NFPYKPASQIDGGKNPGPEGNMSRI 683
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NG LYPFGYGLSYT F+Y+ L T + +
Sbjct: 684 NG-ALYPFGYGLSYTTFEYSDLEITPKV-------------------------------I 711
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
++ ++ N G G +VV +Y + TY K + GF+RV + G K + F
Sbjct: 712 TPNEEATVRLKVTNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGETKEVVF 771
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIF 776
K L ++D ++ G+ TI
Sbjct: 772 TL-GRKHLELLDANMQWVVEPGDFTIM 797
>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
22836]
Length = 727
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 220/740 (29%), Positives = 341/740 (46%), Gaps = 114/740 (15%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F ++SL R+ +L+S MT+DEK+ L GVPRLG+ + SE LHG++ GP
Sbjct: 24 YPFQNTSLSDEKRLDNLLSIMTIDEKINALST-NLGVPRLGI-RNTGHSEGLHGMALGGP 81
Query: 112 GT----------HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
G DV P T+FP +++ L KK+ +TE R R
Sbjct: 82 GNWGGFKMVNYQRVPDVYP-TTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERYT 140
Query: 160 -AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GL +PN ++ARDPRWGR E+ GEDPF+V AV +++GLQ N R
Sbjct: 141 KGGLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGE---------NPRY 191
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K +S KH+ A ++ + +FD R+ E + PF +++G + + M +YN
Sbjct: 192 WKSASLMKHFLANSNEDGRDSTSSNFDNRL----FHEYYSYPFRKGIEKGGSQAFMAAYN 247
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
N IP P L + +R +W+ G I D ++ +++ HK E + A +KAG
Sbjct: 248 SWNEIPMTIHPIL--KKIRKDWNFKGIICTDGGALDLLIKAHKTFPTHTEGSAA-IVKAG 304
Query: 339 LDLDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
+ GQ+ NF A+++G + E +IDK+++ + + ++LG DG Y +G
Sbjct: 305 V----GQFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTKLPYAHIG 360
Query: 393 KQDICS----DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
D S E + + +VLLKN++ LPLN +K +AV+GP AN ++
Sbjct: 361 VTDTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRANEV--LLD 418
Query: 449 NYAGIPCRYMSPIAGFSGYA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
Y+G P +S + G NV +V +S+N I A AA+ AD I+ G
Sbjct: 419 WYSGTPPYTVSILQGIKNAVGNNV-------EVIYESSNEIDKAYLAAQKADIAIVCVGN 471
Query: 507 DL-------------SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
+ S E++DR+ L L Q L+ V + V++++ S
Sbjct: 472 HVYGTDPKWKYSPVPSDGREAVDRKALSLE--QEDLVKIVHKANPNTVMVLVSS---FPF 526
Query: 554 AFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL 613
A + NI AIL +E G +ADV+FG +NP GR TW + P+ +
Sbjct: 527 AINWSQENIPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKS-IADLPPMMDYDI 585
Query: 614 RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
R GRTY + LYPFGYGLSYT F Y+ ++ + + L +NL
Sbjct: 586 R-------NGRTYMYAKEKPLYPFGYGLSYTNFTYSDMALSSS------ALSKGKNL--- 629
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
+ V+ +N G DG +V +Y P IKQ+ G
Sbjct: 630 ----------------------KVSVNVKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLKG 667
Query: 734 FQRVFVRAGRNKRIKFVFNA 753
F R+ ++ G +K +F +A
Sbjct: 668 FDRISIKKGESKTFEFTLSA 687
>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
[Bacteroides sp. 4_1_36]
Length = 942
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 232/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D S P R+++L+ +MTLDEK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 110
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 111 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRGLQ H + +V++ KH+AAY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ ++E VM SYN +GIP L +RG
Sbjct: 272 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C +
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
V++G + E I+ ++ + V +G FD +P L D + +EN +A +A+RE
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASRES 449
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYA 468
IVLLKN LPL+ K +AV GP+AN + +Y + + + G G A
Sbjct: 450 IVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKA 509
Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
V Y GCD V S I A E A+ AD I++ G E+
Sbjct: 510 EVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGEN 569
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q QL+ + K PV+L++++ + I +A + + AIL A YPG +
Sbjct: 570 KSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSK 626
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
GG A+AD++FG +NPGG+L +T+ V +P + P +P +D PG T
Sbjct: 627 GGTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRI 683
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NG LYPFGYGLSYT F+Y+ L T + T + S T
Sbjct: 684 NG-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT---------- 718
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
++ N G G +VV +Y + TY K + GFQR+ + G + + F
Sbjct: 719 -------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSF 771
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K L ++D ++ G+ + G
Sbjct: 772 TIDR-KHLELLDADMKWVVEPGDFVLMAG 799
>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
BAA-286]
Length = 854
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 163/427 (38%), Positives = 240/427 (56%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D P R+ DL+SR+T++EK+ L + G+PRL +P+Y +E+LHGV V PG
Sbjct: 29 VYLDEKAPTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGV--VRPG 86
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I + +N L KI A+S EAR +N G L
Sbjct: 87 RF--------TVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLL 138
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G +VRGLQ + R LK+
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDD---------PRYLKIV 189
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E + FEMCVK+G ++S+M +YN +N
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAIND 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P A+P LL + +R +W +GY+V+DC ++V K++ +KE A ++KAGLDL+
Sbjct: 246 VPCTANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVK-TKEAAATLSIKAGLDLE 304
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSD 399
CG Y NA Q V DID + + M LG FD Y + + S
Sbjct: 305 CGDDVYMQPLLNAYNQYMVSRADIDTAAYRVLRARMHLGLFDDPDLNPYNKISPSVVGSA 364
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ +LA EAAR+ IVLLKN+ TLPLN KVK++AVVG NA + G+Y+GIP +
Sbjct: 365 EHKQLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVG--INAGNSEFGDYSGIPAN--A 420
Query: 460 PIAGFSG 466
P++ G
Sbjct: 421 PVSILQG 427
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 158/316 (50%), Gaps = 51/316 (16%)
Query: 478 DVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
DV K ++ A +A + + I + G++ ++E E DR D+ LP Q + I ++ +V
Sbjct: 584 DVGSKQRLDMYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKV 643
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
P I+V++ AG +A + ++ AI+ A YPGE+GG A+A+V+FG++NPGGRLP+T
Sbjct: 644 --NPNIVVVLVAGS-SLAINWMDEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVT 700
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
+YN ++ +P D GRTY+++ G LYPFGYGLSYT F Y K
Sbjct: 701 YYNS--LEEIPSFD------DYDITKGRTYQYFKGKPLYPFGYGLSYTTFAY------KN 746
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
+Q+N N N+++ FE K N G DG +V VY
Sbjct: 747 LQINDNG-----------------------NNIKVS--FELK----NTGRMDGDEVSQVY 777
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS-LNIVDYAANTLL-PAGEHT 774
K P+ IK++ GFQR ++ G K ++ N K L D A T + P GE+
Sbjct: 778 VKIPSSGIFMPIKELKGFQRSTLKKGATKNVE--INIRKDLLRYWDDATETFITPKGEYE 835
Query: 775 IFVGNGGVSFPIHLNF 790
+G + +F
Sbjct: 836 FMIGTSSQDIQLTKSF 851
>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
43183]
gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 944
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 231/819 (28%), Positives = 365/819 (44%), Gaps = 145/819 (17%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
F+K G++ ++ D + R+++L+ +MTL+EK Q+ +G R+ LP EW
Sbjct: 44 FNKNGIKD---IYEDPAATLDARIENLLQQMTLEEKTCQMVTL-YGYKRVLKDALPTPEW 99
Query: 99 ----WS-------EALHGVSNVG-PGTHFDDVIPG------------------------- 121
W E L+G G P + ++V P
Sbjct: 100 KQMLWKDGIGAIDEHLNGFQQWGLPPSDNENVWPASRHAWALNEIQRFFVEDTRLGIPVD 159
Query: 122 -------------ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSP 167
AT+FPT + ++N L +++G EAR + G T ++P
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAP 213
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
++V RD RWGR E GE P++V + VRGLQ H + +V++ KH
Sbjct: 214 ILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATAKH 260
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+AAY + D ++ +++E + PF+ ++E VM SYN +GIP
Sbjct: 261 FAAYSNNKGAREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQG 320
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG--- 344
L +R E GY+V+D D+++ + H D KE AV Q+++AGL++ C
Sbjct: 321 SYYWLTTRLRKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRS 379
Query: 345 -QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE-NI 402
+ V++G + E I+ ++ + V +G FD Q G D E N
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADDEVEKEANE 439
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
+A +A+RE IVLLKN NTLPLN K+K +AV GP+A+ + +Y + + +
Sbjct: 440 AVALQASRESIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLE 499
Query: 463 GF----SGYANVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILA 504
G G A V Y GCD V S I A A+ AD +++
Sbjct: 500 GIREKAQGKAEVLYTKGCDLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVL 559
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
G E+ R L LPG+Q +L+ V K PVIL++++ + + +A + + A
Sbjct: 560 GGGQRTCGENKSRTSLELPGHQLKLLQAVQATGK-PVILILINGRPLSVNWA--DKFVPA 616
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGY 621
IL A YPG +GG +AD++FG +NPGG+L +T+ V +P + P +P +D
Sbjct: 617 ILEAWYPGSKGGTVVADILFGDYNPGGKLTVTF--PKTVGQIPF-NFPYKPASQIDGGKN 673
Query: 622 PGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
PG NG LYPFGYGLSYT F+Y+ L T
Sbjct: 674 PGPDGNMSRING-ALYPFGYGLSYTTFEYSDLEIT------------------------- 707
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
P V+ + + ++ N G G +VV +Y++ TY K + GF+R+ +
Sbjct: 708 --PKVITPNQKAT----IRLKVTNTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHL 761
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ G +K I F + K L +++ + GE I G
Sbjct: 762 KPGESKEIVFTLDR-KHLELLNADMKWTVEPGEFAIMAG 799
>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
uniformis ATCC 8492]
Length = 990
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 232/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D S P R+++L+ +MTLDEK Q+ +G R+ LP EW W
Sbjct: 100 VYEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 158
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 159 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 218
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWG
Sbjct: 219 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 272
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRGLQ H + +V++ KH+AAY +
Sbjct: 273 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 319
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ ++E VM SYN +GIP L +RG
Sbjct: 320 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 379
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C +
Sbjct: 380 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 438
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
V++G + E I+ ++ + V +G FD +P L D + +EN +A +A+RE
Sbjct: 439 VKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASRES 497
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYA 468
IVLLKN LPL+ K +AV GP+AN + +Y + + + G G A
Sbjct: 498 IVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKA 557
Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
V Y GCD V S I A E A+ AD I++ G E+
Sbjct: 558 EVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGEN 617
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q QL+ + K PV+L++++ + I +A + + AIL A YPG +
Sbjct: 618 KSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSK 674
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
GG A+AD++FG +NPGG+L +T+ V +P + P +P +D PG T
Sbjct: 675 GGTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRI 731
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NG LYPFGYGLSYT F+Y+ L T + T + S T
Sbjct: 732 NG-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT---------- 766
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
++ N G G +VV +Y + TY K + GFQR+ + G + + F
Sbjct: 767 -------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSF 819
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K L ++D ++ G+ + G
Sbjct: 820 TIDR-KHLELLDADMKWVVEPGDFVLMAG 847
>gi|335433420|ref|ZP_08558246.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
gi|335434171|ref|ZP_08558974.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
gi|334898028|gb|EGM36149.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
gi|334898759|gb|EGM36857.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
SARL4B]
Length = 783
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 222/754 (29%), Positives = 340/754 (45%), Gaps = 121/754 (16%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E + +L + RLG+P E E L G PG T FP I +++
Sbjct: 89 ETINELQRYLVEETRLGIPAIEH-EECLTGYRG-----------PGGTIFPQSIGLASTW 136
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
+ +L + I ++ T A+ + SP ++V+RD RWGR+ ET GEDP +VG
Sbjct: 137 SPALVESITDSIRTRLDAV-----GTVQALSPVLDVSRDMRWGRVEETYGEDPQLVGALG 191
Query: 196 VNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
YV GLQ D EG + + KH+AA+ G +R ++ E+++
Sbjct: 192 AAYVAGLQSDGEG-------------IDATLKHFAAHG-SGEGGKNRSSV--QIGERELR 235
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E L PFE+ ++E DA +VM +Y+ ++G+P + LL +RGEW G++VAD S+
Sbjct: 236 EVHLYPFEVAIQEADARAVMNAYHDIDGVPCASSEWLLTDVLRGEWGFDGHVVADYFSVD 295
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSL 369
++ + H +AD++ +A L+AGLD+ DC Y AV+ G++ E +D ++
Sbjct: 296 LLKEEHG-IADTQREAGVAALEAGLDVELPATDC---YDENLRKAVEDGELSEATVDTAV 351
Query: 370 KYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
+ + + G FD + +DE ELAA AARE I LL+ND LPL +
Sbjct: 352 RRVLRAKIESGVFDDPYVDPDAATEPFDTDEQTELAARAARESITLLEND-GLLPLAGGE 410
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----------------FSGYANVTYK 473
+ +VA+VGP A+ A +G+Y R+ + AG +G+ +V Y
Sbjct: 411 LDSVALVGPQADDGRAQVGDYTHA-ARFDTEEAGDFESVTPRDALEARGETAGF-DVEYV 468
Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAGL----------------DLSVEAESLDR 517
G + S + AA E AD + G D+ E+ D
Sbjct: 469 EGA-TMTGPSTDGFDAAEETVADADLAVACVGARSDIDFADRENPAELPDVPTSGENCDV 527
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
DL LPG Q L++++AE P+I+V +S G A E ++ A+L A PG+EGG
Sbjct: 528 TDLELPGVQEALVDRLAET-DTPLIVVQVS--GKPHAIPEIAESVPALLHAWLPGQEGGT 584
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
AIADV+FG++NP G LP++ Q + + P + + +G LY F
Sbjct: 585 AIADVLFGEYNPSGHLPVSVPKSVGQQPVYYSRKP-------NSANEEHVYMDGEPLYSF 637
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGLSYT F+Y L DA G L
Sbjct: 638 GYGLSYTDFEYGDLEV---------------------DAETVAPMGTLTA---------- 666
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
V N G G DVV +Y A +++++GF+RV + G KR+ F F+A + L
Sbjct: 667 SVTVTNAGDVAGDDVVQLYQHAENPSQARPVQELLGFERVHLEPGETKRVTFSFDATQ-L 725
Query: 758 NIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
D N + G + + VG +F
Sbjct: 726 AYHDLDMNLAVEEGPYELRVGKSAAEIVDTADFE 759
>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 942
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 231/808 (28%), Positives = 360/808 (44%), Gaps = 142/808 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D S P R+++L+ +MTLDEK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPSAPLEARIENLLQQMTLDEKTCQVVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 110
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 111 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRGLQ H + +V++ KH+AAY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ ++E VM SYN +GIP L +RG
Sbjct: 272 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C +
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
V++G + E I+ ++ + V +G FD Q G +++ +EN +A +A+RE I
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASRESI 450
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYAN 469
VLLKN LPL+ K +AV GP+AN + +Y + + + G G A
Sbjct: 451 VLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAE 510
Query: 470 VTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESL 515
V Y GCD V S I A E A+ AD I++ G E+
Sbjct: 511 VLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENK 570
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q QL+ + K PV+L++++ + I +A + + AIL A YPG +G
Sbjct: 571 SRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--YN 630
G A+AD++FG +NPGG+L +T+ V +P + P +P +D PG T N
Sbjct: 628 GTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRIN 684
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G LYPFGYGLSYT F+Y+ L T + T + S T
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT----------- 718
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
++ N G G +VV +Y + TY K + GFQR+ + G + + F
Sbjct: 719 ------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFT 772
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K L ++D ++ G+ + G
Sbjct: 773 IDR-KHLELLDADMKWVVEPGDFVLMAG 799
>gi|374320547|ref|YP_005073676.1| glycoside hydrolase [Paenibacillus terrae HPL-003]
gi|357199556|gb|AET57453.1| glycoside hydrolase family protein [Paenibacillus terrae HPL-003]
Length = 767
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 207/692 (29%), Positives = 328/692 (47%), Gaps = 98/692 (14%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FP + +++N L++ + +AV++E RA + G +SP ++V RDPRWGR
Sbjct: 124 GTVFPVPLSIGSTWNVDLYRDMCRAVASETRA-----QGGAVTYSPVLDVVRDPRWGRTE 178
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVD 240
E GEDP+++G +AV V GLQ E+ +S V++ KH+A Y + +
Sbjct: 179 ECFGEDPYLIGEFAVAAVEGLQG----ESLLSEHS----VAATLKHFAGYGSSEGGRNAG 230
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
H R + E L PF+ V+ G A SVM +YN ++G+P + +LL+ +R W
Sbjct: 231 PVHMGWR----EFLEVDLYPFQKAVEAG-AQSVMPAYNEIDGVPCTVNAELLDGILRQTW 285
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGK 359
G I+ DC +I+++ + H +A+ DA Q ++AG+D++ G+ + + AV GK
Sbjct: 286 GFDGLIITDCGAIEMLANGHD-VAEDGSDAAVQAIRAGIDMEMSGEMFGSHLVEAVHAGK 344
Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
++ + +D++++ + T+ RLG FD +Q I E+I LA + A EGIVLLKN
Sbjct: 345 LETSVLDRAVRRVLTLKFRLGLFDKPYVDAERAEQVIGQTEHIRLARQLATEGIVLLKNV 404
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFSG-----YANVTY 472
TLPL K +A++GP+A+ +G+Y R ++ + G G A V Y
Sbjct: 405 DGTLPLPKTS-KRIAIIGPNADQVYNQLGDYTSPQPRSRVITVLDGIRGKLGKDQAGVLY 463
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA--------- 512
GC + +S A A D +++ G +DL A
Sbjct: 464 APGC-RIKGESREGFENALACAAEVDTVVMVVGGSSARDFGEGTIDLKTGASKVSDHDWN 522
Query: 513 -----ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
E +DR L L G Q QL+ +V + K LV++ G IA + AI+
Sbjct: 523 DMESGEGIDRMTLGLAGVQLQLMQEVYRLGKE---LVVVYMNGRPIAEPWVEEHAHAIVE 579
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
A YPG+EGG AIAD++FG NP GRL ++ +V LP+ R G+ Y
Sbjct: 580 AWYPGQEGGHAIADILFGDVNPSGRLTLSIPK--HVGQLPVYYNGKRS------RGKRYL 631
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+ YPFGYGLSYT F Y L+ + N
Sbjct: 632 EDDAEPRYPFGYGLSYTTFSYERLTLS-------------------------------AN 660
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+R D+ VD N G +G++VV +Y I+++ GF +V ++ G + +
Sbjct: 661 SIRADESVTVTVDVTNTGEREGAEVVQLYISDTVSSVTRPIRELKGFCKVVLKPGETRTV 720
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+FV + K L + +++ AG +I VG
Sbjct: 721 EFVVGSDK-LQYIGRDLKSVVEAGRFSIEVGR 751
>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 866
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 157/440 (35%), Positives = 237/440 (53%), Gaps = 30/440 (6%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R +DL SR+TL+EK + + + + +PRLG+PQ+EWWSEALHG++ G AT
Sbjct: 35 RAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG----------FAT 84
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP AS+++ L ++ A S EA A NL R G++ W+PNIN+ RDP
Sbjct: 85 VFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVSIWTPNINIFRDP 144
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
RWGR ET GEDP++ R + V GLQ + RP K +C KHYA +
Sbjct: 145 RWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLACAKHYAVHSG 204
Query: 234 DNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
W +R+ FD R+ E+D+ ET+L F+ V+EG+ VMC+Y R++G P C + + L
Sbjct: 205 PEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDGSPCCGNTRYL 261
Query: 293 NQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
+Q +RGEW +G +V+DC +I + H + ++ +A A ++AG D++CG Y
Sbjct: 262 HQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDVECGAVYATLP 321
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
AV+QG + ID S+ L +G FD + G + I S+ + LA + A
Sbjct: 322 -RAVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPWKLTGPEVIASETHRRLALDMA 380
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
RE + LL+N LPL+ ++ +AV+GP+AN +V + GNY G P + + G S
Sbjct: 381 RESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTTTILKGIRSKVP 439
Query: 469 NVTYKTGCDDVACKSNNSIF 488
+ GC + + S F
Sbjct: 440 AARFVEGCGYIRNEIRQSHF 459
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 144/315 (45%), Gaps = 68/315 (21%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+A KS + + A AD + + G+ +E E + DR + LP Q
Sbjct: 592 DIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQR 651
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
++I + + K ++V ++ G +A A+L A Y GE GG+A+ADV+FG +
Sbjct: 652 EVIRLLRQAGK---LVVFVNCSGGAVALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDY 708
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQ 645
NP G+LP+T+Y D +P D L Y GRTY+++ G L+PFG+GLSYT
Sbjct: 709 NPSGKLPVTFYKSD-------ADLP----DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTS 757
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F + K ++ + Y V+ N G
Sbjct: 758 FAF-------------GKPRYENGMLY--------------------------VEVTNTG 778
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
DG++VV VY K PA+ A +K + GF R+ ++AG +R++ + D AN
Sbjct: 779 KRDGAEVVQVYVKNPAD-ADGPVKTLRGFARIDLKAGERRRVEIAMPR-ERFEGWDATAN 836
Query: 766 TL-LPAGEHTIFVGN 779
T+ + G H + VG+
Sbjct: 837 TMRVKPGNHLLMVGS 851
>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
Length = 866
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 157/440 (35%), Positives = 237/440 (53%), Gaps = 30/440 (6%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R +DL SR+TL+EK + + + + +PRLG+PQ+EWWSEALHG++ G AT
Sbjct: 35 RAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG----------FAT 84
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP AS+++ L + A S EA A NL R G++ W+PNIN+ RDP
Sbjct: 85 VFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIKRYQGVSIWTPNINIFRDP 144
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
RWGR ET GEDP++ R + V GLQ + RP K +C KHYA +
Sbjct: 145 RWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLACAKHYAVHSG 204
Query: 234 DNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
W +R+ FD R+ E+D+ ET+L F+ V+EG+ VMC+Y R++G P C + + L
Sbjct: 205 PEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDGSPCCGNTRYL 261
Query: 293 NQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
+Q +RGEW+ +G +V+DC +I + H + ++ +A A ++AG D++CG Y
Sbjct: 262 HQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDVECGAVYATLP 321
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
AV+QG + ID S+ L +G FD + G + I S+ + LA + A
Sbjct: 322 -RAVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPWKLTGPEVIASETHRRLALDMA 380
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
RE + LL+N LPL+ ++ +AV+GP+AN +V + GNY G P + + G S
Sbjct: 381 RESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTTTILKGIRSKVP 439
Query: 469 NVTYKTGCDDVACKSNNSIF 488
+ GC + + S F
Sbjct: 440 AARFVEGCGYIRNEIRQSHF 459
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 88/315 (27%), Positives = 140/315 (44%), Gaps = 68/315 (21%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+A KS + + A AD + + G+ +E E + DR + LP Q
Sbjct: 592 DIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQR 651
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
++I + + K ++V ++ G +A A+L A Y GE GG+A+ADV+FG +
Sbjct: 652 EVIRLLRQAGK---LVVFVNCSGGAVALVPETEACDAVLQAWYAGEAGGQAVADVLFGDY 708
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQ 645
NP G+LP+T+Y D +P D L Y GRTY+++ G L+PFG+GLSYT
Sbjct: 709 NPSGKLPVTFYKSD-------ADLP----DFLDYRMTGRTYRYFRGIPLFPFGFGLSYTS 757
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F + + + V+ N G
Sbjct: 758 FAFGKPRYENG---------------------------------------KLYVEVTNTG 778
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
DG++VV VY K PA+ A +K + GF R+ ++AG +R++ + D N
Sbjct: 779 KRDGAEVVQVYVKNPAD-ADGPVKTLRGFARIDLKAGERRRVEIAMPR-ERFEGWDATTN 836
Query: 766 TL-LPAGEHTIFVGN 779
T+ + G H + VG+
Sbjct: 837 TMRVKPGNHLLMVGS 851
>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 875
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 159/425 (37%), Positives = 237/425 (55%), Gaps = 38/425 (8%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F F ++ L + RV++LVS++TL+EKV Q+ + A +PRLG+P Y+WW+E LHGV+
Sbjct: 27 FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR--- 83
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA-----GL 162
T F T FP I A+F+++ K+ + E RA+YN L R GL
Sbjct: 84 -TPFK-----TTVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
TYW+PNIN+ RDPRWGR ET GEDP++ +V+GLQ + + LK +
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD---------PKYLKAA 188
Query: 223 SCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+C KHYA + G + R+ FD VT ++ +T+L F+ V + VMC+YN
Sbjct: 189 ACAKHYAVHS-----GPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAF 243
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
P CA L+N +R +W GY+ +DC +I NHK D+ + L G D
Sbjct: 244 RTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLH-GTD 302
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICS 398
+DCG AV+ G++ E ID S+K L+ + RLG FD +Y + S
Sbjct: 303 IDCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFDPVSMVKYAQTPSSVLES 362
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
+E+ E A + AR+ IVLLKN++NTLPL S K+K + V+GP+A+ +++++GNY G P +
Sbjct: 363 EEHKEHALKMARQSIVLLKNEKNTLPL-SKKLKKIVVLGPNADNSISILGNYNGTPSKLT 421
Query: 459 SPIAG 463
+ + G
Sbjct: 422 TVLQG 426
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 144/318 (45%), Gaps = 59/318 (18%)
Query: 475 GCDDVACKSNNSI---FA-ASEAAKTADATIILAGLDLSVEAESL----------DREDL 520
G +VA ++ N I FA E K ADA I G+ +E E + DR +
Sbjct: 580 GKAEVALQTGNFIKTDFANLIERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSI 639
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
LP QT+L+ + K PV+ +IM+ G IA NI AIL Y G+ G A A
Sbjct: 640 LLPEVQTRLLKALQSSGK-PVVFLIMT--GSAIAVPWEAENIPAILNIWYGGQSAGTASA 696
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
DV+FG +NP GRLP+T+Y GD L+S +D+ +TY+++ G LY FGYG
Sbjct: 697 DVIFGDYNPAGRLPVTFYKGDS----DLSSFVDYKMDN-----KTYRYFKGIPLYGFGYG 747
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
LSYT+FKY+ L T D K P + V
Sbjct: 748 LSYTEFKYSGLK--------------------TPDKIKKGQPVTI------------SVK 775
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
N G +G +V +Y P + +K + GF+R ++ G++ + F + + L+ V
Sbjct: 776 VTNTGKMEGEEVAQLYLINPNTSIKSPLKSLKGFERFNLKPGQSTVVNFTL-SPEDLSYV 834
Query: 761 DYAANTLLPAGEHTIFVG 778
+ N G+ I VG
Sbjct: 835 TESGNLKPYEGKIQIAVG 852
>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
Length = 791
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 220/733 (30%), Positives = 338/733 (46%), Gaps = 101/733 (13%)
Query: 78 VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
V L +A RLG+P + E LHG + VG ATSFP I +S++
Sbjct: 126 VNALQKWAMTETRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 173
Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
++ +++ Q + E RA R SP +++ARDPRWGRI ET GEDP++VG V
Sbjct: 174 TMLRQVNQVIGREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 228
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
V GLQ EG RP V + KH + ++ V A V+E+++ E
Sbjct: 229 AVEGLQG-EGRSRLL----RPGHVFATLKHLTGHGQPESGTNVG----PAPVSERELREN 279
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
F PFE VK +VM SYN ++G+PS A+ LL+ +R EW G +V+D ++ +
Sbjct: 280 FFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQL 339
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTV 375
+ H +A + E+A + L AG+D D + + T G V++GKV E +D +++ + +
Sbjct: 340 MSIH-HIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLEL 398
Query: 376 LMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
R G F+ + +DE LA AA+ I LLKND LPL T+AV
Sbjct: 399 KFRAGLFENPYADANAAAAITNNDEARALARTAAQRSITLLKND-GMLPLKPE--GTIAV 455
Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGC---------DDVACK 482
+GP +A VA +G Y G P +S + G AN+ + G +D K
Sbjct: 456 IGP--SAAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVK 513
Query: 483 SNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLIN 531
S+ + I A EAA+ D I+ G E DR L L G Q +L +
Sbjct: 514 SDPAENRKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFD 573
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
+ + K P+ +V+++ G + + + AIL Y GE+GG A+AD++FG NPGG
Sbjct: 574 ALKALGK-PITVVLIN--GRPASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGG 630
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
+LP+T V LP+ ++P R Y F LYPFG+GLSYT F +
Sbjct: 631 KLPVTVPRS--VGQLPMF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTNFSLS-- 680
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
P + + VD +N G+ +G +
Sbjct: 681 -----------------------------APRLSATKIGTGGKTSVSVDVRNTGAREGDE 711
Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
VV +Y + +K++ GFQRV ++ G ++ + F ++L + + ++ G
Sbjct: 712 VVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMRRVVEPG 770
Query: 772 EHTIFVGNGGVSF 784
+ I GN V+
Sbjct: 771 DFEIMTGNSSVAL 783
>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
Length = 866
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 157/440 (35%), Positives = 237/440 (53%), Gaps = 30/440 (6%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R +DL SR+TL+EK + + + + +PRLG+PQ+EWWSEALHG++ G AT
Sbjct: 35 RAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG----------FAT 84
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP AS+++ L ++ A S EA A NL R G++ W+PNIN+ RDP
Sbjct: 85 VFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVSIWTPNINIFRDP 144
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
RWGR ET GEDP++ R + V GLQ + RP K +C KHYA +
Sbjct: 145 RWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLACAKHYAVHSG 204
Query: 234 DNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
W +R+ FD R+ E+D+ ET+L F+ V+EG+ VMC+Y R++G P C + + L
Sbjct: 205 PEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDGSPCCGNTRYL 261
Query: 293 NQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
+Q +RGEW +G +V+DC +I + H + ++ +A A ++AG D++CG Y
Sbjct: 262 HQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDVECGAVYATLP 321
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
AV+QG + ID S+ L +G FD + G + I S+ + LA + A
Sbjct: 322 -RAVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPWKLTGPEVIASETHRRLALDMA 380
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
RE + LL+N LPL+ ++ +AV+GP+AN +V + GNY G P + + G S
Sbjct: 381 RESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTTTILKGIRSKVP 439
Query: 469 NVTYKTGCDDVACKSNNSIF 488
+ GC + + S F
Sbjct: 440 AARFVEGCGYIRNEIRQSHF 459
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/315 (27%), Positives = 140/315 (44%), Gaps = 68/315 (21%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+A KS + + A AD + + G+ +E E + DR + LP Q
Sbjct: 592 DIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQR 651
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
++I + + K ++V ++ G +A A+L A Y GE GG+A+ADV+FG +
Sbjct: 652 EVIRLLRQAGK---LVVFVNCSGGAVALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDY 708
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQ 645
NP G+LP+T+Y D +P D L Y GRTY+++ G L+PFG+GLSYT
Sbjct: 709 NPSGKLPVTFYKSD-------ADLP----DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTS 757
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F + + + V+ N G
Sbjct: 758 FVFGTPRYENG---------------------------------------KLYVEVTNTG 778
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
DG++VV VY K PA+ A +K + GF R+ ++AG +R++ + D N
Sbjct: 779 KRDGAEVVQVYVKNPAD-ADGPVKTLRGFARIDLKAGERRRVEIAMPR-ERFEGWDATTN 836
Query: 766 TL-LPAGEHTIFVGN 779
T+ + G H + VG+
Sbjct: 837 TMRVKPGNHLLMVGS 851
>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
ATCC 35937]
Length = 391
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 165/397 (41%), Positives = 223/397 (56%), Gaps = 38/397 (9%)
Query: 45 LGLQMSSFLFC---DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
LGL + F D S R LV++M+ DEKV Q + A +PRL +P YEWWSE
Sbjct: 13 LGLCLPCIAFAAPADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSE 72
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--- 158
LHG++ G AT FP I AS+N +L +++G VSTEARA +N
Sbjct: 73 GLHGIARNG----------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGP 122
Query: 159 ------RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AGLT WSPNIN+ RDPRWGR ET GEDPF+ G+ AV ++RGLQ
Sbjct: 123 GKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GD 175
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
DLN P +++ KH A V + R+ FD V+ +DME T+ F + +G A S
Sbjct: 176 DLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWS 230
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
VMC+YN ++G P+CA LLN VRG+W G++V+DCD++ M H F D+ + A
Sbjct: 231 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 290
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
LKAG DL+CG Y G A+++G+V E +D+SL L+ RLG + + Y
Sbjct: 291 -ALKAGHDLNCGHAYREL-GTAIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYAR 348
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
LG +D+ + + LA +AA E IVLLKN TLPL +
Sbjct: 349 LGAKDVDNAAHRALALQAAAESIVLLKNTATTLPLKA 385
>gi|393788557|ref|ZP_10376684.1| hypothetical protein HMPREF1068_02964 [Bacteroides nordii
CL02T12C05]
gi|392654237|gb|EIY47885.1| hypothetical protein HMPREF1068_02964 [Bacteroides nordii
CL02T12C05]
Length = 859
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 224/819 (27%), Positives = 362/819 (44%), Gaps = 146/819 (17%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD------ 83
S S + C S + + + +LP RV DL+SRMTL+EKV Q+
Sbjct: 6 SISRLLFCSSLFLSGISVFAQELPYKQPNLPIEERVNDLLSRMTLEEKVAQIRHIHSWNI 65
Query: 84 ---------------------FAHGVP---------------------RLGLPQYEWWSE 101
F G P RLG+P + +E
Sbjct: 66 FNGQTLDTEKLKAFSKGMSWGFVEGFPLTGANCRKNMQLVQKFMVENTRLGIPVFTV-AE 124
Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE--ARAMYNLGR 159
+LHG V G+ +P + ++F+ L + ++ + A+ M+ +
Sbjct: 125 SLHG-----------SVHEGSVIYPQNVALGSTFSPELAYRKAAMITKDLHAQGMHQV-- 171
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
+P I+V RD RWGR+ E+ GEDP + G + + V+G D N
Sbjct: 172 -----LAPCIDVVRDLRWGRVEESFGEDPILCGLFGIAEVKGYMD-----NG-------- 213
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+S KHY + + G++ + + +D+ E +L+PFEM ++ +VM +YN
Sbjct: 214 -ISPMLKHYGPHG-NPLSGLNLASVECGL--RDLHEVYLKPFEMVIRNTSVLAVMSTYNS 269
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
N IP+ A LL + +R ++ GY+ +D +I+++ H + A + E+A Q AGL
Sbjct: 270 WNRIPNSASHYLLTEVLRNQFGFKGYVYSDWGAIEMLKTLH-YTAHNSEEAAMQAFTAGL 328
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
D++ + +++GK+ E +++S++ + V ++G F+ P +
Sbjct: 329 DVEASSNCYPLLADLIKEGKLDEEILNESVRRVLYVKFKMGLFE-DPYGEQYAHCKMHPQ 387
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY-- 457
E ++L+ E A E +VLLKN+ LPLN+ K+++VAV+GP NA G+Y
Sbjct: 388 EGVQLSKEIADESVVLLKNENGLLPLNAEKLRSVAVIGP--NADQVQFGDYTWSRNNKDG 445
Query: 458 MSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-------- 505
M+P+AG V Y+ GC V+ + + I A E A+ ++ I+ G
Sbjct: 446 MTPLAGIRQLLGDKVTVRYEKGCSLVSLDT-SGIKKAVEVARQSEVAIVFCGSASAALAR 504
Query: 506 -LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
S E D DL L G Q+QLI +V E PV+LV+++ I++ + +I A
Sbjct: 505 DYKSSTCGEGFDLNDLNLTGAQSQLIKEVYETGT-PVVLVLVTGKPFTISWEK--KHIPA 561
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSL 619
IL Y GE+ G +IAD++FGK +P GRL ++ Y LP + S
Sbjct: 562 ILTQWYAGEQAGNSIADILFGKISPSGRLTFSFPQSTGHLPVYYDYLPSDKGFYKNPGSY 621
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
PGR Y F + L+ FG+GL+YT F Y + K +H
Sbjct: 622 ETPGRDYVFSSPDPLWAFGHGLTYTSFVYKSMETDK---------EH------------- 659
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
D KVD +N G DG +VV +Y + T +KQ+ F++V V
Sbjct: 660 ---------YDPTDTIYVKVDIKNTGKRDGKEVVQLYVRDKVSTVVTPVKQLRDFEKVLV 710
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
AG + ++ A K L IVD ++ GE + VG
Sbjct: 711 EAGSTRTVRLKV-AVKDLYIVDAGDRRIVEPGEFELQVG 748
>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
Length = 894
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 166/468 (35%), Positives = 245/468 (52%), Gaps = 48/468 (10%)
Query: 39 PGRFSKLGLQM-SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYE 97
P F++ Q S+ + + SLP +R +DLVSRMTL EK QL + A +PRL +P Y
Sbjct: 23 PSAFAQSQTQSPSTPAYLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYN 82
Query: 98 WWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
WWSEALHGV+ + G T FP I A+F+ ++ + TE R +Y
Sbjct: 83 WWSEALHGVA-----------VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEE 131
Query: 158 GRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
GL +W+PN+N+ RDPRWGR ET GEDPF+ G+ V +V G+Q
Sbjct: 132 NEKDGSSKIFHGLDFWAPNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGD---- 187
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
N + +V + KH+ DV + R+ D V+ D +T+ F + +G
Sbjct: 188 -----NPKYYRVIATPKHF---DVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGH 239
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVMCSYN +NG P+CA+ L +RG W GY+V+DCD++ + HK+ +
Sbjct: 240 ADSVMCSYNAINGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKYRP-TLAQ 298
Query: 330 AVAQTLKAGLDLDCGQY--------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
A A +++ G+D DC + Y + +AVQQG + + +D +L L+T ++LG
Sbjct: 299 AAAISMERGMDNDCADFAQPKGDDDYKAYI-DAVQQGYLSQQAMDTALVRLFTARIKLGL 357
Query: 382 FD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
FD G Y ++ S + A + A E +VLLKND TLPL V ++AVVGP
Sbjct: 358 FDPKGMDPYADTPHSELNSPAHRAYARKLADESMVLLKND-GTLPLKPGSVHSIAVVGPL 416
Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSG-YAN--VTYKTGCDDVACKSN 484
A+ T ++GNY G+P +S + G Y N +TY G ++ +N
Sbjct: 417 ADQTAVLLGNYNGVPTHTVSFLEGLRAEYPNTKITYVPGTQFLSDTAN 464
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 135/290 (46%), Gaps = 55/290 (18%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
I + G+ +E E + DR +L +P + L+ VA+ K PV++V+M+
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+ + + + N A+L A Y GEEGG AIAD + GK +P GRLP+T+Y V LP
Sbjct: 686 LAVNWISQHAN--AVLEAWYSGEEGGAAIADTLSGKNDPAGRLPVTFYKS--VNQLPN-- 739
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
+ RTY+++ G LYPFGYGLSYT F+Y+ LS
Sbjct: 740 -----FEDYSMENRTYRYFKGKPLYPFGYGLSYTTFRYSDLSIPHA-------------- 780
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
T DA + E N G G +VV +Y K P A I
Sbjct: 781 --TVDAGQP---------------VEASATVTNTGKVAGDEVVQLYLKFPKVDGAPDIA- 822
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
+ GFQR+ + G+++++ F + L++V ++ G++T+ +G G
Sbjct: 823 LRGFQRIHLEPGQSQQVHFELKK-RDLSMVTALGQIIVAQGDYTLSIGGG 871
>gi|329957143|ref|ZP_08297710.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523411|gb|EGF50510.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 803
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 212/724 (29%), Positives = 331/724 (45%), Gaps = 113/724 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E +HG+++ AT P I +++N+ L ++ G
Sbjct: 142 RLGIP-VDFTNEGIHGLNHTK-----------ATPLPAPIAIGSTWNKELVRRAGVIAGQ 189
Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EA+A+ G T ++P ++V RDPRWGR E GE+PF++ V G+Q +G
Sbjct: 190 EAKAL------GYTNVYAPILDVVRDPRWGRTLECYGEEPFLIAALGTEMVNGIQS-QG- 241
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
V++ KHYA Y V D V +++ E FL PF+ ++
Sbjct: 242 ------------VAATLKHYAVYSVPKGGRDGHCRTDPHVAPRELHELFLYPFKKVIQNS 289
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
VM SYN +G+P A L + +R E+ GY+V+D +++ + H +AD+ +
Sbjct: 290 HPMGVMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSQAVEFVESKH-HVADTYD 348
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNA---------VQQGKVKETDIDKSLKYLYTVLMRL 379
+AV Q L+AGL++ T+FT + +++ K+ IDK + + V RL
Sbjct: 349 EAVRQVLEAGLNV-----RTHFTPPSDFILPIRRLLEEKKISMATIDKRVSEVLRVKFRL 403
Query: 380 GFFDGSPQYVSLGKQDIC--SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
G FD P G D +D N++ E ++ +VLLKN+ N LPL+ ++K V V G
Sbjct: 404 GLFD-RPYVTDTGAADNVGGADRNMDFVKEMQQQALVLLKNENNILPLDKQRIKKVLVTG 462
Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVAC------------ 481
P A+ M Y ++ +AG Y A V Y GCD V
Sbjct: 463 PLADEDNFMTSRYGPNGLETVTVLAGLRAYLQGVAEVDYAKGCDIVDAGWPATEILPVPM 522
Query: 482 --KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
+ I A A +D I + G D ES R L LPG Q QL+ + K
Sbjct: 523 NEREKRGIAEAVAKAGESDVVIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK- 581
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVILV+++ + + +A N I AIL + +PG +GG IA+ +FG+ NPGG+L +T+
Sbjct: 582 PVILVLINGQPLTVNWA--NAYIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVTFPK 639
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-----LYPFGYGLSYTQFKYNLLSFT 654
V + L + P +P S G ++ +G T LYPFG+GLSYT F Y+ L +
Sbjct: 640 S--VGQIEL-NFPFKP-GSHGSQPKSGPNGSGATRVIGELYPFGFGLSYTTFAYSDLEVS 695
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
Q R + KV+ N G G +VV
Sbjct: 696 PLRQ-------------------------------RTQGEYTVKVNVTNTGKRAGDEVVQ 724
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y + TY Q+ GF+RV ++ G +++ F + L I+D N + GE
Sbjct: 725 LYVRDKVSSVITYDSQLRGFERVSLKPGETRQVTFSLKP-EDLQILDRNMNWTVEPGEFE 783
Query: 775 IFVG 778
+ +G
Sbjct: 784 VMIG 787
>gi|440747308|ref|ZP_20926567.1| Periplasmic beta-glucosidase [Mariniradius saccharolyticus AK6]
gi|436484228|gb|ELP40232.1| Periplasmic beta-glucosidase [Mariniradius saccharolyticus AK6]
Length = 763
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 227/785 (28%), Positives = 361/785 (45%), Gaps = 131/785 (16%)
Query: 60 PYSIRVKDLVSRMTLDEKVQQL-----GDFAHG------------------------VPR 90
P+ RV +++ MTL+EK+ QL GDF G V +
Sbjct: 30 PFRDRVDSVMALMTLEEKIGQLNLPAAGDFTTGQASSSNIAEKIKAGLVGGLFNIKSVAK 89
Query: 91 LGLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVST 149
+ Q E+ G+ P DVI G T FP I + S++ +L +K + +
Sbjct: 90 IRDVQRVAVEESRLGI----PLIFAMDVIHGYETVFPIPIGMSCSWDMALMEKSARIAAQ 145
Query: 150 EARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EA A G+ + +SP +++RDPRWGR++E GEDP++ + A ++G Q +
Sbjct: 146 EASA------DGINWTFSPMTDISRDPRWGRMSEGSGEDPYLGAQIAKAMIKGYQGDDLS 199
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
N T L +C KH+A Y G D D ++ Q M + P++ ++ G
Sbjct: 200 LNNTIL--------ACVKHFALYGAPE-AGRDYNTVD--MSRQRMFNEYFLPYQAAIEAG 248
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
SVM S+N V+GIP+ A+ L+ + +R W G++V D +I M D+ L D ++
Sbjct: 249 -VGSVMTSFNDVDGIPASANKWLMTEVLRERWGFEGFVVTDYTAINEMTDHG--LGDLQQ 305
Query: 329 DAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
A + AG+D+D G+ + +V++GKV E +ID + + + T +LG FD +
Sbjct: 306 -VSALAMNAGVDMDMVGEGFLTTLKKSVEEGKVSEAEIDAACRRILTAKFKLGLFDDPYR 364
Query: 388 Y--VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
Y V K++I SD + ++A E A + VLLKN+ TLPL K T+A+VGP A+ T
Sbjct: 365 YCDVERAKREIFSDAHRKVAREIATQTFVLLKNENQTLPLK--KEGTIALVGPMADNTEN 422
Query: 446 MIGNYAGIPCRYMSPIAGFSGYAN-------VTYKTGCD---DVACKSNNSIFA------ 489
M G ++ + R+ + I+ G N + Y G + D +S SIF
Sbjct: 423 MTGTWS-VAARFENSISLRKGLENALGDRAKIVYAKGSNIYPDSLLESRVSIFGKPTYRD 481
Query: 490 ----------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
A +AA+ A+ + G + ES R D+ +P Q L+ + + K
Sbjct: 482 NRPAQVLIQEALQAARNANVIVAAMGESAEMSGESSSRTDIEIPENQRALLEALLKTGK- 540
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV+LV+ + G +A N+ AIL + G E G AIADV+FG NP G+L T+
Sbjct: 541 PVVLVLFT--GRPLAIKWEQENLHAILNVWFAGSEAGHAIADVLFGDVNPSGKLSATFPQ 598
Query: 600 GDYVQMLPL------TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
V +P+ T PL Y + LYPFG+GLSYT F+Y +
Sbjct: 599 N--VGQVPIYYNHKSTGRPLAAGQWFQKFRTNYLDVSNDPLYPFGFGLSYTDFEYGEIKL 656
Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
+K+ +L D+ +D +N G DG++VV
Sbjct: 657 SKS-------------------------------ELVGDERIRVSIDVKNAGGVDGAEVV 685
Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
+Y + +K++ GF++VF++AG +K ++F + L + N + GE
Sbjct: 686 QLYVRDIVASMTRPVKELKGFEKVFLKAGESKTVRFELGQ-EQLKFYNNDLNFIFEPGEF 744
Query: 774 TIFVG 778
I VG
Sbjct: 745 EIMVG 749
>gi|294673871|ref|YP_003574487.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294474367|gb|ADE83756.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 782
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 220/728 (30%), Positives = 334/728 (45%), Gaps = 129/728 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT A++N +L +K G+ +
Sbjct: 130 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGFGMAATWNPALIEKTGEVIGQ 177
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++AR+PRW R+ ET GEDP + G V+GL
Sbjct: 178 EIRL-----QGGHISYGPVLDLAREPRWSRVEETMGEDPVLAGELGAAMVKGL------- 225
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT---EQDMEETFLRPFEMCVK 266
+ S+P + KH+ Y G + +T ++++E+FL PF+ +
Sbjct: 226 -GGGILSKPYSTIATLKHFIGY------GTTEAGQNGGITIAGARELQESFLPPFKKAIN 278
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G A SVM SYN ++GIPS LL +R +W +G++V+D SI + H+ +A++
Sbjct: 279 AG-ALSVMTSYNSLDGIPSTCSKALLTDLLRTQWGFNGFVVSDLYSIDGIHGTHR-VAET 336
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
K+ A LKAG+D D G +AVQ+G V E +ID ++K + + +G F+
Sbjct: 337 KQQAGVMALKAGVDADLGALAFGRLEDAVQKGMVTEAEIDVAVKRILKMKFEMGLFEHPY 396
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ KQ + SD N +A + ARE I LLKN + LPL +K + V V GP+A+ M
Sbjct: 397 VDAAQAKQLVRSDNNKAVALQVAREIITLLKNQNHVLPL--SKTQKVLVCGPNADNVYNM 454
Query: 447 IGNYA-----GIPCRYMSPIAGFSGYANVTYKTGC---DDVA------------------ 480
+G+Y G ++ I + VTY GC D A
Sbjct: 455 LGDYTAPQEEGNVKTILAGIRSKLPASQVTYVKGCAVRDTTASNIAEAVAAAKQADVVVV 514
Query: 481 ------CKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
+ + + + AA T TI + +D E DR L G+Q QL+ +
Sbjct: 515 AVGGSSARDFKTSYKETGAAVTDSKTI--SDMDC---GEGFDRATLTPLGHQMQLLKALK 569
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
+ K P+++V + +D ++A + + A+L A YPG+EGG AIADV+FG +NP GRLP
Sbjct: 570 AIGK-PLVVVYIEGRPMDKSWAAQHAD--ALLTAYYPGQEGGTAIADVLFGDYNPAGRLP 626
Query: 595 ITWYNGDYVQMLPL--TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
++ V +P+ P P D + R LY FGYGLSYT FKY+ L
Sbjct: 627 VSVPAN--VGQIPVYYNKKPPMPHDYVEMSAR--------PLYAFGYGLSYTTFKYDDL- 675
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGS 710
N+ T D +FKV F N G DG
Sbjct: 676 ----------------NIEETGDT-------------------QFKVTFNVTNTGDMDGD 700
Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
+VV +Y A + Q+ F R+F+ G K++ F A + L IVD N ++
Sbjct: 701 EVVQLYLHDEFASTAQPMMQLKKFSRIFIPKGETKQVSFTLEA-EDLEIVDQEMNHVVET 759
Query: 771 GEHTIFVG 778
G+ T+ +G
Sbjct: 760 GDFTVMIG 767
>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 763
Score = 273 bits (697), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 222/735 (30%), Positives = 351/735 (47%), Gaps = 108/735 (14%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E V + +A RLG+P + E HG +G AT FP + +++
Sbjct: 90 EAVNAIQRYAMEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L++ I +AV+ E RA + G +SP ++V RDPRWGR ET GEDP +V +A
Sbjct: 138 NTELFRSISRAVAAETRA-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFA 192
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDME 254
V V+GLQ E ++ T L + KH+A Y + + H R ++
Sbjct: 193 VAAVQGLQG-ERLDSHTSL-------LATLKHFAGYGASEGGRNGAPVHMGLR----ELH 240
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E L PF V+ G A S+M +YN ++G+P + LL +R W G+++ DC +I
Sbjct: 241 EVDLLPFRKAVESG-ALSIMTAYNEIDGVPCTSSRYLLQNVLREAWGFDGFVITDCGAIH 299
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ H A S +A Q+LKAG+D++ G + A++QG + E D++++ +
Sbjct: 300 MLACGHN-TAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQGLITEDDLNRAAGRVL 358
Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
+ RLG FD + +Q I E+I LA +AA EGIVLLKN+ N LPL+S+ T+
Sbjct: 359 ELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPLDSSS-GTI 417
Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFS---GYANVTYKTGCDDVACKSNNSIF 488
AV+GP+A+ +G+Y P + ++ + G G + V Y GC + S
Sbjct: 418 AVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGC-RIQGDSREGFP 476
Query: 489 AASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLWLP 523
A A+ AD +++ G +DL A E +DR L L
Sbjct: 477 RALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKSDMECGEGIDRSTLTLM 536
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q +L+ ++ ++ K PVI+V ++ G I + I AI+ A YPG+EGG AIAD++
Sbjct: 537 GVQLELLQELQKLGK-PVIVVYIN--GRPITEPWIDEFIPAIIEAWYPGQEGGGAIADML 593
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
FG NP GRLP++ V LP++ R G+ Y + YPFG+GLSY
Sbjct: 594 FGDINPSGRLPLSIPK--EVGQLPISYNARR------TRGKRYLETDLAPRYPFGFGLSY 645
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T+F+Y L+ + + +A+ ++D N
Sbjct: 646 TEFRYGRLTVEPAV------------VPIGGEAT-------------------VRIDVTN 674
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G+ DG++VV +Y A K + GF++VF++AG + + F + + L ++
Sbjct: 675 AGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIGS-EQLELIGLD 733
Query: 764 ANTLLPAGEHTIFVG 778
++ GE I VG
Sbjct: 734 LKPVVEPGEFRIQVG 748
>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 837
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 160/430 (37%), Positives = 246/430 (57%), Gaps = 41/430 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P RV+DL+S++T++EKV L + G+ R+G+ +Y +EALHG+ + PG
Sbjct: 13 LYKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGI--IRPG 70
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I + +N L I +S EARA +N G L
Sbjct: 71 KF--------TVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLL 122
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ + R LK
Sbjct: 123 TFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGD---------HPRYLKAV 173
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+ KH+AA + ++ +R++ DA +TE D+ E + FE C++EG A S+M +YN +NG
Sbjct: 174 ATPKHFAANNEEH----NRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAING 229
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P A+ LLN+ ++ +W +GYIV+DC + +++ +H+++ + E A +KAGLD++
Sbjct: 230 VPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAAAMIAIKAGLDVE 288
Query: 343 CGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y + N NA +Q V +ID + + MRLG FD + Y L + +
Sbjct: 289 CGDYVFANPLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFDDPEKNPYNHLSPEIVGCK 348
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ +LA EAAR+ IVLLKN QNTLPLN+ K+K++AVVG NA G+Y+G P +
Sbjct: 349 KHHDLALEAARQSIVLLKNQQNTLPLNAQKIKSIAVVG--INAANCEFGDYSGTPVN--A 404
Query: 460 PIAGFSGYAN 469
P++ G N
Sbjct: 405 PVSVLDGIRN 414
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 92/289 (31%), Positives = 135/289 (46%), Gaps = 46/289 (15%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
AS+ + +D I + G++ S+E E DR + LP Q I + + P +V++ AG
Sbjct: 581 ASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKA--NPNTIVVLVAG 638
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE+GG AIA+V+FG +NP GRLP+T+YN ++ LP
Sbjct: 639 S-SMAIGWMDQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IEDLPAF 695
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
D RTY ++ G LY FGYGLSYT+F Y RN
Sbjct: 696 D------DYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDY-------------------RN 730
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
LN D V +N +N G +G +V VY K P + T +K
Sbjct: 731 LNIKQDTQ-----NVTLN-----------FSIKNSGKYNGDEVAQVYVKFPDQGIKTPLK 774
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
Q+ GF+RV ++ G ++I + D P+G + VG
Sbjct: 775 QLKGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYHFMVG 823
>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 875
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 158/423 (37%), Positives = 230/423 (54%), Gaps = 34/423 (8%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F F + SL + RV DLVSR+TL+EKV Q+ + + + RLG+P Y+WW+E LHGV+
Sbjct: 27 FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVARTPF 86
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA-----GL 162
T T +P I A+F+++ + + E RA+YN L R GL
Sbjct: 87 KT---------TVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
TYW+PNIN+ RDPRWGR ET GEDP++ +V+GLQ + + LK +
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD---------PKYLKAA 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KHYA V + R+ FD VT ++ +T+L F + E + + VMC+YN
Sbjct: 189 ACAKHYA---VHSGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRT 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
P CA L+N +R EW GY+ +DC +I NHK D+ E A A + G D+D
Sbjct: 246 QPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDA-ESAAADAVFHGTDID 304
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG AV+ GK+ E ID S+K L+ + RLG FD +Y + S E
Sbjct: 305 CGTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDPVSMVKYAQTPSSVLESKE 364
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ A + AR+ IVLLKN++N LPLN +K + V+GP+A+ ++++GNY G P + +
Sbjct: 365 HQLHALKMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNYNGTPSKLTTV 423
Query: 461 IAG 463
+ G
Sbjct: 424 LQG 426
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 86/268 (32%), Positives = 117/268 (43%), Gaps = 54/268 (20%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
E K ADA I G+ +E E + DR + P QT+L+ + K PV
Sbjct: 601 EHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLLKALQSSGK-PV 659
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+ +M+ G IA NI AIL Y G+ G A ADV+FG +NP GRLP+T+Y D
Sbjct: 660 VFAMMT--GSAIAIPWEAENIPAILNIWYGGQSAGTAAADVIFGDYNPAGRLPVTFYKND 717
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
L S +D+ +TY+++ G LY FGYGLSYT FKY+ L V +
Sbjct: 718 S----DLPSFVDYKMDN-----KTYRYFKGTPLYGFGYGLSYTSFKYSDLK----TPVKI 764
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
K Q L V N G T+G +V +Y
Sbjct: 765 KKGQSVSIL----------------------------VKVANTGKTEGEEVAQLYLINQD 796
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKF 749
T +K + GF+R ++ G NK I F
Sbjct: 797 TAIKTPLKSLKGFERFNLKPGENKTITF 824
>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 762
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 219/750 (29%), Positives = 346/750 (46%), Gaps = 124/750 (16%)
Query: 60 PYSIRVKDLVSRMTLDEKVQQLGDFA-------------------HGVP-------RLGL 93
P +RV DL+ RMTL+EK+ Q+ D G+ RL +
Sbjct: 34 PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93
Query: 94 PQYEWWSEALH---------GVSNVGPGTHFDDVI-PGATSFPTVILTTASFNESLWKKI 143
Q + A++ G+ +G +I GAT FP I +++FN + ++
Sbjct: 94 EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153
Query: 144 GQAVSTEARAMYNLGRAGL-TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
++ EA+A G+ SP +++AR+ RWGR+ ET GEDP++VGR V YV
Sbjct: 154 ATVIAQEAKA------TGVDQVLSPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAF 207
Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT--EQDMEETFLRP 260
EG V + KH+ A+ G++ A VT E+D+ +L+P
Sbjct: 208 NK-EG-------------VMTTLKHFLAHGSPT-GGLNL----ASVTGCERDLRSLYLKP 248
Query: 261 FEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNH 320
F+ ++E SVM SYN +P A +L+ +RGE GYI +D S++++ H
Sbjct: 249 FQDVMREAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLH 308
Query: 321 KFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
D K DA Q + AG+D++ G Y + V+ G + E +IDK + + T +
Sbjct: 309 HTAKD-KADAACQAVIAGVDVEVDGDCYETLD-SLVRSGVLPEKEIDKCVSRVLTAKFAM 366
Query: 380 GFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
G FD + Q + + E +ELA AARE +L+KN+ + LPL++ K+++VAV+GP
Sbjct: 367 GLFDKDYTKRANLSQTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGP- 425
Query: 440 ANATVAMIGNYAGIPCRY--MSPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEA 493
NA G+Y ++P+ G G + Y GC ++ + + A A
Sbjct: 426 -NAAQVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTA 483
Query: 494 AKTADATIILAGL---------DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
A+ +D ++ G SV ES D D+ LPG Q LI V K P I+V
Sbjct: 484 ARNSDVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALIRAVKATGK-PTIVV 542
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD--- 601
+++ I + + N + W Y GE+ GRAIA+++FG+ NP GRL +++
Sbjct: 543 LVAGKPFAIPWVKDNCEAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVSFPQSTGHL 600
Query: 602 --YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+ P +L PGR Y F + ++ FG+GLSYT FKY K++Q+
Sbjct: 601 PVFYNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKY------KSMQI 654
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+ N +T DD E V+ N G DG +VV +Y
Sbjct: 655 S--------NKEFTD-----------------DDTCEITVEVANTGKRDGKEVVQLYVND 689
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
T +K++ F++VF+ AG + +KF
Sbjct: 690 IVSSVVTPVKELRRFEKVFIPAGETRTVKF 719
>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 866
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 165/434 (38%), Positives = 232/434 (53%), Gaps = 37/434 (8%)
Query: 43 SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
S++ Q + F D+ LP+ RV DL+ R+T++EKV + D + + RLG+ QY WW+EA
Sbjct: 15 SQISAQNKLYPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEA 74
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN------ 156
LHGV+ G AT FP I ASF+ + AVS EARA +N
Sbjct: 75 LHGVARAGL----------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQG 124
Query: 157 -LGR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
GR GLT W+P IN+ RDPRWGR ET GEDP++ V V+GLQ
Sbjct: 125 SYGRYEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPS-------- 176
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDASSV 273
N + K+ +C KH+A + W +R+ FD A + ++D+ ET+L FE VKE V
Sbjct: 177 NGKYDKLHACAKHFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEV 233
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAV 331
MC+YNR G P C +LL Q +R +W G +VADC +I HK D+ A
Sbjct: 234 MCAYNRFEGDPCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAAS-AS 292
Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYV 389
A + +G DLDCG Y T AV++G ++E DID S++ L RLG D +
Sbjct: 293 AAAVYSGTDLDCGSSYKALT-EAVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQSLVPWS 351
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
+ + S + ++A + AR+ I LL+N N LPL S +K +AV+GP+A +V GN
Sbjct: 352 KISYNVVASKAHNQIALDMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGN 410
Query: 450 YAGIPCRYMSPIAG 463
Y G P ++ + G
Sbjct: 411 YNGTPANTITILEG 424
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 139/311 (44%), Gaps = 54/311 (17%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K +I + + AD + + G+ S+E E + DR D+ LP Q
Sbjct: 584 DIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQR 643
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
Q + + E K ++ ++ G I A+ N +AI+ A YPG+ GG+A+ADV+FGK+
Sbjct: 644 QFVKALKEAGKR---VIFINCSGSPIGLADEMANSEAIVQAWYPGQAGGQAVADVLFGKY 700
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP GRLPIT+Y T +P ++ GRTY++ L+PFGYGLSYTQF+
Sbjct: 701 NPSGRLPITFYR-------DTTQLP--DFENYDMAGRTYRYMQDKPLFPFGYGLSYTQFQ 751
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y + + N +Q V N G
Sbjct: 752 YGNPILNQQVITNGQTIQ-------------------------------LTVPVTNTGKR 780
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
G +VV VY + + A +K + F+R+ AG+ +++ F + + +
Sbjct: 781 SGDEVVQVYLRKKGD-ATGPVKTLRDFRRLSFNAGQTQQVVFKITPKQLEWWNEQSKAMQ 839
Query: 768 LPAGEHTIFVG 778
+ +G++ + VG
Sbjct: 840 VQSGDYELLVG 850
>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
CL02T12C01]
Length = 863
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 167/450 (37%), Positives = 239/450 (53%), Gaps = 38/450 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
F +S LP R +DL+ R+TL EKV + D++ +PRLG+ +Y WW+EALHGV G
Sbjct: 24 FNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
AT FP I A+F++ ++ + VS EARA Y+ GLT+W
Sbjct: 82 --------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ + + VRGLQ S+ K+ +C
Sbjct: 134 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPS--------ESKYDKLHACA 185
Query: 226 KHYAAYDVDNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W +R+ FD ++ +D+ ET+L F+ V++G VMC+YNR G P
Sbjct: 186 KHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEP 242
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C +LL +R EW G +V+DC +I + H +KE AVA +KAG DLDC
Sbjct: 243 CCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDC 302
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G Y AV++G + E ID SL L LG D + + + S+++
Sbjct: 303 GVDYYALQ-KAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSDIPYTVVDSEKH 361
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
E A E AR+ + LLKND TLPL S +AV+GP+AN +V M GNY G P ++ +
Sbjct: 362 REKALEMARKSMTLLKNDHGTLPL-SKHCGKIAVIGPNANDSVMMWGNYNGFPSHTVTIL 420
Query: 462 AGFS---GYANVTYKTGCDDVACKSNNSIF 488
G + G + Y GC+ + S+F
Sbjct: 421 EGITHKLGAEQIIYDKGCELTTGDTFVSLF 450
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 139/299 (46%), Gaps = 58/299 (19%)
Query: 493 AAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGP 540
AA+ DA +I+ G+ VE E L DR + LP Q L+ ++ + K P
Sbjct: 594 AARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-P 652
Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
VIL++ S + ++ AE + AI+ A Y G+ GG A+ADV+FG +NP GRLP+T+Y
Sbjct: 653 VILILCSGSAIGLS-AEVDL-ADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKA 710
Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
+ LP + GRTY+++ G L+PFGYGLSYT F+
Sbjct: 711 --TEQLP-------DFEDYSMQGRTYRYFEGEALFPFGYGLSYTSFEIG----------- 750
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
+ SK R +R ++ K+ +N G DG +V+ +Y +
Sbjct: 751 ------------KARLSKKR--------IRENESVSLKLTVENTGKLDGDEVIQIYIRKL 790
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
+ +K + F+R +RAG K + F N D +NT+ + GE+ I G
Sbjct: 791 QDKEGP-LKTLRAFKRFHLRAGEKKDVTFHLQN-DHFNFFDTESNTMRVMPGEYEILYG 847
>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 944
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 210/719 (29%), Positives = 337/719 (46%), Gaps = 102/719 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E + GV + AT+FPT + ++N L +++G
Sbjct: 153 RLGIP-VDFTNEGIRGVESYK-----------ATNFPTQLGLGHTWNRELIRQVGLITGR 200
Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EAR + G T ++P ++V RD RWGR E GE P++V + VRGLQ H
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQ----H 250
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+ +V++ KH+AAY + D +++ +++E + PF+ ++E
Sbjct: 251 NH---------QVAATAKHFAAYSNNKGAREGMSRVDPQMSPREVENIHIYPFKRVIRET 301
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
+M SYN +GIP L +R E GY+V+D D+++ + H D KE
Sbjct: 302 GLLGIMSSYNDYDGIPVQGSYYWLTTRLRQEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE 361
Query: 329 DAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
AV Q+++AGL++ C + V++G + E I+ ++ + V +G FD
Sbjct: 362 -AVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFD- 419
Query: 385 SPQYVSLGKQD--ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
SP L D + N +A +A+RE +VLLKN NTLPLN K+K +AV GP+A+
Sbjct: 420 SPYQTDLAGADNEVEKAANEAVALQASRESVVLLKNADNTLPLNIDKIKKIAVCGPNADE 479
Query: 443 TVAMIGNYAGIPCRYMSPIAGF----SGYANVTYKTGCDDVACKSNNS------------ 486
+ +Y + + + G G A V Y GCD V S
Sbjct: 480 EGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDLVDAHWPESEIIEYPLTPDEQ 539
Query: 487 --IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
I A+ A+ AD +++ G E+ R L LPG+Q +L+ V K PV+LV
Sbjct: 540 AEIDRAAANARQADVAVVVLGGGQRTCGENKSRTSLDLPGHQLKLLQAVQATGK-PVVLV 598
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+++ + + +A + + AIL A YPG +GG A+AD++FG +NPGG+L +T+ V
Sbjct: 599 LINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAVADILFGDYNPGGKLTVTFPK--TVG 654
Query: 605 MLPLTSMPLRP---VDSLGYPGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+P + P +P +D PG NG LYPFGYGLSYT F+Y+ L +
Sbjct: 655 QIPF-NFPCKPASQIDGGKNPGADGNMSRING-ALYPFGYGLSYTTFEYSDLEIS----- 707
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
P V+ D + ++ N G G +VV +Y++
Sbjct: 708 ----------------------PKVITPDQKAT----VRLKVTNTGKRAGDEVVQLYTRD 741
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
TY K + GF+R+ ++ G K + F + K L +++ ++ GE I G
Sbjct: 742 ILSSITTYEKNLAGFERIRLKPGETKEVTFTLDR-KHLELLNADMKWIVEPGEFAIMAG 799
>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
DSM 14838]
Length = 942
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 231/810 (28%), Positives = 360/810 (44%), Gaps = 146/810 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + R++DL+S+MTL+EK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 111 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L ++IG EAR + G T ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQIGLITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRG+Q H + +V++ KH+ AY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFVAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ +KE VM SYN +G+P L +RG
Sbjct: 272 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C Y
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
V++G + E I+ ++ + V +G FD +P L D + EN LA +A+RE
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLAGADKEVEKAENESLALQASRES 449
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
+VLLKN+ N LPL+ VK +AV GP+A+ + +Y + + + G G A
Sbjct: 450 LVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKA 509
Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
V Y GCD V S I A E A+ AD +++ G E+
Sbjct: 510 EVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGEN 569
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K PV+LV+++ + I +A + + AIL A YPG +
Sbjct: 570 KSRSSLELPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSK 626
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFY 629
GG A+ADV+FG +NPGG+L +T+ V +P + P +P +D PG
Sbjct: 627 GGTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRV 683
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
NG LY FGYGLSYT F+Y+ + + K I N C+
Sbjct: 684 NG-ALYSFGYGLSYTTFEYSDIEISPKVITPNQKATVRCK-------------------- 722
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
N G G +VV +Y + TY K + GF+R+ ++ G K +
Sbjct: 723 ------------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVV 770
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F + K L ++D ++ G+ +I +G
Sbjct: 771 FTLDR-KQLELLDKHMEWVVEPGDFSIMIG 799
>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
[Bacteroides thetaiotaomicron VPI-5482]
gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
[Bacteroides thetaiotaomicron VPI-5482]
Length = 769
Score = 271 bits (694), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 218/719 (30%), Positives = 336/719 (46%), Gaps = 110/719 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT I A+++ +L +++G ++
Sbjct: 114 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPTLIEEVGNVIAK 161
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + + P ++++RDPRW R+ ET GEDP + GR + GL
Sbjct: 162 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMILGL------- 209
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ DL+ +++ KH+ AY V Y A V +D+ E FL PF + G
Sbjct: 210 GSGDLSCEYATIATL-KHFLAYAVPEGGQNGNY---ASVGTRDLHENFLPPFREAIDAG- 264
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++G+P A+ LL Q +R EW G++V+D SI+ + ++H F+A + E+
Sbjct: 265 ALSVMTSYNSIDGVPCTANHYLLTQLLRNEWRFRGFVVSDLYSIEGVHESH-FVAPTIEE 323
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q + AG D+D G + N T +AVQ GK+ E ID ++ + + +G F+
Sbjct: 324 AAMQAVSAGADIDLGGDAFMNLT-HAVQFGKISEAVIDTAVCRVLRMKFEIGLFEHPYVN 382
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + S ++I+LA + A+ IVLLKN+ + LPLN K+K VAVVGP+A+ M+G
Sbjct: 383 PKTATKIVRSKDHIKLARKVAQSSIVLLKNENSILPLNK-KIKKVAVVGPNADNRYNMLG 441
Query: 449 NYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
+Y I I+ S + V Y GC + + N I A EAA ++ I
Sbjct: 442 DYTAPQEDENIKTVLDGVISKLSP-SKVEYVRGCA-IRDTTVNEIAEAVEAASRSEVIIA 499
Query: 503 LAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQVAEVAKG 539
+ G + + E DR L L G Q L+ + K
Sbjct: 500 VVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLIALKATGK- 558
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+I+V + +D +A + A+L A YPG+EGG AIADV+FG +NP GRLP++
Sbjct: 559 PLIVVYIEGRPLDKVWASEYAD--ALLTASYPGQEGGYAIADVLFGDYNPAGRLPVSIPR 616
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
V +P+ P + Y LY FGYGLSYT F+Y+ L +
Sbjct: 617 S--VGQIPVYYNKKAPRN------HDYVEQAASPLYTFGYGLSYTTFEYSDLQVIR---- 664
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
K+ C +FE +N GS DG +V +Y +
Sbjct: 665 ------------------KSPC------------HFEVSFKVKNTGSYDGEEVAQLYLRD 694
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++Q+ F+R F++ G K I F K L+I+D ++ G+ I +G
Sbjct: 695 EYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMKRVVETGDFRIMIG 752
>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
Length = 888
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 175/507 (34%), Positives = 257/507 (50%), Gaps = 68/507 (13%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+ LP R DLV+RMTL+EK +Q+G A +PRLG+P Y WW+E LHGV+ G
Sbjct: 38 YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---------RAGLTY 164
AT FP I A+++ + + TE RA Y GLT
Sbjct: 96 --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP++ GR V ++RGLQ D N K +
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQ-------GQDPNF--FKTIAT 198
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA V + +R+ D + D+E+T+L F V EG +VMC+YN V+G+P
Sbjct: 199 AKHYA---VHSGPESNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVP 255
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCD-SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
+CA L++Q +R +W G++V+DC + + ++ + E+ + + L AG+DL C
Sbjct: 256 ACASEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVC 315
Query: 344 GQYYTNF------TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC 397
G Y ++ T +AV++G + ET +D +L L+ +RLG FD P V K
Sbjct: 316 GDYRADWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAA 374
Query: 398 SDENIE---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
++ E ++ E A+ + LLKND LPL + + +AVVGP+A++ A+IGNY G P
Sbjct: 375 QNDTPEHRAMSLEMAKASMTLLKND-GVLPLK-GEPRRIAVVGPNADSVDALIGNYYGTP 432
Query: 455 CRYMSPIAGFSGY---ANVTYKTG----------------CDDVACKS---NNSIF--AA 490
++ +AG A V Y G C D AC++ +F A
Sbjct: 433 SNPVTVLAGIRARFPKAEVVYAEGTGLVGPASLPVPDAVLCADAACRTKGLKQEVFEGVA 492
Query: 491 SEAAKTADATIILAGLDLSVEAESLDR 517
E A T+ A D + + +S R
Sbjct: 493 LEGAPVETRTVANATFDWTGDRQSSAR 519
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 131/293 (44%), Gaps = 55/293 (18%)
Query: 498 DATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
D + + GL VE E + DR L LP Q L+ ++ K PV+LV+M+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
+ + +A + N+ AI+ A YPG EGG A+A ++ G ++P GRLP+T+Y
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYR-------- 721
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
++ L P GRTY+++ G LYPFGYGLSYT+F Y
Sbjct: 722 -SAGDLPPFADYAMKGRTYRYFGGEVLYPFGYGLSYTRFSYG------------------ 762
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
P + + D N G DG +VV +Y P T
Sbjct: 763 -------------APQLSARSVSADGEITVTTQVTNTGGMDGEEVVQLYVSHPGR-DGTP 808
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
I+ + GFQR+ ++ G + + F + L++VD N + G ++VG G
Sbjct: 809 IRALQGFQRIGLKRGETRPVSFTLKD-RQLSVVDAEGNRRVEPGRVEVWVGGG 860
>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
51230]
Length = 791
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 231/782 (29%), Positives = 355/782 (45%), Gaps = 110/782 (14%)
Query: 29 GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
G SP F G F++ SF P + +D + L V L +A
Sbjct: 86 GKLSPTFPSGIGHFTRPSDGRGSFS------PRVVPGRDPRRTVAL---VNGLQKWAMTQ 136
Query: 89 PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
RLG+P + E LHG + VG ATSFP I +S++ ++ +++ Q ++
Sbjct: 137 TRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIA 184
Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
E RA R SP +++ARDPRWGRI ET GEDP++VG V V GLQ V
Sbjct: 185 REIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGV--- 236
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+P V + KH + G + A V+E+++ E F PFE VK
Sbjct: 237 --GRSRTLQPNHVFATLKHLTGHGQPE-SGTN--IGPAPVSERELRENFFPPFEQVVKRT 291
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
+VM SYN ++G+PS A+ LL+ +R EW G +V+D ++ ++ H +A + E
Sbjct: 292 GIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIH-HIAANLE 350
Query: 329 DAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
+A + L AG+D D + + T G V++GKV E +D +++ + + R G F+ +P
Sbjct: 351 EAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPY 409
Query: 388 YVSLGKQDICSDENIE-LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ I ++E+ LA AA+ I LLKND LPL T+AV+GP +A VA
Sbjct: 410 ADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPE--GTIAVIGP--SAAVAR 464
Query: 447 IGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD---------DVACKSNNS-----IF 488
+G Y G P +S + G AN+ + G D KS+ + I
Sbjct: 465 LGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAENRKLIA 524
Query: 489 AASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEVAKGPVI 542
A EAA+ D I+ G E DR L L G Q +L + + + K P+
Sbjct: 525 QAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK-PIT 583
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V+++ G + + + AIL Y GE+GG A+AD++FG NPGG+LP+T
Sbjct: 584 VVLIN--GRPASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVTVPRS-- 639
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
LPL ++P R Y F LYPFG+GLSYT F +
Sbjct: 640 AGQLPLF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTSFSLS------------- 680
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
P + + VD +N G+ +G +VV +Y +
Sbjct: 681 ------------------APRLSATRIGTGGKTSVSVDVRNTGAREGDEVVQLYIRDKVS 722
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
+K++ GFQRV ++ G ++ I F ++L + + ++ G+ I GN V
Sbjct: 723 SVTRPVKELKGFQRVTLKPGESRTITFTV-GPEALQMWNDQMRRVVEPGDFEIMTGNSSV 781
Query: 783 SF 784
+
Sbjct: 782 AL 783
>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
Length = 925
Score = 271 bits (694), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 194/686 (28%), Positives = 327/686 (47%), Gaps = 88/686 (12%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
AT+FP+ + ++N L +K G+ V EAR + G T ++P ++V RD RWGR
Sbjct: 180 ATNFPSQLGMGHTWNRELLRKTGRIVGREARLL------GYTNIYAPVLDVGRDQRWGRY 233
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GE P++V V G+Q TD +V+S KH+AAY +
Sbjct: 234 EEVFGESPYLVAELGVAMASGMQ--------TDY-----QVASTAKHFAAYSNNKGAREG 280
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
D ++ +++E L PF ++ VM SYN +G+P L + +RGE
Sbjct: 281 MSRVDPQMPPREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEM 340
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQ 356
GY+V+D S++ + + H A ++ DAV Q+++AGL++ C + Y ++
Sbjct: 341 GFRGYVVSDSGSVEYLHNKH-HTAVNQLDAVRQSIEAGLNVRCNFWHPETYVMPLRQLLR 399
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIVL 415
+G + E +D ++ + V +G FD Q ++ +++ E+ E+A +A+RE IVL
Sbjct: 400 EGLITEELLDSRVRDVLRVKFLVGLFDRPYQTDLAAADREVDGPEHNEVALQASRESIVL 459
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYANVT 471
LKN+ +TLPL++ K++ +AV+GP+A+A +G+Y + S + G +
Sbjct: 460 LKNENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEIV 519
Query: 472 YKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
Y+ GC +++ + I A+EAA +D +++ G E+ R
Sbjct: 520 YEKGCELVDAAWPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENCSR 579
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
L LPG Q +L+ V K P +LV+++ I +A + ++ AI+ A YPG GG+
Sbjct: 580 SSLDLPGRQEELLRAVEATGK-PTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHGGQ 636
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG-----YPGRTYKFYNGP 632
A+ +V+FG++NPGG+L +T+ +V +P + P +P + PG NG
Sbjct: 637 AVYEVLFGEYNPGGKLTVTFPR--HVGQIPF-NFPYKPAANTDGGLTPGPGGNQTRING- 692
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
LY FGYGLSYT F+Y L +R D
Sbjct: 693 ALYDFGYGLSYTTFEYADLRIEPQT-------------------------------IRQD 721
Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
+ F D N G DG +VV +Y TY K + GF RV ++AG +R+
Sbjct: 722 EPFRVSFDVTNTGQRDGDEVVQLYIHDVLSSVTTYEKNLRGFDRVHLKAGETRRVTMQVR 781
Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVG 778
+ L++++ ++ G+ + +G
Sbjct: 782 P-QDLSLLNERMERVVEPGDFDVLIG 806
>gi|365877135|ref|ZP_09416640.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
gi|442587941|ref|ZP_21006755.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
gi|365754995|gb|EHM96929.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
gi|442562440|gb|ELR79661.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
Length = 827
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 221/816 (27%), Positives = 361/816 (44%), Gaps = 158/816 (19%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQ-LGDFAHG-VPRLGLPQYEWWSEA-LHGVSNV 109
+F D P RV++L+S+MTL EK Q + + +G + + P +W +E +HG++N+
Sbjct: 66 IFEDRKEPIDKRVENLISQMTLQEKANQTVTLYGYGRILKDEQPTSQWKNEVWVHGLANI 125
Query: 110 G------------------------------------------PGTHFDDVIPG-----A 122
P ++ I G A
Sbjct: 126 DEMLNSLPYHKSAVTKYSYPYSNHTEALNNIQKWFIEETRLGIPVDFTNEGIHGLTHDRA 185
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
T FP I +++++ L KIG + EA Y LG + ++P ++V+RDPRWGR+ E
Sbjct: 186 TPFPAPINIGSTWDKDLVGKIGNTIGKEA---YYLGYTNV--YAPILDVSRDPRWGRVVE 240
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
T GEDPF++G Y V+G+Q +N V+S KHYA Y V
Sbjct: 241 TYGEDPFMIGEYGKRMVKGIQ-----QNG---------VASTLKHYAVYSVPKGGRDGLA 286
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
D V ++M +L PF+ +++ VM SYN +G+P + L +R E+
Sbjct: 287 RTDPHVAPKEMHTMYLYPFKEVIRKEHPLGVMASYNDYDGVPVISSKYFLTDLLRKEYGF 346
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------N 353
GY+V+D D+++ + H +A E+ + + L+AGLD+ TNFT +
Sbjct: 347 DGYVVSDSDALEFLHGKH-HVAKDYEEGIQKALEAGLDVR-----TNFTQPKEYLTALMD 400
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREG 412
A++ GK+KE +++ ++ + RLG FD + ++ + + + E+ L+ + R
Sbjct: 401 ALKSGKIKEEVLNERVRSVLKTKFRLGLFDEPIRNFIKEADRKVHTKEDEALSVDVNRRS 460
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA---- 468
+VLLKN++ TLPL++ K+K + + GP A+A Y + G YA
Sbjct: 461 VVLLKNEKQTLPLDTGKLKNILITGPLADAVNYTTSRYGPSNNPVTTIRKGIEDYASLHH 520
Query: 469 -NVTYKTGCD--------------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
N +Y G D + K + I A+ +D I + G E
Sbjct: 521 INTSYTKGVDVIDEGWPETEIIPVEPTEKEKSEISKTISMAEKSDVIIAVMGESEKEVGE 580
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
S R L LPG QT + Q+ + K P++LV+++ + I + N + AIL + G
Sbjct: 581 SRSRSSLNLPGKQTYFLQQLYKTRK-PIVLVLVNGRPLTINWE--NKYLPAILETWFLGP 637
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
+ G +A+ +FG+ NPGG+LPI++ + L + + P +P G PG GP
Sbjct: 638 QSGNIVAETLFGENNPGGKLPISFPKS--IGQLEM-NFPTKPAAQAGQPG------TGPN 688
Query: 634 ----------LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
LYPFGYGLSYT F++ S +SK G
Sbjct: 689 GSGSSRVTGFLYPFGYGLSYTNFEFTDFSL----------------------SSKKIKAG 726
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
N+L K+ N G G +VV +Y TY + GF+RV + G
Sbjct: 727 ---NELHA------KLKVTNTGKVKGDEVVQLYLSDLVSSVTTYEMDLRGFERVTLEPGE 777
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
K ++F N + + +++ ++ GE + VGN
Sbjct: 778 AKEVQFTLNK-EHMQLLNDKMEWVVEPGEFRVSVGN 812
>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
Length = 875
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 167/468 (35%), Positives = 252/468 (53%), Gaps = 54/468 (11%)
Query: 47 LQMSSFL-FCDSS-----LPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL 93
L MS F+ FC ++ LPY + R DL+SR+TLDEKV + D + +PRLG+
Sbjct: 5 LMMSLFVGFCATAMDAQGLPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGI 64
Query: 94 PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
PQ++WW+EALHG+ G AT FP + AS++++L ++ AVS EAR
Sbjct: 65 PQFQWWNEALHGIGRNG----------FATVFPITMAMAASWDDALLHQVFTAVSDEARV 114
Query: 154 MYNLGR--------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
+ L++W+PNIN+ RDPRWGR ET GEDP++ + + VRGLQ V
Sbjct: 115 KAQQAKCTGDIKRYQSLSFWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGV 174
Query: 206 EGHENATDLN-SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEM 263
G+ N DL S+ K+ +C KH+A + W +R+ F+ + E+D+ ET+L F+
Sbjct: 175 -GY-NGEDLGVSKYRKLLACAKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKA 229
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
V+EG + VMC+Y R++G CA + Q +R EW G I +DC +I+ + +
Sbjct: 230 LVQEGKVAEVMCAYQRIDGQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNV 289
Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
+ +A A+ + AG D++CG Y + AV++G VKE DID+SL+ L LG D
Sbjct: 290 SKDGAEASAKAVLAGTDVECGSEYKHLP-EAVRRGDVKEADIDRSLRRLLIARFELGDMD 348
Query: 384 GSP--QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL------NSAKVKTVAV 435
+ + + + S + +LA + A + IVLL+N LPL + K + V
Sbjct: 349 SDDLNAWTKIPETVVASQAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVV 408
Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAG-------FSGYANVTYKTGC 476
+GP+AN +V M GNYAG P ++ + G S A V + GC
Sbjct: 409 MGPNANDSVMMWGNYAGYPTHTVTALDGITRMAKTLSPDATVRFIQGC 456
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 123/292 (42%), Gaps = 68/292 (23%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
I + G+ ++E E + DR + LP Q L+ + + K ++ ++ G
Sbjct: 624 IFVGGISPNLEGEEMRVNEPGFKGGDRTSIELPQAQRDLLAVLHKAGKK---VIFVNCSG 680
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+A A AIL Y GE+GG A+A +FG P G+LP+T+Y LP
Sbjct: 681 SAMALAPELETCDAILQWWYGGEQGGAALATTLFGMVAPSGKLPVTFYKS--TDELP--- 735
Query: 611 MPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
D L Y RTY++Y G L+PFG+GL YT F + K I N NK+Q
Sbjct: 736 ------DFLDYTMKNRTYRYYEGEPLFPFGFGLGYTTF-----NIDKPIYKN-NKVQ--- 780
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
V +N+G+T G++ V VY + A+
Sbjct: 781 ------------------------------VRVKNLGTTAGTETVQVYIRHLADKEGPK- 809
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVGN 779
K + +Q+V + A K I KS D NT+ + G++ + VGN
Sbjct: 810 KSLRAYQQVTLNAAEAKTISIEL-PRKSFEGWDVKTNTMRVVPGKYEVMVGN 860
>gi|423301682|ref|ZP_17279705.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
CL09T03C10]
gi|408471675|gb|EKJ90206.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
CL09T03C10]
Length = 1365
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 230/797 (28%), Positives = 352/797 (44%), Gaps = 146/797 (18%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---------------------------FAH 86
+ + LP RVKDL+ RMT +EK+ Q+ F
Sbjct: 536 YQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGFVE 595
Query: 87 GVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
G P RLG+P + +E+LHGV V GAT F
Sbjct: 596 GFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGV-----------VHEGATVF 643
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
P I ++F+ L + ++ E A+ SP I+V RD RWGR+ E+ G
Sbjct: 644 PQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRWGRVEESFG 698
Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
EDP++ GR+ + V+G D +S KHY + + G++ +
Sbjct: 699 EDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPHG-NPLSGLNLASVE 743
Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
+ +D+ E +L+PFEM +K+ +VM +YN N IP+ A LL +R EW GY
Sbjct: 744 TSI--RDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTDVLRKEWGFKGY 801
Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
+ +D +I+ M+ N F A + E+A Q L AGLD++ +++G++ +
Sbjct: 802 VYSDWGAIE-MLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPGLIERGELNREIV 860
Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
D++++ + R+G FD P K I S + I L+ + A E VLLKND+ LPL
Sbjct: 861 DEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTVLLKNDRQLLPL 919
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGI-PCRY-MSPIAGFSGYA----NVTYKTGCDDV 479
+ K+K++AV+GP NA G+Y R+ ++P+ G +A V Y GC V
Sbjct: 920 SIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNVKVNYVKGCSLV 977
Query: 480 ACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLPGYQTQLI 530
+ + I A EAA+ +D ++ G S E D DL L G Q LI
Sbjct: 978 SM-DESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLNDLTLTGAQPALI 1036
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
V K PVILV+++ G A NI AIL Y GE+ G +IAD++FGK +P
Sbjct: 1037 KAVQATGK-PVILVLVT--GKPFAIPWEKKNIPAILVQWYAGEQSGNSIADILFGKVSPS 1093
Query: 591 GRLPITWYNGDYVQMLPLTSMPLR-------PVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GRL ++ + LP+ LR S PGR Y F L+ FG+GL+Y
Sbjct: 1094 GRLTFSF--PESTGHLPVFYNHLRSDRGFYKSPGSYDSPGRDYVFSAPVPLWSFGHGLTY 1151
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F+Y+ L +T + LN H R +D +N
Sbjct: 1152 TTFEYSNLQTDRTSYL-LNDTVHVR------------------------------IDLKN 1180
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G +G +VV +Y A + Q+ F++V ++AG + ++ L I++
Sbjct: 1181 TGKREGKEVVQLYVSDVYSSVAMPVHQLRDFRKVALQAGETQTVRLSI-PVSELTILNEK 1239
Query: 764 ANTLLPAGEHTIFVGNG 780
++ GE I VG+
Sbjct: 1240 NEAIVEPGEFEIQVGSA 1256
>gi|224535195|ref|ZP_03675734.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523186|gb|EEF92291.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
DSM 14838]
Length = 733
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 220/792 (27%), Positives = 376/792 (47%), Gaps = 126/792 (15%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA------------------- 85
L ++ ++ D+ P RVKDL++RMTL EKV QL +
Sbjct: 16 LSVRSQKPVYKDAGQPVETRVKDLLNRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPA 75
Query: 86 --------HGVPRL-GLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASF 135
H P+L Q + E+ G+ P DVI G T +P + SF
Sbjct: 76 EIGSLIYLHTDPKLRNRIQRKAMEESRLGI----PILFGFDVIHGLRTVYPISLAQACSF 131
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
N L + QA A+ +G+ + +SP I+VARDPRWGRI+E GEDP+
Sbjct: 132 NPDL---VTQACGMAAKESV---LSGIDWTFSPMIDVARDPRWGRISECYGEDPY----- 180
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
+N V G+ V+G++ + S P +++C KHY Y V G D + D ++ Q +
Sbjct: 181 -LNTVFGVASVKGYQG--EKLSDPYSIAACLKHYVGYGVSE-GGRDYRYTD--ISPQALW 234
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
ET+L P+E CVK G A+++M S+N ++G+P+ ++ +L + ++ +W G++V+D ++I+
Sbjct: 235 ETYLPPYEACVKAG-AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIE 293
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ ++ +A ++++A + AG+++D Y + V + K++ + ID ++ +
Sbjct: 294 QLI--YQGVAKNRKEAAYKAFHAGVEMDMRDNVYYEYLEQLVAEKKIEISQIDDAVARIL 351
Query: 374 TVLMRLGFFDGSPQYVSLGKQD-ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
V RLG FD P L +Q+ E+I LAA A E +VLLKN++N LPL+S VK
Sbjct: 352 RVKFRLGLFD-EPYTKELTEQERYLQKEDIALAARLAEESMVLLKNEKNLLPLSST-VKR 409
Query: 433 VAVVGPHANATVAMIGNYA------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
VA++GP ++G +A + Y F + Y+ GC A N+
Sbjct: 410 VALIGPMVKDRSDLLGAWAFKGQAEDVETIYEGMQKEFGDKVRLDYEQGC---ALDGNDE 466
Query: 487 --IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
AA + A+ +D ++ G E+ R + LP Q +L+ + + K P++LV
Sbjct: 467 SGFSAALKTAEASDVVVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLV 525
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+ S G + ++AI+ PG GG +A ++ G+ NP G+L +T+
Sbjct: 526 LSS--GRPLELIRLEPQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVTF------- 576
Query: 605 MLPLTS--MPL--------RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
PL++ +P+ RP D++G Y+ LYPFGYGLSYT F Y+
Sbjct: 577 --PLSTGQIPVYYNMRQSARPFDAMG----DYQDIPTEPLYPFGYGLSYTTFTYS----- 625
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
L+ L+ +N T++ + T N G +G + V+
Sbjct: 626 ---DAKLSSLKIKKNQKITAEVTVT-----------------------NAGKVEGKETVL 659
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
Y P + +K++ F++ ++ G ++ +F + + L+ D L AGE
Sbjct: 660 WYVSDPFCSISRPMKELKFFEKQSLKVGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFI 719
Query: 775 IFVGNGGVSFPI 786
+ VG ++F +
Sbjct: 720 VSVGGRKLTFEV 731
>gi|364284956|gb|AEW47953.1| GHF3 protein [uncultured bacterium D1_14]
gi|364284964|gb|AEW47958.1| GHF3 protein [uncultured bacterium E2_1]
Length = 752
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 218/733 (29%), Positives = 347/733 (47%), Gaps = 102/733 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHG-----VPRL------GLPQYEWWSEALHGVSNVG-- 110
RV+ L+++MTL+EK+ Q+ + V RL G E E ++ + V
Sbjct: 36 RVESLLTKMTLEEKIGQMNQVSFSGNIEEVSRLIKNGEVGSILNEVDPERVNALQRVAIE 95
Query: 111 ------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
P DVI G T FP + ASFN + +K + + EA ++ G+
Sbjct: 96 ESRLGIPILIGRDVIHGFKTIFPIPLGQAASFNPQIVEKGARVSAVEASSV------GVR 149
Query: 164 Y-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
+ ++P I+++RDPRWGRI E+ GEDP++ V+G Q D + P ++
Sbjct: 150 WTFTPMIDISRDPRWGRIAESCGEDPYLTSVMGAAMVKGFQG--------DSLNNPNSIA 201
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KH+ Y R + +TE+ + +L PFE VK+G A+ M S+N +G
Sbjct: 202 ACAKHFVGYGAAEG---GRDYNTTCITERQLRNVYLPPFEAAVKQGVAT-FMTSFNANDG 257
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
IPS +P +L + +R EW G++V+D SI MV H F D K DA + + AG+D++
Sbjct: 258 IPSSGNPFILKKVLRDEWGFDGFVVSDWASIIEMVA-HGFCTDDK-DAAMKAVNAGVDME 315
Query: 343 CGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
Y Y N + + KV E ID +++ + V RLG FD +P I S EN
Sbjct: 316 MVSYTYMNHLKDLKNENKVSEETIDNAVRNILRVKFRLGLFD-NPYVDEKAPSPIYSKEN 374
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMS 459
+ +A EAA + +LLKND+ LP+N + VKT+AVVGP A+A +G +A G +
Sbjct: 375 LAIAKEAAIQSAILLKNDKQILPINES-VKTIAVVGPMADAPYEQMGTWAFDGEKSMTQT 433
Query: 460 PIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
P+ + N ++ G K+ + I A AA AD + G + + E+
Sbjct: 434 PLMALRQFYGDKVNFIFEPGLAYTRDKNTSGISKAVSAANRADLVLAFVGEEAILSGEAH 493
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
+L L G Q+ LIN +A+ K P++ V+++ G + + KA+L++ +PG G
Sbjct: 494 CLANLNLQGAQSDLINALAKTGK-PIVTVVIA--GRPLTIGKEAELSKAVLYSFHPGTMG 550
Query: 576 GRAIADVVFGKFNPGGRLPITW---------YNGDYVQMLPLTSMPLRPVDSLGY-PGRT 625
G AIAD++FGK P G+ P+T+ Y Y P + +D++ G+T
Sbjct: 551 GPAIADLLFGKAVPSGKTPVTFPKEVGQIPIYYSHYNTGRPANRNEIL-LDNIAVGAGQT 609
Query: 626 ----YKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
FY LYPFG+GLSYT F+Y+ NL +S
Sbjct: 610 SLGNTSFYLDAGFDPLYPFGFGLSYTTFEYS-------------------NLKLSS---- 646
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
N+L D D +N G+ +G++V +Y + +K++ F R+
Sbjct: 647 --------NELSAKDELTVTFDLKNTGNYEGAEVAQLYVRDMVGSVVRPVKELKRFNRIT 698
Query: 739 VRAGRNKRIKFVF 751
++ G + + F
Sbjct: 699 LKPGETRNVSMTF 711
>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 854
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 164/434 (37%), Positives = 248/434 (57%), Gaps = 41/434 (9%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G+ + L+ D P R+ DL+SR+T++EK+ L + G+ RL +P+Y +EALHG
Sbjct: 20 GVAQAQELYKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHG 79
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG---- 161
V V PG T FP I A++N L ++ +S EARA +N G
Sbjct: 80 V--VRPGRF--------TVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQK 129
Query: 162 ------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
LT+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ G ++
Sbjct: 130 SQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GDDD----- 181
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
R LK+ S KH+AA + ++ +R+ + +++E+ + E +L FE CVK+G ++S+M
Sbjct: 182 -RYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMS 236
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
+YN +N +P + LL + +R +W GY+V+DC ++V+ HK++ +KE A A ++
Sbjct: 237 AYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAAALSI 295
Query: 336 KAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
KAGLDL+CG Y +A +Q V + DID + + M LG FD Q Y +
Sbjct: 296 KAGLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRVLRARMELGLFDSGEQNPYTKIS 355
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
I S E+ E+A AARE IVLLKN + LPLN+ KVK++AVVG NA + G+Y+G
Sbjct: 356 PAVIGSAEHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGSSEFGDYSG 413
Query: 453 IPCRYMSPIAGFSG 466
+P ++PI+ G
Sbjct: 414 LPV--IAPISVLQG 425
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 100/293 (34%), Positives = 147/293 (50%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 595 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKV--NPNIVVVLVAG 652
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y L
Sbjct: 653 S-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 704
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSYT FKY+ +QV
Sbjct: 705 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYS------NLQV--------- 747
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
D E V FQ N G G +V VY K P
Sbjct: 748 ----------------------ADGEEEINVSFQLKNSGKYAGDEVAQVYVKLPERDEVM 785
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
IK++ GF+RV +++G NK++ L D A + + P+G++TI VG
Sbjct: 786 PIKELKGFERVTLKSGENKKVTLKLRK-DLLRYWDEAKDKFVCPSGDYTIMVG 837
>gi|389696043|ref|ZP_10183685.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
gi|388584849|gb|EIM25144.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
Length = 751
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 230/759 (30%), Positives = 354/759 (46%), Gaps = 114/759 (15%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG-VSNVGPG---------- 112
RV +L+ RMTL+EKV QL +HG P ++E SE G V N
Sbjct: 39 RVNELLGRMTLEEKVGQLNLVSHGPPL----RWEDISEGKAGAVLNFNSAEDVARAQALV 94
Query: 113 --THFD-------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
+H DV+ G T FP + A+F+ + + + + EA + G+
Sbjct: 95 RESHLKIPLLFGLDVLHGFRTQFPLPLGEAAAFSPRVSRLASEWAAREASYV------GV 148
Query: 163 TY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
+ ++P +++RD RWGRI E GEDP + V G R +
Sbjct: 149 NWTFAPMADLSRDSRWGRIVEGFGEDPTLGAALTAARVEGF--------------RKGGL 194
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
++ KH+A Y R + + +M +T+L PF V+ G AS M ++N +N
Sbjct: 195 AAAAKHFAGYGAPQG---GRDYDTTYIPRAEMYDTYLPPFRAAVEAGTAS-FMAAFNALN 250
Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
G PS A+P LL +R +W G++ +D I +V NH AD E A + + AG+D+
Sbjct: 251 GEPSTANPWLLTDVLRTQWGFDGFVTSDWVGIGELV-NHGIAADGAE-AARKAILAGVDM 308
Query: 342 DC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
D GQ Y N + V+ G+V E+ ID+S++ + RLG FD S + S E
Sbjct: 309 DMMGQLYINHLPDEVRAGRVPESVIDESVRRVLRTKFRLGLFDRPDVDSSHLDSEFPSPE 368
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--------- 451
+ + A E ARE VLL+N + LP+ S KV+++AVVGP A+A +G +A
Sbjct: 369 SRQAAREVARETFVLLQNRDDVLPIPS-KVRSIAVVGPLADAPQDQMGPHAARGHKEDSV 427
Query: 452 ----GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
GI R S AG + V + GCD + C++ +++ A EAA+ +D I + G
Sbjct: 428 TILEGIRRRAQS--AGIA----VRHAPGCD-LFCRNTDALPGALEAARQSDFVIAVFGEP 480
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
+ E+ R ++ L G Q +++ ++A+ K PV LVIM GG I +IL
Sbjct: 481 QELSGEAASRANMELNGKQIEVLEELAKTGK-PVALVIM--GGRPQVLGPVADRIPSILM 537
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL--TSMPL-RPVDSLGYPGR 624
A YPG E G A+ADV+FG +P G+LP+TW LPL +P RP +
Sbjct: 538 AWYPGTEAGPAVADVLFGDVSPSGKLPLTWPRA--TGQLPLYYNRLPTGRPTLANNRFTL 595
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
Y + LYPFG+GLSYT F Y SDA +
Sbjct: 596 HYIDESIAPLYPFGWGLSYTHFAY-------------------------SDAR------I 624
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
L E +D +N G+ DG +VV +Y++ P + ++++ F+++ +++G
Sbjct: 625 ASRQLDEGQVLEVSLDVKNTGARDGQEVVQLYTRDPVASRSRPLRELKAFEKIALKSGET 684
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
KR+ +SL L+ AG +FVG ++
Sbjct: 685 KRVTLRV-PVESLGFHLDDGTYLVEAGAIQVFVGGSSLA 722
>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
CL02T12C05]
Length = 958
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 223/809 (27%), Positives = 366/809 (45%), Gaps = 144/809 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + P R++DL+S+M L+EK Q+ +G R+ LP EW W
Sbjct: 64 VYEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGMGA 122
Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
E L+G G P + ++V P
Sbjct: 123 IDEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 182
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWG
Sbjct: 183 YKATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWG 236
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + V+G+Q +V++ KH+ AY +
Sbjct: 237 RYEEVYGESPYLVAELGIEMVKGMQ-------------HNYQVAATGKHFIAYSNNKGAR 283
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ ++E VM SYN +G+P + L +RG
Sbjct: 284 EGMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRG 343
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
+ GY+V+D D+++ + H D KE AV Q+++AGL++ C Y
Sbjct: 344 QMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 402
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE--LAAEAAREG 412
VQ+G + E I+ ++ + V +G FD +P L D ++ +A +A+RE
Sbjct: 403 VQEGGLSEEIINDRVRDILRVKFLVGLFD-TPYQTDLKGADEEVEKEENEIVALQASRES 461
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
IVLLKND+N LPL+ A ++ +AV GP+A+ T + +Y + + ++G G A
Sbjct: 462 IVLLKNDKNALPLDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKA 521
Query: 469 NVTYKTGCDDVACK--------------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
V Y GC+ V N I A AK AD +++ G E+
Sbjct: 522 EVLYTKGCELVDANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGEN 581
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q L+ V K PV+LV+++ + + +A+ + AI+ A YPG +
Sbjct: 582 KSRSSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWAD--KFVPAIIEAWYPGSK 638
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
GG A+ADV+FG +NPGG+L +T+ V +P + P +P +D PG
Sbjct: 639 GGTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGPKGNMSRV 695
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NG LYPFG+GLSYT F+Y+ +S + + K+Q +
Sbjct: 696 NG-ALYPFGHGLSYTTFEYSDISISPKVITPNQKVQ-----------------------V 731
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
RC N G G +VV +Y + TY K + GF+R+ ++ G K + F
Sbjct: 732 RC--------KITNTGKRAGDEVVQLYVRDILSSVTTYEKNLEGFERIHLQPGETKEVSF 783
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K+L +++ + ++ G+ +I +G
Sbjct: 784 TLDR-KALELLNAKNDWVVEPGDFSIMLG 811
>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
Length = 869
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 167/420 (39%), Positives = 229/420 (54%), Gaps = 40/420 (9%)
Query: 69 VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
++RMT+++K Q+ + A +P GL YEWW+E LHGV+ G AT FP
Sbjct: 40 IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89
Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRWGRI 180
I A++N +L K++G VSTEARA +N GLT WSPNIN+ RDPRWGR
Sbjct: 90 IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
ET GEDPF+ R A +V GLQ + KV + KH A V +
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQGPDPQHP---------KVVASVKHLA---VHSGPEAG 197
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
R+ F A V+ D+E T+L F V A SVMC+YN V G+P+CA LL VR W
Sbjct: 198 RHGFAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAW 257
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
GY+V DCD+I M H + + E + A++LKAG+DL+CG Y AVQ+G +
Sbjct: 258 GFKGYVVTDCDAIYDMTRFHFYRLNDAESS-AESLKAGVDLNCGNAYAALP-EAVQKGLI 315
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
E+ +D+SL L V RLG DG+P + + + I + + LA +AA + +VLLKN+
Sbjct: 316 PESLMDQSLNRLLDVRKRLG-IDGAPSPWARISPEAINTPQAQGLALQAAEQSLVLLKNN 374
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKTGC 476
LPL +TVAV+GP+A+ + GNY GI + ++P+ G G A V Y G
Sbjct: 375 -GVLPLKPG--QTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRAQLGAAKVLYAQGA 431
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 127/298 (42%), Gaps = 56/298 (18%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL +E E L DR DL LP Q L+ V K P+++V++S V +
Sbjct: 608 GLSPDIEGEELQILVPGFDRGDRTDLGLPRTQEDLLKAVKATGK-PLVVVLLSGSAVALN 666
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + W YPGE GG AIA + G+ NP GRLP+T+Y VQ LP
Sbjct: 667 WADAHADAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPVTFYRS--VQDLP------- 715
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P GRTY+++ G LYPFG+GLSYTQF Y+ L +
Sbjct: 716 PFIDYRMEGRTYRYFKGKPLYPFGHGLSYTQFSYSDLKLDTST----------------- 758
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
L V +N G G +VV +Y K P + F
Sbjct: 759 --------------LTAGQPLRVSVRVRNNGQRAGDEVVQLYVKRPDTFGLN--ASLAAF 802
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFNY 792
RV ++AG ++ + + + L+ V + AG + + VG G F LN ++
Sbjct: 803 ARVSLKAGESRTVVMTIDP-RDLSTVTLEGERAIRAGAYGLSVGGGQPGFAPTLNADF 859
>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 882
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 166/423 (39%), Positives = 238/423 (56%), Gaps = 40/423 (9%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV++MT EK+ Q + A +PRLG+P Y+WWSE LHG++ G AT FP
Sbjct: 37 LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNL----GR-----AGLTYWSPNINVARDPRWG 178
I AS+N L + +G STEARA +NL G+ AGLT WSPNIN+ RDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R ET GEDP++ G+ AV+++RGLQ D P +++ KH+A V +
Sbjct: 147 RGMETYGEDPYLTGQLAVSFIRGLQG--------DTPDHPRTIATP-KHFA---VHSGPE 194
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN +R
Sbjct: 195 QGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
+W +G++V+DCD+I+ M H F D+ A A LK+G DL+CG Y + A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNCGNTYRDLN-QAIARG 312
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
+ E+ +D++L L+T RLG Y ++G + I + + LA +AA + +VLL
Sbjct: 313 DIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALALQAAAQSLVLL 372
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
KN NTLPL T+AV+GP A++ A+ NY G ++P+ G G A V Y
Sbjct: 373 KNSGNTLPL--PPETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGTAKVHYA 430
Query: 474 TGC 476
G
Sbjct: 431 QGA 433
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 140/301 (46%), Gaps = 55/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A A ADA + GL VE E L DR + LP Q L+ V K
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+I+V+MS V + +A+ + + AIL A YPG+ GG AIA + G NPGGRLP+T+Y
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYR 720
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
Q LP P S GRTY+++ G LYPFGYGLSYTQF Y
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYE---------- 761
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
P + L+ + +N G+ G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGNTLTVTAHVRNTGTRAGDEVVQLYLEP 800
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
P A ++ ++GF+RV +R G ++ + F +A + L+ V + AG + +FVG
Sbjct: 801 PYSPQAP-LRSLVGFKRVTLRPGESRLLTFTLDA-RQLSGVQQTGQRSVEAGHYHLFVGG 858
Query: 780 G 780
G
Sbjct: 859 G 859
>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 904
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 171/446 (38%), Positives = 237/446 (53%), Gaps = 43/446 (9%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
LGL +S D + R LV++MT EK+ Q + A +PRLG+P YEWWSE LH
Sbjct: 39 LGLLVSPLAHADDA---EDRATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLH 95
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
G++ G AT FP I AS+N L +G STEARA +NL
Sbjct: 96 GIARNGE----------ATVFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKN 145
Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
GLT WSPNIN+ RDPRWGR ET GEDP++ G+ AV ++ GLQ D
Sbjct: 146 HARYGGLTIWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDP 197
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
+ P +++ KH A V + R+ FD V+ D E T+ F + EG A SVMC
Sbjct: 198 THPRTIATP-KHLA---VHSGPESGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMC 253
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
+YN ++GIP+CA L++ VRG W G++V+DCD+I M H + + A L
Sbjct: 254 AYNALHGIPACAADWLIDGRVRGNWGFKGFVVSDCDAIDDMTQFH-YYRADNAGSAAAAL 312
Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGK 393
KAG DL+CG Y + G A+ +G+ +E +D+SL L+ RLG + Y LG
Sbjct: 313 KAGHDLNCGYAYRDL-GTALDRGEAEEAMLDRSLVRLFAARYRLGELQPRSKDPYARLGA 371
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+DI S + LA +AA++ +VLL+N +TLPL +AV+GP+A+A A+ NY G
Sbjct: 372 KDIDSPTHRALALQAAQQSLVLLQNRNDTLPLRPG--LRLAVIGPNADALAALEANYQGT 429
Query: 454 PCRYMSPIAGFS---GYANVTYKTGC 476
++P+ G G V Y G
Sbjct: 430 SVAPVTPLQGLRARFGTTQVHYTQGA 455
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 136/286 (47%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL VE E L DR DL LP Q L+ + A+ + P+I+V+MS V +
Sbjct: 641 GLSPDVEGEELRIDVPGFDGGDRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALN 699
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+ + + AIL A YPG+ GG AIA + G NPGGRLP+T+Y ++ L
Sbjct: 700 WAKQHAD--AILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYR---------STKDLP 748
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P S GRTY+++ G L+PFGYGLSYT F Y + T
Sbjct: 749 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTHFAYTAPQLSSTT----------------- 791
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
L+ D +N G+ G +VV VY + P A + ++ ++GF
Sbjct: 792 --------------LQAGDTLHVTTTVRNTGARAGDEVVQVYLQYPPR-AQSPLRALVGF 836
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV ++ G + + F + L+ VD + + AG++ +FVG G
Sbjct: 837 QRVSLQPGEARTLSFALEP-RQLSDVDRSGQRAVEAGDYRLFVGGG 881
>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
17393]
gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 943
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 229/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + R++DL+S+MTL+EK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 111 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGIES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARIL------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRG+Q H + +V++ KH+ AY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFVAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ +KE VM SYN +G+P L +RG
Sbjct: 272 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C Y
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
V++G + E I+ ++ + V +G FD Q G +++ EN LA +A+RE +
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKAENESLALQASRESL 450
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYAN 469
VLLKN+ N LPL+ VK +AV GP+A+ + +Y + + + G G A
Sbjct: 451 VLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAE 510
Query: 470 VTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESL 515
V Y GCD V S I A E A+ AD +++ G E+
Sbjct: 511 VLYTKGCDLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q +L+ V K PV+LV+++ + I +A + + AIL A YPG +G
Sbjct: 571 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
G A+ADV+FG +NPGG++ +T+ V +P + P +P +D PG N
Sbjct: 628 GTAVADVLFGDYNPGGKMTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVN 684
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
G LY FGYGLSYT F+Y+ + + K I N C+
Sbjct: 685 G-ALYSFGYGLSYTTFEYSGIEISPKVITPNQKATVRCK--------------------- 722
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
N G G +VV +Y + TY K + GF+R+ ++ G K + F
Sbjct: 723 -----------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVF 771
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K L ++D ++ G+ +I VG
Sbjct: 772 TLDR-KQLELLDKHMEWVVEPGDFSIMVG 799
>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1073
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 167/442 (37%), Positives = 248/442 (56%), Gaps = 48/442 (10%)
Query: 45 LGLQMSSFL-------FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYE 97
L LQ+SSF F D++L + R+KDL+SR+ + EK+ L + +PRLG+ +Y
Sbjct: 13 LLLQISSFAVAQINYPFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYY 72
Query: 98 WWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
+EALHGV V PG T FP I + +N +++ A+S EAR +N
Sbjct: 73 HGNEALHGV--VRPGKF--------TVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNE 122
Query: 158 GRAG----------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
G LT+WSP IN+ARDPRWGR ET GEDPF+ G +VRGLQ G
Sbjct: 123 LNQGKDQTAGASDLLTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQ---G 179
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
++ + +KV S KH+AA + ++ +R +A ++E+D+ E + FE C+KE
Sbjct: 180 ND------PKYIKVVSTPKHFAANNEEH----NRASGNAVISERDLREYYFPAFEKCIKE 229
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G A SVM +YN VNGIP + LL +R +W GY+V+DC + + +V H ++ D+
Sbjct: 230 GQAQSVMSAYNAVNGIPCTLNKWLLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYV-DTY 288
Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
E+A + +KAGLDL+CG Y NA +G V ++ID + + MRLG FD
Sbjct: 289 EEAASLCIKAGLDLECGDNVYITPLLNAYNRGMVTMSEIDSAAYRVLRGRMRLGLFDDPN 348
Query: 387 Q--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+ Y + + +++ ELA EAAR+ +VLLKND++ LP+ + +K++AVVG NA
Sbjct: 349 ENPYNKISPSIVGCEKHRELALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVG--INAAN 406
Query: 445 AMIGNYAGIPCRYMSPIAGFSG 466
G+Y+G P +PI+ G
Sbjct: 407 CEFGDYSGTPVN--TPISVLEG 426
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 121/258 (46%), Gaps = 46/258 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A E + +D TI + G+D ++E E DR + LP Q I + + V++++
Sbjct: 736 AGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKANPNTVVVLV---A 792
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + NI A+L A YPGE+GG A+A+ +FG +NPGGRLP+T+YN L+
Sbjct: 793 GSSLAINWIDQNIPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFYNS-------LS 845
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+P D RTY ++ G LYPFGYGLSYT F Y R
Sbjct: 846 DLPAFD-DYNVRNNRTYMYFEGKPLYPFGYGLSYTDFAY-------------------RG 885
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
L+ T D K N G+ DG +V VY + P + +K
Sbjct: 886 LDVTQDEENVTV----------------KFFVSNTGNYDGDEVAQVYIQFPDQGTTLPLK 929
Query: 730 QVIGFQRVFVRAGRNKRI 747
Q+ GF+RV + G+ I
Sbjct: 930 QLKGFKRVHISKGQETEI 947
>gi|383115356|ref|ZP_09936112.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
gi|313695234|gb|EFS32069.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
Length = 735
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 218/775 (28%), Positives = 352/775 (45%), Gaps = 111/775 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P R+ DL+SRMTL+EKV QL + G E E S +G
Sbjct: 29 LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+FD D I G T +P + S+N L ++
Sbjct: 86 IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
D S ++++C KHY Y R + ++ Q + +T+L P+EM
Sbjct: 200 G--------DDMSAENRIAACLKHYIGYGASE---AGRDYVYTEISAQTLWDTYLLPYEM 248
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
VK G A+++M S+N ++G+P A+ + ++ W G+IV+D +++ + ++ L
Sbjct: 249 GVKAG-AATLMSSFNDISGVPGSANHYTMTAILKERWKHDGFIVSDWGAVEQL--KNQGL 305
Query: 324 ADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A +K+DA AGL++D + Y V++GKV +D+S++ + V RLG F
Sbjct: 306 AATKKDAAWYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLF 365
Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
+ V+ K +++ +AA+ A E +VLLKND LPL + K +AVVGP A
Sbjct: 366 ERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KRIAVVGPMAKN 423
Query: 443 TVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAAK 495
++G++ G + Y A F G A + Y GC ++ S FA A + +
Sbjct: 424 GWDLLGSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKPQG--NDRSGFAGALDVVR 481
Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
+D I+ G L+ E+ R + LP Q +L+ ++ E K P+ILV+ + G +
Sbjct: 482 WSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLEL 538
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
AIL PG G R++A ++ G+ NP G+L IT P ++ +
Sbjct: 539 NRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAIT---------FPYSTGQIPI 589
Query: 616 VDSLGYPGRTYK-FYNGPT---LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
+ GR ++ FY T Y FGYGLSYT+F+Y +++ + T KL
Sbjct: 590 YYNRRKSGRWHQGFYKDITSDPFYSFGYGLSYTEFQYGVVTPSSTTVKRGEKLS------ 643
Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
+V N G DG++ V + P +K++
Sbjct: 644 -------------------------VEVTVTNAGKRDGAETVHWFISDPYCSITRPVKEL 678
Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
F++ F++ G + +F + + L VD L AGE+ I+V + V +
Sbjct: 679 KHFEKQFIKVGETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWVQDQKVKIEL 733
>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
BAA-286]
Length = 957
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 228/754 (30%), Positives = 358/754 (47%), Gaps = 107/754 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ + +LP RV+DL+S MT+++K++ L G G+P LG+P EA+HG S
Sbjct: 170 YMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHGFSYGS- 227
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP I A++N+ L + A+ E + + WSP ++V
Sbjct: 228 ---------GATIFPQSIGMGATWNKRLIEAAAMAIGDET-----VSANAVQAWSPVLDV 273
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V +++G Q + L + P KH+AA+
Sbjct: 274 AQDARWGRCEETYGEDPVLVTEIGGAWIKGYQ-------SKGLMTTP-------KHFAAH 319
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF K+ S+M SY+ G+P +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKEL 376
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN-F 350
L +R EW G+IV+DC +I + + A K +A Q L AG+ +CG Y +
Sbjct: 377 LKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPD 436
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK--QDICSDENIELAAE 407
A ++G++ D+D + K L L R G F+ +P + + K S E+ LA +
Sbjct: 437 VIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPCKPLDWNKIYPGWNSPEHQALARK 496
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
A+E IVLL+N N LPL S +KT+AV+GP A+ G+Y P + S + G
Sbjct: 497 TAQESIVLLENKGNILPL-SKSLKTIAVIGPGADNL--QPGDYTSKPQPGQLKSVLTGIK 553
Query: 466 GYAN----VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA--------- 512
N V Y+ GC + + + I A +AA+ AD +++ G + EA
Sbjct: 554 AAVNSSTKVLYEEGCRFIGTEGTD-IAKAVKAAENADVAVLVLGDCSTSEALKGITNTSG 612
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E+ D L LPG Q +L+ V + K PV+L++ + ++++A N + W PG
Sbjct: 613 ENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNLSYAAENCQAVLVNW--LPG 669
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
+EGG A ADV+FG +NP GRLP+T+ P + L + GR Y + + P
Sbjct: 670 QEGGYATADVLFGDYNPAGRLPMTF---------PRDAAQLPLYYNFKTSGRVYDYVDMP 720
Query: 633 --TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
LY FGYGLSYT F Y+ L+ ++L K N N + +A+ T
Sbjct: 721 YYPLYQFGYGLSYTSFNYSDLN------ISLEK-----NGNVSVNATVT----------- 758
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
N G G +VV +Y T + ++ F RV++ G +K++ FV
Sbjct: 759 ------------NTGKVAGDEVVQLYITDMYASVKTRVMELKDFDRVYLNPGESKKVSFV 806
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
+ L++++ + ++ G I VG S+
Sbjct: 807 LTPYQ-LSLLNDEMDRVVEKGLFKIMVGGKSPSY 839
>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 777
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 211/724 (29%), Positives = 330/724 (45%), Gaps = 117/724 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT I +++N +L +K+ V+
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + + P ++++RDPRW R+ E+ GEDP + G A VRGL G
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVRGL----GSG 224
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
N +D P KH+ AY + A V E+++ E FL PF+ V G
Sbjct: 225 NLSD----PFATIPTLKHFVAYGIPEG---GHNGSAASVGERELREYFLPPFQSAVAAG- 276
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM +YN V+GIP ++ LL +R EW +G+ V+D SI+ + +H+ D K+
Sbjct: 277 AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWSFNGFTVSDLGSIEGIKGSHRVAKDHKQA 336
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
A+ ++AGLD D G AV+QG+V+E ID+++ + + +G F+ V
Sbjct: 337 AIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRILALKFEMGLFEKPFVDV 395
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
K+++ ++ NI L+ + ARE IVLL+N N LPL K +A+VGP+A+ M+G+
Sbjct: 396 KTAKKEVKTESNIALSRQVARESIVLLENKNNILPLR--KDVKIAIVGPNADNVYNMLGD 453
Query: 450 YA-----GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
Y G I+ A V+Y GC + +N+ I AA AA+ +D + +
Sbjct: 454 YTAPQPDGAVTTVRQAISARLPKAQVSYVKGC-AIRDTTNSDIPAAVTAARQSDIIVAVV 512
Query: 505 G----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
G D E E DR L L G Q +L+ + + K P+
Sbjct: 513 GGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALKQTGK-PL 571
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+++ + +++ +A T + A+L A YPG+EGG AIADV+FG +NP G++P++
Sbjct: 572 VVIYIQGRPLNMNWAATQAD--ALLCAWYPGQEGGHAIADVLFGDYNPAGKMPLSVPRS- 628
Query: 602 YVQMLPL-----TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
V +P+ +S+ R V+ P LY FGYG SY+ F+Y L K
Sbjct: 629 -VGQIPVHYNRKSSLDHRYVEEAATP-----------LYAFGYGKSYSDFEYKDLKIQK- 675
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
N +Y N G DG +V +Y
Sbjct: 676 -----------ENTDY-----------------------HVSFTLTNTGKYDGDEVPQLY 701
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
+ + ++Q+ F+R+ ++ G +K + FV A I L P I
Sbjct: 702 IRNQYASVSQPVQQLKHFERIHLKTGESKTVSFVLTAGDFSVINTQMKKVLEPGSSFKIR 761
Query: 777 VGNG 780
VG+
Sbjct: 762 VGSA 765
>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
17393]
gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 935
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 226/760 (29%), Positives = 357/760 (46%), Gaps = 111/760 (14%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
+ +S + D +LP RV+ L+S MT ++K++ + G G+P L +P EA+HG
Sbjct: 145 EKTSLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 203
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
S GAT FP + A++N+ L +++ AV E L + W
Sbjct: 204 FSYGS----------GATIFPQALAMGATWNKKLTEEVAMAVGDE-----TLSAGTMQAW 248
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SP ++VA+D RWGR ET GEDP +V + +++G Q + + +
Sbjct: 249 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------MGLYTTP 294
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ + G D + D ++E++M E L PF ++ D S+M +Y+ G+P
Sbjct: 295 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPV 351
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+LL+ +R EW G+IV+DC +I + + A +K +A Q L AG+ +CG
Sbjct: 352 AKSRELLHNILREEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGIATNCGD 411
Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
Y + A + G++ ++D+ + + ++ R F+ +P L I SD
Sbjct: 412 TYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKAPNK-PLDWNKIYPGWNSDS 470
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYM 458
+ E+A +AARE IVLL+N N LPL S ++T+AV+GP AN G+Y P +
Sbjct: 471 HKEMARQAARESIVLLENKDNILPL-SKDMRTIAVLGPGANDLQP--GDYTPKLQPGQLK 527
Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
S + G V Y+ GCD + NN I A + A +D +++ G + EA
Sbjct: 528 SVLTGIKQAVGKQTKVIYEQGCDFTSLGENN-IAKAVKVASQSDVVLLVLGDCSTSEATT 586
Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
E+ D L LPG Q +L+ V K PVIL++ + G ++ + KAI
Sbjct: 587 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQA--GRPYNLSKASELCKAI 643
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
L PG+EGG A ADV+FG +NP GRLP+T+ +V LPL + GR
Sbjct: 644 LVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRR 694
Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
Y++ + LY FGYGLSYT F+Y+ L K+Q N N T A+
Sbjct: 695 YEYSDMEYYPLYYFGYGLSYTSFEYSGL-----------KIQEKENGNITVQAT------ 737
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+N+G G +VV +Y T I ++ F R+ ++ G
Sbjct: 738 -----------------VKNIGQRAGDEVVQLYVTDMYASVKTRITELKDFTRIHLKPGE 780
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
K + F + L++++ + ++ G I V GGVS
Sbjct: 781 AKTVSFELTPYE-LSLLNDHMDRVVEKGAFKILV--GGVS 817
>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
12058]
Length = 733
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 220/792 (27%), Positives = 374/792 (47%), Gaps = 126/792 (15%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA------------------- 85
L ++ ++ D+ P RVKDL+ RMTL EKV QL +
Sbjct: 16 LSVRSQKPVYKDAGQPVETRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPA 75
Query: 86 --------HGVPRL-GLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASF 135
H P+L Q + E+ G+ P DVI G T +P + SF
Sbjct: 76 EIGSLIYLHTDPKLRNQIQRKAMEESRLGI----PILFGFDVIHGLRTVYPISLAQACSF 131
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
N L + QA A+ +G+ + +SP I+VARDPRWGRI+E GEDP+
Sbjct: 132 NPDL---VTQACGMAAKESV---LSGIDWTFSPMIDVARDPRWGRISECYGEDPY----- 180
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
+N V G+ V+G++ + S P +++C KHY Y G D + D ++ Q +
Sbjct: 181 -LNTVFGVASVQGYQG--EKLSDPYSIAACLKHYVGYGASE-GGRDYRYTD--ISPQALW 234
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
ET+L P+E CVK G A+++M S+N ++G+P+ ++ +L + ++ +W G++V+D ++I+
Sbjct: 235 ETYLPPYEACVKAG-AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIE 293
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ ++ +A +++A + AG+++D Y + V + K++ + ID ++ +
Sbjct: 294 QLI--YQGVAKDRKEAAYKAFHAGVEMDMRDNIYYEYLEQLVAEKKIQMSQIDDAVARIL 351
Query: 374 TVLMRLGFFDGSPQYVSLGKQD-ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
V RLG FD P L +Q+ E+I LAA A E +VLLKN+ N LPL+S VK
Sbjct: 352 RVKFRLGLFD-EPYTKELTEQERYLQKEDIALAARLAEESMVLLKNENNLLPLSST-VKR 409
Query: 433 VAVVGPHANATVAMIGNYA------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
VA++GP A + ++G +A + Y F + Y+ GC A N+
Sbjct: 410 VALIGPMAKDSANLLGAWAFKGHAEDVETIYEGMQKEFGDKVQLDYEQGC---ALDGNDE 466
Query: 487 --IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
AA + A+ +D ++ G E+ R + LP Q +L+ + + K P++LV
Sbjct: 467 SGFSAALKTAEASDVVVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLV 525
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+ S G + ++AI+ PG GG +A ++ G+ NP G+L +T+
Sbjct: 526 LSS--GRPLELIRLEPQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVTF------- 576
Query: 605 MLPLTS--MPL--------RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
PL++ +P+ RP D++G Y+ LYPFG+GLSYT F Y+
Sbjct: 577 --PLSTGQIPVYYNMRQSARPFDAMG----DYQDIPTKPLYPFGHGLSYTTFVYS----- 625
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
L+ L+ +N T++ + T N G +G + V+
Sbjct: 626 ---DAKLSSLKIRKNQKITAEVTVT-----------------------NAGKMEGKETVL 659
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
Y P + +K++ F++ + AG ++ +F + + L+ D L AGE
Sbjct: 660 WYVSDPFCSISRPMKELKFFEKHSLNAGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFI 719
Query: 775 IFVGNGGVSFPI 786
+ VG ++F +
Sbjct: 720 VSVGGRKLTFEV 731
>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 878
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 165/446 (36%), Positives = 246/446 (55%), Gaps = 46/446 (10%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q + F ++ LP RV DL++R+T+DEK+ QL + + RLG+P Y WW+E+LHGV+
Sbjct: 20 QSEKYPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGVA 79
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA----- 160
G AT FP I AS+++ L ++ +S EARA ++ L R
Sbjct: 80 RAG----------YATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHDIY 129
Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT+WSPNIN+ RDPRWGR ET GEDP++ G YV+GLQ N++ L
Sbjct: 130 QGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQGN---------NAKYL 180
Query: 220 KVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
KV + KH+A + G + R+ FD +++D+ ET+L F VK+G+ S+M +Y
Sbjct: 181 KVVATAKHFAVHS-----GPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAY 235
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
NR+ G + A L + +R +W +GY+V+DC +I M H D+ E A A +K
Sbjct: 236 NRIYGEAASASNSLYS-ILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAE-ASAMAVKE 293
Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC 397
G DL+CG Y T +A+Q G + E D+D +L L +LG FD S + V K
Sbjct: 294 GCDLNCGNSYEKLT-DALQDGLITEADLDVALHRLMRARFKLGMFD-SDEKVPYAKIPFS 351
Query: 398 SDENIE---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ N + LA +AA++ IVLLKN+ LPL S +K +AV+GP+A+ ++ GNY G+P
Sbjct: 352 VNNNPKHKVLALKAAQKSIVLLKNENAILPL-SKNLKNIAVIGPNADNIQSLWGNYNGMP 410
Query: 455 CRYMSPIAGFS----GYANVTYKTGC 476
++ + G NV ++ G
Sbjct: 411 KNPVTVLEGIKNKVGAQVNVHFEEGA 436
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/322 (30%), Positives = 156/322 (48%), Gaps = 55/322 (17%)
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
+ + N + A AA +D ++ GL+ +E E + DR L LP Q +L
Sbjct: 582 SIPTENQLEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVEL 641
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ +V K PV+LV+++ + I +A + NI AI+ AGYPG+EGG AIA+V+FG +NP
Sbjct: 642 MKEVVATGK-PVVLVLLNGSALSINWA--SENIPAIISAGYPGQEGGNAIANVLFGDYNP 698
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLP+T+Y V LP P + GRTYK++ LYPFGYGLSYT+FKY+
Sbjct: 699 AGRLPVTYYKS--VDDLP-------PFEDYNMDGRTYKYFKKEPLYPFGYGLSYTKFKYS 749
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
L I++N + + V N G DG
Sbjct: 750 NLEIPLEIKIN--------------------------------EPIKVSVQVANEGDFDG 777
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+VV +Y + I +++GF+R+ ++ G ++++F + L +++ ++
Sbjct: 778 DEVVQLYVRDEEGSTPRPICELVGFKRIHLKKGARQKVEFTIQP-RELAMINKDDKFVIE 836
Query: 770 AGEHTIFVGNGGVSFPIHLNFN 791
G +I VG +F + + N
Sbjct: 837 PGWFSISVGGSQPNFTENKHIN 858
>gi|293372493|ref|ZP_06618877.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|299144770|ref|ZP_07037838.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|292632676|gb|EFF51270.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|298515261|gb|EFI39142.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 735
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 214/765 (27%), Positives = 362/765 (47%), Gaps = 109/765 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
L+ D P RV DL+SRMTL+EKV QL + G VP +G Y
Sbjct: 29 LYKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88
Query: 98 WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
+ AL S +G F D I G T +P + S+N L ++
Sbjct: 89 ETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148
Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G + V+G Q
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
DL++ ++++C KHY Y R + +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +I+ + ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308
Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
K++A AGL++D + Y V++G+V +D++++ + + RLG F+
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
+ K+ +++++AA A E +VLLKN+ TLPL K +AV+GP A
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426
Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS--IFAASEAAKTA 497
++G++ G + Y F+G A + Y GC A K +N A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWS 483
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D ++ G ++ E+ R + LP Q +L ++ + K P++LV+++ +++ E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
++ AIL PG G +A ++ G+ NP G+L +T P ++ +P+
Sbjct: 543 LISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591
Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
R G+ G YK LYPFG+GLSYT+FKY ++ +
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTVTPS------------------ 632
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
V ++ D +V NVG+ DG++ V + P +K++
Sbjct: 633 -------------VTKVKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
F++ +RAG K +F + + V+ L AGE+ I V
Sbjct: 680 HFEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 727
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 219/740 (29%), Positives = 349/740 (47%), Gaps = 107/740 (14%)
Query: 47 LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
+ ++F F ++ LP + R+ +L+S MTLDEKV L GVPRLG+ + SE LHG+
Sbjct: 20 VSQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALST-NLGVPRLGI-RNTGHSEGLHGM 77
Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTA-----SFNESLWKKIGQAVSTEAR---AMYNLG 158
+ GPG A ++PT I A +++ L +K+ +TE R NL
Sbjct: 78 ALGGPGNWGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQ 137
Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
+ G+ +PN ++ARDPRWGR E+ GED F+ R V +V+GLQ G++ +
Sbjct: 138 KGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQ---GND------PKY 188
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K +S KH+ A ++ + +FD R+ E + PF + EG + + M SYN
Sbjct: 189 WKSASLMKHFLANSNEDGRDSTSSNFDERL----FREYYSFPFYKGITEGGSRAFMASYN 244
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
NG+P +P +L + R EW +G I D ++ ++V+ H E A A +KA
Sbjct: 245 AWNGVPMTVNP-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAA-VVKAS 302
Query: 339 LDLDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
+ GQ+ NF A+++G + E +ID ++ + V ++LG D Y +G
Sbjct: 303 V----GQFLDNFRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIG 358
Query: 393 KQDICSDENIE----LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
D S N + + + +VLLKN LPLN +K+K++AV+GP AN ++
Sbjct: 359 VTDTVSPWNKQDTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRANE--VLLD 416
Query: 449 NYAGIPCRYMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLD 507
Y+G P +S + G + G D +V ++ + A+ AA+ AD I+ G
Sbjct: 417 WYSGTPPYAVSILQG------IKNAVGKDIEVFYAPSDEMDKATLAARKADVAIVCVGNH 470
Query: 508 -------------LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
S E++DR+ + L Q L+ V + A ++V++S A
Sbjct: 471 PYGTDARWKISPVPSDGREAVDRKSITLE--QEDLVKLVMQ-ANPKTVMVLVS--NFPFA 525
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ N+ AIL +E G +ADV+FG +P GR TW + P+ +R
Sbjct: 526 INWSQENVPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWVK-SITDLPPMMDYDIR 584
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
GRTY+++ LYPFG+GLSYT F+Y+ L TS
Sbjct: 585 -------HGRTYQYFKSKPLYPFGFGLSYTSFEYSGLE--------------------TS 617
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+ + T D V +N+G DG +V+ +Y P +KQ+ GF
Sbjct: 618 NPTLT-------------DSIFVSVKVKNIGKRDGDEVIQLYVSYPDSKVERPMKQLKGF 664
Query: 735 QRVFVRAGRNKRIKFVFNAC 754
+RVF+ AG++K ++ A
Sbjct: 665 KRVFIPAGKSKTVEIPLKAS 684
>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 942
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 230/810 (28%), Positives = 359/810 (44%), Gaps = 146/810 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + R++DL+S+MTL+EK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 111 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRG+Q H + +V++ KH+ AY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HSH---------QVAATGKHFVAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ +KE VM SYN +G+P L +RG
Sbjct: 272 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C Y
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
V++G + E I+ ++ + V +G FD +P L D + EN LA +A+RE
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLAGADKEVEKAENESLALQASRES 449
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
+VLLKN+ N LPL+ VK +AV GP+A+ + +Y + + + G G A
Sbjct: 450 LVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKA 509
Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
V Y GCD V S I A E A+ AD +++ G E+
Sbjct: 510 EVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGEN 569
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q +L+ V K PV+LV+++ + I +A + + IL A YPG +
Sbjct: 570 KSRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSK 626
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFY 629
GG A+ADV+FG +NPGG+L +T+ V +P + P +P +D PG
Sbjct: 627 GGTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRV 683
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
NG LY FGYGLSYT F+Y+ + + K I N C+
Sbjct: 684 NG-ALYSFGYGLSYTTFEYSDIEISPKVITPNQKATVRCK-------------------- 722
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
N G G +VV +Y + TY K + GF+R+ ++ G K +
Sbjct: 723 ------------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVV 770
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F + K L ++D ++ G+ +I VG
Sbjct: 771 FTLDR-KQLELLDKHMEWVVEPGDFSIMVG 799
>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
17393]
gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 879
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 172/459 (37%), Positives = 239/459 (52%), Gaps = 47/459 (10%)
Query: 52 FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
FL C S PY R DLV R+TL+EK + + + +PRLG+ Y+WW+EALH
Sbjct: 34 FLSC-SQPPYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALH 92
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------L 157
GV G AT FP I ASFN L + A+S EARA L
Sbjct: 93 GVGRAGL----------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGL 142
Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
R GLT W+PNIN+ RDPRWGR ET GEDP++ + + VRGLQ EG +
Sbjct: 143 KRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYD----- 197
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
K+ +C KHYA + W +R+ F+A + +D+ ET+L F+ V++ VMC
Sbjct: 198 ---KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMC 251
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQT 334
+YNR G P C +LL +R EW +V+DC +I + D K+ A A+
Sbjct: 252 AYNRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKA 311
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
+ +G D++CG Y + AV++G + E ID SLK L LG D Q + +
Sbjct: 312 VLSGTDIECGDSYGSLP-EAVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIP 370
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ S E+ ELA ARE +VLL+N+Q+ LPLN K VAVVGP+AN +V GNY G
Sbjct: 371 YSVVDSKEHRELALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNG 428
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G Y + + Y+ GCD + + S+F
Sbjct: 429 FPSHTITLLEGIREYLPESQIIYEPGCDLTSDVTLQSVF 467
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 126/295 (42%), Gaps = 56/295 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K AD I G+ +VE E + DRE + LP Q++L+ AE+ K +V
Sbjct: 614 KEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLL---AELKKAGKKIV 670
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G IA + AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y
Sbjct: 671 FVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK----- 725
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
++ L + RTY++ L+PFG+GLSYT F+Y S
Sbjct: 726 ----STSQLPGFEDYSMKERTYRYMTEAPLFPFGHGLSYTTFRYGDASL----------- 770
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
N D +T +L + NVG DG +VV VY + P +
Sbjct: 771 ----NTQEVKDGEQT----ILT------------IPVSNVGEYDGEEVVQVYLRRPGDKE 810
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
+ F+R + G + + + D NT+ P G++ I G
Sbjct: 811 GPS-HALRAFKRANIAKGATSNVTVSLSK-EDFEWFDTETNTMRPIEGDYEILYG 863
>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
thermohalophila DSM 12881]
Length = 749
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 223/731 (30%), Positives = 343/731 (46%), Gaps = 106/731 (14%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL---PQYEWWSEALHGV 106
S+ F + L R+ DL+SRMTLDEKV L VPRLG+ P E + HGV
Sbjct: 50 ESYPFQNPELDSEARIDDLLSRMTLDEKVSALSTDP-SVPRLGVKGAPHIEGY----HGV 104
Query: 107 SNVGPGT---HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRA 160
+ GP D+ +P T+FP A++N L + G+ S EAR ++ + +
Sbjct: 105 AMGGPANWAPKGDEAVP-TTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKG 163
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GL +PN ++ RDPRWGR E GEDPF+VG A + +GLQ + + +
Sbjct: 164 GLVVRAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQGD---------DDQYWR 214
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+S KH+ A +N + FD ++ + +F R F EG +++ M +YN +
Sbjct: 215 TASLLKHFLANSNENGRESSSSDFDMQLYHEYYGASFRRAF----IEGGSNAYMAAYNAI 270
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
NG+P+ + + W + G D Q++V HK+ D A +KAGL+
Sbjct: 271 NGVPAHVH-DMHKEITERMWGVDGIKCTDGGGYQLLVYGHKYY-DDLYLAAEGVIKAGLN 328
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQD- 395
Y G A+ G + E DID+ L+ +Y V+++LG D PQ Y ++G+
Sbjct: 329 QFLDNYREGVYG-ALAHGYITEADIDEVLRGVYRVMIKLGQLD--PQEKVPYSAIGRDGK 385
Query: 396 ---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ ++ + A ARE IVLLKN+ TLPLN+ K+ VAV+G A+ ++ Y+G
Sbjct: 386 PAPWTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAVIGYLAD--TVLLDWYSG 443
Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDD-VACKSNNSIFAASEAAKTADATIILAG------ 505
+P ++P+ G + K G D V +N AA EAA AD I++ G
Sbjct: 444 LPPYRITPLEG------IREKLGNDSKVLYAPDNDYNAAVEAASEADVAIVILGNYPTCN 497
Query: 506 -------LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
D + E++DR+ L L L+ V E A I V+ S+ I +++
Sbjct: 498 SEIWADCPDPGMGREAIDRKTLRLT--DEYLVKLVME-ANPNTIFVLQSSFPYAINWSQ- 553
Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
N+ AIL + G+E G A+ADV+FG +NPGG+L TW + Q+ + +R
Sbjct: 554 -QNVPAILHLTHNGQETGSALADVLFGDYNPGGKLTQTWPKSE-DQLPDMMEYDIR---- 607
Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
G TY ++ LYPFG+GLSYT F + +S K + ++D
Sbjct: 608 ---KGHTYMYFEDKPLYPFGHGLSYTTFAWEDISINKPV--------------VSAD--- 647
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
D+ V +N G G +VV +Y+ P K + GF+RV
Sbjct: 648 -------------DEEVIITVKLKNTGDVKGDEVVQLYASFPESTVRRPAKALKGFKRVT 694
Query: 739 VRAGRNKRIKF 749
+ G K+I+
Sbjct: 695 LEPGEKKKIEI 705
>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
thetaiotaomicron VPI-5482]
Length = 950
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 228/749 (30%), Positives = 354/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E A N +A WSP ++V
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 315
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ G+P +L
Sbjct: 316 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 372
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 373 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 432
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ D+D + + + R F+ +P L + I SD + E+A
Sbjct: 433 VIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 491
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N N LPL S ++T+AV+GP A+ G+Y +P + S + G
Sbjct: 492 QAARESIVMLENKDNLLPL-SKTLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 548
Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
G V Y+ GCD N I A +AA +D I++ G + EA
Sbjct: 549 KGAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTC 607
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 608 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 664
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NP GRLP+T+ +V LPL + GR Y++ +
Sbjct: 665 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 715
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N
Sbjct: 716 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 746
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
E + +NVGS G +V +Y T + ++ F R+ ++ G +K + F
Sbjct: 747 -----VEVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLQPGESKTVSF 801
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I VG
Sbjct: 802 EMTPY-DISLLNDRMDRVVEKGEFKIMVG 829
>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
CL09T03C10]
gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
CL09T03C10]
Length = 735
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 212/767 (27%), Positives = 362/767 (47%), Gaps = 113/767 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D+ P RV DL+SRMTL+EKV QL + G E E + +G
Sbjct: 29 LYKDAKAPIEKRVDDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGE---EVKKVPAEIGSL 85
Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
+F+ D I G T +P + S+N L ++
Sbjct: 86 IYFETNPELRNNMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQAC 145
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G + VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFGAASVRGYQ 199
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
+N + N +V++C KHY Y R + +++Q + +T+L P++M
Sbjct: 200 G----DNMSAEN----RVAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYKM 248
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
VK G A+++M S+N ++G+P A+P + + ++ W G+IV+D +I+ + ++ L
Sbjct: 249 GVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFIVSDWGAIEQL--KNQGL 305
Query: 324 ADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A +K++A AGL++D + Y V++GKV +D++++ + + RLG F
Sbjct: 306 AATKKEAARHAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQVDEAVRRVLLLKFRLGLF 365
Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
+ V+ K+ +++++AA A E +VLLKN+ N LPL A K +AV+GP A
Sbjct: 366 ERPYTPVTTEKERFLRPQSMDIAARLAAESMVLLKNENNVLPL--ADKKKIAVIGPMAKN 423
Query: 443 TVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAAK 495
++G++ G + Y A F+G A + Y GC+ N FA A AA+
Sbjct: 424 GWDLLGSWRGHGKDTDVVMLYDGLAAEFAGKAELRYALGCNTKG--DNREGFAEALGAAR 481
Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
+D ++ G ++ E+ R + LP Q +L ++ +V K PV+L++++ +++
Sbjct: 482 WSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPLELNR 540
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL 613
E ++ AIL PG G +A ++ G+ NP G+L +T+ P ++ +P+
Sbjct: 541 LEPVSD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMTF---------PYSTGQIPI 589
Query: 614 ---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
R G+ G YK LYPFG+GLSYT+FKY T+ + K++ L
Sbjct: 590 YYNRRKSGRGHQG-FYKDMTSDPLYPFGHGLSYTEFKYG------TVTPSATKVKRGEKL 642
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
+ +V N+G+ DG++ V + P +K+
Sbjct: 643 SA-------------------------EVTVTNIGARDGAETVHWFISDPYCSITRPVKE 677
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
+ F++ ++AG K +F + + V+ L GE+ I V
Sbjct: 678 LKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNIHV 724
>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
BAA-286]
Length = 861
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 162/441 (36%), Positives = 246/441 (55%), Gaps = 39/441 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D++L R +DL+SR+TL EKV +GD + V RLG+ ++ WWSEALHGV+N G
Sbjct: 23 YKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQG--- 79
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---------GLTY 164
G T FP I ASFN+ L + A+S EARA ++ GL+
Sbjct: 80 -------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDNGLSV 132
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
W+PN+N+ RDPRWGR ET GEDP++ R ++ V GLQ + +++ K+ +C
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPK--------DAKYKKLLAC 184
Query: 225 CKHYAAYDVDNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
KHYA + W +R+ + + + + ET++ F++ V++ D S VMC+Y+R +
Sbjct: 185 AKHYAVHSGPEW---NRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDDD 241
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
P C + LL + +R EW +V+DC +I +HK +D+ AV L AG D++C
Sbjct: 242 PCCGNNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVL-AGTDVEC 300
Query: 344 GQYYT-NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDE 400
G YT + +AV +G + E DIDKS+ L T RLG FD + + ++ I +
Sbjct: 301 GFGYTYHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFDDNSIVPWANIPDTIINCKK 360
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA E AR+ + LL+N N LPL+S K +AV+GP+A+ M GNY GIP + ++
Sbjct: 361 HQALALEMARQSMTLLQNKNNILPLSSK--KKIAVIGPNADDAKLMWGNYNGIPVKTVTI 418
Query: 461 IAGFSGYA--NVTYKTGCDDV 479
+ G A ++ Y+ GCD V
Sbjct: 419 LEGIKSIAGKDIFYEKGCDIV 439
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 22/162 (13%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K D + G+ +E E + DR D+ LP Q I + + K ++
Sbjct: 597 KDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKAGKR---VI 653
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+++ G I + + +AIL A Y G+ GG+AIA+V+FGK+NP G+LPIT+Y +
Sbjct: 654 MVNCSGSAIGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPITFYKN--ID 711
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
LP + GRTY++ L+PFGYGLSYT F
Sbjct: 712 QLP-------DFEEYDMKGRTYRYLEDKPLFPFGYGLSYTTF 746
>gi|237721771|ref|ZP_04552252.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
gi|229448640|gb|EEO54431.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
Length = 735
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 213/765 (27%), Positives = 362/765 (47%), Gaps = 109/765 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
L+ D P RV DL+SRMTL+EK+ QL + G VP +G Y
Sbjct: 29 LYKDPKAPIEKRVNDLLSRMTLEEKMMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88
Query: 98 WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
+ AL S +G F D I G T +P + S+N L ++
Sbjct: 89 ETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148
Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G + V+G Q
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
DL++ ++++C KHY Y R + +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +I+ + ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308
Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
K++A AGL++D + Y V++G+V +D++++ + + RLG F+
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
+ K+ +++++AA A E +VLLKN+ TLPL K +AV+GP A
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426
Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS--IFAASEAAKTA 497
++G++ G + Y F+G A + Y GC A K +N A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWS 483
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D ++ G ++ E+ R + LP Q +L ++ + K P++LV+++ +++ E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
++ AIL PG G +A ++ G+ NP G+L +T P ++ +P+
Sbjct: 543 LISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591
Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
R G+ G YK LYPFG+GLSYT+FKY ++ +
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTVTPS------------------ 632
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
V ++ D +V NVG+ DG++ V + P +K++
Sbjct: 633 -------------VTKVKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
F++ +RAG K +F + + V+ L AGE+ I V
Sbjct: 680 HFEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|363583088|ref|ZP_09315898.1| b-glucosidase [Flavobacteriaceae bacterium HQM9]
Length = 779
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 223/774 (28%), Positives = 357/774 (46%), Gaps = 118/774 (15%)
Query: 63 IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ---YEWWSEALHGVSNVGPGTHFD--- 116
++++ L+++MTLD+KV QL LP+ E + NV + D
Sbjct: 62 LKIEALIAKMTLDQKVGQLSLRGTSSRTKLLPEALKKEVKQGKIGAFLNVMNRAYVDELQ 121
Query: 117 -----------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
DVI G T FP + AS++ K + + EA +
Sbjct: 122 RIAVEESPLGIPLIFARDVIHGFKTIFPIPLGLAASWDAETAKAAARVSAIEASSY---- 177
Query: 159 RAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
G+ + ++P +++ +D RWGRI E+PGEDP++ A YV G QD DL S+
Sbjct: 178 --GIRWTFAPMLDITQDSRWGRIAESPGEDPYLASVLAKAYVEGFQD-------NDL-SK 227
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
+++C KH+ Y R + A + E + T+L+PFE + G A++VM S+
Sbjct: 228 STSLAACAKHFIGYGAAIG---GRDYNTAIIHEPLLRNTYLKPFEAAIDAG-AATVMTSF 283
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N +NG+P+ + LLN+ +R E HG++V+D +SI M+ H + A++++ A A + A
Sbjct: 284 NELNGVPASGNKWLLNEVLRKELGFHGFVVSDWNSITEMIA-HSY-AENEKHAAALGINA 341
Query: 338 GLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI 396
GLD++ + Y N+ +++ K+ ET +D + + V RL F+ P + +
Sbjct: 342 GLDMEMTSKSYENYIKQLLKEKKITETQLDFLVSNILRVKFRLNLFE-KPYRLKKHTGNF 400
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIP 454
S E+++LA AA VLLKN+Q LPLN K+ VAV+GP ANA +G + G
Sbjct: 401 YSQEHMDLAKNAAIRSSVLLKNNQGLLPLN--KLTKVAVIGPLANAPHEQLGTWTFDGDQ 458
Query: 455 CRYMSPIAGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
++P+ F + N + +S + A A+++D + G + + E
Sbjct: 459 AYSVTPLQAFKNNKVNFNFAETLTYSRDQSTKAFDKALRTAQSSDVILFFGGEEAILSGE 518
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
+ R + LPG Q LI +A+ K P++ VIM+ G I + + AIL +PG
Sbjct: 519 AHSRAHINLPGQQEALIKALAKTGK-PIVFVIMA--GRPITLTKVIDQVDAILMTWHPGT 575
Query: 574 EGGRAIADVVFGKFNPGGRLPITW----------YN----------GDYVQMLPLTSMPL 613
GG AI ++++GK PGGRLPITW YN +VQM S+P+
Sbjct: 576 MGGEAIYEMLWGKNEPGGRLPITWPKTSGQLPLFYNHKNTGRPPSIKSFVQM---DSIPV 632
Query: 614 RP-VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
SLG P +PFGYGL YT FKY+ + + T
Sbjct: 633 GAWQSSLGNTSHYLDVGFTPQ-FPFGYGLGYTTFKYSDVKISTT---------------- 675
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
+ ++ E V N G G+++V +Y + +K++
Sbjct: 676 ---------------SITKNESLEVSVTLTNTGDRAGAELVQLYVQDVVGSLTRPVKELK 720
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA---GEHTIFVGNGGVS 783
GF+ + + G + +KF NA N + + NTL P GE IFVG+ S
Sbjct: 721 GFKHIHLDKGASTIVKFTLNA----NDLMFVNNTLKPVLEKGEFNIFVGSSSQS 770
>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
3_8_47FAA]
Length = 954
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 226/749 (30%), Positives = 358/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E A N +A WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 319
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ GIP +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTEL 376
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKE 436
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ ++D + + + + R F+ +P L + I SD + E+A
Sbjct: 437 VIQAAKDGRIDMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N +N LPL + ++T+AV+GP A+ G+Y +P + S + G
Sbjct: 496 QAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552
Query: 465 S----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
V Y+ GCD N I A +AA +D +++ G + EA
Sbjct: 553 KEAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTC 611
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NPGGRLP+T+ +V LPL + GR Y++ +
Sbjct: 669 GQEGGPAMADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N T A+
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSDL-----------KIQEKPNGNVTVQAT------------ 756
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+N+GS G +V +Y T + ++ F R++++ G +K + F
Sbjct: 757 -----------VKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDFDRIYLQPGESKTVSF 805
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I VG
Sbjct: 806 ELTPY-DISLLNDHMDRVVEKGEFKICVG 833
>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
Length = 954
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 226/749 (30%), Positives = 358/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E A N +A WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 319
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ GIP +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTEL 376
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKE 436
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ ++D + + + + R F+ +P L + I SD + E+A
Sbjct: 437 VIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N +N LPL + ++T+AV+GP A+ G+Y +P + S + G
Sbjct: 496 QAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552
Query: 465 S----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
V Y+ GCD N I A +AA +D +++ G + EA
Sbjct: 553 KEAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTC 611
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NPGGRLP+T+ +V LPL + GR Y++ +
Sbjct: 669 GQEGGPAMADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N T A+
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSDL-----------KIQEKPNGNVTVQAT------------ 756
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+N+GS G +V +Y T + ++ F R++++ G +K + F
Sbjct: 757 -----------VKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDFDRIYLQPGESKTVSF 805
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I VG
Sbjct: 806 ELTPY-DISLLNDHMDRVVEKGEFKICVG 833
>gi|423226659|ref|ZP_17213124.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392628186|gb|EIY22220.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 750
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 219/765 (28%), Positives = 359/765 (46%), Gaps = 114/765 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDF-AHGVPRLGLPQYEWWSEALHGV---------------- 106
R++ L+ +MTL+EK+ Q+ P L + ++ +
Sbjct: 35 RIEALLGKMTLEEKIGQMNQLHCENFPYLKTETRKGRVGSVMSITDPNIFNEVQRIAVED 94
Query: 107 SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
S +G P + DVI G T FP + ASFN + + + +TEA A AG+ +
Sbjct: 95 SRLGIPLINARDVIHGFKTIFPIPLGQAASFNPEIAETGARIAATEASA------AGIRW 148
Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
++P I++ DPRWGRI E GEDP +V + V ++G Q + LN P +++
Sbjct: 149 TFAPMIDITHDPRWGRIAEGFGEDPLLVSQMGVAAIKGFQ-------GSSLN-HPTSIAA 200
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KH+A Y R + +TE+ +LRPFE V G A+++M ++N +GI
Sbjct: 201 CAKHFAGYGASEG---GRDYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGI 256
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
PS A+P LL +R EW+ G +V+D S+ M+ H F D KE A+ T AG D++
Sbjct: 257 PSSANPFLLKDVLRNEWNYRGTVVSDWASVSEMI-RHGFCEDEKEAALKAT-NAGTDIEM 314
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
+ Y +++GKV ID +++ + + RLG F+ P K+ + +
Sbjct: 315 VSETYIKHLPQLIKEGKVSMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFL 373
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG--------NYAGIP 454
E A AA + VLLKN++ TLP+ S +KT+ V GP A+A +G +Y+ P
Sbjct: 374 EAAQTAAEQSAVLLKNERGTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTP 432
Query: 455 CRYMSPIAGFSGYANVTYKTGCD---DVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+ + I+G S V Y G + D A N + E A+ AD + G + +
Sbjct: 433 LQALRRISGDS--IKVLYAPGLNYSRDTATSQFNKVV---ELAREADLILAFVGEEAILS 487
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ +L L G Q++L+++++E K P++ V+M+ + I E N + A+L+A +P
Sbjct: 488 GEAHCLANLNLQGAQSRLLHRLSETGK-PLVTVVMAGRPLTIG-REVNIS-DALLYAFHP 544
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLR---------PV 616
G GG A+A+++FGK P G+LP+T+ +P+ T P PV
Sbjct: 545 GTMGGPALANLLFGKVVPSGKLPVTF--PKETGQIPIYYNHTSTGRPASGSEKNIFTIPV 602
Query: 617 DSLGYPGRTYKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
+ FY L+PFGYGLSYT F Y+ L + T Q+ RN
Sbjct: 603 GAEQTSLGNTSFYLDAGKDPLFPFGYGLSYTTFAYSNLQLSST--------QYTRN---- 650
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
+ D N G TDG+++ +Y + A +K++
Sbjct: 651 -------------------EVIIITFDLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAA 691
Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F+R+ ++AG + I+ K L+ +YA + + G+ +++G
Sbjct: 692 FERIHLKAGETRHIRMEL-PVKQLSFWNYAMDYCVEPGKFDLWIG 735
>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
CL03T00C23]
gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
CL03T12C37]
Length = 942
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 227/808 (28%), Positives = 358/808 (44%), Gaps = 142/808 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D S P R+++L+ +MTLDEK Q+ +G R+ LP EW W
Sbjct: 52 VYEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 110
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 111 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 170
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + VRGLQ H + +V++ KH+AAY +
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 271
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + PF+ ++E VM SYN +GIP L +RG
Sbjct: 272 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 331
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E GY+V+D D+++ + H D KE AV Q+++AGL++ C +
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 390
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
V++G + E I+ ++ + V +G FD Q G +++ +EN +A +A+ E +
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASHESV 450
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYAN 469
VLLKN LPL+ K +AV GP+AN + +Y + + + G A
Sbjct: 451 VLLKNADELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAE 510
Query: 470 VTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESL 515
V Y GCD V S I A E A+ AD +++ G E+
Sbjct: 511 VLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q QL+ + K PV+L++++ + I +A + + AIL A YPG +G
Sbjct: 571 SRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKG 627
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--YN 630
G A+AD++FG +NPGG+L +T+ V +P + P +P +D PG T N
Sbjct: 628 GTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRIN 684
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G LYPFGYGLSYT F+Y+ L T + T + S T
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT----------- 718
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
++ N G G +VV +Y + TY K + GFQR+ + G + + F
Sbjct: 719 ------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFT 772
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ K L ++D ++ G+ + G
Sbjct: 773 IDR-KHLELLDADMKWVVEPGDFVLMAG 799
>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 950
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 226/749 (30%), Positives = 358/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E A N +A WSP ++V
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 315
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ GIP +L
Sbjct: 316 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTEL 372
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 373 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKE 432
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ ++D + + + + R F+ +P L + I SD + E+A
Sbjct: 433 VIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 491
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N +N LPL + ++T+AV+GP A+ G+Y +P + S + G
Sbjct: 492 QAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 548
Query: 465 S----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
V Y+ GCD N I A +AA +D +++ G + EA
Sbjct: 549 KEAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTC 607
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 608 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 664
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NPGGRLP+T+ +V LPL + GR Y++ +
Sbjct: 665 GQEGGPAMADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 715
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N T A+
Sbjct: 716 EYYPLYRFGFGLSYTSFEYSDL-----------KIQEKPNGNVTVQAT------------ 752
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+N+GS G +V +Y T + ++ F R++++ G +K + F
Sbjct: 753 -----------VKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDFDRIYLQPGESKTVSF 801
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I VG
Sbjct: 802 ELTPY-DISLLNDHMDRVVEKGEFKICVG 829
>gi|441498970|ref|ZP_20981160.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
gi|441437215|gb|ELR70569.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
Length = 752
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 218/775 (28%), Positives = 364/775 (46%), Gaps = 127/775 (16%)
Query: 64 RVKDLVSRMTLDEKVQQL----GDFAHGVPRLGLPQYEWWSEAL-----------HGVSN 108
+++ L+ +MTL+EKV QL GD + P + + + + + + HG +
Sbjct: 32 KIEALIRQMTLEEKVGQLNFYVGDLFNTGPTVRTTESDKFDQLIREGKLTGLFNVHGAAY 91
Query: 109 VG--------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
G P DVI G T FP + + AS++ +K + + E+ A
Sbjct: 92 TGRLQKIAVEESRLGIPLLFGADVIHGFKTVFPIPLASAASWDLEAIEKAERVAAIESTA 151
Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AG+ + ++P ++++RDPRWGRI E GEDPF+ A VRG Q+ ++ T
Sbjct: 152 ------AGINFNFAPMVDISRDPRWGRIAEGAGEDPFLGSEVAKARVRGFQE----QSLT 201
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
D P +++C KH+AAY + G D D ++E+ + E +L P++ + G A++
Sbjct: 202 D----PQTMAACVKHFAAYGAPD-GGRDYNTVD--MSERLLREMYLPPYKAGIDAG-AAT 253
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
+M S+N +NGI + LL +R EW G +V+D S+ MV + A + +A
Sbjct: 254 IMTSFNELNGIAASGSQFLLRDILRKEWGFKGMVVSDWQSVNEMVAHGN--AANNAEAAM 311
Query: 333 QTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL 391
LKAG+D+D G Y V +GK+ +D++++ + + LG FD +Y
Sbjct: 312 MALKAGVDMDMMGDVYLEEVPRLVNEGKLDIKFVDEAVRNVLKLKYDLGLFDDPYRYSDT 371
Query: 392 --GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
K +I + E++E A + A++ IVLLKN + LPL + + T+AV+GP A+ M G
Sbjct: 372 IREKNNIRAVEHLEAARDVAKKSIVLLKNKEKLLPLKKS-IGTIAVIGPLADNQADMNGT 430
Query: 450 Y-----AGIPCRYMSPIA-GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
+ A P ++ I SG + V Y GC+ + +S + A AK AD I+
Sbjct: 431 WSFFGEAQHPITFLQGIKDAVSGQSRVLYAEGCN-LYDRSKDKFAEAVNIAKKADVVILA 489
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G + E+ R D+ LPG Q +L+ ++A+ K PV+ ++MS +D+++ + NI
Sbjct: 490 VGESAVMNGEAGSRSDIRLPGIQPELVMEIAKTGK-PVVALVMSGRPLDLSW--LDENIP 546
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-------------------YNGDYVQ 604
AIL G E G A ADV+FG +NP G+LP+T+ Y GDY +
Sbjct: 547 AILEVWTLGSEAGNAAADVLFGDYNPSGKLPVTFPRNVGQVPIYYNHKNTGRPYEGDYSE 606
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
P+ Y + N P LYPFGYGLSY+ F+Y+ ++ +
Sbjct: 607 ----------PLSERIYRSKYRDVQNSP-LYPFGYGLSYSTFEYSDITLS---------- 645
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
+ L + V N G DG +VV +Y +
Sbjct: 646 ---------------------ADTLNAGESITASVSITNEGPYDGEEVVQLYIRDLVGSV 684
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+K++ GF+++ ++ G ++ F + L+ + + G+ IF+G+
Sbjct: 685 TRPVKELKGFKKLMIKNGETVKVDFTL-SSDDLSFYRHDMTYGIEPGDFQIFIGS 738
>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
19592]
Length = 725
Score = 269 bits (687), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 228/730 (31%), Positives = 343/730 (46%), Gaps = 104/730 (14%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS---N 108
+ F + + RV +L+S MT+DEKV L VPRLG+ + E LHG++
Sbjct: 30 YPFQNPKIATEKRVDNLLSLMTIDEKVNALSTNPE-VPRLGV-KGTGHVEGLHGLALGGP 87
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR-AMYNLGRAGLTYWSP 167
G G + +P T+FP +++ L K+I + EAR A+ GR GL +P
Sbjct: 88 AGWGGKGKEPLP-TTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRGGLVIRAP 146
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
N ++ARDPRWGR E+ GED F G+ V +V+GLQ + + +S KH
Sbjct: 147 NADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQGSD---------KTYWQTASLMKH 197
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+ A ++ + FD R+ E + PF+M V EG + + M +YN+VNGIP+
Sbjct: 198 FLANSNEDGRTYTSSDFDERL----WREYYALPFKMGVVEGGSRAYMAAYNKVNGIPAMV 253
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
P L + TV EW +G I D + ++++ +HK+ D K A T+KAG++ Q+
Sbjct: 254 HPMLKDITV-DEWGQNGIICTDGGAYKLLLSDHKYYKD-KYLGAAATIKAGIN----QFL 307
Query: 348 TNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD--- 399
+FT A+ G + E D+D+ L+ Y V+++LG D S Y +G + D
Sbjct: 308 DDFTEGVYGALANGYLTEADLDEVLRGNYRVMIKLGMLDSSANNPYAKIGAEADSMDPWE 367
Query: 400 --ENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+ +LA EA + IVLLKND + LPL KVK +A++G +A+A ++ Y+G P
Sbjct: 368 LEAHKKLALEATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYADAV--LLDWYSGTPP 425
Query: 456 RYMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
+SP+ G + K G + +V NN+ A E AK AD I+ G + A
Sbjct: 426 YTISPLQG------IKNKVGENVEVLFAKNNADGKAVEIAKNADVAIVFIGNHPTCNAGW 479
Query: 513 ----------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
E++DR+ L ++ + V V K V+ T NI
Sbjct: 480 AQCPVPSNGKEAVDRQAL-----NSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQENI 534
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AI +E G AIA+V+FG +NP GRL TW D + PL +R
Sbjct: 535 PAIFHVTQNSQELGTAIANVLFGAYNPAGRLTQTWVK-DISDLPPLMDYNIR-------N 586
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
GRTY ++ G LY FG+GLSYT FKY + K I+ N
Sbjct: 587 GRTYMYFKGKPLYAFGHGLSYTTFKYKDMEIPKQIKEN---------------------- 624
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ KV+ N G DG +VV +Y K IK++ F+R+ ++AG
Sbjct: 625 ----------EEVSVKVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAG 674
Query: 743 RNKRIKFVFN 752
K + + N
Sbjct: 675 ETKTVSLLLN 684
>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
Length = 954
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 228/749 (30%), Positives = 354/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E A N +A WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 319
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ G+P +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 376
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 436
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ D+D + + + R F+ +P L + I SD + E+A
Sbjct: 437 VIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N N LPL S ++T+AV+GP A+ G+Y +P + S + G
Sbjct: 496 QAARESIVMLENKDNLLPL-SKTLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552
Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
G V Y+ GCD N I A +AA +D I++ G + EA
Sbjct: 553 KGAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTC 611
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NP GRLP+T+ +V LPL + GR Y++ +
Sbjct: 669 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 750
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
E + +NVGS G +V +Y T + ++ F R+ ++ G +K + F
Sbjct: 751 -----VEVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLQPGESKTVSF 805
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I VG
Sbjct: 806 EMTPY-DISLLNDRMDRVVEKGEFKIMVG 833
>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
3_8_47FAA]
gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
3_8_47FAA]
Length = 735
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 215/765 (28%), Positives = 363/765 (47%), Gaps = 109/765 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
L+ D P RV DL+SRMTL+EKV QL + G VP +G Y
Sbjct: 29 LYKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88
Query: 98 WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
+ AL S +G F D I G T +P + S+N L ++
Sbjct: 89 ETNPALRNSMQKKAMEKSRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148
Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G + V+G Q
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
DL++ ++++C KHY Y R + +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G A+++M S+N ++G+P A+P ++ + ++ W G+IV+D +I+ + ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308
Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
K++A AGL++D + Y V++G+V +D++++ + + RLG F+
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
+ K+ +++++AA A E +VLLKN+ TLPL K +AV+GP A
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426
Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN--SIFAASEAAKTA 497
++G++ G + Y F+G A + Y GC A K +N A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNREGFAEALEAARWS 483
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D ++ G ++ E+ R + LP Q +L ++ + K P++LV+++ +++ E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
++ AIL PG G +A ++ G+ NP G+L +T P ++ +P+
Sbjct: 543 PISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591
Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
R G+ G YK LYPFG+GLSYT+FKY +
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV--------------------- 629
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
T A+K ++ D +V NVG+ DG++ V + P +K++
Sbjct: 630 TPSATK----------VKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
F++ ++AG K +F + + V+ L AGE+ I V
Sbjct: 680 HFEKQLIKAGETKTFRFDIDMERDFGFVNEDGKRFLEAGEYHILV 724
>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
Length = 1552
Score = 268 bits (686), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 220/758 (29%), Positives = 334/758 (44%), Gaps = 131/758 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--------------------GDFAHGVPRLGL 93
+ +++LP +IRV DL+ RMTLDEK+ Q+ ++ H +
Sbjct: 721 YQNAALPSAIRVHDLLQRMTLDEKLAQMRHIHFKHYNTDGHVDLTKLRNNYTHSMSFGCF 780
Query: 94 PQYEW----WSEALHGVS-NVGPGTHFD-DVIP-----------GATSFPTVILTTASFN 136
+ + + +A+ + N T F VIP G T FP I A+FN
Sbjct: 781 EAFPYSSTQYRQAVSTIQQNAADSTRFGIPVIPVIEGIHGIVQDGCTIFPQAIAQGATFN 840
Query: 137 ESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAV 196
L ++ Q + TE RA+ +P++++AR+ RWGR+ ET GEDP+++ R
Sbjct: 841 PQLVFRMAQHIGTEMRAI-----GARQVLAPDLDIAREQRWGRVEETFGEDPYLISRMGY 895
Query: 197 NYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-------DVDNWKGVDRYHFDARVT 249
NYV+G+Q G KH+ A+ ++ + KG R FD
Sbjct: 896 NYVKGIQSRGG--------------IPTLKHFVAHGTPQGGLNLASVKGGQRELFD---- 937
Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
+++PFE ++ A SVM Y+ + + P L +R GYI +D
Sbjct: 938 ------VYVKPFEYVIRHTKAGSVMNCYSAYDNEAITSSPFFLRTLLRDSLHFKGYIYSD 991
Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
SI ++ H ADS+ +A Q + AG+DL+ G Y + QG + + ID +
Sbjct: 992 WGSIPMLRYFHH-TADSETEAAQQAINAGVDLEAGSDYYRTAPTLIAQGLLDKARIDSAA 1050
Query: 370 KYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
++ G FD +Q I + E + +A + A E +VLL+N + LPL+ +
Sbjct: 1051 AHVLYTKFEAGLFDELASDTLHWRQQIHTPEAVAVAKQLADESLVLLENRNHFLPLDLNR 1110
Query: 430 VKTVAVVGPHANATVAMIGNYAGIP-CRY-MSPIAGFSGYA----NVTYKTGCDDVACKS 483
+ ++AVVGP NA G+Y+ R+ ++P+AG A V Y GCD ++
Sbjct: 1111 LHSIAVVGP--NAAQVQFGDYSWTADNRHGITPLAGIQQVAGMRTKVRYVKGCD-YYSQN 1167
Query: 484 NNSIFAASEAAKTADATIILAGLDL---------SVEAESLDREDLWLPGYQTQLINQVA 534
+SI A AK +D T+++ G S E D DL LPG Q QLI ++A
Sbjct: 1168 TDSIDEAVALAKQSDVTVVVVGTQSMLLARPSQPSTSGEGYDLSDLILPGVQQQLIERIA 1227
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
A G +V+M G + A N A+L Y GE+ G ++A +FG+ NP GRLP
Sbjct: 1228 --ATGKPFIVVMVTGRPLLTEAFKN-KADALLVQWYGGEQAGLSLAQALFGQLNPSGRLP 1284
Query: 595 ITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
I++ Y LP + PGR Y F + YPFGYGLSYT FKY+
Sbjct: 1285 ISFPKATGQLPVYYNHLPTDKGYYNKKGTPDKPGRDYVFADPYPAYPFGYGLSYTTFKYS 1344
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
L+ +K Q N N D QN G G
Sbjct: 1345 QLALSKK-QTNEN------------------------------DTIAVTFRVQNTGKRAG 1373
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+V +Y + AT IKQ+ GF++ ++ G K I
Sbjct: 1374 KEVAQLYIRDMKSSVATPIKQLFGFEKCALQPGETKTI 1411
>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 777
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 207/721 (28%), Positives = 330/721 (45%), Gaps = 111/721 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT I +++N +L +K+ V+
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + + P ++++RDPRW R+ E+ GEDP + G A V GL G
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVTGL----GSG 224
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
N +D P KH+ AY + A + E+++ E FL PF+ V G
Sbjct: 225 NLSD----PFATIPTLKHFVAYGIPEG---GHNGSAASIGERELREYFLPPFQSAVAAG- 276
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM +YN V+GIP ++ LL +R EW+ +G+ V+D SI+ + +H+ D K+
Sbjct: 277 AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWNFNGFTVSDLGSIEGIKGSHRVAKDHKQA 336
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
A+ ++AGLD D G AV+QG+V+E ID+++ + + +G F+
Sbjct: 337 AIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRVLALKFEMGLFEKPFVDA 395
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
K+++ ++ NI L+ + ARE IVLL+N N LPL K +A++GP+A+ M+G+
Sbjct: 396 KTAKKEVKTEANIALSRQVARESIVLLENKNNILPLR--KDVKIAIIGPNADNIYNMLGD 453
Query: 450 YA-----GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
Y G I+ A V+Y GC + +N+ I AA AA+ +D + +
Sbjct: 454 YTAPQPDGAVTTVRQAISARLPKAQVSYVKGC-SIRDTTNSDIPAAVTAAQQSDIIVAVV 512
Query: 505 G----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
G D E E DR L L G Q +L+ + + K P+
Sbjct: 513 GGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALKQTGK-PL 571
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+++ + +++ +A T+ + A+L A YPG+EGG AIADV+FG +NP G++P++
Sbjct: 572 VVIYIQGRPLNMNWAATHAD--ALLCAWYPGQEGGHAIADVLFGDYNPAGKMPLSVPRS- 628
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
V +P+ P+D Y LY FGYG SY+ F+Y L K
Sbjct: 629 -VGQIPVHYNRKSPLD------HRYVEEAATPLYAFGYGKSYSDFEYKDLKIQK------ 675
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKP 719
D +++V F N G DG +V +Y +
Sbjct: 676 -------------------------------DNKDYRVSFTLTNTGKYDGDEVAQLYIRN 704
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ ++Q+ F+R+ ++ G +K + FV A I L P I VG+
Sbjct: 705 QYASVSQPVQQLKHFERIHLKTGESKTVSFVLTAGDLSVINTQMKKVLEPGSSFKIRVGS 764
Query: 780 G 780
Sbjct: 765 A 765
>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 881
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 168/432 (38%), Positives = 238/432 (55%), Gaps = 43/432 (9%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
+ F + L S RV DL+ R+T++EK+ QL + + RLG+P+Y WW+E+LHGV+
Sbjct: 25 QQYPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARA 84
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------G 161
G AT FP I A+++ L K++ A+S EARA ++ + R G
Sbjct: 85 G----------YATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQG 134
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
LT+WSPNIN+ RDPRWGR ET GEDP++ G+ + YV+GLQ D N LK+
Sbjct: 135 LTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQ-------GNDPNY--LKL 185
Query: 222 SSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ KH+A + G + R+ FD +++D+ ET+L F VK+GD SVM +YNR
Sbjct: 186 VATAKHFAVH-----SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMTAYNR 240
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
V G + A L +R WD GY+V+DC +I + HK D+ E A A + G
Sbjct: 241 VYGEAASASDTLFT-ILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAE-ASAMAVIEGC 298
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDIC 397
DL+CG Y A QQG V E DID +L L ++LG FD Y +
Sbjct: 299 DLNCGDSYEKLN-QAYQQGMVTEKDIDIALSRLMEARIKLGMFDPEQLVPYAQIPFNVNT 357
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
S+++ +LA +AA+E IVLLKN + LPL S +K+VAV+GP+A+ ++ GNY G P
Sbjct: 358 SEKHNQLALKAAKESIVLLKNQGDLLPL-SKDLKSVAVIGPNADNIQSLWGNYNGNP--- 413
Query: 458 MSPIAGFSGYAN 469
PI G N
Sbjct: 414 KDPITVLQGIQN 425
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 141/290 (48%), Gaps = 48/290 (16%)
Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
L G ++ V E DR L LP Q L+ +VA+ K P++LV+++ + I +A N
Sbjct: 615 LEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSALSINWAAEN 673
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
I AI+ AGY G++GG A+A+V+FG +NP RLP+T+Y V+ LP +
Sbjct: 674 --IPAIMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKS--VEDLP-------DFEDY 722
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
GRTY+++ LYPFGYGLSYT F Y+ I +N
Sbjct: 723 NMDGRTYRYFEKEPLYPFGYGLSYTTFDYSKFQLPSKIDMN------------------- 763
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+ E V+ N G+ DG +VV VY I++++GF+R+ +
Sbjct: 764 -------------ESIELSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRIHL 810
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ G +++++F + L+++D + ++ G +I VG F LN
Sbjct: 811 KKGESQKVQFTIEP-RQLSMIDDKGDLVIEPGVFSISVGGEQPGFNAKLN 859
>gi|319953334|ref|YP_004164601.1| beta-glucosidase [Cellulophaga algicola DSM 14237]
gi|319421994|gb|ADV49103.1| Beta-glucosidase [Cellulophaga algicola DSM 14237]
Length = 756
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 216/734 (29%), Positives = 347/734 (47%), Gaps = 105/734 (14%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
EK++ DFA RLG+P + + S+ +HG T+FP + ++S+
Sbjct: 83 EKIKTAQDFAVKKTRLGIPLF-FGSDIIHGYK---------------TTFPIPLGLSSSW 126
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
+ L K+ Q + EA A G+ + +SP ++++RDPRWGRI+E GEDP++ +
Sbjct: 127 DMELLKRTAQVAALEATA------DGINWNFSPMVDISRDPRWGRISEGAGEDPYLGSQI 180
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
A V G Q DL ++ +++ KH+A Y G D D ++ M
Sbjct: 181 AKAMVTGYQ-------GEDLMAKNTMLATV-KHFALYGAAE-AGRDYNSVD--MSRLKMY 229
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
+L P++ + G SVM S+N ++GIP+ + LL +R +W +G++V+D S+
Sbjct: 230 NEYLPPYKAAIDAG-VGSVMSSFNDIDGIPASGNKWLLTDLLRDDWKFNGFVVSDYTSVN 288
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
M+ + L D + A +LKAGLD+D G+ + ++ +GKV +I + + +
Sbjct: 289 EMIAHG--LGDLQA-VSALSLKAGLDMDMVGEGFLTTLKKSLDEGKVTAEEITTACRRIL 345
Query: 374 TVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
+LG FD +Y+ + +DI DEN LA EAA++ VLLKND LP+N K
Sbjct: 346 EAKFKLGLFDDPYKYIDKKRPAKDILKDENRALAREAAKKSFVLLKNDTKNLPIN--KSS 403
Query: 432 TVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGYA---NVTYKTGC---DDVACKS 483
+A++G AN+ M+G +A G P +S + GF A +T+ G DD A
Sbjct: 404 KIALIGDLANSKDNMLGTWAPTGDPQLSVSILQGFKNVAPNAQITHAKGANITDDAALAK 463
Query: 484 NNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
++F A E AK +D + + G ES R D+ +P Q
Sbjct: 464 KINVFGERVTIDKRSAEEMLNEAVELAKKSDIIVAVVGEATEFTGESSSRTDISIPQSQK 523
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+LI +A K P++LV+MS G + E +IL +PG E G AIADVVFG +
Sbjct: 524 KLIRALAATGK-PLVLVLMS--GRPLVLEEELALSASILQVWFPGVEAGNAIADVVFGDY 580
Query: 588 NPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT--LYPFGYGLSYT 644
NP G+L TW N + + RP + + T + + P L PFGYGLSYT
Sbjct: 581 NPSGKLTATWPRNVGQIPIYHSIKNTGRPQLTSEFEKFTSNYLDAPNTPLLPFGYGLSYT 640
Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
+F+Y+ L+ + Q+N N+ P ++ V N
Sbjct: 641 EFEYSNLNVNAS-QINQNE------------------PLIVT------------VSVTNT 669
Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
G+ DG +VV +Y + +KQ+ GF++V ++ G K++ L +
Sbjct: 670 GNFDGEEVVQLYLRDVVRSITQPVKQLKGFKKVMLKKGETKQVTLTLTP-DDLKFYNSNL 728
Query: 765 NTLLPAGEHTIFVG 778
+ + G+ I+VG
Sbjct: 729 DFVAEPGDFEIYVG 742
>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
17393]
gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 863
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 158/441 (35%), Positives = 229/441 (51%), Gaps = 32/441 (7%)
Query: 35 FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
+C FS ++ F + LP RV DLV R+TL+EK+ Q+ + A + RLG+P
Sbjct: 7 LICSLLLFSVTVAGQATCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIP 66
Query: 95 QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
Y WW+E LHGV+ P TSFP I A+++ ++ S E RA+
Sbjct: 67 AYNWWNECLHGVAR--------SPYP-VTSFPQAIAMAATWDTESVHQMAVYASDEGRAI 117
Query: 155 YNLGRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
Y+ GLTYWSPNIN+ RDPRWGR ET GEDPF+ V++V+GLQ
Sbjct: 118 YHDATRKGTPGIFRGLTYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQ--- 174
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
G + LK S+C KHYA + W +R+ +DA+V D+ +T+L F+ V
Sbjct: 175 GDDPVY------LKSSACAKHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVV 225
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
EG + VMC+YN G P C + L+ +R W GY+ +DC +++ + HK D+
Sbjct: 226 EGKVTGVMCAYNSFFGQPCCGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDA 285
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
+ L G D +CG +AV +G + E ID+SLK L+ + RLG FD
Sbjct: 286 AAASADAVLH-GTDCECGNGAYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDD 344
Query: 387 Q--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+ Y ++ + D + A + AR+ IVLLKN LPLN K+K +AVVGP+A+
Sbjct: 345 RVPYSNIPLSVLECDAHKAHALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKS 404
Query: 445 AMIGNYAGIPCRYMSPIAGFS 465
++ NY G P + + G
Sbjct: 405 VLLANYYGYPSHITTALEGIQ 425
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 147/298 (49%), Gaps = 57/298 (19%)
Query: 493 AAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
A K AD I + GL VE E + DR + +P Q L+ ++ K PV+
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELYATGK-PVV 653
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
V+M+ + + + + ++ AIL A Y G+ GG+AIADV+FG +NP GRLP+T+Y
Sbjct: 654 FVMMTGSALGLEWE--SAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKS-- 709
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
V LP + RTY+++ G +YPFGYGLSYT F+Y+ L +Q + +
Sbjct: 710 VNDLP-------DFEDYSMENRTYRYFTGTPVYPFGYGLSYTTFQYSSLK----LQPSPD 758
Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
K R++ T+ + N G +G +V +Y P +
Sbjct: 759 K----RSVKVTAKIT-------------------------NTGKMEGEEVAQLYVSNPRD 789
Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
T I+ + GF+R+ ++ G ++ ++FV + K L++VD + ++ G+ I +G G
Sbjct: 790 F-VTPIRALKGFKRINLKPGESQTVEFVLTS-KELSVVDISGKSVPMKGKVQISLGGG 845
>gi|224537403|ref|ZP_03677942.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520981|gb|EEF90086.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
DSM 14838]
Length = 750
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 218/765 (28%), Positives = 359/765 (46%), Gaps = 114/765 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDF-AHGVPRLGLPQYEWWSEALHGV---------------- 106
R++ L+ +MTL+EK+ Q+ P L + ++ +
Sbjct: 35 RIEALLGKMTLEEKIGQMNQLHCENFPYLKTETRKGRVGSVMSITDPNIFNEVQRIAVED 94
Query: 107 SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
S +G P + DVI G T FP + ASFN + + + +TEA A AG+ +
Sbjct: 95 SRLGIPLINARDVIHGFKTIFPIPLGQAASFNPEIAETGARIAATEASA------AGIRW 148
Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
++P I++ DPRWGRI E GEDP +V + V ++G Q + LN P +++
Sbjct: 149 TFAPMIDITHDPRWGRIAEGFGEDPLLVSQMGVAAIKGFQ-------GSSLN-HPTSIAA 200
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KH+A Y R + +TE+ +LRPFE V G A+++M ++N +GI
Sbjct: 201 CAKHFAGYGASEG---GRDYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGI 256
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
PS A+P LL +R EW+ G +V+D S+ M+ H F D KE A+ T AG D++
Sbjct: 257 PSSANPFLLKDVLRNEWNYRGTVVSDWASVSEMI-RHGFCEDEKEAALKAT-NAGTDIEM 314
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
+ Y + +++GKV ID +++ + + RLG F+ P K+ + +
Sbjct: 315 VSETYIKYLPQLIKEGKVSMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFL 373
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG--------NYAGIP 454
E A AA + VLLKN++ TLP+ S +KT+ V GP A+A +G +Y+ P
Sbjct: 374 EAAQTAAEQSAVLLKNERGTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTP 432
Query: 455 CRYMSPIAGFSGYANVTYKTGCD---DVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+ + +G S V Y G + D A N + E A+ AD + G + +
Sbjct: 433 LQALRRTSGDS--IKVLYAPGLNYSRDTATSQFNKVV---ELAREADLILAFVGEEAILS 487
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ +L L G Q++L+++++E K P++ V+M+ + I E N + A+L+A +P
Sbjct: 488 GEAHCLANLNLQGAQSRLLHRLSETGK-PLVTVVMAGRPLTIG-REVNIS-DALLYAFHP 544
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLR---------PV 616
G GG A+A+++FGK P G+LP+T+ +P+ T P PV
Sbjct: 545 GTMGGPALANLLFGKVVPSGKLPVTF--PKETGQIPIYYNHTSTGRPASGSEKNIFTIPV 602
Query: 617 DSLGYPGRTYKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
+ FY L+PFGYGLSYT F Y+ L + T Q+ RN
Sbjct: 603 GAEQTSLGNTSFYLDAGKDPLFPFGYGLSYTTFAYSNLQLSST--------QYTRN---- 650
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
+ D N G TDG+++ +Y + A +K++
Sbjct: 651 -------------------EVIIITFDLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAA 691
Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F+R+ ++AG + I+ K L+ +YA + + G+ +++G
Sbjct: 692 FERIHLKAGETRHIRMEL-PVKQLSFWNYAMDYCVEPGKFDLWIG 735
>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 770
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 222/818 (27%), Positives = 383/818 (46%), Gaps = 148/818 (18%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
+C S L Q S + D +LP S RV L+S+MTL+EKV Q+ + G+ + +
Sbjct: 10 ICCAIGISTLACQDKSKDYTDPTLPVSERVSSLMSQMTLEEKVAQMCQYV-GLEHMKKAE 68
Query: 96 YEWWSEALHGVSNVG--PGTHFDDV----------------------------------I 119
+ +E L + G P H DV I
Sbjct: 69 KDMSAEDLKHSHSQGFYPNLHSSDVEEMTKKGLISSFLHVVKAEEANYLQSLAQQSRLKI 128
Query: 120 P---------------GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
P G+T +PT I A+F+ +L +++ + + E RA +G+ +
Sbjct: 129 PLLIGIDAIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHW 182
Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKV 221
++PN+ VARD RWGR+ ET GEDP++VG+ VRG Q D G++ KV
Sbjct: 183 TFTPNVEVARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND----------KV 232
Query: 222 SSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+C KH + + G A ++E+ ++E F PF+ C++ G +VM ++N
Sbjct: 233 IACAKHLVGGSQPANGINGAP-----AELSERTLQEVFFPPFKDCLEAG-VFTVMTAHNE 286
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NGIP + L+ + +R +W G++V+D I+ M D H +A++ +DA ++ AG+
Sbjct: 287 LNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHN-VAETLKDAYQISVDAGM 345
Query: 340 DLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--I 396
+ G + V++G + E ID ++ + V RLG F+ ++ L K+D +
Sbjct: 346 GMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENP--FIDLKKKDEIV 403
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-GIPC 455
++++ + A E AR+ IVLLKN+ N LPL+++K K V V G +AN +++G++A P
Sbjct: 404 FNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPE 462
Query: 456 RYMSPIAGFSGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAG---- 505
+++ + G ++ +T + +V S+N I A + A+++D I++ G
Sbjct: 463 EHVTTV--LKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSM 520
Query: 506 ---LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
+ E+ DR +L LPG Q +L+ VA V++++ G + + N+
Sbjct: 521 RYHWNEKTCGENSDRYELSLPGRQQELVEAVAATGVPTVVILV---NGRPLTTEWIDENM 577
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
I+ A PG GG+A+A++++GK NP G+LPIT +P ++ ++ + + +
Sbjct: 578 PCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT---------IPRSTGQIQCMYNHKFT 628
Query: 623 GRTYKFYNGPTL--YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
+ + G +L Y FGYGLSYT +KY L ++ T
Sbjct: 629 NHWFPYATGNSLPLYEFGYGLSYTTYKYENLKLSEA----------------------TI 666
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
P D + VD N G DG + V +Y + A +K++ F R+ ++
Sbjct: 667 TP---------DKSVKVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKELKDFARIPLK 717
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
AG K + F + L+ D + + G I VG
Sbjct: 718 AGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754
>gi|347736643|ref|ZP_08869226.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
gi|346919803|gb|EGY01181.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
Length = 775
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 218/725 (30%), Positives = 335/725 (46%), Gaps = 109/725 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + E LHG +GP TSFP I +S++ L +++ V+
Sbjct: 121 RLGIPVL-FHEEGLHGYPAIGP-----------TSFPQAIAQASSWDPDLIREVDSVVAR 168
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R R SP ++VARDPRWGRI ET GEDP++ G V V+GLQ
Sbjct: 169 EIRV-----RGVSLVLSPVVDVARDPRWGRIEETFGEDPYLAGEMGVAAVQGLQG----- 218
Query: 210 NATDLNSRPL---KVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
+S PL KV + KH + ++ V A V E+ + E F PFE +
Sbjct: 219 -----DSLPLADGKVFATLKHLTGHGQPESGTNVG----PASVGERTLREMFFPPFEQVI 269
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
+ +VM SYN ++G+PS + LL+ +RGEW G I++D +I +V H + D
Sbjct: 270 HRTNVRAVMASYNEIDGVPSHVNTWLLHDILRGEWGYKGSIISDYSAIDQLVSIHHVVPD 329
Query: 326 SKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
A+ + ++AG+D D G+ Y + ++V+ GK+KE ID++++ + + + G F+
Sbjct: 330 LPSAAI-RAIQAGVDADLPDGESYASLA-DSVRAGKIKEEVIDRAVRRILELKFQAGLFE 387
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
+ + E +A +AA++ +VLLKND LPL+ AKVKT+AV+GP NA
Sbjct: 388 HPYADADKAEALTANGEARAVALKAAQKSVVLLKND-GVLPLDMAKVKTLAVIGP--NAA 444
Query: 444 VAMIGNYAGIPCRYMSPIAGFS----GYANVTYKTGC----DD--------VACKSNNS- 486
A +G Y+G P + +S + G VTY G DD +A + N+
Sbjct: 445 KAHLGGYSGEPKQTVSILDGIKAKVGARVKVTYAEGVRITKDDDWYGDTVELADPAENAR 504
Query: 487 -IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEVAKG 539
I A AKTAD +++ G + E DR+ L L G Q L + + K
Sbjct: 505 LIQQAVAVAKTADHIVLVIGDNEQTSREGWANNHLGDRDSLDLVGQQNDLAKALFALGK- 563
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PV++V+ + G ++ + A++ Y G+EGG A+ADV+FG NPGG+LP+T
Sbjct: 564 PVVVVLQN--GRPLSVVDVAARANALVEGWYLGQEGGTAMADVLFGDVNPGGKLPVTVAR 621
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
V LP+ + R Y F L+PFGYGLSYT F
Sbjct: 622 S--VGQLPMF------YNKKPSARRGYLFDTTDPLFPFGYGLSYTTFDVG---------- 663
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
P + + D VD +N G G +VV +Y
Sbjct: 664 ---------------------SPRLSTPTIAKDGAITVAVDVRNTGKRAGDEVVQLYLHQ 702
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+K++ GFQR+ + G ++ + F + K+L + + ++ G I VG+
Sbjct: 703 QVASVTRPVKELKGFQRITLAPGESRTVTFTVDG-KALALWNQDMKRVVEPGAFDIMVGD 761
Query: 780 GGVSF 784
V
Sbjct: 762 NSVDL 766
>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
Length = 954
Score = 268 bits (685), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 226/749 (30%), Positives = 355/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N+ L +++ + E A N +A WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ------------SRGLFTTP--KHFGGH 319
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ G+P +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 376
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 436
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ D+D + + + + R F+ +P L + I SD + E+A
Sbjct: 437 VIQAAKDGRINMEDLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N +N LPL S ++T+AVVGP A+ G+Y +P + S + G
Sbjct: 496 QAARESIVMLENKENLLPL-SKTLRTIAVVGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552
Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
V Y+ GCD + N I A + A +D I++ G + EA
Sbjct: 553 KSAVGKQTKVLYEQGCDFTNPDATN-IPKAVKTASQSDVVIMVLGDCSTSEATNDVRKTC 611
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NP GRLP+T+ +V LPL + GR Y++ +
Sbjct: 669 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 750
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
E + +NVGS G +V +Y T + ++ F R+ ++ G +K + F
Sbjct: 751 -----VEVQATVKNVGSCAGDEVAQLYVTDMYASVKTRVMELKDFTRIHLQPGESKTVSF 805
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I +G
Sbjct: 806 EMTPY-DISLLNDRMDRVVEKGEFKIMIG 833
>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
CL03T12C61]
Length = 946
Score = 268 bits (685), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 235/819 (28%), Positives = 371/819 (45%), Gaps = 146/819 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS------ 100
+ D + P R++DL+S+MTL+EK Q+ +G R+ LP EW W
Sbjct: 53 YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111
Query: 101 -EALHGVSNVG-PGTHFDDVIPG------------------------------------- 121
E L+G G P + + V P
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171
Query: 122 -ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGR 179
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
E GE P++V + VRG+Q H + +V++ KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFIAYSNNKGARE 272
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
D +++ +++E PF+ ++E VM SYN +G P + L +RGE
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
GY+V+D D+++ + H D KE AV Q+++AGL++ C Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
++G + E I+ ++ + V +G FD +P L D + EN E+A +A+RE I
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLKGADEEVEKKENEEVALQASRESI 450
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYAN 469
VLLKN++N LPL+ +K++ +AV GP+A+ + +Y + S + G A+
Sbjct: 451 VLLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKAD 510
Query: 470 VTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
V Y GCD V + I A AK AD I++ G E+
Sbjct: 511 VLYTKGCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q L+ V K PV+LV+++ + I +A+ + AIL A YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWAD--KFVPAILEAWYPGSKG 627
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
G A+AD++FG +NPGG+L +T+ V +P + P +P +D PG N
Sbjct: 628 GIAVADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRAN 684
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G LYPFGYGLSYT F+Y+ L + P ++ + +
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLKIS---------------------------PAIITPNQK 716
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
Y KV N G G +V+ +Y + TY K ++GF+RV ++ G K I F
Sbjct: 717 A--YVTCKV--TNTGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFP 772
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ K+L +++ + ++ G+ T+ + G S I LN
Sbjct: 773 IDR-KALELLNADMHWVVEPGDFTLML--GASSTDIRLN 808
>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 877
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 163/450 (36%), Positives = 243/450 (54%), Gaps = 47/450 (10%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
+++ + D++L R DLVSRMTL+EK QLG A +PRLG+P+Y WW+E LHGV+
Sbjct: 18 VAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNEGLHGVAR 77
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
G AT FP I A+++E + +G VSTE RA Y + R
Sbjct: 78 AGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKY-VERVHPDGGTDW 126
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GLT WSPNIN+ RDPRWGR ET GEDP++ R + Y+ GLQ + +
Sbjct: 127 YRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGN---------DPKF 177
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K + KH+A V + +R+ D ++ D+E+T+L F V EG A SVMC YN
Sbjct: 178 FKTVATSKHFA---VHSGPESNRHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVMCVYN 234
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD-SIQVMVDNHKFLADSKEDAVAQTLKA 337
V G+P CA L+ + +R W G++V+DC + + ++ + E+ VA LKA
Sbjct: 235 AVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAVGLKA 294
Query: 338 GLDLDCGQYYTNFTG------NAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYV 389
G+DL CG Y + NAV+ G++ +D++L L+ +RLG FD S +
Sbjct: 295 GMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFDPPASLPFA 354
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
+ D + + +A + A++ +VLLKND LPL A+ KT+AV+GP+A++ A++GN
Sbjct: 355 HITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPLK-AEPKTIAVIGPNADSLDALVGN 412
Query: 450 YAGIPCRYMSPIAGFSGY---ANVTYKTGC 476
Y G P + ++ + G A + Y G
Sbjct: 413 YYGKPSKPVTVLDGIRARFPTAKIVYAEGT 442
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 96/309 (31%), Positives = 144/309 (46%), Gaps = 73/309 (23%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + AKTAD + + GL VE E + DR + LP Q QL+ +V K
Sbjct: 591 AVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIGTGK- 649
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P +LV+MS + + +A + ++ AI+ A YPG EGG A+A ++ G ++P GRLP+T+Y
Sbjct: 650 PTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPVTFY- 706
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPG--------RTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
R VD+L PG RTY+++NG LYPFG+GLSYT F Y
Sbjct: 707 --------------RSVDAL--PGFSDYTMKNRTYRYFNGEVLYPFGHGLSYTTFAYA-- 748
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
P V + VD N G+ D +
Sbjct: 749 -----------------------------NPKVSAASVAAGSSVTVSVDVSNSGAMDSDE 779
Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
VV +Y P T I+ + GFQRV ++ G K ++F + ++L++VD + AG
Sbjct: 780 VVQLYVSHP---GGTAIRSLQGFQRVSLKKGETKTVQFKLDD-RALSVVDEHGGRKVQAG 835
Query: 772 EHTIFVGNG 780
+ +++G G
Sbjct: 836 QVDLWIGGG 844
>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
Length = 882
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 164/423 (38%), Positives = 238/423 (56%), Gaps = 40/423 (9%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV++MT EK+ Q + A +PRLG+P Y+WWSE LHG++ G AT FP
Sbjct: 37 LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNL----GR-----AGLTYWSPNINVARDPRWG 178
I AS+N L + +G STEARA +NL G+ AGLT WSPNIN+ RDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R ET GEDP++ + AV+++RGLQ ++ P +++ KH+A V +
Sbjct: 147 RGMETYGEDPYLTSQLAVSFIRGLQG--------NIPDHPRTIATP-KHFA---VHSGPE 194
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN +R
Sbjct: 195 PGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
+W +G++V+DCD+I+ M H F D+ A A LK+G DL+CG Y + A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNCGNTYRDLN-QAIARG 312
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
+ E+ +D++L L+T RLG Y ++G + I + + LA +AA + +VLL
Sbjct: 313 DIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALALQAAAQSLVLL 372
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
KN NTLPL T+AV+GP A++ A+ NY G ++P+ G G A V Y
Sbjct: 373 KNSGNTLPLTPG--TTLAVLGPDADSLTALEANYQGTSSTPVTPLIGLRTRFGTAKVHYA 430
Query: 474 TGC 476
G
Sbjct: 431 QGA 433
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 140/301 (46%), Gaps = 55/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A A ADA + GL VE E L DR + LP Q L+ V K
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+I+V+MS V + +A+ + + AIL A YPG+ GG AIA + G NPGGRLP+T+Y
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPMTFYR 720
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
Q LP P S GRTY+++ G LYPFGYGLSYTQF Y
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYE---------- 761
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
P + L+ D +N G+ G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGDTLTVTAHVRNTGTRAGDEVVQLYLEP 800
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
P A ++ ++GF+RV +R G ++ + F +A + L+ V + AG + +FVG
Sbjct: 801 PHSPQAP-LRNLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGG 858
Query: 780 G 780
G
Sbjct: 859 G 859
>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
Length = 948
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 223/806 (27%), Positives = 350/806 (43%), Gaps = 140/806 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL-------------------------------G 82
+ D+S P + R+ DL+ +MT++EK Q+
Sbjct: 61 YEDASAPLNDRINDLLEQMTIEEKTNQMVTLYGYKRVLEDDLPNAGWKQKLWKDGIGAID 120
Query: 83 DFAHGVPRLGLPQYE--W-WSEALHG----------VSNVGPGTHFDDVIPG-------- 121
+ +G + GLP + W W + H V G D G
Sbjct: 121 EHLNGFVQWGLPPSDNPWVWPASKHAWAINEVQRFFVEETRLGIPVDFTNEGIRGIESYK 180
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
AT+FPT + ++N L +++G EAR + G T ++P ++V RD RWGR
Sbjct: 181 ATNFPTQLGLGTTWNRQLIRQVGYITGREARLL------GYTNVYAPILDVGRDQRWGRY 234
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GE PF+V + RGLQ TD +V+S KH+AAY +
Sbjct: 235 EEIYGESPFLVAELGIQMTRGLQ--------TDF-----QVASTAKHFAAYSNNKGGREG 281
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
D ++ +++E L P+E V+E M SYN +GIP L + +R +
Sbjct: 282 MSRVDPQMPPREVENIHLYPWERVVQEAGLLGAMSSYNDYDGIPIQGSYHWLTEVLRHRF 341
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQ 356
GYIV+D D+++ + H AD KE AV Q + AGL++ C + ++
Sbjct: 342 GFRGYIVSDSDALEYLFSKHHTAADMKE-AVYQAVMAGLNVRCTFRSPDSFVLPLRELIR 400
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIVL 415
+G++ + ID+ + + V G FD Q + Q++ S+ N +A +A+R+ IVL
Sbjct: 401 EGRIPMSVIDRLVGDILRVKFITGIFDNPYQMNLKAADQEVNSERNQAVALQASRQSIVL 460
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
LKN LPL+ +K++ + V GP+A+ + +Y + + + G V+
Sbjct: 461 LKNQDRLLPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGIRDKVENNIEVS 520
Query: 472 YKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
Y GCD V + I A AK +D I++ G + E+ R
Sbjct: 521 YAKGCDVVDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCGENKSR 580
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
L LPG Q L+ V K PV+LV+++ + + +A+ I AI+ A YPG +GG
Sbjct: 581 SSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWAD--RFIPAIVEAWYPGSQGGT 637
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLT--SMPLRPVD---SLGYPGRTYKFYNGP 632
A+ADV+FG +NPGG+L +T+ V +P S P VD LG G + NG
Sbjct: 638 AVADVLFGDYNPGGKLTVTFPKS--VGQIPFNFPSKPASQVDGGNKLGLQGNASRI-NG- 693
Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
LY FG+GLSYT FKY+ L +K + +
Sbjct: 694 ALYSFGHGLSYTTFKYSNLRLSKET-------------------------------MTLN 722
Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
D D N G +G +VV +Y + TY K + GF R+ ++ G K + F
Sbjct: 723 DSINISCDVSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPGETKTLTFTIK 782
Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVG 778
+ L +V+ ++ GE I +G
Sbjct: 783 P-EHLKLVNKDFEKVVEPGEFKIMIG 807
>gi|313204470|ref|YP_004043127.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443786|gb|ADQ80142.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 746
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 223/762 (29%), Positives = 356/762 (46%), Gaps = 111/762 (14%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG----------------- 110
L+ +MTLDEK+ QL ++ G E E VG
Sbjct: 32 LIRQMTLDEKIGQLNQYSSDWESTGKITAEGDKETQIRQGKVGSMLNVTGVDKTRKLQEL 91
Query: 111 --------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG 161
P DVI G T+FP + TAS++ +L +K + +TEA A G
Sbjct: 92 AMQSRLHIPMIFGLDVIHGFRTTFPIPLGETASWDLALIEKSARIAATEASAY------G 145
Query: 162 LTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPL 219
+ + ++P +++ARDPRWGR+ E GED ++ A V G Q + G+ +A
Sbjct: 146 VQWTFAPMVDIARDPRWGRVMEGAGEDTYLGSLVAKARVHGFQGNGLGNVDA-------- 197
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ +C KH+AAY G D D + + + ET+L PF+ V E + ++ M S+N
Sbjct: 198 -IMACAKHFAAYGA-AIGGRDYNSVDMSLRQ--LNETYLPPFKAAV-EANVATFMNSFND 252
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NGIP+ A+ + ++G+W+ G++V+D SI M+ H + DS DA + + AG
Sbjct: 253 INGIPATANKYIQRDILKGQWNFKGFVVSDWGSIGEMI-AHGYAKDSY-DAAMKAINAGS 310
Query: 340 DLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
D+D + Y N VQ GKV + ID+++K + LG FD ++ + ++ +
Sbjct: 311 DMDMESRCYRNNLKQLVQDGKVDISVIDEAVKRILVKKFELGLFDDPYRFCNAAREKKQT 370
Query: 399 D--ENIELAAEAAREGIVLLKND-----QNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ EN A E ++ IVLLKN+ + LPL S + KTVA++GP AT A G ++
Sbjct: 371 NNPENRAFAREIGKKSIVLLKNEPLSNGKTLLPL-SKQTKTVALIGPLFKATKANHGFWS 429
Query: 452 -GIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
P I+ + G N + Y GC+ + A AAK+AD I+
Sbjct: 430 IAFPDDSTRIISQYQGIKNQLDKSSSIVYAKGCN-INDNDKTGFAEAINAAKSADVVIMS 488
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G + E+ + +L LPG Q +L+ ++ + K PV+L++ + G + F + NI
Sbjct: 489 LGEAADMSGEAKSKSNLQLPGVQEELLKEIYKTGK-PVVLLLNA--GRPLIFNWASDNIP 545
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVD 617
+IL+ + G E G AIADV+FG +NP G+LPI++ + +P+ T P + +
Sbjct: 546 SILYTWWLGTEAGNAIADVLFGDYNPAGKLPISFPRTE--GQIPIYYNHFNTGRPAKDEN 603
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
Y N P YPFGYGLSYT+F NL +SD
Sbjct: 604 DKNYVSAYIDLQNSPK-YPFGYGLSYTKF-------------------DISNLKLSSDK- 642
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
L + VD N G+ DG +VV +Y + +K++ GFQ++
Sbjct: 643 -----------LSSGNKLTVTVDIANTGNYDGEEVVQLYVRDLVGSVVRPVKELKGFQKL 691
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
++ G K++ F + L + + AG++ +FVGN
Sbjct: 692 MLKKGETKQLTFTLTP-EDLKFFNNEIQYINEAGDYELFVGN 732
>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
Length = 946
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 238/831 (28%), Positives = 375/831 (45%), Gaps = 149/831 (17%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
F+K G++ ++ D S P R++DL+ +MTL+EK Q+ +G R+ LP EW
Sbjct: 44 FNKNGMKD---IYEDPSAPVDARIEDLLKQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEW 99
Query: 99 ----WS-------EALHGVSNVG-PGTHFDDVIPG------------------------- 121
W E L+G G P + + V P
Sbjct: 100 KNQLWKDGIGAIDEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTD 159
Query: 122 -------------ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSP 167
AT+FPT + ++N L +++G EAR + G T ++P
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIRQVGVITGREARML------GYTNVYAP 213
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
++V RD RWGR E GE P++V + VRG+Q + +V++ KH
Sbjct: 214 ILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ-------------QDYQVAATGKH 260
Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
+ AY + D +++ +++E + PF+ ++E VM SYN +G P +
Sbjct: 261 FIAYSNNKGGREGMSRVDPQMSPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQS 320
Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG--- 344
L +RGE GY+V+D D+++ + H D KE AV Q+++AGL++ C
Sbjct: 321 SYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRS 379
Query: 345 -QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDEN 401
Y V++G + E I+ ++ + V +G FD P L D + EN
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFD-HPYQTDLKGADEEVEKAEN 438
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
E+A +A+RE IVLLKNDQ+ LPL+ + +K +AV GP+A+ +G+Y + S +
Sbjct: 439 EEVALQASRESIVLLKNDQDVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVL 498
Query: 462 AGFS----GYANVTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIIL 503
G G V Y GC+ V + I A AK AD +++
Sbjct: 499 KGIQEKTDGKVEVLYSKGCELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVV 558
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G E+ R L LPG Q L+ V K PV+LV+++ + I +A + +
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVP 615
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLG 620
AIL A YPG +GG+A+ADV+FG +NPGG+L +T+ V +P + P +P +D
Sbjct: 616 AILEAWYPGAKGGKAVADVLFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGK 672
Query: 621 YPGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
PG NG LY FG+GLSYT F+Y+ L T
Sbjct: 673 NPGMDGNMSRANG-ALYAFGHGLSYTSFEYSDLKIT------------------------ 707
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
P V+ + + Y KV N G G +VV +Y + TY K + GF+R+
Sbjct: 708 ---PAVITPNQKT--YVTCKV--TNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIH 760
Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
++ G K + F + K+L +++ + ++ G+ T+ V G S I LN
Sbjct: 761 LKPGETKEVFFPIDR-KALELLNADMHWVVEPGDFTLMV--GASSTDIRLN 808
>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
Length = 950
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 228/749 (30%), Positives = 354/749 (47%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E A N +A WSP ++V
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 315
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ G+P +L
Sbjct: 316 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 372
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 373 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 432
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
A + G++ D+D + + + R F+ +P L + I SD + E+A
Sbjct: 433 VIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 491
Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
+AARE IV+L+N +N LPL S + T+AV+GP A+ G+Y +P + S + G
Sbjct: 492 QAARESIVMLENKENLLPL-SKTLCTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 548
Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
G V Y+ GCD N I A +AA +D I++ G + EA
Sbjct: 549 KGAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTC 607
Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + DI A + KAIL P
Sbjct: 608 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 664
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A+ADV+FG +NP GRLP+T+ +V LPL + GR Y++ +
Sbjct: 665 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 715
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FG+GLSYT F+Y+ L K+Q N N
Sbjct: 716 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 746
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
E + +NVGS G +V +Y T + ++ F R+ ++ G +K + F
Sbjct: 747 -----VEVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLQPGESKTVSF 801
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE I VG
Sbjct: 802 EMTPY-DISLLNDRMDRVVEKGEFKIMVG 829
>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
Length = 826
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 219/831 (26%), Positives = 373/831 (44%), Gaps = 157/831 (18%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF--------AHGVPRLGLPQYEWWSEALHG 105
F + SLP + RV DL++R+T +E +QQ+ + A G+ RL + Y+W
Sbjct: 29 FRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQW------- 81
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY- 164
+N G G T FP + A+F+ ++ +A E RA +N +A TY
Sbjct: 82 RTNPGDGR--------ITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKTYR 133
Query: 165 -------WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE----NATD 213
++P +N+ R P WGR ET GEDPF++G+ A +VRGL + E + +
Sbjct: 134 DGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQN 193
Query: 214 LNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
L+S+P L V + CKH+A + V R F+A VT+ D+ +T+L F C++ G A
Sbjct: 194 LSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAG-AV 252
Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
SVMC+Y+ +NG P C + LL + +R +W G++V DC ++Q ++ H+ E A+
Sbjct: 253 SVMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETAM 312
Query: 332 AQTLKAGLDLDCGQYYT----NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF---DG 384
A ++AG++L+ Y + + + G + + + + L+ + G F +
Sbjct: 313 A-AVRAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQGEFNPVEM 371
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS------AKVKTVAVVGP 438
P + ++ I ++++ +A IVLL+N LPL + ++ +A+VGP
Sbjct: 372 DPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRHIAIVGP 431
Query: 439 HANATVAMIGNYAGIPCRYMS-PIAGFSGYANVTYKTGCDDVA-----CKS-NNSIFAAS 491
A + + G+Y P + P++ G + ++ + D+ C S N+ ++
Sbjct: 432 FATSVTELYGHYRTAPEPEIEVPLS--KGLSQLSRRMHASDICTDGGRCSSLNDDALHST 489
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-----------P 540
D ++ G VE E++DR+++ LPG Q +L+ + +++ G P
Sbjct: 490 LGYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKRTVP 549
Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK-------------- 586
+IL++ SAG ++I+ A N N+KAI W G+PG G A+ ++ G
Sbjct: 550 IILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFGPSKPISVG 609
Query: 587 -------------------FNPGGRLPITWYNG-DYVQMLPLTSMPLRPVDSLGYPGRTY 626
+ P RLP TWY D + + + M +TY
Sbjct: 610 FHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEM----------TNQTY 659
Query: 627 KFYNG-----------PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
++ P LYPFGYGLSY +NL + + +L
Sbjct: 660 RYLPTQCHMSSEDCKIPVLYPFGYGLSY---NFNLSGASGFVYSDL-------------- 702
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK-----Q 730
P V+ + F V QN G +VV VY+K + Q
Sbjct: 703 ----IAPSSAVSS---NQRIVFYVTVQNEGPIACEEVVQVYTKWLNRTENDNSRNGPLIQ 755
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG 781
+ GF+RV + G K++KF + L + + NT++P G + + GG
Sbjct: 756 LAGFERVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIP-GRGVLQISVGG 805
>gi|255689951|ref|ZP_05413626.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624557|gb|EEX47428.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 735
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 215/779 (27%), Positives = 363/779 (46%), Gaps = 119/779 (15%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG----VPRLGLPQYEWWSEALHGVSN 108
L+ D+ +P RV DL+SRMTL+EK+ QL + G V +G E +
Sbjct: 29 LYKDAKVPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIG-------EEVKKVPAE 81
Query: 109 VGPGTHFD---------------------------DVIPG-ATSFPTVILTTASFNESLW 140
+G ++D D I G T +P + S+N L
Sbjct: 82 IGSLIYYDTNPTLRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPELV 141
Query: 141 KKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
+K + EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A V
Sbjct: 142 EKACAVTAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASV 195
Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
RG Q D+++ ++++C KHY Y R + ++ Q + +T+L
Sbjct: 196 RGYQ-------GDDMSAED-RIAACLKHYIGYGASE---AGRDYVYTEISRQTLWDTYLL 244
Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
P+EM VK G A+++M S+N ++GIP A+ + + ++ W G+IV+D +I+ +
Sbjct: 245 PYEMGVKAG-AATLMSSFNDISGIPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL--K 301
Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
++ LA +K++A AGL++D + Y + V++GK+ +D+S++ + V R
Sbjct: 302 NQGLAANKKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKFR 361
Query: 379 LGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
LG F+ V+ K+ +++++AA+ A E +VLLKN+ LPL K +AVVGP
Sbjct: 362 LGLFERPYTPVTSEKERFFRPQSMDIAAQLAAESMVLLKNENQILPLTDK--KKIAVVGP 419
Query: 439 HANATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASE 492
A ++G++ G + Y F G A + Y GC + A E
Sbjct: 420 MAKNGWDLLGSWCGHGKDTDVVMLYNGLATEFVGKAELRYALGC-RTQGDNRKGFEEALE 478
Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
AA+ +D ++ G ++ E+ R + LP Q +L ++ +V K P++LV+++ ++
Sbjct: 479 AARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKVGK-PIVLVLVNGRPLE 537
Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT--WYNGDYVQMLPLTS 610
+ E ++ AIL PG G +A ++ G+ NP G+L +T + NG
Sbjct: 538 LNRLEPISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMTFPYSNG---------Q 586
Query: 611 MPL---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
+P+ R G+ G YK LYPFG+GLSYT+FKY +++ +
Sbjct: 587 IPIYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGVVTLS------------- 632
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
ASK ++ + +V N G DG + V + P
Sbjct: 633 --------ASK----------VKRGEKLSAEVTVTNTGKRDGLETVHWFISDPYCSITRP 674
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
+K++ F++ ++AG K +F + + L VD L AGE+ I V + V +
Sbjct: 675 VKELKYFEKQSIKAGETKIFRFDIDLERDLGFVDGNGKRFLEAGEYYIQVKDQKVKIEL 733
>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
Length = 770
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 222/818 (27%), Positives = 382/818 (46%), Gaps = 148/818 (18%)
Query: 36 VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
+C S L Q S + D +LP S RV L+S+MTL+EKV Q+ + G+ + +
Sbjct: 10 ICCAIGISTLACQDKSKDYTDPTLPVSERVSSLMSQMTLEEKVAQMCQYV-GLEHMKKAE 68
Query: 96 YEWWSEALHGVSNVG--PGTHFDDV----------------------------------I 119
+ +E L + G P H DV I
Sbjct: 69 KDMSAEDLKHSHSQGFYPNLHSSDVEEMTKKGLISSFLHVVKAEEANYLQSLAQQSRLKI 128
Query: 120 P---------------GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
P G+T +PT I A+F+ +L +++ + + E RA +G+ +
Sbjct: 129 PLLIGIDAIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHW 182
Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKV 221
++PN+ VARD RWGR+ ET GEDP++VG+ VRG Q D G++ KV
Sbjct: 183 TFTPNVEVARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND----------KV 232
Query: 222 SSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+C KH + + G A ++E+ ++E F PF+ C++ G +VM ++N
Sbjct: 233 IACAKHLVGGSQPANGINGAP-----AELSERTLQEVFFPPFKDCLEAG-VFTVMTAHNE 286
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NGIP + L+ + +R +W G++V+D I+ M D H +A++ +DA ++ AG+
Sbjct: 287 LNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHN-VAETLKDAYRISVDAGM 345
Query: 340 DLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--I 396
+ G + V++G + E ID ++ + V RLG F+ ++ L K+D +
Sbjct: 346 GMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENP--FIDLKKKDEIV 403
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-GIPC 455
++++ + A E AR+ IVLLKN+ N LPL+++K K V V G +AN +++G++A P
Sbjct: 404 FNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPE 462
Query: 456 RYMSPIAGFSGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAG---- 505
+++ + G ++ +T + +V S+N I A + A+ +D I++ G
Sbjct: 463 EHVTTV--LKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSM 520
Query: 506 ---LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
+ E+ DR +L LPG Q +L+ VA V++++ G + + N+
Sbjct: 521 RYHWNEKTCGENSDRYELSLPGRQQELVKAVAATGVPTVVILV---NGRPLTTEWIDENM 577
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
I+ A PG GG+A+A++++GK NP G+LPIT +P ++ ++ + + +
Sbjct: 578 PCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT---------IPRSTGQIQCMYNHKFT 628
Query: 623 GRTYKFYNGPTL--YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
+ + G +L Y FGYGLSYT +KY L ++ T
Sbjct: 629 NHWFPYATGNSLPLYEFGYGLSYTTYKYENLKLSEA----------------------TI 666
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
P D + VD N G DG + V +Y + A +K++ F R+ ++
Sbjct: 667 TP---------DKSVKVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKELKDFARIPLK 717
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
AG K + F + L+ D + + G I VG
Sbjct: 718 AGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754
>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
Length = 882
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 165/423 (39%), Positives = 237/423 (56%), Gaps = 40/423 (9%)
Query: 68 LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
LV++MT EK+ Q + A +PRLG+P Y+WWSE LHG++ G AT FP
Sbjct: 37 LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNL----GR-----AGLTYWSPNINVARDPRWG 178
I AS+N L + +G STEARA +NL G+ AGLT WSPNIN+ RDPRWG
Sbjct: 87 AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R ET GEDP++ + AV+++RGLQ D P +++ KH+A V +
Sbjct: 147 RGMETYGEDPYLTSQLAVSFIRGLQG--------DTPDHPRTIATP-KHFA---VHSGPE 194
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
R+ FD V+ D+E T+ F + +G A SVMC+YN ++G P+CA LLN +R
Sbjct: 195 QGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
+W +G++V+DCD+I+ M H F D+ A A LK+G DL+CG Y + A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTYRDLN-QAIARG 312
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
+ E+ +D++L L+T RLG Y ++G + I + + LA +AA + +VLL
Sbjct: 313 DIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALALQAAAQSLVLL 372
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
KN NTLPL T+AV+GP A++ A+ NY G ++P+ G G A V Y
Sbjct: 373 KNSGNTLPL--PPETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGTAKVHYA 430
Query: 474 TGC 476
G
Sbjct: 431 QGA 433
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 140/301 (46%), Gaps = 55/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A A ADA + GL VE E L DR + LP Q L+ V K
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P+I+V+MS V + +A+ + + AIL A YPG+ GG AIA + G NPGGRLP+T+Y
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYR 720
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
Q LP P S GRTY+++ G LYPFGYGLSYTQF Y
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYE---------- 761
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
P + L+ + +N G+ G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGNTLTVTTHVRNTGTRAGDEVVQLYLEP 800
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
P A ++ ++GF+RV +R G ++ + F +A + L+ V + AG + +FVG
Sbjct: 801 PYSPQAP-LRSLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGG 858
Query: 780 G 780
G
Sbjct: 859 G 859
>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
Length = 774
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 242/808 (29%), Positives = 370/808 (45%), Gaps = 162/808 (20%)
Query: 64 RVKDLVSRMTLDEKVQQL---------------GDF-----------AHGVPRLGLPQ-- 95
RVKDL++RMTL+EK Q+ G+F HG+ ++G P
Sbjct: 21 RVKDLLARMTLEEKAAQMMCVWQEKAAKLLDGNGNFDPAKAKAAFKKGHGLGQVGRPSDA 80
Query: 96 ------------YEWWSEALHGV-------SNVG-PGTHFDDVIPG-----ATSFPTVIL 130
+E + + S +G P ++ + G TSFP I
Sbjct: 81 GSDPATPANGKTARGMAELTNAIQKFFLEHSRLGIPVMFHEECLHGHAARDGTSFPQPIG 140
Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
A+FN +L +K+ + E R R G +P ++VARD RWGR+ ET GEDPF+
Sbjct: 141 LGATFNPALVEKLYAMTAHETRV-----RGGHQALTPVVDVARDARWGRVEETYGEDPFL 195
Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV----DNWKGVDRYHFDA 246
+ + VRG Q G + D V + KH+AA+ N V+
Sbjct: 196 NTQLGIAAVRGFQ---GDASFKDKKH----VIATLKHFAAHGQPESGQNCAPVN------ 242
Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
V+E+ + ETFL PF C+K+G A SVM SYN ++G+PS A LL +R EW G++
Sbjct: 243 -VSERLLRETFLHPFRDCLKKGGAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFV 301
Query: 307 VADCDSIQVMV---DNH-KFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQ 357
V+D +I + D+H +A K++A +KAG+++ DC ++ V++
Sbjct: 302 VSDYYAIWELSHRPDSHGHHVAADKKEACVLAVKAGVNIEFPEPDCYRHLVEL----VRK 357
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
+ ET++D+ + + ++G FD + + + + ELA+EAARE I LLK
Sbjct: 358 KVLHETELDELIAPMLLWKFKMGLFDDPYVDPEEAARVVGCEVHRELASEAARETITLLK 417
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYK 473
N+ + LPLN AK+KTVAV+GP+AN + ++G Y+G+P ++ + G G V +
Sbjct: 418 NENDLLPLNPAKLKTVAVIGPNANRS--LLGGYSGVPAHNVTVLDGIKARLGGAVKVVHA 475
Query: 474 TGC----------DDV----ACKSNNSIFAASEAAKTADATIILAGLD--LSVEAESL-- 515
GC D+V + I A + A +AD I+ G + S EA SL
Sbjct: 476 EGCKITVGGSWQQDEVLASDPAEDRKQIDEAVKVAWSADVVIVAIGGNEQTSREAWSLKH 535
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR L L G+Q +LI + K PV+ ++ + G +A N+ AIL Y G+
Sbjct: 536 MGDRTSLDLIGHQDELIRALLATGK-PVVALVFN--GRPLAINHVAQNVPAILECWYLGQ 592
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNG 631
E G A+A V+FG NPGG+LPI+ +P + L PV P R + +
Sbjct: 593 ECGSAVAAVLFGDHNPGGKLPIS---------IPRSVGQL-PVFYNHKPSARRGFLWDEA 642
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
L+PFG+GLSYT+F + + K I S+T V
Sbjct: 643 TPLFPFGFGLSYTKFTFKNVRLAKKI------------------ISRTGSTHV------- 677
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
VD N G G++VV VY + +K++ FQ++ + G K +
Sbjct: 678 ------SVDVTNAGKRAGTEVVQVYVRDLISSVTRPVKELKVFQKITLAPGETKTVSLDL 731
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
+SL D ++ GE I VGN
Sbjct: 732 TP-ESLAFYDVNMKYVVEPGEFEIMVGN 758
>gi|317503000|ref|ZP_07961085.1| beta-glucosidase, partial [Prevotella salivae DSM 15606]
gi|315665888|gb|EFV05470.1| beta-glucosidase [Prevotella salivae DSM 15606]
Length = 770
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 231/816 (28%), Positives = 368/816 (45%), Gaps = 166/816 (20%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV------------------- 88
+M L+ + + + RV DL+ RMTL+EKV Q+ G+
Sbjct: 20 KMEKPLYKNPNASVAQRVDDLLRRMTLEEKVGQMNQLV-GIEHFKTNSITMSAEELATNT 78
Query: 89 -----PRLGLPQYEWW------SEALHGVSNVGPGTHFD----------------DVIPG 121
P + + + E+W S LH V + + D I G
Sbjct: 79 ATAFYPGVTVSEIEYWVRRGWVSSFLH-VLTLEEANYLQKLSMQSRLQIPLIIGIDAIHG 137
Query: 122 A------TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS--PNINVAR 173
T +PT I +SF+ L KI + + E RAM +W+ PN+ VAR
Sbjct: 138 NAKCKNNTVYPTNIGLASSFDVDLAYKIARQTAEEMRAMN-------MHWNFNPNVEVAR 190
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY--AAY 231
D RWGR ET GEDP++V + V +G Q +N +D V C KH+ +Y
Sbjct: 191 DGRWGRCGETFGEDPYLVMQMGVATNKGYQ--RNLDNTSD-------VLGCVKHFVGGSY 241
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
++ G V+E+ + E F PF+ +++G +VM S+N +NGIP + L
Sbjct: 242 SINGTNGAP-----CDVSERTLREVFFPPFKATLQQGGDWNVMMSHNELNGIPCHTNRWL 296
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNF 350
+ +R EW G+IV+D I+ VD H D+KE A Q++ AG+D+ G +
Sbjct: 297 MTDVLRKEWGFQGFIVSDWMDIEHCVDQHHTAKDNKE-AFYQSIMAGMDMHMHGPEWQKD 355
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAR 410
V++G++ E+ ID+S++ + TV RLG F+ V + I + + A +A+R
Sbjct: 356 VVELVREGRIPESRIDESVRRILTVKFRLGLFEHPYSDVKTRDRVINDPVHKQTALDASR 415
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC--RYMSPIAGF---S 465
E IVLLKN++ LPL+ K K V V G +AN M G+++ + + + + G S
Sbjct: 416 ESIVLLKNEKQLLPLDEQKYKKVLVTGINANDQNIM-GDWSELQPEDKVWTVLKGLKLVS 474
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-------LDLSVEAESLDRE 518
+ + + D S + + AA EAAK +D I+ G + E DR+
Sbjct: 475 PHTDFRFVDQGWDPRNMSQSQVDAAVEAAKESDLNIVCCGEYMMRFRWNERTSGEDTDRD 534
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
+L L G Q QLI ++ E K P IL+I+S + + +A ++ AI+ A PG+ GG+A
Sbjct: 535 NLELVGLQEQLIRRLNETGK-PTILIIISGRPLSVRYAA--DHVPAIVNAWEPGQYGGQA 591
Query: 579 IADVVFGKFNPGGRLPIT----------WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
IA++++GK NP +L +T WYN R+ F
Sbjct: 592 IAEILYGKINPSAKLAMTIPRHVGQISSWYNHK----------------------RSAYF 629
Query: 629 Y------NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
+ N P LYPFGYGLSYT+FKY+ L + T+ N K
Sbjct: 630 HPAVCADNTP-LYPFGYGLSYTKFKYSNLVLSDTVIENDGK------------------- 669
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
+ ++ +N+G+ +G++V +Y A +K++ F+RV ++AG
Sbjct: 670 ----------SAIKAQITIENIGNREGTEVCQLYINDIVSSVARPVKELKDFRRVTLKAG 719
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ I+F+ K L D + GE + +G
Sbjct: 720 EKQTIEFIITPDK-LAFYDVDMKLKIEPGEFKVMIG 754
>gi|146298537|ref|YP_001193128.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
UW101]
gi|146152955|gb|ABQ03809.1| Candidate beta-glycosidase; Glycoside hydrolase family 3
[Flavobacterium johnsoniae UW101]
Length = 745
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 237/797 (29%), Positives = 355/797 (44%), Gaps = 141/797 (17%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---FAH-GVPRLGLPQYEWWSEAL 103
Q ++ + S + + L+S+MTL+EK+ L FA+ GV RLG+P+ + L
Sbjct: 33 QTEEYVGKEISTDHDAEIDKLISQMTLEEKIGMLHGNSMFANAGVKRLGIPELKMADGPL 92
Query: 104 HGV------SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
GV N P +D AT +P A++N + G ++ E RA
Sbjct: 93 -GVREEISRDNWAPAGWTNDF---ATYYPAGGALAATWNAEMAHTFGTSLGEELRA---- 144
Query: 158 GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
R SP IN+ R P GR E EDPF+ + AV V GLQ+ +
Sbjct: 145 -RDKDMLLSPAINMVRTPLGGRTYEYMSEDPFLNKKIAVPLVVGLQEKD----------- 192
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
V +C KHYAA N + +R D ++ E+ + E +L FE VKE A S+M +Y
Sbjct: 193 ---VMACVKHYAA----NNQETNRDFVDVQIDERTLREIYLPAFEATVKEAKAYSIMGAY 245
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N+ G C + +LN+ +R EW G +V+D ++ + A++LK
Sbjct: 246 NKFRGEYLCENDYMLNKILRDEWGFKGVVVSDWAAVH---------------STAKSLKN 290
Query: 338 GLDLDCGQ-------YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
GLD++ G + + AV+ G+V E +ID +K + VL ++ G +
Sbjct: 291 GLDIEMGTPKPFNEFFLADKLIAAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER--- 347
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
K I ++ + + A + A E I+LLKN+ N LPL VK++AV+G +A A+ G
Sbjct: 348 -AKGSIATEAHYQDAYKIAAEAIILLKNENNALPLKLDGVKSIAVIGNNATKKNALGGFG 406
Query: 451 AGIPC-RYMSPIAGF-------------SGY------------ANVTYKTGCDDVACKSN 484
AG+ R ++P+ G GY N+T TG +
Sbjct: 407 AGVKTKREVTPLEGLKNRLPSSVKINYAEGYLEKYEEKNKGNLGNIT-STGPVTIDKLDP 465
Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
+ A EAAK +D II AG + E E+ DR DL LP Q +LI +V E P +V
Sbjct: 466 AKVQEAVEAAKKSDVAIIFAGSNRDYETEASDRRDLHLPFGQEELIKKVIEA--NPKTIV 523
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
+M AG E + A++W+ + G EGG A+ADV+ GK NP G+LP W ++
Sbjct: 524 VMIAGA-PFDLNEVSQKSSALVWSWFNGSEGGNALADVILGKVNPSGKLP--WTMPKQLK 580
Query: 605 MLPLTSMPLRPVDS---------LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK 655
P + P D +GY R + N LYPFGYGLSYT F
Sbjct: 581 DSPAHATNSFPGDKAVNYAEGILIGY--RWFDTKNVAPLYPFGYGLSYTTFAL------- 631
Query: 656 TIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV 715
D +KT ND+ E VD +N G DG +VV +
Sbjct: 632 -------------------DNAKTDKDSYAQNDV-----IEVTVDVKNTGKVDGKEVVQL 667
Query: 716 YSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN--TLLPAGEH 773
Y+ +++ GF++ V+AG +++I K L D AA T+ P G++
Sbjct: 668 YTSKSDSKITRAAQELKGFKKADVKAGGSEKITIKV-PVKELAYYDVAAKKWTVEP-GKY 725
Query: 774 TIFVGNGGVSFPIHLNF 790
TI +G +NF
Sbjct: 726 TIKLGTSSRDIKKEINF 742
>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
6_1_58FAA_CT1]
Length = 819
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 228/823 (27%), Positives = 354/823 (43%), Gaps = 151/823 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WSEALHG 105
+F + P RV+DL+S+M LDEK QL +G R+ LP EW W + +
Sbjct: 52 VFENPKQPIEKRVQDLLSQMNLDEKTCQLATL-YGYKRVMSDSLPTPEWKNKIWKDGIAN 110
Query: 106 V----SNVGPGTHF-----------------------------------DDVIPG----- 121
+ + VG G ++ I G
Sbjct: 111 IDEQLNGVGRGAKIAQDLIYPFSKHAEAINKTQKWFIEETRLGIPVDFSNETIHGLNHTK 170
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
AT P I +++N L K G EA+A LG + ++P +++ARDPRWGR+
Sbjct: 171 ATPLPAPIGIGSTWNAPLVYKAGSIAGKEAKA---LGYTNI--YAPILDLARDPRWGRVL 225
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
E GEDPF+V V+G+Q+ +G V++ KH+A Y V
Sbjct: 226 ECYGEDPFLVATLGTQMVKGIQE-QG-------------VAATLKHFAVYSVPKGGRDGS 271
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
D V ++M + L PF+ +++ VM SYN +G+P A L Q +R E+
Sbjct: 272 VRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFG 331
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG--------- 352
GY+V+D D+++ + + H +A++ E+AV L+AGL++ T F
Sbjct: 332 FDGYVVSDSDAVEYVYNKH-HVAETYEEAVRMVLEAGLNV-----RTTFAAPDIFILPAR 385
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAR 410
V++G++ ID+ + + V RLG FD P D + +D+N + + R
Sbjct: 386 KLVKEGRLSMKVIDERVADVLRVKFRLGLFD-QPFVADPKAADKIVGADKNKDFVLDIQR 444
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN- 469
+ +VLLKN+ N LPL+ K+ + + GP A M+ Y ++ G Y
Sbjct: 445 QSLVLLKNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGN 504
Query: 470 ---VTYKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
V Y GC + + I A E AK +D I + G D
Sbjct: 505 KVAVDYALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTG 564
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
ES R L LPG Q QL+ + K PV+LV+++ + I +A + I AIL A +PG
Sbjct: 565 ESKSRSGLDLPGRQQQLLEALYATGK-PVVLVLINGQPLTINWA--DRYIPAILEAWFPG 621
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP-----GRTYK 627
+ GG AIA+ +FG +NPGG+LP+T+ + + L + P +P P G
Sbjct: 622 QMGGTAIAETLFGDYNPGGKLPVTF--PKTLGQIEL-NFPFKPASQSKQPEAGPNGYGKT 678
Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
NG LYPFG+GLSYT F+Y+ L + Q +Q
Sbjct: 679 RVNG-ALYPFGFGLSYTTFEYSNLKVSPERQGPKGDIQ---------------------- 715
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
D N G G ++V +Y K +Y + GF+RV ++ G K I
Sbjct: 716 ---------VSFDITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNI 766
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
+F + + L I+D N + GE + +G + +F
Sbjct: 767 QFTLHP-EDLEILDINMNWNVEPGEFEVRIGASSEDIKLKKSF 808
>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
DSM 14838]
Length = 954
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 228/760 (30%), Positives = 358/760 (47%), Gaps = 111/760 (14%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
+ +S + D +LP RV+ L+S MT ++K++ + G G+P L +P EA+HG
Sbjct: 164 EKTSLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 222
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
S GAT FP + A++N+ L + + AV E L + W
Sbjct: 223 FSYGS----------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAW 267
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SP ++VA+D RWGR ET GEDP +V + +++G Q + L + P
Sbjct: 268 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-------SKGLFTTP------- 313
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ + G D + D ++E++M E L PF ++ D SVM +Y+ G+P
Sbjct: 314 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPV 370
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+LL+ +R EW G+IV+DC +I + + A K +A Q L AG+ +CG
Sbjct: 371 AKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGD 430
Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
Y + A + G++ ++D+ + + ++ R F+ +P L I SD
Sbjct: 431 TYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNK-PLDWNKIYPGWNSDS 489
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYM 458
+ E+A +AARE IV+L+N N LPL + ++T+AVVGP A+ G+Y +P +
Sbjct: 490 HKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKLLPGQLK 546
Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
S + G V Y+ GCD + N I A +AA +D +++ G + E+
Sbjct: 547 SVLTGIKQAVGKQTKVVYEQGCDFTSSNGTN-IPKAVKAASQSDVVVLVLGDCSTSESTT 605
Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
E+ D L LPG Q +L+ V K PVIL++ + G ++ + KAI
Sbjct: 606 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQA--GRPYNLSKASELCKAI 662
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
L PG+EGG A ADV+FG +NP GRLP+T+ +V LPL + GR
Sbjct: 663 LVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRR 713
Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
Y++ + LY FGYGLSYT F+Y+ L K+Q N N A+
Sbjct: 714 YEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAIQAT------ 756
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+NVG G +VV +Y T I ++ F RV ++ G
Sbjct: 757 -----------------VKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPGE 799
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+K + F + L++++ + ++ GE I V GGVS
Sbjct: 800 SKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILV--GGVS 836
>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
Length = 721
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 223/731 (30%), Positives = 343/731 (46%), Gaps = 100/731 (13%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F + +L R+ DL+SRMTL EK+Q LGD GVPRLG+P E LHG + GP
Sbjct: 24 YPFQNPALSPDQRIDDLLSRMTLQEKIQALGDDP-GVPRLGIPG-ALTEEGLHGAAIGGP 81
Query: 112 GTHFDD----VIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR-AMYNLGRAGLTYWS 166
H++ V+P T FP +++ +L +K + E R A+ GL +
Sbjct: 82 A-HWEGRGRAVVP-TTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRA 139
Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
PN N++RDPRWGR E+ GEDP++VG AV +++GLQ N R + ++ K
Sbjct: 140 PNANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGN---------NPRYWETAALMK 190
Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
H+ AY + + +F R+ E + PF M +++G + + M SYN NGIP
Sbjct: 191 HFDAYSNEANRDGSSSNFGKRL----FYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMT 246
Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
A+P +L V +W +G I D ++ MV + + E A A + AG++ +Y
Sbjct: 247 ANP-VLKSVVMKKWGFNGIICTDAGALSNMVTHFHYYKTMPE-AAAGAVHAGINQFLDRY 304
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG-------KQDIC 397
A+QQ + E ID+ LK +Y V++RLG D S Y +G K D
Sbjct: 305 QQPVE-EALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPSSMSPYSMIGLTNDNPAKGDPW 363
Query: 398 S-DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
+I L + E IVLLKN + LPL++ K+ ++AV+GP AN + + Y+G P
Sbjct: 364 DWPSHIALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWAN--IVALDWYSGTPPF 421
Query: 457 YMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLDLSVEA--- 512
++P+ G + + G D V +++ AA+ AK +D I++ G + +A
Sbjct: 422 GVTPVEG------IRQRVGPDVKVTFNDGSNLQAAAALAKQSDEAIVIIGNHPTCDAGWG 475
Query: 513 ---------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
E+ DR L LP + I + A ++V+ ++ + T +I
Sbjct: 476 KCALPSEGKEAFDRTALNLP---DESIAKAVYAANPHTVVVLQTSFPYTTDW--TQAHIP 530
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AIL + EE G A+ADV+FG ++P GRL TW Q+ P+ +R G
Sbjct: 531 AILEMAHNSEEQGTALADVLFGDYDPAGRLAQTWV-ASIGQLPPMMDYNIR-------DG 582
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RTY + LYPFG+GLSYT FKY+ NL +S
Sbjct: 583 RTYMYLKSKPLYPFGFGLSYTTFKYS-------------------NLRLSS--------- 614
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+ L VD N G +G +VV +Y K + ++ + GF RV + G+
Sbjct: 615 ---HTLPAGGQLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPVGQ 671
Query: 744 NKRIKFVFNAC 754
+ + A
Sbjct: 672 TRTVTLPLKAS 682
>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
CL02T12C05]
Length = 854
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 158/427 (37%), Positives = 244/427 (57%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ D + P R+ DL+S++T++EK+ L + G+PRL + +Y +EALHGV V PG
Sbjct: 27 VYLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGV--VRPG 84
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A +N L +I +S EARA +N G L
Sbjct: 85 NF--------TVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLL 136
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G+ V++V+GLQ + R LK+
Sbjct: 137 TFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD---------PRYLKIV 187
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + ++E+D+ E +L FE C+ EG A+S+M +YN +N
Sbjct: 188 STPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAIND 243
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC +V +HK++ + E A A +++AGLDL+
Sbjct: 244 VPCTLNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVK-TLEAAAALSIQAGLDLE 302
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSD 399
CG + Y NA +Q V E +ID + ++ MRLG FD Y + + +
Sbjct: 303 CGDEVYMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFDDPALNPYNKISPSIVGCE 362
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ +LA EAAR+ IVLLKN++ LPL+S K+K++AVVG NA + G+Y+G P
Sbjct: 363 KHSKLALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVG--INAGNSEFGDYSGTPVN--Q 418
Query: 460 PIAGFSG 466
P++ G
Sbjct: 419 PVSILEG 425
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 132/292 (45%), Gaps = 50/292 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + + D T+ + G++ S+E E DR + LP Q I + ++ V++++
Sbjct: 595 AGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVVVLV---A 651
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-L 608
G +A + +I AI+ A YPGE GG A+A+V+FG +NPGG+LP+T+Y + LP
Sbjct: 652 GSSLAINWMDEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDELPAF 709
Query: 609 TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+R GRTY+F+ G LY FG+GLSYT F Y LS
Sbjct: 710 DDYDIR-------KGRTYQFFEGDPLYAFGHGLSYTTFSYKKLSI--------------- 747
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY- 727
DA+ D +N G +G +V +Y K +
Sbjct: 748 ------DAA--------------GDVVSVSFTLKNTGKYEGDEVAQLYVKYQGSDSQVKL 787
Query: 728 -IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+KQ+ GF+R+ ++ G +K+I + + PAG++ VG
Sbjct: 788 PLKQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVG 839
>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
DSM 14838]
Length = 791
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 227/818 (27%), Positives = 362/818 (44%), Gaps = 153/818 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEAL--HGVS 107
++ D S P RV DL+S+MTL+EK+ Q+ +G R+ LP+ E W +AL G+
Sbjct: 46 IYEDPSAPMEERVNDLLSQMTLEEKICQMATL-YGSGRVLEDALPE-EHWKQALWKDGIG 103
Query: 108 NV-----GPGTHFDDV-------------------------IP--------------GAT 123
N+ G GT + IP AT
Sbjct: 104 NIDEEHNGLGTFGSEYSFPYNKHVKAKHEIQRWFVEETRLGIPVDFTNEGIRGLCHDRAT 163
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
FP+ +++N+ L +IG+ + EA A LG + +SP +++ +DPRWGR E
Sbjct: 164 FFPSQSGQGSTWNKELIARIGEVEAKEAIA---LGYTNI--YSPILDICQDPRWGRSVEC 218
Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
GEDP++VG+ ++ LQ ++ S KH+A Y + +
Sbjct: 219 YGEDPYLVGQLGKQMIQSLQK--------------HRLVSTVKHFAVYSIPVGGRDGKTR 264
Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
D V+ ++M +L PF E A VM SYN +G P + L + +R E+
Sbjct: 265 TDPHVSPREMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFK 324
Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------NA 354
GY+V+D ++++ + H +++ E VAQ + AGL++ T+FT A
Sbjct: 325 GYVVSDSEAVEFITTKHHVVSNEVE-GVAQAVNAGLNIR-----THFTKPEDFVLPLRQA 378
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAARE 411
+++GKV I+ + + + LG FD Y KQ+ + E+ ++A EAAR+
Sbjct: 379 IKEGKVSPETINSRVADILRIKFWLGLFDNP--YRGDEKQEEKIVHCKEHQQVALEAARQ 436
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYA 468
+VLLKN+ LPL VK+VAV+GP+AN +I Y A P + + I
Sbjct: 437 SLVLLKNENQLLPLKKT-VKSVAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPET 495
Query: 469 NVTYKTGCD--------------DVACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAE 513
V Y+ GC+ + + + A AA+ A+ + +L G +L+V E
Sbjct: 496 EVVYRKGCEIIDSHFPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELTVR-E 554
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
R L LPG+Q +L+ + K P +LV++ I +A N I AIL A +PGE
Sbjct: 555 DRSRTSLDLPGHQQELMQAIHATGK-PTVLVLLDGRAATINYA--NQYIPAILHAWFPGE 611
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
G A+A+ +FG +NPGGRL +T+ V +P + P +P Y
Sbjct: 612 FAGTAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDEPCETAVYG-----A 663
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
LYPFGYGLSYT+F Y L T Q ++ + C
Sbjct: 664 LYPFGYGLSYTKFSYKNLQITPEEQGPQGEI-----------------------TVSC-- 698
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
+ N+G G +VV +Y + TY+K + GF+R+ + G K++ F+
Sbjct: 699 ------EVTNIGDRTGDEVVQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILTP 752
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
+ L + D ++ G + +G + FN
Sbjct: 753 -QDLGLWDKNNKFVVEPGMFKVMIGAASTDIRLEGKFN 789
>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
CL02T12C05]
Length = 855
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 167/435 (38%), Positives = 243/435 (55%), Gaps = 40/435 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q + LF D + P R+ DL++R+T++EKV L + A +PRL + +Y +EALHGV
Sbjct: 23 QNKTELFRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGV- 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---------- 157
V PG T FP I A++N +L ++ A+S EAR +
Sbjct: 82 -VRPGEF--------TVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAG 132
Query: 158 GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
G LT+WSP +N+ARDPRWGR ET GEDPF+ GR +V+GLQ N R
Sbjct: 133 GSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGD---------NPR 183
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
LK S KH+AA + ++ +R +AR++E+D+ E +L FE C+ +G A S+M +Y
Sbjct: 184 YLKTVSTPKHFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAY 239
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N VN +P + L+ + +RG+W+ +GYIV+DC + + MV HK++ + E A LKA
Sbjct: 240 NAVNDVPCTVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNL-EAAATLALKA 298
Query: 338 GLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQ 394
GLDL+CG + YT A + V E +ID + ++ M LG FD Q Y +
Sbjct: 299 GLDLECGDRVYTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFDDPSQNPYNKIEPS 358
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
I E+ ELA E AR+ +VLLKN +N LPLN K++++AVVG +A G+Y+G P
Sbjct: 359 VIGCKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVG--ISAAHCEFGDYSGNP 416
Query: 455 CRY-MSPIAGFSGYA 468
+S + G YA
Sbjct: 417 KNTPVSVLDGIKKYA 431
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 138/299 (46%), Gaps = 49/299 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + AK D T+ + G++ S+E E DR L LP Q + I ++ +V V++++
Sbjct: 596 AGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTVVVLV---A 652
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + N+ AIL A YPGE+GG A+A+V+FG +NPGGRLP+T+YN L
Sbjct: 653 GSSMAINWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS-------LD 705
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+P D RTY+++ G LY FGYGLSYT FKY S
Sbjct: 706 ELP--AFDDYSVKNRTYQYFEGKPLYEFGYGLSYTNFKYKKKSI---------------- 747
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
++ +D + + NVG DG +V VY + P +K
Sbjct: 748 -------------------MQSNDTVDITFNLSNVGKYDGDEVAQVYVRYPETGTYMPLK 788
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGNGGVSFPIH 787
Q+ GF RV ++ G++ I K L D + P GE+ VG + I
Sbjct: 789 QLKGFSRVHLKKGKSADITISIPK-KELRYWDEKTRQFVTPTGEYVFQVGGSSENISIE 846
>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
CL03T12C04]
Length = 782
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 218/727 (29%), Positives = 340/727 (46%), Gaps = 124/727 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L K++GQ ++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL G
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL----GGG 227
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
N S+ + KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 228 NL----SQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 279
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL Q +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 338
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ +T ID ++ + + +G F+
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKTVIDTAVCRVLRMKFEMGLFEHPYVD 397
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+Y GI + +SP V Y GC + + N I A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGILTK-LSPF-------RVEYVRGCA-IRDTTVNEIEQAIEAAR 507
Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
++ A + G +E E DR L L G Q +L+
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
+ + K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LPI+ V +P+ P + Y + LY FGYG+SYT F+Y+ L
Sbjct: 625 LPISVPRS--VGQIPVYYNKKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 676
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
V+ RC FE +N G DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+Y + +KQ+ F+R ++ G K++ FV + +V+Y ++ +G
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGT 761
Query: 773 HTIFVGN 779
+ +G+
Sbjct: 762 FQVMIGS 768
>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 858
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 160/427 (37%), Positives = 247/427 (57%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + P R+ DL+SR+T++EK+ L + G+ RL +P+Y +EALHGV V PG
Sbjct: 28 LYKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 86 RF--------TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ G+ +SR LK+
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GN------DSRYLKIV 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E +L FE CVKEG ++S+M +YN +N
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALND 244
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 245 VPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 303
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y NA +Q V + DID + + M+LG FD Y + + I S
Sbjct: 304 CGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPKVIGSK 363
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AARE IVLLKN LPL++ K+K++AVVG NA + G+Y+G+P ++
Sbjct: 364 EHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLPV--IA 419
Query: 460 PIAGFSG 466
P++ G
Sbjct: 420 PVSILQG 426
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 143/293 (48%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + + + + G++ ++E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 596 AGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAG 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
++ + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y L
Sbjct: 654 S-SLSINWMDEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 705
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTY+++ G LYPFGYGLSYT FKY+ L T+ Q
Sbjct: 706 ELP--PFDDYDITKGRTYQYFKGNVLYPFGYGLSYTSFKYSDLQVTEGNQ---------- 753
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
E V F +NVG G +V +Y K P
Sbjct: 754 ---------------------------EVNVSFCLKNVGKYAGDEVAQIYVKLPERDKIM 786
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
IK++ GF+R+ ++ G ++++ L D + P+G++TI VG
Sbjct: 787 PIKELKGFERISLKRGGSRKVTIRLKK-DLLRYWDEEKGCFVHPSGDYTIMVG 838
>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
eggerthii DSM 20697]
Length = 858
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 160/427 (37%), Positives = 247/427 (57%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + P R+ DL+SR+T++EK+ L + G+ RL +P+Y +EALHGV V PG
Sbjct: 28 LYKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L K++ +S EARA +N G L
Sbjct: 86 RF--------TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ G+ +SR LK+
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GN------DSRYLKIV 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E +L FE CVKEG ++S+M +YN +N
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALND 244
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 245 VPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 303
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y NA +Q V + DID + + M+LG FD Y + + I S
Sbjct: 304 CGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPKVIGSK 363
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ ++A +AARE IVLLKN LPL++ K+K++AVVG NA + G+Y+G+P ++
Sbjct: 364 EHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLPV--IA 419
Query: 460 PIAGFSG 466
P++ G
Sbjct: 420 PVSILQG 426
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 142/293 (48%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + + + + G++ ++E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 596 AGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAG 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
++ + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y L
Sbjct: 654 S-SLSINWMDEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 705
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTY+++ G LYPFGYGLSYT FKY+ L T Q
Sbjct: 706 ELP--PFDDYDITKGRTYQYFKGNVLYPFGYGLSYTSFKYSDLQVTDGNQ---------- 753
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
E V F +NVG G +V +Y K P
Sbjct: 754 ---------------------------EVNVSFCLKNVGKYAGDEVAQIYVKLPERDKIM 786
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
IK++ GF+R+ ++ G ++++ L D + P+G++TI +G
Sbjct: 787 PIKELKGFERISLKRGESRKVTIRLKK-DLLRYWDEEKECFVHPSGDYTIMIG 838
>gi|404487205|ref|ZP_11022392.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
YIT 11860]
gi|404335701|gb|EJZ62170.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
YIT 11860]
Length = 860
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 223/802 (27%), Positives = 348/802 (43%), Gaps = 144/802 (17%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD------------------------ 83
+ S + + +LP RV+DL++RMT+DEK+ Q+
Sbjct: 23 KAQSLPYKNKNLPIEERVEDLLNRMTVDEKIAQIRHIHSSKIFNGQELDMKKLTDWAGNT 82
Query: 84 ---FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVI 119
F G P RLG+P + +E+LHG V
Sbjct: 83 SWGFVEGFPLTGDNCAKSMYLIQKYMVEKTRLGIPIFTV-AESLHGA-----------VH 130
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
GAT +P I ++FN L +K Q +S + +M SP I+V RD RWGR
Sbjct: 131 DGATIYPQNIALGSTFNPELARKKTQMISDDLHSM-----GFRQVLSPCIDVVRDLRWGR 185
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGH-ENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
+ E+ GEDP++ G + G+++V G+ EN +S KHY + + G
Sbjct: 186 VEESYGEDPYLCGLF------GIEEVSGYLENG---------ISPMLKHYGPHG-NPLSG 229
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
++ + + +D+ E +L+PFEM VK +VM +YN N IP+ A LL +R
Sbjct: 230 LNLASVECGL--RDLHEIYLKPFEMVVKNTGILAVMSTYNSWNHIPNSASHYLLTDILRD 287
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
EW GY+ +D +I+++ H F A + +A Q + AGLD + F +++G
Sbjct: 288 EWGFKGYVYSDWGAIEMLKTLH-FTARNSSEAAIQAISAGLDAEASSKCYPFLKGLIEKG 346
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
+ E +D +++ + +G F+ P + + S E+++LA A E VLLKN
Sbjct: 347 QFDEKILDTAVRRVLFAKFAMGLFE-DPYGKTFKNRKRHSPESVKLAKTIADESTVLLKN 405
Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAGFSGYAN----VTY 472
+ LPL++ +K++A++GP NA G+Y ++P+ G N + Y
Sbjct: 406 ENQLLPLDAKSLKSIAIIGP--NADQVQFGDYTWSRNNKDGVTPLQGIKNRVNKNTAIHY 463
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLP 523
GC + + I A EAAK ++ +I G S E D DL L
Sbjct: 464 AKGCS-LTSLDTSGIAEAVEAAKNSEVAVIFGGSASAALARDYKSSTCGEGFDLNDLNLT 522
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q+QLI +V PVILV+++ I + + N + AIL Y GE+ G +IAD++
Sbjct: 523 GAQSQLIREVYRTGT-PVILVLVTGKPFVIEWEKNN--LPAILVQWYAGEQAGNSIADIL 579
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
FG+ P GRL ++ Y LP + S PGR Y F LY FG
Sbjct: 580 FGEVVPSGRLTFSFPRSTGHLPVYYNYLPSDRGFYKNPGSYDSPGRDYVFSAPSALYSFG 639
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSYT F Y LS K + +D
Sbjct: 640 YGLSYTSFVYKNLSTDK-------------------------------DKYELNDTIHAT 668
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
V+ +N G G +VV +Y + A T +KQ+ F+++ + G + ++ L
Sbjct: 669 VEVKNTGKYTGKEVVQLYVRDKASTYVTPVKQLRDFKKIELAPGETRTVQLQV-PISDLY 727
Query: 759 IVDYAANTLLPAGEHTIFVGNG 780
+VD + AGE + VG
Sbjct: 728 LVDEKNQRFVEAGEFILEVGQA 749
>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
12058]
Length = 1050
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 159/427 (37%), Positives = 248/427 (58%), Gaps = 41/427 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + + P R+ DL+SR+T++EK+ L + G+ RL +P+Y +EALHGV V PG
Sbjct: 28 LYKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L +++ +S EARA +N G L
Sbjct: 86 RF--------TVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ G+ +SR LK+
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GN------DSRYLKIV 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E +L FE CVK+G ++S+M +YN +N
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALND 244
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 245 VPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKAGLDLE 303
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Y +A +Q V + DID + + M+LG FD + Y + I S
Sbjct: 304 CGDDVYDEPLLSAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEKNPYTKISPAVIGSK 363
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ E+A AARE IVLLKN + LPLN+ K+K++AVVG NA + G+Y+G+P ++
Sbjct: 364 EHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVG--INAGSSEFGDYSGLPV--IA 419
Query: 460 PIAGFSG 466
P++ G
Sbjct: 420 PVSVLQG 426
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/293 (32%), Positives = 144/293 (49%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 596 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIVVVLVAG 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + ++ AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y L
Sbjct: 654 S-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 705
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSYT FKY+ +QV
Sbjct: 706 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTSFKYS------NLQV--------- 748
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
D E V FQ N G G +V VY K P
Sbjct: 749 ----------------------ADGEEEVSVSFQLKNTGRYAGDEVAQVYVKLPEREEVM 786
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
+K++ GF+RV +++G +K++ L D A + P+G + I VG
Sbjct: 787 PVKELKGFERVSLKSGESKKVTIKLRK-DLLRYWDEAKGKFIYPSGNYNIMVG 838
>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
17393]
gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 865
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 160/450 (35%), Positives = 245/450 (54%), Gaps = 38/450 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + +L S R DL+ RMTL+EK+ Q+ + + + RLG+P Y+WW+EALHGV+ G
Sbjct: 25 YRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 82
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
AT FP I A+F+ + VS EARA Y+ G GLT+W
Sbjct: 83 --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTFW 134
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + V+GLQ + + K +C
Sbjct: 135 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--------NGAGKYDKAHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W +R+ FD++ ++++D+ ET+L F+ V EG VMC+YNR G P
Sbjct: 187 KHYAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGEP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C++ +LL + +R +W +V+DC +I NH S E A A + +G DL+C
Sbjct: 244 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLEC 303
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G Y++ AV++G + E I++S+ L +LG FD + + + S E+
Sbjct: 304 GGSYSSLN-EAVKKGLITEDKINESVFRLLRARFQLGMFDDDTLVSWSEIPYSVVESKEH 362
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
++ A E AR+ +VLL N N+LPL S ++ VAV+GP+AN +V + NY G P + ++ +
Sbjct: 363 VDKALEMARKSMVLLTNKNNSLPL-SKSIRKVAVLGPNANDSVMLWANYNGFPTKSVTIL 421
Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIF 488
G V Y+ GCD V+ ++ S F
Sbjct: 422 EGIRSKLPEGAVYYEKGCDFVSTQTLFSDF 451
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 123/263 (46%), Gaps = 54/263 (20%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
I + GL ++E E + DR ++ LP Q +++ + + K PVI V+ S G
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCS--G 661
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+A N+ A+L A YPG++GG A+ADV+FG +NP GRLP+T+Y D +
Sbjct: 662 STLALPWEAENLDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASD-------SD 714
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
+P + RTY+++ G L+PFGYGLSYT F Y K
Sbjct: 715 LP--DFEDYNMSNRTYRYFKGKPLFPFGYGLSYTTFDYGKAKVDK--------------- 757
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
++ D + +N G DG +VV VY + PA+ IK
Sbjct: 758 ----------------KSIKTGDSMTLTIPLKNTGKMDGDEVVQVYLRNPADKEGP-IKM 800
Query: 731 VIGFQRVFVRAGRNKRIKFVFNA 753
+ F+RV ++AG+ + I+ A
Sbjct: 801 LRAFRRVSLKAGQAENIQIELPA 823
>gi|371776218|ref|ZP_09482540.1| beta-glucosidase [Anaerophaga sp. HS1]
Length = 774
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 220/783 (28%), Positives = 364/783 (46%), Gaps = 111/783 (14%)
Query: 64 RVKDLVSRMTLDEKVQQL----GDFAHGVPRLGLPQYEWWSEALHGVS-NVGPGTHFD-- 116
+V +++ MTLDEK+ QL G+FA E+ + + G + NV H
Sbjct: 43 KVDSVLNLMTLDEKIGQLNQYSGNFAVTGEVTDTKSGEYLKKGMIGSTFNVFGADHVRML 102
Query: 117 ------------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
DVI G T+FP + S++ L +K + + EA A
Sbjct: 103 QEQNLKYSRLKIPMLFAADVIHGLETTFPIPLAEACSWDLQLMEKSARIAAEEATA---- 158
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
+G+ + ++P ++++RDPRWGRI E GEDPF+ A VRG Q ++ +++ S
Sbjct: 159 --SGVAWNFAPMVDISRDPRWGRIMEGAGEDPFLGSLIARARVRGFQGIDSYKDF----S 212
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
+P + +C KH+ Y G D + D ++E+ + ET+L PF+ V EG +S M +
Sbjct: 213 KPNTMMACAKHFVGYGAAQ-AGRDYHTVD--ISERTLFETYLPPFKAAVDEG-VASFMTA 268
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N +NG+P + + +R +W+ +G +V D +IQ MV H F D K+ A +
Sbjct: 269 FNELNGVPCTGNKYIFQDILRHQWNFNGMVVTDYTAIQEMV-AHGFAKDLKQ-ASKLAID 326
Query: 337 AGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY--VSLGK 393
AG+D+D + + + V++G+V E ID ++ + + LG FD +Y K
Sbjct: 327 AGIDMDMISEGFVTYLKELVEEGQVSEKQIDVAVARILEMKFLLGLFDDPYKYCDAEREK 386
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
+ + + ++++ A E A+ IVLLKN+ N LPL K VA++GP ++ G +A
Sbjct: 387 EVLMNPQHLQAAREVAQRSIVLLKNENNVLPLRKDIPKRVALIGPFVKERESLNGEWAIK 446
Query: 452 -----------GIPCRYMSPIAGFSGYANVTYKTGCD----DVACKS--NNSIFA-ASEA 493
G+ +Y F+ YA T D V+ + + S FA A
Sbjct: 447 GDRSKSVTLWEGLQEKYADTPVRFN-YAKGTSLPLIDGATRHVSLEQGFDKSGFAEALRV 505
Query: 494 AKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
AKT+D ++ G E+ R D+ LPG Q +L+ ++ + K P++LV+ + +D+
Sbjct: 506 AKTSDLILVAMGEHYHWSGEAASRTDITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDL 564
Query: 554 AFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL----- 608
++ N+ AI+ A YPG G A+ADV+ G +NP RL +T+ V +P+
Sbjct: 565 SWEA--ENVDAIVEAWYPGIMAGHAVADVLSGDYNPSARLVVTFPRN--VGQIPIFYNMK 620
Query: 609 -TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
T P Y N P L+PFG+GLSYT F+Y+
Sbjct: 621 NTGRPFDENHPADYKSSYIDSPNSP-LFPFGFGLSYTSFQYD------------------ 661
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
N T + K G L+ VD N G+ DG +VV +Y
Sbjct: 662 ---NATISSQKLTKGGSLI----------VSVDVTNTGNVDGEEVVQLYIHDKVGSVTRP 708
Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
+K++ GF+++F++ G K ++F N + L + + + + GE ++V H
Sbjct: 709 VKELKGFKKIFLKKGETKTVEFTINE-EMLKMYNINMDWVAEPGEFDVWVACNSADESNH 767
Query: 788 LNF 790
L F
Sbjct: 768 LEF 770
>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
Length = 783
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 210/717 (29%), Positives = 323/717 (45%), Gaps = 107/717 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L +++G+A+
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++ARDPRW R+ ET GEDP + G V GL
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL------- 226
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
DL S P + KH+ AY + F +++ E FL PF + G
Sbjct: 227 GGGDL-SHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQAIDAG- 281
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++G+P A+ LL + +R EW G +V+D SI+ + +H F+A + E+
Sbjct: 282 ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FVAPTMEE 340
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A L AG+D+D G Y N NAV G++ +T +D S+ + + +G F+
Sbjct: 341 AAILALSAGVDVDLGGDAYMNLM-NAVNTGRISKTALDASVARVLRLKFEMGLFENPYVD 399
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
K+++ S+E++ LA A+ I LLKN+ + LPLN K + VA++GP+A+ M+G
Sbjct: 400 PEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLN--KNRKVALIGPNADNRYNMLG 457
Query: 449 NYAGIPCR-----YMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE-------- 492
+Y + I + V Y GC D V ++ AA
Sbjct: 458 DYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVV 517
Query: 493 ---AAKTADATIILAGLDLSVE--------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
+A+ + G ++ E E DR L L G Q +L+ + K P+
Sbjct: 518 GGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELLKALKATGK-PL 576
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
I+V + +D +A N + A+L A YPG+EGG AIADV+FG FNP GRLP +
Sbjct: 577 IVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFSVPRS- 633
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
V +PL P Y + LYPFGYGLSYT F Y+ L + + +
Sbjct: 634 -VGQIPLYYNKKAP------QSHDYVEMSASPLYPFGYGLSYTSFDYSDLHLSALMPRS- 685
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
FE +N G DG +V +Y +
Sbjct: 686 ---------------------------------FEISFKVRNTGKYDGEEVAQLYLRDEY 712
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+KQ+ F R +++ G + +KF+ + + ++VD ++ G I +G
Sbjct: 713 ASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTFQIMIG 768
>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
Length = 864
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 166/453 (36%), Positives = 235/453 (51%), Gaps = 40/453 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ +S+L R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ TD N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG Y +A ++G + E DID S+K L LG D ++ + +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420
Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
+ G + Y+ GC V S+F+
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + LP + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y+ + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A K + F+RV + AG+ ++
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
Length = 864
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 171/466 (36%), Positives = 246/466 (52%), Gaps = 40/466 (8%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
++ ++ D+S + R +DLV ++TL+EKV + D + V RLG+ Y WW+EALHGV+
Sbjct: 19 LAQSIYKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVAR 78
Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
G AT FP I ASF+ AVS EARA A
Sbjct: 79 SG----------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQ 128
Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
GLT W+P +N+ RDPRWGR ET GEDP++ V+ V+GLQ D N + K
Sbjct: 129 GLTIWTPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQ-------CLDENEKYDK 181
Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
V +C KH+A + W +R+ F+A ++ +D+ ET+L PFE VKEG VMC+YNR
Sbjct: 182 VHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNR 238
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKA 337
G P C +LLN +R EW G +VADC +I ++ HK AD+ + A L +
Sbjct: 239 FEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVL-S 297
Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQD 395
G DL+CG Y + T V++G + E DID+S+K L LG D Q + +
Sbjct: 298 GTDLECGSNYRSLT-EGVKKGFIDEADIDRSVKRLLQARFELGEMDEPDQVRWAQIPYSV 356
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
+CSD++ L+ + AR+ + LL N N LPL T+AV+GP+AN +V GNY G+P
Sbjct: 357 VCSDKHDSLSLDMARKSMTLLLNKNNALPLERGGT-TIAVMGPNANDSVMQWGNYNGLPK 415
Query: 456 RYMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAKTA 497
R ++ + G + Y+ GC V S+F ++ + A
Sbjct: 416 RTITILDGIRSAMGKDDKLIYEQGCSWVERTLIRSVFNQCKSKEGA 461
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 132/300 (44%), Gaps = 56/300 (18%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K I + K AD I G+ +E E + DR D+ LP Q
Sbjct: 583 DLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQR 642
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
++I + + K ++ ++ G IA +AIL A YPG+ GG+A+A+V+FG +
Sbjct: 643 EMIKALHDAGKK---VIFVNCSGSPIAMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDY 699
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP GRLP T+Y L +P + G TY+F+NG L+PFGYGLSYT FK
Sbjct: 700 NPAGRLPATFYRN-------LAQLP--DFEDYNMAGHTYRFFNGEPLFPFGYGLSYTTFK 750
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y + + Q D+ + V N GS
Sbjct: 751 YGKIQLKSSAQT--------------------------------DETVKITVPVTNTGSR 778
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+G +VV VY K E +K + F+RV++ AG+ +++ K L D A NT+
Sbjct: 779 NGEEVVQVYLKKQGETDGP-VKTLRAFKRVYIPAGKTVKVELELTP-KQLEWWDSATNTM 836
>gi|94497563|ref|ZP_01304132.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
gi|94422980|gb|EAT08012.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
Length = 774
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 228/738 (30%), Positives = 348/738 (47%), Gaps = 111/738 (15%)
Query: 78 VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
V L +A RLG+P + E LHG + VG ATSFP I +S++
Sbjct: 108 VNALQKWAMTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIALASSWDP 155
Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
L +++ ++ E R R SP +++ARDPRWGRI ET GEDP++VG V
Sbjct: 156 HLVQQVNSVIAREIRV-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 210
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
V GLQ G + DL RP KV + KH + ++ V A ++E+++ E
Sbjct: 211 AVEGLQ---GEGRSHDL--RPGKVFATLKHLTGHGQPESGTNVG----PAPISERELREN 261
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
F PFE VK ++VM SYN ++G+PS + LL+ +RGEW G +V+D + +
Sbjct: 262 FFPPFEQVVKRTGINAVMASYNEIDGVPSHMNRWLLDDVLRGEWGFRGAVVSDYSGVDQL 321
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTV 375
++ H +A S ++A + L AG+D D + + T G+ V+ GKV E +DK+++ + +
Sbjct: 322 MNIH-HVAGSLDEAARRALDAGVDADLPEGLSYATLGDQVRAGKVSEAQVDKAVRRMLEL 380
Query: 376 LMRLGFFD----GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
R G F+ + Q V+L E LA AA+ I LLKND LPL KV+
Sbjct: 381 KFRAGLFEHPYADAAQAVAL----TNDAEARALARTAAQRSITLLKND-GMLPL---KVE 432
Query: 432 -TVAVVGPHANATVAMIGNYAGIPCRYMSPIAG------------FSGYANVT-----YK 473
++AV+GP +A VA +G Y G P +S + G F+ +T +
Sbjct: 433 GSIAVIGP--SAAVARLGGYYGQPPHVVSILDGIKARVGDRVRIVFAQGVKITQDDDWWA 490
Query: 474 TGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQ 526
D N + A A EAA+ D ++ G E DR L L G Q
Sbjct: 491 DKVDKADPAENRRLIAQAVEAARNVDRIVLTLGDTEQSSREGWAANHLGDRPSLDLVGEQ 550
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
+L + + + K P+ +V+++ G + + + A+L Y GE+GG A+AD++FG
Sbjct: 551 QELFDALKTLGK-PITVVLIN--GRPASTVKVSEEANALLEGWYLGEQGGHAVADILFGD 607
Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
NPGG+LP+T V LP ++P GR Y F LYPFG+GLSYT F
Sbjct: 608 VNPGGKLPVTVPRS--VGQLP-AFYNVKP-----SAGRGYLFDTNAPLYPFGFGLSYTNF 659
Query: 647 KYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGS 706
LS + Q ++ PG + VD +N G+
Sbjct: 660 T---LSPPRLAQSSIG-------------------PGGTTS---------VTVDVRNDGA 688
Query: 707 TDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
DG +VV +Y IK++ GF+RV ++ G + ++F +SL + + +
Sbjct: 689 RDGDEVVQLYIHDKVSSVTRPIKELKGFERVSLKPGEVRTVRFTIT-PESLQMWNDKMHR 747
Query: 767 LLPAGEHTIFVGNGGVSF 784
++ GE I GN V+
Sbjct: 748 VVEPGEFEIMTGNSSVAL 765
>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
[Tannerella forsythia ATCC 43037]
gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
forsythia ATCC 43037]
Length = 775
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 222/754 (29%), Positives = 356/754 (47%), Gaps = 122/754 (16%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E + L +A RLG+P + + E +HG +G T FPT I +++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHGHMAIG-----------TTVFPTSIGQASTW 154
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N +L +K+G A++ E R+ + + P +++AR+PRW R+ ET GEDP + G
Sbjct: 155 NRTLIEKMGAAIAHETRS-----QGAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILG 209
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
+VRGLQ G + A ++ S KH AAY + R A++ +++
Sbjct: 210 SAFVRGLQ---GKDFADGRHTY-----STLKHLAAYGIPVGGHNGR---QAQIGARELIA 258
Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
L PFEM VK G A SVM SYN V+G+P ++ +L + +RGEWD +G++V+D SI+
Sbjct: 259 EHLLPFEMAVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEG 317
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ H+ D K A A L AG+++D G YT A + ++ID ++ +
Sbjct: 318 IATTHRVAPDIKH-AAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISMSEIDDAVSRILR 376
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
+ +G F+ S + I S E+ LA + A E IVLLKN+ N LPL S + ++A
Sbjct: 377 LKFEMGLFESPYVQPSRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPL-SKNIGSIA 435
Query: 435 VVGPHANATVAMIGNY-AGIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNS 486
V+GP+A+ +G+Y A P ++ I G N + Y GC V + ++
Sbjct: 436 VIGPNADNLYNQLGDYTAPQPEEHIVTI--LEGIRNAVSPTTVIRYVKGC-AVRDTTQSN 492
Query: 487 IFAASEAAKTADATIILAG-------------------------LDLSVEA-ESLDREDL 520
I A AA ++A +++ G L +E+ E DR+ L
Sbjct: 493 IDEAVRAANASNAVVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRKSL 552
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G+Q +LI +A K P+I+V + +++ A+ + A+L A YPGEEGG A+A
Sbjct: 553 TLLGHQEKLIESIAATGK-PLIMVYIQGRPLNMNLADKKAS--ALLTAWYPGEEGGNAVA 609
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
+V+FG NP GRLPI+ +P ++ L SLG + + P LY FGYG
Sbjct: 610 NVIFGDVNPSGRLPIS---------VPRSTGQLPVYYSLGKSNDYVEGTSTP-LYAFGYG 659
Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
LSYT F+Y L+ ++ N T + T
Sbjct: 660 LSYTAFEYGNLTISR------------EGGNITVSCTVT--------------------- 686
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI--GFQRVFVRAGRNKRIKFVFNACKSLN 758
N G+TDG +VV +Y + +A+ + V+ F ++ ++ G + R+ FV + L
Sbjct: 687 --NTGNTDGDEVVQLYLRD--HVASVSVPPVLLKDFAKISLKKGESARVNFVLTP-EQLA 741
Query: 759 IVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFNY 792
+ ++ GE T+ +G + +F Y
Sbjct: 742 FFNTDLKRVVEPGEFTVMIGAASNDIRLKESFVY 775
>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
Length = 769
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 210/717 (29%), Positives = 323/717 (45%), Gaps = 107/717 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L +++G+A+
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 164
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++ARDPRW R+ ET GEDP + G V GL
Sbjct: 165 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL------- 212
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
DL S P + KH+ AY + F +++ E FL PF + G
Sbjct: 213 GGGDL-SHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQAIDAG- 267
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++G+P A+ LL + +R EW G +V+D SI+ + +H F+A + E+
Sbjct: 268 ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FVAPTMEE 326
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A L AG+D+D G Y N NAV G++ +T +D S+ + + +G F+
Sbjct: 327 AAILALSAGVDVDLGGDAYMNLM-NAVNTGRISKTALDASVARVLRLKFEMGLFENPYVD 385
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
K+++ S+E++ LA A+ I LLKN+ + LPLN K + VA++GP+A+ M+G
Sbjct: 386 PEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLN--KNRKVALIGPNADNRYNMLG 443
Query: 449 NYAGIPCR-----YMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE-------- 492
+Y + I + V Y GC D V ++ AA
Sbjct: 444 DYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVV 503
Query: 493 ---AAKTADATIILAGLDLSVE--------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
+A+ + G ++ E E DR L L G Q +L+ + K P+
Sbjct: 504 GGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELLKALKATGK-PL 562
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
I+V + +D +A N + A+L A YPG+EGG AIADV+FG FNP GRLP +
Sbjct: 563 IVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFSVPRS- 619
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
V +PL P Y + LYPFGYGLSYT F Y+ L + + +
Sbjct: 620 -VGQIPLYYNKKAP------QSHDYVEMSASPLYPFGYGLSYTSFDYSDLHLSALMPRS- 671
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
FE +N G DG +V +Y +
Sbjct: 672 ---------------------------------FEISFKVRNTGKYDGEEVAQLYLRDEY 698
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+KQ+ F R +++ G + +KF+ + + ++VD ++ G I +G
Sbjct: 699 ASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTFQIMIG 754
>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
Length = 946
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 235/819 (28%), Positives = 370/819 (45%), Gaps = 146/819 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS------ 100
+ D + P R++DL+S+MTL+EK Q+ +G R+ LP EW W
Sbjct: 53 YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111
Query: 101 -EALHGVSNVG-PGTHFDDVIPG------------------------------------- 121
E L+G G P + + V P
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171
Query: 122 -ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGR 179
AT+FPT + ++N L ++G EAR + G T ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
E GE P++V + VRG+Q H + +V++ KH+ AY +
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFIAYSNNKGARE 272
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
D +++ +++E PF+ ++E VM SYN +G P + L +RGE
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
GY+V+D D+++ + H D KE AV Q+++AGL++ C Y V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
++G + E I+ ++ + V +G FD +P L D + EN E+A +A+RE I
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLKGADEEVEKKENEEVALQASRESI 450
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYAN 469
VLLKN++N LPL+ +K++ +AV GP+A+ + +Y + S + G A+
Sbjct: 451 VLLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKAD 510
Query: 470 VTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
V Y GCD V + I A AK AD I++ G E+
Sbjct: 511 VLYTKGCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q L+ V K PV+LV+++ + I +A+ + AIL A YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWAD--KFVPAILEAWYPGSKG 627
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
G A+AD++FG +NPGG+L +T+ V +P + P +P +D PG N
Sbjct: 628 GIAVADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRAN 684
Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
G LYPFGYGLSYT F+Y+ L + P ++ + +
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLKIS---------------------------PAIITPNQK 716
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
Y KV N G G +V+ +Y + TY K + GF+RV ++ G K I F
Sbjct: 717 A--YVTCKV--TNTGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFP 772
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ K+L +++ + ++ G+ T+ + G S I LN
Sbjct: 773 IDR-KALELLNADMHWVVEPGDFTLML--GASSTDIRLN 808
>gi|255690486|ref|ZP_05414161.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260623937|gb|EEX46808.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1365
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 227/797 (28%), Positives = 352/797 (44%), Gaps = 146/797 (18%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---------------------------FAH 86
+ + LP RVKDL+ RMT +EK+ Q+ F
Sbjct: 536 YQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGFVE 595
Query: 87 GVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
G P RLG+P + +E+LHGV V GAT F
Sbjct: 596 GFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGV-----------VHEGATVF 643
Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
P I ++F+ L + ++ E A+ SP I+V RD RWGR+ E+ G
Sbjct: 644 PQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRWGRVEESFG 698
Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
EDP++ GR+ + V+G D +S KHY + + G++ +
Sbjct: 699 EDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPHG-NPLSGLNLASVE 743
Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
+ +D+ E +L+PFEM +K+ +VM +YN N IP+ A LL +R EW GY
Sbjct: 744 TSI--RDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTDVLRKEWGFKGY 801
Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
+ +D +I+ M+ N F A + E+A Q L AGLD++ +++G++ +
Sbjct: 802 VYSDWGAIE-MLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPGLIERGELNREIV 860
Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
D++++ + R+G FD P K I S + I L+ + A E VLLKN++ LPL
Sbjct: 861 DEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTVLLKNERQLLPL 919
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGI-PCRY-MSPIAGFSGYA----NVTYKTGCDDV 479
+ K+K++AV+GP NA G+Y R+ ++P+ G +A V Y GC V
Sbjct: 920 SIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNVKVNYAKGCSLV 977
Query: 480 ACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLPGYQTQLI 530
+ + I A EAA+ +D ++ G S E D DL L G Q LI
Sbjct: 978 SM-DESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLNDLTLTGAQPALI 1036
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
V K PVILV+++ G A NI AIL Y GE+ G +IAD++FGK +P
Sbjct: 1037 KAVQATGK-PVILVLVT--GKPFAIPWEKKNIPAILVQWYAGEQSGNSIADILFGKVSPS 1093
Query: 591 GRLPITWYNGDYVQMLPLTSMPLR-------PVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
GRL ++ + LP+ LR S PGR Y F L+ FG+GL+Y
Sbjct: 1094 GRLTFSF--PESTGHLPVYYNHLRSDRGFYKSPGSYDSPGRDYVFSAPVPLWSFGHGLTY 1151
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T F+Y+ L +T L+ND ++ +N
Sbjct: 1152 TTFEYSNL--------------------------QTDRASYLLNDT-----VHVRIGLKN 1180
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G +G +VV +Y A ++Q+ F++V ++AG + ++ L I++
Sbjct: 1181 TGKCEGKEVVQLYVSDVCSSVAMPVRQLRDFRKVALQAGETQIVRLSI-PVSELTILNEK 1239
Query: 764 ANTLLPAGEHTIFVGNG 780
++ GE I VG+
Sbjct: 1240 NEAIVEPGEFEIQVGSA 1256
>gi|399025438|ref|ZP_10727439.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398078072|gb|EJL69004.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 740
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 221/763 (28%), Positives = 357/763 (46%), Gaps = 111/763 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD------- 116
+V +L+S+MTL+EKV QL ++ G PQ + L + + G+ +
Sbjct: 26 KVSELLSKMTLEEKVGQLVQYS-GFEYATGPQNSNSATVLEEIKSGKVGSMLNVAGVEET 84
Query: 117 --------------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
DVI G T+FP + AS++ L +K + +TEA A Y
Sbjct: 85 RSFQKLALQSRLKIPLLFGQDVIHGYRTTFPVNLGQAASWDLGLIEKSERIAATEASA-Y 143
Query: 156 NLGRAGLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
+ +W+ P +++ARDPRWGR+ E GED ++ + + ++G Q +G N
Sbjct: 144 GI------HWTFAPMVDIARDPRWGRVMEGSGEDTYLGTQIGLARIKGFQG-KGLGNID- 195
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
+ +C KH+AAY G D D + + + ET+L PF+ + G ++
Sbjct: 196 ------AIMACAKHFAAYGA-AVGGRDYNSVDMSLRQ--LNETYLPPFKAAAEAG-VATF 245
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
M S+N +NG+P+ A+ +L ++G+W+ G++V+D SI M H + D K +A +
Sbjct: 246 MNSFNDINGVPATANTYILRDLLKGKWNYKGFVVSDWGSIGEMT-YHGYTKD-KTEAAQK 303
Query: 334 TLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
+ AG D+D + Y V++GKV ID++ + + T +G FD ++
Sbjct: 304 AILAGSDMDMESRVYMAELPKLVKEGKVDPKFIDEAARRILTKKFEMGLFDDPYRFSDDK 363
Query: 393 KQ--DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+Q + EN + E + +VLLKN +N LP+ S KTVA++GP TVA G +
Sbjct: 364 RQKDQTNNQENRKFGREFGSKSMVLLKNQKNILPI-SKSTKTVALIGPFGKETVANHGFW 422
Query: 451 A-GIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNSIFA-ASEAAKTADATI 501
A G ++ F G N + Y GC+ + S+FA A E AK AD I
Sbjct: 423 AVGFKDDSQRIVSQFDGIRNQLDQNSALLYAKGCN--VDDQDRSMFAEAVETAKKADVVI 480
Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
+ G ++ E+ R ++ G Q L+ ++A+ K P++L+I + G + F N
Sbjct: 481 MTLGEGHAMSGEAKSRSNIHFSGVQEDLLKEIAKTGK-PIVLMINA--GRPLVFDWAADN 537
Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRP 615
I I++ + G E G +IADV+FGK NPGG+LP+T+ + +P+ T P +
Sbjct: 538 IPTIMYTWWLGTEAGNSIADVLFGKVNPGGKLPMTFPRTE--GQIPVYYNHYNTGRPAKT 595
Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
Y N P +PFGYGLSYTQFKY+ + +
Sbjct: 596 NTERNYVSAYIDLDNDPK-FPFGYGLSYTQFKYSDMILSSA------------------- 635
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
DL+ + KV+ N G+ DG +VV +Y + +K++ GFQ
Sbjct: 636 ------------DLKGNQTLNIKVNISNTGNYDGEEVVQLYIRDLFGKVVRPVKELKGFQ 683
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++F++ G K + F ++L D A N GE I VG
Sbjct: 684 KIFLKKGETKIVSFNLTP-ENLKFYDDALNYDWEGGEFDIMVG 725
>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 499
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 161/428 (37%), Positives = 241/428 (56%), Gaps = 43/428 (10%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ + P R+ DL+SR+T++EKV L + G+ RL +P+Y +EALHGV V PG
Sbjct: 27 LYKNEDAPLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 84
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A++N L ++ +S EARA +N G L
Sbjct: 85 RF--------TVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKGQFSDLL 136
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G +VRGLQ + +R LKV
Sbjct: 137 TFWSPTVNMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQGDD---------ARYLKVV 187
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R+ + +++E+ + E +L FE C+K+G A+S+M +YN +N
Sbjct: 188 STPKHFAANNEEH----NRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAYNAINN 243
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++KAGLDL+
Sbjct: 244 VPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNAHKYV-KTKEAAATLSIKAGLDLE 302
Query: 343 CGQ--YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
CG YY NA +Q V + DID + ++ MRLG FD Y + I S
Sbjct: 303 CGDDVYYEPLL-NAYKQYMVSDADIDSTAYHVLKARMRLGLFDNGKNNPYTKISPSIIGS 361
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
+ +A EAAR+ IVLLKN LPL++ K+K++AVVG NA G+Y+G P +
Sbjct: 362 KLHQRVALEAARQCIVLLKNHNWVLPLDTKKLKSIAVVG--INAGNCEFGDYSGSPV--I 417
Query: 459 SPIAGFSG 466
+PI+ G
Sbjct: 418 APISILQG 425
>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
5_1_36/D4]
Length = 864
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ +S+L R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ TD N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG Y +A ++G + E DID S+K L LG D ++ + +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420
Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
+ G + Y+ GC V S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + LP + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y+ + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A K + F+RV + AG+ ++
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
3_1_33FAA]
Length = 864
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ +S+L R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ TD N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG Y +A ++G + E DID S+K L LG D ++ + +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420
Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
+ G + Y+ GC V S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 142/326 (43%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETQYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + LP + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKMVIP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K + K + F+RV + AG+ ++
Sbjct: 772 -------VTNTGNRDGEEVVQVYLKKQEDTEGP-AKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
Length = 735
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 212/765 (27%), Positives = 361/765 (47%), Gaps = 109/765 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
L+ D P RV DL+SRMTL+EKV QL + G VP +G Y
Sbjct: 29 LYKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88
Query: 98 WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
+ AL S +G F D I G T +P + S+N L ++
Sbjct: 89 ETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148
Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G + V+G Q
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
DL++ ++++C KHY Y R + +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G A+++M S+N ++G+P A+ ++ + ++ W G+IV+D +I+ + ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANSYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308
Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
K++A AGL++D + Y V++G+V +D++++ + + RLG F+
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
+ K+ +++++AA A E +VLLKN+ TLPL K +AV+GP A
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426
Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS--IFAASEAAKTA 497
++G++ G + Y F+G A + Y GC A K +N A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWS 483
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
D ++ G ++ E+ R + LP Q +L ++ + K P++LV+++ +++ E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
++ AIL PG G +A ++ G+ NP G+L +T P ++ +P+
Sbjct: 543 PISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591
Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
R G+ G YK LYPFG+GLSYT+FKY ++ +
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTVTPS------------------ 632
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
V ++ D +V NVG+ DG++ V + P +K++
Sbjct: 633 -------------VTKVKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
F++ ++AG K +F + + V+ L AGE+ I V
Sbjct: 680 HFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724
>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
CL02T00C15]
gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
CL02T12C06]
Length = 864
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ +S+L R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ TD N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG Y +A ++G + E DID S+K L LG D ++ + +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420
Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
+ G + Y+ GC V S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 143/326 (43%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + LP + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y+ + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLEQTIKV--------------GETAKMVIP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K + K + F+RV + AG+ ++
Sbjct: 772 -------VTNTGNRDGEEVVQVYLKKQEDTEGP-TKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
dorei DSM 17855]
Length = 864
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ +S+L R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ TD N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG Y +A ++G + E DID S+K L LG D ++ + +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420
Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
+ G + Y+ GC V S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + LP + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y+ + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A K + F+RV + AG+ ++
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|381200965|ref|ZP_09908097.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
Length = 774
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 219/733 (29%), Positives = 338/733 (46%), Gaps = 101/733 (13%)
Query: 78 VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
V L +A RLG+P + E LHG + VG ATSFP I +S++
Sbjct: 109 VNGLQKWAMTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 156
Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
++ +++ Q ++ E RA R SP +++ARDPRWGRI ET GEDP++VG V
Sbjct: 157 AMLRQVNQVIAREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 211
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
V GLQ V G N V + KH + G + A V+E+++ E F
Sbjct: 212 AVEGLQGV-GRSRTLQSN----HVFATLKHLTGHGQPE-SGTN--IGPAPVSERELRENF 263
Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
PFE VK +VM SYN ++G+PS A+ LL +R EW G +V+D ++ ++
Sbjct: 264 FPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLENILREEWGFRGAVVSDYSAVDQLM 323
Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVL 376
H +A + E+A + L AG+D D + + T G V++GKV E +D +++ + +
Sbjct: 324 SIH-HIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELK 382
Query: 377 MRLGFFDGSPQYVSLGKQDICSDENIE-LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
R G F+ +P + I ++E+ LA AA+ I LLKND LPL T+AV
Sbjct: 383 FRAGLFE-NPYADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPE--GTIAV 438
Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD---------DVACK 482
+GP +A VA +G Y G P +S + G AN+ + G D K
Sbjct: 439 IGP--SAAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTK 496
Query: 483 SNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLIN 531
S+ + I A EAA+ D I+ G E DR L L Q +L +
Sbjct: 497 SDPAENRKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVSEQQELFD 556
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
+ + K P+ +V+++ G + + + AIL Y GE+GG A+AD++FG NPGG
Sbjct: 557 ALKALGK-PITVVLIN--GRPASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGG 613
Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
+LP+T LPL ++P R Y F LYPFG+GLSYT F +
Sbjct: 614 KLPVTVPRS--AGQLPLF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTSFSLS-- 663
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
P + + VD +N G+ +G +
Sbjct: 664 -----------------------------APRLSATKIGTGGKTSVSVDVRNTGAREGDE 694
Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
VV +Y + +K++ GFQRV ++ G ++ + F ++L + + + ++ G
Sbjct: 695 VVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMHRVVEPG 753
Query: 772 EHTIFVGNGGVSF 784
+ I GN V+
Sbjct: 754 DFEIMTGNSSVAL 766
>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
CL02T12C01]
Length = 929
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 149/415 (35%), Positives = 230/415 (55%), Gaps = 30/415 (7%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
F D SL + R K+LVS +TL+EK+ Q+G +PRL + Y +W+EA+HGV+ G
Sbjct: 42 FQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSGL-- 99
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
ATSFP +++++ L A S EAR N GL YW P IN++R
Sbjct: 100 --------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTINMSR 151
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR E GEDPF+ G+ AV Y++G+Q + + K + KH+AA +
Sbjct: 152 DPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDD---------PKYYKTIATAKHFAANNY 202
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
+ + DAR ++ E +L FEM VKEG+ SVM +YN +NGIP A+ +LL
Sbjct: 203 EKGRHSTSSDMDAR----NLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHELLI 258
Query: 294 QTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+R EW +G++ +DC ++ V N ++ +A A ++ G DL+CG + ++
Sbjct: 259 DILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDYCK 318
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
A+++G ++E D+D +L ++ +G FD + + S+ + +E+ +LA +AA+
Sbjct: 319 EAIEKGYMQEADLDTALVRVFEARFSVGEFDNASNVPWRSISDDVLDCEEHRQLAYKAAQ 378
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
E IVLLKND N LPL+ K K+VAV+GP N +G Y+G P +P G +
Sbjct: 379 EAIVLLKNDNNILPLD--KTKSVAVIGPFGNTIT--LGGYSGSPTALTTPFGGIA 429
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 131/277 (47%), Gaps = 50/277 (18%)
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
GC V + ++ A E A AD I AG DL+V ES DR +L LPG Q +L+ V
Sbjct: 592 GCA-VTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650
Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
A VIL++ + V I +A+ + + AI+ A Y G+ G+AIADV++G +NP G+L
Sbjct: 651 S-ANPNVILLLQTCSSVTINWAKEH--VPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLT 707
Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGR----TYKFYNGPTLYPFGYGLSYTQFKYNL 650
TWYN ++ P L Y R TY +++ LYPFGYG+SYT F+Y
Sbjct: 708 STWYN----------ALSDLPNGMLNYDIRDAKYTYMYHDKTPLYPFGYGMSYTTFEYQK 757
Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
L+ +K+ L + D N G G+
Sbjct: 758 LNISKS-------------------------------RLAAGEELIVSADITNTGKYAGA 786
Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
++V +Y+ + I +KQ++GF RV + G K +
Sbjct: 787 EIVQLYAHVNSSIERP-LKQLVGFARVELEPGETKTV 822
>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
Length = 782
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 215/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L K++GQ ++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 224
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+L+ + +++ KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG- 279
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL Q +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVALTKEN 338
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ + ID ++ + + +G F+
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+Y GI + +SP + V Y GC + + N I A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 507
Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
++ A + G +E E DR L L G Q +L+
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
+ + K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LPI+ V +P+ P + Y + LY FGYG+SYT F+Y+ L
Sbjct: 625 LPISVPRS--VGQIPVYYNKKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSALQ 676
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
V+ RC FE +N G DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+Y + +KQ+ F+R ++ G K++ FV + +V+Y ++ +G
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 761
Query: 773 HTIFVG 778
+ +G
Sbjct: 762 FHLMIG 767
>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
12058]
Length = 954
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 226/760 (29%), Positives = 357/760 (46%), Gaps = 111/760 (14%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
+ ++ + D +LP RV+ L+S MT ++K++ + G G+P L +P EA+HG
Sbjct: 164 EKTALRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 222
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
S GAT FP + A++N+ L ++I AV E L + W
Sbjct: 223 FSYGS----------GATIFPQALAMGATWNKKLTEEIAMAVGDE-----TLAAGTMQAW 267
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SP ++VA+D RWGR ET GEDP +V + +++G Q + L + P
Sbjct: 268 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-------SKGLFTTP------- 313
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ + G D + D ++E++M E L PF ++ D S+M +Y+ G+P
Sbjct: 314 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPV 370
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+LL+ +R EW G+IV+DC +I + + A K +A Q L AG+ +CG
Sbjct: 371 AKSKELLHNILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGD 430
Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
Y + A + G++ ++D + + ++ R F+ +P L I SD
Sbjct: 431 TYNDKEVIQAAKDGRLNMENLDNVCRTMLRMMFRNELFEKAPNK-PLDWNKIYPGWNSDN 489
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYM 458
+ E+A +AARE IV+L+N +N LPL+ ++++AV+GP A+ G+Y +P +
Sbjct: 490 HKEMARQAARESIVMLENKENILPLDKG-IRSIAVLGPGADDLQP--GDYTPKLLPGQLK 546
Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
S + G V Y+ GCD N I A +AA +D +++ G + EA
Sbjct: 547 SVLTGIKQAVGKQTKVIYEQGCDFTNLSETN-IPKAVKAASQSDVVVMVLGDCSTSEATT 605
Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
E+ D L LPG Q +L+ V K PVILV+ + G + + KAI
Sbjct: 606 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILVLQA--GRPYNLTKASKLCKAI 662
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
+ PG+EGG A ADV+FG +NP GRLP+T+ +V LPL + GR
Sbjct: 663 IVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PQHVGQLPLYY-------NFKTSGRR 713
Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
Y++ + LY FGYGLSYT F+Y+ L K+Q N N T A+
Sbjct: 714 YEYSDLEYYPLYYFGYGLSYTSFEYSGL-----------KVQEKDNGNITVQAT------ 756
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+NVG G +VV +Y T I ++ F R+ ++ G
Sbjct: 757 -----------------VKNVGQRAGDEVVQLYVTDMYASVKTRITELKDFTRINLKPGE 799
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+K + F L++++ + ++ GE I V GGVS
Sbjct: 800 SKTVSFELTPY-DLSLLNDHMDRVVEKGEFKILV--GGVS 836
>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 875
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 162/456 (35%), Positives = 235/456 (51%), Gaps = 36/456 (7%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
L Q S F F + L + RV DLVSR+TL+EKV Q+ + A G+PRL +P Y+WW+E LH
Sbjct: 21 LQAQNSKFPFQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLH 80
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
GV+ T ++ T FP I A+++ + ++ + E R ++N A
Sbjct: 81 GVAR----TPYN-----VTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKAIAAGKE 131
Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
GLTYW+PNIN+ RDPRWGR ET GEDP++ A +VRGLQ G++
Sbjct: 132 KDRYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQ---GND------ 182
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
+ LK ++C KHYA V + R+ FD VT D+ +T+L F+ V + + VMC
Sbjct: 183 PKYLKAAACAKHYA---VHSGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSNVAGVMC 239
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
+YN P CA L+ +R +W GY+ +DC +I NHK D+ +
Sbjct: 240 AYNAFRKQPCCASDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAASADAVF 299
Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
G D+DCG AV++ K+ E ID S+K L+ + RLG FD P V +
Sbjct: 300 H-GTDIDCGNEAYRALVQAVKENKITEKQIDISVKRLFMIRFRLGMFD-PPSMVKYAQTP 357
Query: 396 ICSDENIELAAEA---AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
E+ A A A E IVLLKN NTLPL +K + V+GP+A +A +GNY+G
Sbjct: 358 ATELESAAHAKHALLMAHESIVLLKNANNTLPLKKG-LKKIVVLGPNATNVIAPLGNYSG 416
Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIF 488
P + ++ G A + + +NN++
Sbjct: 417 TPSKLITLFQGIKEKAGAATQVVYEKAVNYTNNNVL 452
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 128/292 (43%), Gaps = 55/292 (18%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
ADA I G+ +E E + DR + LP QT+L+ + K PV+ V+M
Sbjct: 607 ADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELMKALQASGK-PVVFVMM 665
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G +A + NI AI+ A Y G+ G A+ADV+FG +NP GRLP+T+Y D
Sbjct: 666 T--GSALATPWESENIPAIVNAWYGGQAAGTALADVLFGDYNPSGRLPVTFYGSD----- 718
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
L + RTY+++ G LY FGYGLSYT F+Y+ L+ T Q+
Sbjct: 719 ----NDLPSFEDYSMKNRTYRYFTGKPLYGFGYGLSYTTFRYDQLTMPVTA-------QN 767
Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
+ + T V N G T G +V +Y T
Sbjct: 768 GKPVKVT-------------------------VRVTNTGKTTGDEVAQIYVVNENTSIQT 802
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+K + GFQR+ +R +K + FV + L VD G+ I VG
Sbjct: 803 ALKTLKGFQRISLRPAESKMVSFVLQS-DDLTYVDADGQRKPLTGKIQICVG 853
>gi|255530706|ref|YP_003091078.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255343690|gb|ACU03016.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
DSM 2366]
Length = 801
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 232/823 (28%), Positives = 356/823 (43%), Gaps = 150/823 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-FAHG-VPRLGLPQYEW----WSEAL--- 103
+F D S P RVKDL+ +M LDEK Q + +G V + +P EW W + +
Sbjct: 41 VFEDPSRPVDARVKDLLGQMNLDEKTCQTATLYGYGRVLKDEMPTAEWKTSIWKDGIANI 100
Query: 104 -------------------------HGVSNVG-----------PGTHFDDVIPG-----A 122
H ++ V P ++ I G A
Sbjct: 101 DEELNSLPYNKKAVTQYSFPFSKHAHAINTVQKWFVEETRLGIPVDFSNEGIHGLCHDRA 160
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
T FP + +++N+S+ + G V EA+A LG + ++P ++VARD RWGR+ E
Sbjct: 161 TPFPAPVNIGSTWNKSIVYQAGSIVGREAKA---LGYTNV--YAPILDVARDQRWGRVVE 215
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
EDPF++ G+QD +G ++ KHYA Y V +
Sbjct: 216 CYAEDPFLIAELGKQMTMGIQD-QG-------------TAATLKHYAVYSVPKGGRDGQA 261
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
D V ++M E FL PF ++E +M SYN NG P L + +R ++
Sbjct: 262 RTDPHVAPREMHEMFLYPFRRVIQEAKPMGIMSSYNDWNGEPVTGSYYFLTELLRKQYGF 321
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------N 353
GY+V+D ++++ + H D K+ AV Q ++AGL++ T+FT
Sbjct: 322 DGYVVSDSEAVEFISGKHHVAEDYKQ-AVKQAIEAGLNV-----RTHFTKPENFILPLRE 375
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELAAEAAR 410
V++G V +D+ + + V RLG FD YV + + + + + ELA + R
Sbjct: 376 LVKEGSVSMKTLDERVADVLRVKFRLGLFDDP--YVKDPAAADKKVHTRADEELAVQLNR 433
Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-- 468
E +VLLKND+N LPL+ AK K + V GP A Y +S + G YA
Sbjct: 434 ESMVLLKNDKNLLPLDIAKYKRILVSGPLATEINYTTSRYGPSNNPIVSILDGIKAYAGK 493
Query: 469 --NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEA 512
+ Y GC+ + K S I A AAK +D I + G
Sbjct: 494 NSTIAYSKGCEVIDAKWPESEIIPVELTTEEQLQIDQAVAAAKASDVIIAVVGETDEQVG 553
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
ES R L LPG Q L+ + K PV++V+++ + I + N + AIL AG+PG
Sbjct: 554 ESKSRTGLNLPGRQLMLLQALHATGK-PVVMVMVNGRPLTINWE--NRYLPAILQAGFPG 610
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
G+ +A+ +FG NPGG+L +T+ + + L + P +P G NG
Sbjct: 611 PSAGKVVAETLFGDNNPGGKLTMTYPKS--IGQIEL-NFPFKPGSQAGQGKNDDPNGNGK 667
Query: 633 T-----LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
T LYPFGYGLSYT F+++ L NL+K + ++ +D
Sbjct: 668 TRVLGALYPFGYGLSYTTFEFSNL--------NLDK----KEIHNQADV----------- 704
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ VD +N G G +VV +Y K TY + GF+RV + G K +
Sbjct: 705 --------QVSVDVKNTGQRKGDEVVQLYLKDVVSSVTTYESVLRGFERVSLAPGETKTL 756
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
KF + L I+D N + G+ + +GN + F
Sbjct: 757 KFTLHP-DDLAILDKNMNRTVEPGKFIVMIGNSSEDIKLKKEF 798
>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 804
Score = 265 bits (678), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 211/726 (29%), Positives = 325/726 (44%), Gaps = 118/726 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P E+ +E +HG+++ AT P I +++N +L + G+
Sbjct: 145 RLGIP-VEFTNEGIHGLNH-----------SRATPLPAPIAIGSTWNRALVHRAGEIAGH 192
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EAR LG + ++P ++VARDPRWGR+ E GEDPF++ V VRG+Q
Sbjct: 193 EARV---LGYKNV--YAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQS----- 242
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
V+S KHYAAY V D + +++ + +L PF ++E
Sbjct: 243 ---------QGVASTLKHYAAYSVPKGGRDGNCRTDPHIAPRELHQMYLYPFRRVIRESG 293
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
VM SYN +G+P A L +R E+ GY+V+D ++++ + H +A++ ED
Sbjct: 294 PMGVMSSYNDWDGVPVTASRYFLTDLLRHEYGFDGYVVSDSEAVEYVHTKHA-VAETYED 352
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNA---------VQQGKVKETDIDKSLKYLYTVLMRLG 380
AV Q L+AGL++ TNF+ A V++G++ +D+ ++ + V RLG
Sbjct: 353 AVRQVLEAGLNV-----RTNFSPPARFILPVRKLVREGRLSMEVVDQRVREVLRVKFRLG 407
Query: 381 FFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
FD + +D++ + + R+ +VLLKN+ TLPL+ K V V GP A
Sbjct: 408 LFDNPYNDPREAVAEAGADKHRDFVLDIQRQSLVLLKNEDKTLPLDKKKTARVLVAGPLA 467
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAAS----- 491
+ MI Y ++ + G Y A V Y GCD V +S A+
Sbjct: 468 DEDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDVVDAGFPDSELTATPLTAA 527
Query: 492 ------EAAKTA---DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
EA K A D + + G D ES R L LPG Q QL+ + PV+
Sbjct: 528 ERAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPGRQQQLLEALHATGV-PVV 586
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
LV+++ + + +A N+ AIL +P EGG AIA+ +FG +NPGG+L IT+
Sbjct: 587 LVLINGQPLTVNWAA--QNVPAILEGWFPSVEGGTAIAETLFGDYNPGGKLTITF----- 639
Query: 603 VQMLPLTSMPLR---PVDSLGYPGRTYKFYNG-------PTLYPFGYGLSYTQFKYNLLS 652
P ++ + P + + K NG ++YPFGYGLSYT F Y
Sbjct: 640 ----PRSTGQIELNFPYKKGSHGAQPRKGPNGGGVTRVLGSIYPFGYGLSYTTFAY---- 691
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
+NL + S+T+ F + N G G +V
Sbjct: 692 ---------------KNLRIAPEPSRTQGS------------FRVSCEVTNTGDRRGDEV 724
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
V +Y TY + GF+RV + G K + F L ++D N + GE
Sbjct: 725 VQLYISDKFSSVVTYESVLRGFERVTLEPGETKTVSFEVTPSH-LELLDSNMNWTVEPGE 783
Query: 773 HTIFVG 778
I +G
Sbjct: 784 FEIRIG 789
>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
CL02T12C04]
gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
CL03T12C18]
Length = 863
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 243/458 (53%), Gaps = 45/458 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + D+ L R DL+ R+TL+EKV + + + +PRLG+ YEWW+EALHGV+
Sbjct: 22 QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA---MYNLG-----R 159
G AT FP I ASFN+ L ++ AVS EARA +N
Sbjct: 82 RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRY 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT W+PN+N+ RDPRWGR ET GEDP++ GR + VRGLQ E E
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V++ VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
R G P C +LL Q +R +W G +V DC +I + + H A + DAV
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVL- 299
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
+G DL+CG + + T +AV++G + E I+ S+K L LG + + + ++
Sbjct: 300 ---SGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNIPF 355
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
I ++ ELA + A E +VLL+N+ N LPLN + VAV+GP+AN +V GNY G
Sbjct: 356 SVIDCPKHKELALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413
Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G A + Y+ C + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
++AD I G+ +E ES+ DR ++ LP Q +++ A + K V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G +A N AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y +Q
Sbjct: 655 FVNFSGSAMAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--MQ 712
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP + GRTY+F LYPFGYGLSYT+F Y + +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
T + NVG DG +VV VY P +
Sbjct: 760 TKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKE 794
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFVGN 779
K + GFQRV + G+ + ++ S D A NT+ P G + I GN
Sbjct: 795 GPQ-KTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYGN 848
>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 163/450 (36%), Positives = 242/450 (53%), Gaps = 38/450 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + L S R DL+ RMTL+EKV Q+ + + + RLG+P Y+WW+EALHGV+ G
Sbjct: 24 YKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
AT FP I A+F+ + VS EARA Y+ G GLT+W
Sbjct: 82 --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + V+GLQ G D K +C
Sbjct: 134 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--GGTGKYD------KAHACA 185
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W +R+ FDA+ ++++D+ ET+L F+ VKEG VMC+YNR G P
Sbjct: 186 KHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNRFEGEP 242
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C++ +LL + +R +W +V+DC +I NH + A A + +G DL+C
Sbjct: 243 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 302
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G Y++ AV++G + E I++S+ L +LG FD + + + S E+
Sbjct: 303 GGSYSSLN-EAVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESKEH 361
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+ A E AR+ +VLL N +TLPL S ++ VAV+GP+AN +V + NY G P + ++ +
Sbjct: 362 VTKALEMARKSMVLLTNKNHTLPL-SKSIRKVAVLGPNANDSVMLWANYNGFPTKSVTIL 420
Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIF 488
G V Y+ GCD V ++ S F
Sbjct: 421 EGIKSKLPEGTVYYEKGCDYVNTQTVFSYF 450
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 54/286 (18%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K + ++ A ADA I + GL ++E E + DR ++ LP Q
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+++ + + K PVI V+ S G +A N+ AIL A YPG++GG A+ADV+FG +
Sbjct: 641 EMLKALKKTGK-PVIFVLCS--GSTLALPWEAENLDAILEAWYPGQQGGTAVADVLFGDY 697
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP GRLP+T+Y +S L + RTY+++ G L+PFG+GLSYT F
Sbjct: 698 NPAGRLPLTFY---------ASSNDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y K ++R + + +N G
Sbjct: 749 YGKAKVDK-------------------------------QNVRAGEGMTLTIPLKNTGKL 777
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
DG +V+ VY + PA+ IK + F+RV + AG+ + I+ A
Sbjct: 778 DGDEVIQVYLRNPADKEGP-IKTLRAFRRVSLPAGQTENIRIELPA 822
>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 215/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L K++GQ ++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 224
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+L+ + +++ KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 279
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL Q +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 338
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ + ID ++ + + +G F+
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+Y GI + +SP + V Y GC + + N I A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 507
Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
++ A + G +E E DR L L G Q +L+
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
+ + K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LPI+ V +P+ P + Y + LY FGYG+SYT F+Y+ L
Sbjct: 625 LPISVPRS--VGQIPVYYNQKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 676
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
V+ RC FE +N G DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+Y + +KQ+ F+R ++ G K++ FV + +V+Y ++ +G
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 761
Query: 773 HTIFVG 778
+ +G
Sbjct: 762 FHLMIG 767
>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
Length = 241
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 125/195 (64%), Positives = 145/195 (74%), Gaps = 8/195 (4%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
FC SL RVKDL+ R+TL EK++ L + A VPRLG+ YEWWSEALHGVSNVGPGT
Sbjct: 46 FCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGVSNVGPGT 105
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
F PGATSFP VI T ASFN+SLW IG+ VS EARAMYN G AGLTYWSPN+N+ R
Sbjct: 106 KFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 165
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR ETPGEDP + +YA NYV+GLQ +G + LKV++CCKHY AYD+
Sbjct: 166 DPRWGRGQETPGEDPILAAKYAANYVQGLQGNDG--------KKRLKVAACCKHYTAYDL 217
Query: 234 DNWKGVDRYHFDARV 248
DNW GVDRYHF+A+V
Sbjct: 218 DNWNGVDRYHFNAKV 232
>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
CL02T12C01]
Length = 850
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 168/459 (36%), Positives = 246/459 (53%), Gaps = 41/459 (8%)
Query: 45 LGLQMSSFL-FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEAL 103
LG +S+ L + + L R DL+ R+T++EK+ + + + G+PRLG+ YEWW+EAL
Sbjct: 6 LGTTLSAQLPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEAL 65
Query: 104 HGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR 159
HGV+ G AT FP I ASFN+SL +K+ AVS EARA + G+
Sbjct: 66 HGVARAGL----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQ 115
Query: 160 ----AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
GLT W+PN+N+ RDPRWGR ET GEDP++ R V V+GLQ + +
Sbjct: 116 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPD--------S 167
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVM 274
+R K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V+E D VM
Sbjct: 168 ARYDKLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVM 224
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVA 332
C+YNR G P C +LL Q +R EW +G +V+DC +I H D+ A A
Sbjct: 225 CAYNRFEGDPCCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAH-ASA 283
Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
+ +G DL+CG Y T +AV+ G + E ID S+K L LG + S + +L
Sbjct: 284 DAVLSGTDLECGSNYRKLT-DAVKAGIISEEQIDISVKRLLKARFELGEMEESHPW-ALP 341
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ E+ LA + A E + LL+N +N LPL+ K VAV+GP+AN +V GNY G
Sbjct: 342 YSIVDCPEHRHLALQIAHETMTLLQNKENILPLD--KHAKVAVIGPNANDSVMQWGNYNG 399
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P + ++ A + Y+ C + NS+F
Sbjct: 400 TPSHTSTLLSALRSKLPAAQLIYEPVCGLTDDITFNSLF 438
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 131/302 (43%), Gaps = 58/302 (19%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
A E K + I G+ +E E + DR D+ LP Q ++ + + K
Sbjct: 579 ATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKKAGK 638
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
VI V S G +A N AIL A YPG+EGG A+ADV+FG +NP GRLP+T+Y
Sbjct: 639 -KVIFVNFS--GSAMALTPETENCDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPVTFY 695
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
++ LP + GRTY++ L+PFGYGLSYT F Y
Sbjct: 696 KN--MEQLP-------DFEDYSMQGRTYRYMKEAPLFPFGYGLSYTTFTYG--------- 737
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
+ A K R + + + N+GS DG +VV VY +
Sbjct: 738 --------------KARADKKR--------ISTGEKMTLTIPVSNIGSRDGEEVVQVYLR 775
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGR--NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
+ K + F+RV + G+ N +I+ + A + + + +++ GE+ +
Sbjct: 776 REDDPEGP-TKTLRAFKRVEITKGKSLNVKIELPYTAFEWFDNSTHTMHSM--KGEYEVL 832
Query: 777 VG 778
G
Sbjct: 833 YG 834
>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
Length = 863
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 243/458 (53%), Gaps = 45/458 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + D+ L R DL+ R+TL+EKV + + + +PRLG+ YEWW+EALHGV+
Sbjct: 22 QPSKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA---MYNLG-----R 159
G AT FP I ASFN+ L ++ AVS EARA +N
Sbjct: 82 RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRY 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT W+PN+N+ RDPRWGR ET GEDP++ GR + VRGLQ E E
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V++ VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
R G P C +LL Q +R +W G +V DC +I + + H A + DAV
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVL- 299
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
+G DL+CG + + T +AV++G + E I+ S+K L LG + + + ++
Sbjct: 300 ---SGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNIPF 355
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
I ++ ELA + A E +VLL+N+ N LPLN + VAV+GP+AN +V GNY G
Sbjct: 356 SVIDCPKHKELALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413
Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G A + Y+ C + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
++AD I G+ +E ES+ DR ++ LP Q +++ A + K V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G +A N AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y +Q
Sbjct: 655 FVNFSGSAMAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--MQ 712
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP + GRTY+F LYPFGYGLSYT+F Y + +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
T + NVG DG +VV VY P +
Sbjct: 760 TKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKE 794
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFVGN 779
K + GFQRV + G+ + ++ S D A NT+ P G + I GN
Sbjct: 795 GPQ-KTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYGN 848
>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
Length = 799
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 213/725 (29%), Positives = 334/725 (46%), Gaps = 107/725 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E +HG++ H AT+FP I +++N+ L ++GQ +
Sbjct: 138 RLGIP-VDFTNEGIHGLNQ----DH-------ATAFPAPIGIGSTWNKELVHQMGQIIGR 185
Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EA+A+ G T ++P ++VARD RWGR+ ET GEDPF+V G+Q
Sbjct: 186 EAKAL------GYTNVYAPILDVARDQRWGRVVETYGEDPFLVAGLGTALAGGIQ----- 234
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
EN V+S KH+A Y V D V ++M++ FL PF ++
Sbjct: 235 ENG---------VASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNV 285
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
VM SYN +G+P A L Q +R ++ GY+V+D +++ + + H D KE
Sbjct: 286 HPLGVMSSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKE 345
Query: 329 DAVAQTLKAGLDLDCG-QYYTNFT---GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
AV ++AGL++ +NF +++G + +++ + + +V RLG FD
Sbjct: 346 -AVKMVMEAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLFDA 404
Query: 385 SPQYVSLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
YV K + + ++ + +A + RE +VLLKND+N LPL+ + + + V GP A+
Sbjct: 405 P--YVKDPKAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLAD 462
Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNS----------- 486
I Y + +S + G + A + Y GC+ S
Sbjct: 463 EKEHAISRYGPSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTPQE 522
Query: 487 ---IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
+ A EAAK D I + G + ESL R L LPG Q +L+ ++ + K P++L
Sbjct: 523 IAEMNKAVEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTGK-PMVL 581
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDY 602
++++ + I + N + AIL +PG GG A+A+ +FG +NPGG+L T+
Sbjct: 582 ILINGQPLTINWE--NRYLDAILETWFPGPAGGTAVAEAIFGAYNPGGKLTTTFPKTTGQ 639
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYN-----GPTLYPFGYGLSYTQFKYNLLSFTKTI 657
++M + P +P G PG Y GP LYPFGYGLSYT F+Y
Sbjct: 640 IEM----NFPFKPASHAGQPGDGPNGYGKTAVVGP-LYPFGYGLSYTTFEY--------- 685
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
N D K R + D VD +N G G +VV +Y
Sbjct: 686 ------------ANLKVDPEKART--------QAD--ISVAVDVKNTGKVKGDEVVQLYV 723
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
K TY + GF+RV + G K + F L+I+D N ++ G I V
Sbjct: 724 KQLVSSVTTYESILRGFERVSLSPGETKTVHFKLTP-DDLSILDKNMNFVVEPGAFDIMV 782
Query: 778 GNGGV 782
G+ V
Sbjct: 783 GSSSV 787
>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 215/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L K++GQ ++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGL------- 224
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+L+ + +++ KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 279
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL Q +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 338
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ + ID ++ + + +G F+
Sbjct: 339 AAIQSVMAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKMINKVAVIGPNADNRYNMLG 456
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+Y GI + +SP + V Y GC + + N I A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 507
Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
++ A + G +E E DR L L G Q +L+
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
+ + K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LPI+ V +P+ P + Y + LY FGYG+SYT F+Y+ L
Sbjct: 625 LPISVPRS--VGQIPVYYNQKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 676
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
V+ RC FE +N G DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+Y + +KQ+ F+R ++ G K++ FV + +V+Y ++ +G
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 761
Query: 773 HTIFVG 778
+ +G
Sbjct: 762 FHLMIG 767
>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 896
Score = 265 bits (677), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 155/453 (34%), Positives = 245/453 (54%), Gaps = 41/453 (9%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ F + LP + R+++L++ +T +EK+ + D + VPRL +P Y WW+EALHGV+ G
Sbjct: 44 YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI 103
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR-AGL 162
AT FP I A+++ K + +S EARA YN GR GL
Sbjct: 104 ----------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGL 153
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+W+PNIN+ RDPRWGR ET GEDP++ V V+GLQ + + K
Sbjct: 154 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGND---------PKFFKTH 204
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KH+A + W +R+ ++A ++++D+ ET+L F+ V+EG+ VMC+YN +G
Sbjct: 205 ACAKHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDG 261
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGLD 340
P CA+ LL + +RG+W G +V+DC ++ H D K A A LK D
Sbjct: 262 QPCCANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTA-ADALKHSTD 320
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICS 398
L+CG Y N ++ G + E DID+S++ + LG D S + ++ + S
Sbjct: 321 LECGDTYNNLN-KSLASGLITEKDIDESMRRILKGWFELGMLDPKSSVHWNTIPYSVVDS 379
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
+E+ + A + A++ IVL+KN++N LPLN +K +AVVGP+A+ + +GNY G P +
Sbjct: 380 EEHKKQALKMAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNGTPSSIV 438
Query: 459 SPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ + G A + Y+ G + S S++
Sbjct: 439 TILDGIKTKFPNAEIIYEKGSEVTDPSSRTSLY 471
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 134/302 (44%), Gaps = 48/302 (15%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
+ E K AD + GL S+E E + D+ + LP Q L+ ++ + K
Sbjct: 615 SVREKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK 674
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PV+ V+ + G + + N A+L A Y G+ GG A+ADV+ G +NP G+LPIT+Y
Sbjct: 675 -PVVFVLCT--GSALGLEQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPITFY 731
Query: 599 -NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
N + + + ++ GRTY++ LYPFG+GLSY++F Y +K
Sbjct: 732 KNLEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYGDSKLSK-- 789
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
N + ++ + N+ +G +VV VY
Sbjct: 790 -----------------------------NSISVNENVTITIPVTNISEREGEEVVQVYI 820
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIF 776
K + A +K + F+R +++ K I+ + + S D A+ L+ G++TIF
Sbjct: 821 KRNNDAQAP-VKTLRAFERTPIKSKETKNIQLILSK-DSFAFYDEKADDLVSKPGDYTIF 878
Query: 777 VG 778
G
Sbjct: 879 YG 880
>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
18391]
Length = 908
Score = 265 bits (676), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 166/439 (37%), Positives = 237/439 (53%), Gaps = 42/439 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + +L R DLV RMTL+EK Q+ + A +PRL +P Y++W+E LHGV+ G
Sbjct: 24 YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSG--- 80
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYW 165
AT FP I A+++ L K+IG ++TEARA N L R GLT+W
Sbjct: 81 -------YATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SPNIN+ RDPRWGR ET GEDP + + VN++ GLQ + + KV +
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQGTD---------PKFYKVIATP 184
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A V + R+ FD T D+ +T+L F + + A S+MC+YNR++G P+
Sbjct: 185 KHFA---VHSGPEEGRHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPA 241
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C LL +R +W G++ +DC +I + H+ D+ E A L AG D +C
Sbjct: 242 CGSKLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDA-EHADKAALLAGTDTNC 300
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
G Y G+AV+ G +KE+DID SL+ L+ +RLG FD GS Y + + S N
Sbjct: 301 GSTYRKL-GDAVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSVPYAQIPFSQVNSPAN 359
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+A AA E +VLLKND LPL + K KT+AV+GP+ + ++ GNY G+ P+
Sbjct: 360 AAVAKRAAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPV 418
Query: 462 ----AGFSGYANVTYKTGC 476
+ SG NV Y G
Sbjct: 419 DALRSALSG-TNVVYAPGA 436
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 136/301 (45%), Gaps = 56/301 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A EAA +D + + GL +E E + DR D+ LP Q L+ + K
Sbjct: 624 ALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLIATGK- 682
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
P I+V+++ + I A+ N AIL + YPGE G A+AD + G+ NP GRLPIT+Y
Sbjct: 683 PTIVVLLNGSALAINLADEKAN--AILESWYPGEAGSTALADTLVGRNNPSGRLPITFYK 740
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+ + +P + RTY+++ G LY FG+GLSYT+F Y+ L K
Sbjct: 741 SE-------SDLP--GFEDYSMQNRTYRYFKGAPLYGFGFGLSYTKFAYSGLKLAKA--- 788
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
L D +V +N G G +V +Y P
Sbjct: 789 ----------------------------KLNAGDTLTAEVTVKNTGKVAGEEVAELYLLP 820
Query: 720 PAEIAA--TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
PAE A + +Q+ GFQRV ++ G ++++ F + L+ VD + G + I +
Sbjct: 821 PAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTLTP-RQLSEVDAKGTRAIQPGTYAIAI 879
Query: 778 G 778
G
Sbjct: 880 G 880
>gi|167765233|ref|ZP_02437346.1| hypothetical protein BACSTE_03621 [Bacteroides stercoris ATCC
43183]
gi|167696861|gb|EDS13440.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
stercoris ATCC 43183]
Length = 818
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 209/729 (28%), Positives = 332/729 (45%), Gaps = 123/729 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E +HG+++ AT P I +++N+ L ++ G
Sbjct: 157 RLGIP-VDFTNEGIHGLNHTK-----------ATPLPAPIAIGSTWNKELVRRAGVIAGQ 204
Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EA+A+ G T ++P +++ RDPRWGR E GE+P+++ V G+Q +G
Sbjct: 205 EAKAL------GYTNVYAPILDIVRDPRWGRTLECYGEEPYLIAALGTEMVNGIQS-QG- 256
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
V++ KHYA Y V D V +++ E FL PF+ ++
Sbjct: 257 ------------VAATLKHYAVYSVPKGGRDGNCRTDPHVAPRELHELFLYPFKKVIQNS 304
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
VM SYN +G+P A L + +R E+ GY+V+D ++++ + H +AD+ +
Sbjct: 305 HPMGVMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSEAVEFVESKH-HVADTYD 363
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNA---------VQQGKVKETDIDKSLKYLYTVLMRL 379
+AV Q L+AGL++ T+FT + +++ K+ IDK + + V RL
Sbjct: 364 EAVRQVLEAGLNVR-----THFTPPSDFILPIRRLLEEKKISMAVIDKRVSEVLRVKFRL 418
Query: 380 GFFDGSPQYVSLGKQDIC--SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
G FD P D +D N++ + ++ +VLLKN+ N LPL+ ++K V V G
Sbjct: 419 GLFD-QPYVADTKAADRVGGADRNMDFVKQMQQQALVLLKNENNILPLDKRQIKKVLVTG 477
Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDV-------------- 479
P A+ M Y ++ +AG Y A V Y GCD V
Sbjct: 478 PLADEDNFMTSRYGPNGLETVTVLAGLRNYLKGIAEVDYAKGCDIVDAGWPATEILPAPM 537
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
+ + I A A +D I + G D ES R L LPG Q QL+ + K
Sbjct: 538 SEQEKQGIAEAVAKAGESDVIIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK- 596
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVILV+++ + + +A N I AIL + +PG +GG IA+ +FG+ NPGG+L +T+
Sbjct: 597 PVILVLINGQPLTVNWA--NAYIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVTFPK 654
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT----------LYPFGYGLSYTQFKYN 649
V + L + P +P P ++GP LYPFG+GLSYT F Y+
Sbjct: 655 S--VGQIEL-NFPFKPGSHGAQP------HSGPNGSGATRIIGELYPFGFGLSYTTFAYS 705
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
L ++ LQ YT KV+ N G G
Sbjct: 706 DLE--------VSPLQQHTQGEYT-----------------------IKVNVTNTGKRAG 734
Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
+VV +Y + TY Q+ GF+RV ++ G +++ F + L I+D N +
Sbjct: 735 DEVVQLYVRDKVSSVITYDSQLRGFERVSLQPGETRQVTFSLKP-EDLQILDRNMNWTVE 793
Query: 770 AGEHTIFVG 778
GE + +G
Sbjct: 794 PGEFEVMIG 802
>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
Length = 863
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 164/456 (35%), Positives = 241/456 (52%), Gaps = 41/456 (8%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + D+ L R DL+ R+TL+EKV + + + +PRLG+ YEWW+EALHGV+
Sbjct: 22 QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR---- 159
G AT FP I ASFN+ L ++ AVS EARA G+
Sbjct: 82 RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRY 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT W+PN+N+ RDPRWGR ET GEDP++ GR + VRGLQ E E
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYD-------- 183
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V++ VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA- 337
R G P C +LL Q +R +W G +V DC +I K ++ DAV + A
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH--ETHPDAVHASADAV 298
Query: 338 --GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
G DL+CG + + T +AV++G + E I+ S+K L LG + + + ++
Sbjct: 299 LNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPYSV 357
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
I ++ ELA + A E +VLL+N N LPLN + VAV+GP+AN +V GNY G P
Sbjct: 358 IDCPKHKELALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPS 415
Query: 456 RYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
++ + G A + Y+ C + +S+F
Sbjct: 416 HTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K AD I G+ +E ES+ DR ++ LP Q +++ A + K V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G +A + AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y +Q
Sbjct: 655 FVNFSGSAMAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--IQ 712
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP + GRTY+F LYPFGYGLSYT+F Y + Q LNK
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLN---QSKLNKG 762
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
+ +L + NVG DG +VV VY P +
Sbjct: 763 EKA----------------ILT------------IPVSNVGQRDGEEVVQVYICRPDDKE 794
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
K + GFQRV + G+ + + S D A NT+ P +G + I GN
Sbjct: 795 GPQ-KTLRGFQRVNIAKGKTQNVSIEL-PYDSFEWFDTATNTIRPLSGTYKILYGN 848
>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 163/450 (36%), Positives = 242/450 (53%), Gaps = 38/450 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + L S R DL+ RMTL+EKV Q+ + + + RLG+P Y+WW+EALHGV+ G
Sbjct: 24 YKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
AT FP I A+F+ + VS EARA Y+ G GLT+W
Sbjct: 82 --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTFW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PNIN+ RDPRWGR ET GEDP++ + V+GLQ G D K +C
Sbjct: 134 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--GGTGKYD------KAHACA 185
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W +R+ FDA+ ++++D+ ET+L F+ VKEG VMC+YNR G P
Sbjct: 186 KHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNRFEGEP 242
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
C++ +LL + +R +W +V+DC +I NH + A A + +G DL+C
Sbjct: 243 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 302
Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
G Y++ AV++G + E I++S+ L +LG FD + + + S E+
Sbjct: 303 GGSYSSLN-EAVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESKEH 361
Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
+ A E AR+ +VLL N +TLPL S ++ VAV+GP+AN +V + NY G P + ++ +
Sbjct: 362 VAKALEMARKSMVLLTNKNHTLPL-SKSIRKVAVLGPNANDSVMLWANYNGFPTKSVTIL 420
Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIF 488
G V Y+ GCD V ++ S F
Sbjct: 421 EGIKSKLPEGTVYYEKGCDYVNTQTVFSYF 450
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 54/286 (18%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K + ++ A ADA I + GL ++E E + DR ++ LP Q
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+++ + + K PVI V+ S G +A N+ AIL A YPG++GG A+ADV+FG +
Sbjct: 641 EMLKALKKTGK-PVIFVLCS--GSTLALPWEAENLDAILEAWYPGQQGGTAVADVLFGDY 697
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP GRLP+T+Y +S L + RTY+++ G L+PFG+GLSYT F
Sbjct: 698 NPAGRLPLTFY---------ASSDDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y K ++R + + +N G
Sbjct: 749 YGKAKVDK-------------------------------QNVRAGEGMTLTIPLKNTGKL 777
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
DG +V+ VY + PA+ IK + F+RV + AG+ + I+ A
Sbjct: 778 DGDEVIQVYLRNPADKEGP-IKTLRAFRRVSLPAGQTENIRIELPA 822
>gi|325286191|ref|YP_004261981.1| beta-glucosidase [Cellulophaga lytica DSM 7489]
gi|324321645|gb|ADY29110.1| Beta-glucosidase [Cellulophaga lytica DSM 7489]
Length = 754
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 222/741 (29%), Positives = 352/741 (47%), Gaps = 119/741 (16%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
+K++ DFA RL +P + + S+ +HG T+FP + T +S+
Sbjct: 81 KKLKIAQDFAVNDTRLKIPLF-FGSDVIHGYK---------------TTFPIPLATASSW 124
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
+ L KK+ + + EA A G+ + +SP ++VARDPRWGRI E GEDP++
Sbjct: 125 DMDLIKKMAETAALEATA------DGINWNFSPMVDVARDPRWGRIAEGAGEDPYLGSAI 178
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
A V G Q N T N+ + + KH+A Y G D D T+ M
Sbjct: 179 AKAMVHGYQ----GNNLTAKNT----MLATVKHFALYGAAE-AGRDYNSVDMSRTK--MF 227
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
+L P++ + G A+S+M S+N V+GIP+ + LL +R +W G++V+D S+
Sbjct: 228 NQYLPPYKAGIDAG-AASIMTSFNDVDGIPASGNKWLLTDLLRKKWGFKGFVVSDYTSVN 286
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
M+ + L D +D A +LKAGLD+D G+ + ++ +G+V E +I + + +
Sbjct: 287 EMIAHG--LGDL-QDVSALSLKAGLDMDMVGEGFLTTLKKSLDEGRVTEEEITNACRRIL 343
Query: 374 TVLMRLGFFDGSPQYVSLG--KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
+LG FD +Y+ K+DI + ++ LA EAA+ VL KN N LPL +K
Sbjct: 344 EAKYKLGLFDDPYKYIDAKRPKKDILTKKSKTLAREAAKRSFVLFKNHNNILPL--SKTA 401
Query: 432 TVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGF---SGYANVTYKTGC---DDVACKS 483
+A+VGP AN M+G +A G P + + GF + A +TY G DD
Sbjct: 402 KIALVGPLANNKNNMLGTWAPTGDPQLSIPILNGFKNVASKAKITYAKGANITDDTELAK 461
Query: 484 NNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
++F A + AKT+D + + G + E+ R D+ +P Q
Sbjct: 462 KVNVFGTRVDIDKRSSEELLQEALDLAKTSDVVVAVVGEASEMSGEAASRTDISIPNSQK 521
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+LI ++ + K PV+LV+MS + I E N + +IL +PG E G A+ADV+FG +
Sbjct: 522 RLIQELVKTGK-PVVLVLMSGRPLTIE-EEFNLPV-SILQVWHPGIEAGNAVADVIFGDY 578
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF----------YNGPTLYPF 637
NP G+L TW V +P+ + + G P + F N P L PF
Sbjct: 579 NPSGKLTATWPRN--VGQIPI----YHSIKNTGRPAPSPAFEKFKSNYLDVKNAP-LLPF 631
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGLSYT FKY+ +NL+K + + + T
Sbjct: 632 GYGLSYTSFKYS--------NINLSKKEIAQGEDVT-----------------------V 660
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
V +N G+ DG +VV +Y + ++Q+ GF++VF++ G +K ++ V A L
Sbjct: 661 SVTVKNTGNFDGEEVVQLYLRDVVRSITPPMRQLKGFKKVFLKKGESKTVELVLTA-DDL 719
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+ + + G+ IFVG
Sbjct: 720 KFYNSTLDFVAEPGDFEIFVG 740
>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
Length = 1049
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 220/769 (28%), Positives = 362/769 (47%), Gaps = 108/769 (14%)
Query: 56 DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
+S LP++ VKDL+SRMT++EK+ QL + G L P+ E+ S++L VG
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 111 -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
P DVI G T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ + E+ A AGL + ++P +++ARD RWGR+ E GED ++ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
N + NS V +C KH+ AY + R + ++E+ + +T+L PF+
Sbjct: 501 -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
C+ G + M ++N +NGIP+ A P LL +RG+W+ +G++V+D ++++ +V + +
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605
Query: 324 ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A+ +DA +G+D+D Y + ++ GK+ D+D S+ + + LG F
Sbjct: 606 AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 383 DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
++ + Q I E ++ A + A + VLLKND +TLPL + V+++AVVGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 441 NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
+ ++G++ A R+++ + G N V Y GC D + + A
Sbjct: 725 DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ A +D I + G + ES R L LPG Q +LI ++ K PV++V+M+ +
Sbjct: 782 KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
I + + N+ AIL + G G AIAD++FG +NP GRL I++ + V +
Sbjct: 841 SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPIYYNYK 898
Query: 611 MPLRPVDSL-GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
RP D L R N P LYPFGYGLSYT F Y+ T+
Sbjct: 899 KSGRPGDMLHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSAPQSTQK------------- 944
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
YT + V N G DG + V +Y +K
Sbjct: 945 -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ F+++F++AG +K ++F + +L D A N ++ GE I G
Sbjct: 987 ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
3_8_47FAA]
Length = 863
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 241/458 (52%), Gaps = 45/458 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + D+ L R DL+ R+TL+EKV + + + +PRLG+ YEWW+EALHGV+
Sbjct: 22 QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR---- 159
G AT FP I ASFN+ L ++ AVS EARA G+
Sbjct: 82 RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRY 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT W+PN+N+ RDPRWGR ET GEDP++ GR + VRGLQ E E
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V++ VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
R G P C +LL Q +R +W G +V DC +I + + H A + DAV
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLN 300
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
G DL+CG + + T +AV++G + E I+ S+K L LG + + + ++
Sbjct: 301 ----GTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPY 355
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
I ++ ELA + A E +VLL+N N LPLN + VAV+GP+AN +V GNY G
Sbjct: 356 SVINCPKHKELALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413
Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G A + Y+ C + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 130/296 (43%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K AD I G+ +E ES+ DR ++ LP Q +++ A + K V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G +A + AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y +Q
Sbjct: 655 FVNFSGSAMAIVPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFYKS--IQ 712
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP + GRTY+F LYPFGYGLSYT+F Y + +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
T + NVG DG +VV VY P +
Sbjct: 760 AKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKG 794
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
K + GFQRV + G+ + + S D A NT+ P +G + I GN
Sbjct: 795 GPQ-KTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYGN 848
>gi|333377833|ref|ZP_08469566.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
22836]
gi|332883853|gb|EGK04133.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
22836]
Length = 780
Score = 265 bits (676), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 214/720 (29%), Positives = 340/720 (47%), Gaps = 107/720 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + E HG +G T FPT I A++N +L +++ +S
Sbjct: 125 RLGIPIF-LAEECPHGHMAIG-----------TTVFPTAIGQAATWNPNLIQQMSAVISK 172
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EAR+ + + P +++AR+ RW R+ ET GEDP ++ + +V G
Sbjct: 173 EARS-----QGSHIGYGPVLDLAREARWSRVEETYGEDPVLISKMGEAFVTGF------- 220
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ DL S+P + S KH+ AY + + G + ++ V +D++E +L PFE VK G
Sbjct: 221 GSGDL-SKPYSLISTLKHFVAYGIPD--GGHNGNSNS-VGMRDLKENYLPPFEKAVKAG- 275
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM +YN V+GIP ++ LL + +W G+ V+D SI+ + +H ++ + ++
Sbjct: 276 ALSVMTAYNSVDGIPCTSNEYLLKDVLCKDWGFKGFTVSDLGSIEGLKGSH-YVVSTIQE 334
Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
A +L +GLD D G +AV++G V ET ID ++ + + +G F+
Sbjct: 335 AAILSLTSGLDCDLGGNAFFTLSDAVKKGMVGETQIDSAVYKILKLKFDMGLFENPYVDE 394
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
+ +Q + + ENI LA + ARE IVLL+N N LPLN +K+K +AV+GP+A+ +G+
Sbjct: 395 NNARQVVRTQENIVLARQVARESIVLLENKNNVLPLNKSKIKKIAVIGPNADNVYNQLGD 454
Query: 450 YAGIP-----CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI--- 501
Y + I + + Y GC + N I A +AA +D +
Sbjct: 455 YTAPQDDSNVKTVLDGIRSKLKQSQIEYVKGCA-IRDTLNTDIDKAVQAALRSDVAVVVV 513
Query: 502 ------------ILAGLDLSVE--------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
I G ++ E E DR L L G Q +L+ + K PV
Sbjct: 514 GGSSARDFKTKYIETGAAVADEHSISDMESGEGFDRVSLDLMGKQLELLKAIKATGK-PV 572
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
++V + +++ +A N + A+L A YPG+EGG AIADV+FG++NP GRLP++
Sbjct: 573 VVVYIQGRPLNMNWASENAD--ALLSAWYPGQEGGNAIADVLFGEYNPAGRLPMSV--AK 628
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
V LP+ P S Y T K LY FGYGLS+T F+Y+ L K+
Sbjct: 629 SVGQLPVYYNHRNPA-SHDYVEMTSK-----PLYSFGYGLSFTSFEYSNLKINKS----- 677
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
+ E V+ +N G+ DG +VV +Y +
Sbjct: 678 ------------------------------NSGVEVTVELRNSGNFDGDEVVQLYLRNNR 707
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGNG 780
I Q+ F+RV ++ G K IK + +I+D N ++ P G+ T VG+
Sbjct: 708 ASVVQPIMQLKAFERVNLKKGETKTIKLLLTK-DDFSIIDKKMNRVVEPNGDFTFMVGSA 766
>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 863
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 241/458 (52%), Gaps = 45/458 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + D+ L R DL+ R+TL+EKV + + + +PRLG+ YEWW+EALHGV+
Sbjct: 22 QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR---- 159
G AT FP I ASFN+ L ++ AVS EARA G+
Sbjct: 82 RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRY 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT W+PN+N+ RDPRWGR ET GEDP++ GR + VRGLQ E E
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V++ VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
R G P C +LL Q +R +W G +V DC +I + + H A + DAV
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLN 300
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
G DL+CG + + T +AV++G + E I+ S+K L LG + + + ++
Sbjct: 301 ----GTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPY 355
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
I ++ ELA + A E +VLL+N N LPLN + VAV+GP+AN +V GNY G
Sbjct: 356 SVINCPKHKELALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413
Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G A + Y+ C + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 130/296 (43%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K AD I G+ +E ES+ DR ++ LP Q +++ A + K V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G +A + AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y +Q
Sbjct: 655 FVNFSGSAMAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--IQ 712
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP + GRTY+F LYPFGYGLSYT+F Y + +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
T + NVG DG +VV VY P +
Sbjct: 760 AKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKG 794
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
K + GFQRV + G+ + + S D A NT+ P +G + I GN
Sbjct: 795 GPQ-KTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYGN 848
>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
buccalis ATCC 35310]
Length = 947
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 204/719 (28%), Positives = 333/719 (46%), Gaps = 102/719 (14%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E + GV + AT+FPT + ++N L ++G
Sbjct: 159 RLGIP-VDFTNEGIRGVESFK-----------ATNFPTQLGLGTTWNRKLIHQVGYITGR 206
Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EAR + G T ++P ++V RD RWGR E GE PF+V + RGLQ
Sbjct: 207 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPFLVAELGIQMTRGLQT---- 256
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+V+S KH+AAY + D +++ ++++ L P+ V+E
Sbjct: 257 ---------NYQVASTGKHFAAYSNNKGAREGMARVDPQMSPREVQNIHLYPWGRVVREA 307
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
M SYN +G+P L + +R ++ GY+V+D D+++ + H+ A+ KE
Sbjct: 308 GLLGAMSSYNDYDGVPIQGSFHWLTEVLRQQFGFKGYVVSDSDALEYLFSKHRTAANMKE 367
Query: 329 DAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
AV + + AGL++ C + V++G++ ID+ L+ + V +G FD
Sbjct: 368 -AVYKAVMAGLNVRCTFRSPDSFVLPLRELVKEGRIPMKVIDERLRDILRVKFMVGIFDR 426
Query: 385 SPQY-VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
Q + +++ + ++A +A+RE IVLLKN NTLPLN A +K +AV GP+AN
Sbjct: 427 PYQMNLQAADKEVDGKSHQQVALQASRESIVLLKNQNNTLPLNKASIKKIAVCGPNANDA 486
Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYA----NVTYKTGCDDV--------------ACKSNN 485
+ +Y + + G VTY GCD V N
Sbjct: 487 AYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKGCDLVDAHWPESELVDYPMTADEQN 546
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
I A E + +D +++ G + E+ R L LPG Q QL+ V K VILV+
Sbjct: 547 EIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLELPGRQLQLLKAVQATGK-TVILVL 605
Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
++ + + +A+ + AI+ A YPG +GG A+ADV+FG +NPGG+L +T+ V
Sbjct: 606 INGRPLSVNWADKF--VPAIVEAWYPGSQGGTAVADVLFGDYNPGGKLTVTF--PKTVGQ 661
Query: 606 LPLTSMPLRPV------DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+P + P +P + LG G + NG LY FG+GLSYT FKY+ L +
Sbjct: 662 IPF-NFPSKPAALVDGGNKLGLHGNASR-ANG-ALYYFGHGLSYTTFKYSNLRLS----- 713
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
+N++ T D+ C D N G G +VV +Y +
Sbjct: 714 -------AQNISPT-DSVVVSC------------------DITNTGQRAGDEVVQLYIQD 747
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
TY K + GF+RV ++ G + + FV + L +++ ++ G+ + +G
Sbjct: 748 VLSTVTTYEKNLRGFERVHLKPGETRTLSFVIKP-EHLQLINEQYQHVVEPGDFKVMMG 805
>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
17393]
gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
intestinalis DSM 17393]
Length = 864
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 160/434 (36%), Positives = 246/434 (56%), Gaps = 41/434 (9%)
Query: 46 GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
G+ + L+ D P R+ DL+SR+T++EK+ L + G+PRL +P+Y +EALHG
Sbjct: 20 GVAQAQELYKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHG 79
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG---- 161
V V PG T FP I A++N L ++ +S EARA +N G
Sbjct: 80 V--VRPGRF--------TVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQK 129
Query: 162 ------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
LT+WSP +N+ARDPRWGR ET GEDP++ G +V+GLQ G ++
Sbjct: 130 SQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQ---GDDD----- 181
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
R LK+ S KH+AA + ++ +R+ + +++E+ + E +L FE CVK+G ++S+M
Sbjct: 182 -RYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMS 236
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
+YN +N +P + LL + +R +W GY+V+DC ++V+ HK++ +KE A ++
Sbjct: 237 AYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSI 295
Query: 336 KAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
KAGLDL+CG + +A +Q V DID + + M+LG FD + Y +
Sbjct: 296 KAGLDLECGDDVFDEPLLSAYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEKNPYTKIS 355
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ S ++ E+A AARE IVLLKN + LPLN+ KVK++AVVG NA G+Y+G
Sbjct: 356 PAVVGSAKHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGNCEFGDYSG 413
Query: 453 IPCRYMSPIAGFSG 466
P ++PI+ G
Sbjct: 414 SPV--IAPISVLQG 425
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/293 (33%), Positives = 146/293 (49%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 595 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKV--NPNIVVVLVAG 652
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + ++ AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y L
Sbjct: 653 S-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 704
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSYT FKY+ +QV
Sbjct: 705 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYS------NLQV--------- 747
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
D E V FQ N G G +V VY K P
Sbjct: 748 ----------------------ADGEEEINVSFQLKNAGKYAGDEVAQVYVKLPERDEVM 785
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
+K++ GF+RV +++G NK++ L D A + P+G++TI VG
Sbjct: 786 PVKELKGFERVALKSGENKKMTLKLRK-DLLRYWDEAKGKFVYPSGDYTIMVG 837
>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
CL02T12C01]
Length = 854
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 158/432 (36%), Positives = 243/432 (56%), Gaps = 41/432 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q ++ D + P R+ DL+S++T++EK+ L + G+PRL + +Y +EALHGV
Sbjct: 22 QKGKDVYLDMNAPQHERILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGV- 80
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG------ 161
V PG T FP I A +N L +I A+S EARA +N G
Sbjct: 81 -VRPGNF--------TVFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQ 131
Query: 162 ----LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
LT+WSP +N+ARDPRWGR ET GEDPF+ G+ V++V+GLQ + R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD---------PR 182
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
LK+ S KH+AA + ++ +R+ + ++E+D+ E +L FE C+ EG A+S+M +Y
Sbjct: 183 YLKIVSTPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAY 238
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N +N +P + LL + +R +W GY+V+DC + +V +HK++ + E A +++A
Sbjct: 239 NAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVK-TLEAAATLSIQA 297
Query: 338 GLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQ 394
GLDL+CG Y NA +Q V E +ID + ++ MRLG FD Y +
Sbjct: 298 GLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFDDPNLNPYNKISPS 357
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ +++ +LA EAAR+ IVLLKN++ LPL+ K+K++AVVG NA G+Y+G P
Sbjct: 358 VVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVG--INAGNCEFGDYSGTP 415
Query: 455 CRYMSPIAGFSG 466
P++ G
Sbjct: 416 VN--QPVSILEG 425
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 137/293 (46%), Gaps = 50/293 (17%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSA 548
AA +A + D TI + G++ S+E E DR + LP Q I + ++ V++++
Sbjct: 594 AAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVVVLV--- 650
Query: 549 GGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP- 607
G +A + +I AI+ A YPGE GG A+A+V+FG +NPGG+LP+T+Y + LP
Sbjct: 651 AGSSLAINWMDEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDELPA 708
Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
+R GRTY+F+ G LY FG+GLSYT F Y L
Sbjct: 709 FDDYDIR-------KGRTYQFFEGNPLYAFGHGLSYTTFSYKKL---------------- 745
Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA--EIAA 725
N++ T DA K F K N G DG +V +Y K +
Sbjct: 746 -NIDSTGDAVKVS--------------FALK----NTGKYDGDEVAQLYVKYQGNDSLVK 786
Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+KQ+ GF+RV ++ G +KR+ + + PAG++ VG
Sbjct: 787 LPLKQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVG 839
>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
CL03T12C01]
Length = 864
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 165/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ +S+L R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ TD N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R EW G +++DC +I HK ++ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPNA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG Y +A ++G + E DID S+K L LG D ++ + +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420
Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
+ G + Y+ GC V S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + LP + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y+ + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A K + F+RV + AG+ ++
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|374310554|ref|YP_005056984.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358752564|gb|AEU35954.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 739
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 235/760 (30%), Positives = 353/760 (46%), Gaps = 119/760 (15%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ DS RV L++ MTLDEK+ L VPRLG+ E LHG++ GPG
Sbjct: 44 VYLDSHADPESRVTALLAAMTLDEKIHALSTDP-SVPRLGVAGTNH-VEGLHGLALGGPG 101
Query: 113 THFD---------DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR-AMYNLGRAGL 162
H++ +VIP T FP +++ +L +K + E R A R GL
Sbjct: 102 -HWEGHSEGRTMLNVIP-TTQFPQSRGLGQTWDPALLQKAAAQEAYETRFAFGKYHRGGL 159
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
+PN +++RDPRWGR E+ GEDPF+VG A + GLQ + H T +
Sbjct: 160 VVRAPNADLSRDPRWGRGEESYGEDPFLVGTLATAFAHGLQGDDPHVWMT---------A 210
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+ A ++ + +FDAR+ E + PF M ++EG A ++M SYN N
Sbjct: 211 SLLKHFLANSNEDGRDGSSSNFDARL----FHEYYAVPFRMAIEEGHADAMMTSYNAWNS 266
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-- 340
+P A+P ++ V +W L G + D ++ MV H A E A A + AG++
Sbjct: 267 VPMTANP-VVRDVVMAQWGLDGIVCTDAGALTNMVKQHHTYATMPE-AAAAAIHAGINQF 324
Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDIC 397
LD Y +A+QQ + E DID++L+ +Y V++ LG D SP Y +G D
Sbjct: 325 LDD---YQQPVRDALQQKLITEQDIDRNLRGVYRVMLHLGLLDPTANSP-YSHIGAFDQA 380
Query: 398 SDE--NIE----LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
+ N E L A E IVLLKN LPL++AK+K++AV+G + TVA+ Y+
Sbjct: 381 QSDPWNTEAPRALVRRATDESIVLLKNTGGALPLDAAKLKSIAVIGQWGD-TVAL-DWYS 438
Query: 452 GIPCRYMSPIAGF---SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
G P ++P+ G + A+V + G D+ A + A ++A I++ G
Sbjct: 439 GTPLLSVTPVEGIRRRAAGASVVFNDGKDEAAAAA---------LAARSEAVIVIVGNHP 489
Query: 509 SVEA------------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFA 556
+ +A E++DR+ L LP L+ V +A P +V++
Sbjct: 490 TCDAGWNKCALPSEGKEAIDRKSLTLP--DESLVKAV--LAANPHAVVVLQT-SFPYTTN 544
Query: 557 ETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV 616
T + AIL + EE G A+ADV+FG +NP GRL TW Q+ P+ LR
Sbjct: 545 WTQEHAPAILEITHNSEEQGTALADVLFGDYNPAGRLTQTW-PASLEQLPPMMDYDLR-- 601
Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
GRTY + LYPFG+GLSYT F Y+ L+ T+
Sbjct: 602 -----HGRTYLYAEKAPLYPFGFGLSYTSFAYSDLTVTQ--------------------- 635
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
R + V +V N GS G +VV +Y+ I+++ F+R
Sbjct: 636 ---RGKSIAV-----------QVTVANTGSRAGDEVVQIYAAHQGSTVPRPIEELKAFRR 681
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
V +RAG + ++F SL D A + + G+ F
Sbjct: 682 VALRAGEKQVVRFEM-PVTSLAYWDEATHRFIVEGDRVEF 720
>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
sp. HGB5]
Length = 771
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 208/718 (28%), Positives = 333/718 (46%), Gaps = 108/718 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT+FPT +++N L +++G+ ++
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++ RDPRW R E+ GED ++ R YVRG
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT------- 215
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ DL S+ S KH+ AY + + E+++ ET+L PFE VK G
Sbjct: 216 GSGDL-SQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG- 270
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM +YN V+GIP A+ ++L +RGEW G++V+D SI+ + + H +E
Sbjct: 271 ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREA 330
Query: 330 AVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
AV Q L+AG+D D G + + A + G V E +ID++++ + + +G F+ +P
Sbjct: 331 AV-QALRAGVDADLKGGAFASLR-EAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYI 387
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
++ + ELA EAAR+ + LL+N TLPL+ +++ VAV+GP+A+ +G
Sbjct: 388 DEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLG 447
Query: 449 NYAGIPCRYMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
+Y + G G V Y GC V + I AA AA+ DA +++ G
Sbjct: 448 DYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSEIAAAVSAARGTDAAVVVIG 506
Query: 506 ----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
D E E DR L L G Q +L+ ++ P+I
Sbjct: 507 GSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRIKATGT-PLI 565
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V ++ +D+ A + A+L A YPG GG A+A+ + G+ NP GRLPIT +
Sbjct: 566 VVCIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGRNNPAGRLPITIPRAE- 622
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
+P+ RP + Y LYPFGYGLSY+ F+Y L + Q N
Sbjct: 623 -GQIPVYYNKKRPANH------DYTDLTAAPLYPFGYGLSYSTFEYGSL---EARQSGDN 672
Query: 663 KLQ-HCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
L+ CR N +D D+ + + SD+V +PP
Sbjct: 673 VLEVSCRIRN--------------TSDREGDEVVQLYI----------SDMVASTVRPP- 707
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+Q+ GF+R+ + G +++ F ++L ++D ++ G+ I VG+
Sbjct: 708 -------RQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQGRRVVEKGDFVIAVGS 757
>gi|375309610|ref|ZP_09774891.1| glycoside hydrolase [Paenibacillus sp. Aloe-11]
gi|375078919|gb|EHS57146.1| glycoside hydrolase [Paenibacillus sp. Aloe-11]
Length = 769
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 227/817 (27%), Positives = 366/817 (44%), Gaps = 150/817 (18%)
Query: 49 MSSFLFCDSSLPYSIRVKDLVSRMTLDEK----VQQLG---------------DFAH--- 86
M+ ++ D S P RVK L+ MT++EK VQ G DF
Sbjct: 1 MTMLIYKDKSKPIEERVKHLIGLMTIEEKVGQLVQPFGWQVYEHTDGELSLHHDFKQQVQ 60
Query: 87 --GVPRL-GLPQYEWWS--------------EALHGVSN-------------VGPGTHFD 116
GV L G+ + + W+ EA++ + +G
Sbjct: 61 NGGVGSLYGVLRADPWTGVTLENGLSAKQGAEAVNLIQRYAVEHSRLGIPILIGEECSHG 120
Query: 117 DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPR 176
+ T FP + +++N L++ + +AV++E RA + G +SP ++V RDPR
Sbjct: 121 HMAIDGTVFPVPLSIGSTWNVDLYRDMCRAVASETRA-----QGGAVTYSPVLDVVRDPR 175
Query: 177 WGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDN 235
WGR E GEDP+++G +AV V GLQ E+ +S V++ KH+A Y +
Sbjct: 176 WGRTEECFGEDPYLIGEFAVAAVEGLQG----ESLLSEHS----VAATLKHFAGYGSSEG 227
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
+ H R ++ E L PF+ V G A S+M +YN ++G+P + +LL+
Sbjct: 228 GRNAGPVHMGWR----ELLEVDLYPFQKAVVAG-AQSIMPAYNEIDGVPCTVNAELLDDI 282
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNA 354
+R W G ++ DC +I+++V+ H + ++ DA Q ++AG+D++ G+ + + A
Sbjct: 283 LRQSWGFDGLVITDCGAIEMLVNGHD-VTENGSDAAVQAIRAGIDMEMSGEMFGSHLVEA 341
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
GK++ + +D++ + + T+ RLG FD +Q I E+I LA + A EGIV
Sbjct: 342 AHAGKLETSVLDQAGRRVLTLKYRLGLFDNPYVNAERAEQVIGRAEHIRLARQLATEGIV 401
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFSGYA---- 468
LLKN TLPL K +AV+GP+A+ +G+Y R ++ + G
Sbjct: 402 LLKNVNRTLPL-PKNSKRIAVIGPNADQVYNQLGDYTSPQPRSRVVTVLDGIRSKLSKHQ 460
Query: 469 -NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA---- 512
+V Y GC + +S A A AD +++ G +DL A
Sbjct: 461 DDVLYTPGC-RIKGESREGFENALACAAEADTVVMVVGGSSARDFGEGTIDLKTGASKVA 519
Query: 513 ----------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
E +DR L L G Q QL+ ++ + K LV++ G IA +
Sbjct: 520 DHDWNDMECGEGIDRMTLGLAGVQLQLMQEIYSLGKE---LVVVYMNGRPIAEPWVEEHA 576
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AI+ A YPG+EGG AIAD++FG NP GRL ++ +V LP+ R
Sbjct: 577 HAIVEAWYPGQEGGHAIADILFGDVNPSGRLTLSIPK--HVGQLPVYYNGKRS------R 628
Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
G+ Y + YPFGYGLSYT F Y L+ +
Sbjct: 629 GKRYLEDDAEPRYPFGYGLSYTTFSYERLTLS---------------------------- 660
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
N +R D+ VD N G +G++VV +Y ++++ GF +V ++ G
Sbjct: 661 ---TNSIRADESVTVTVDVTNTGEREGAEVVQLYISDTVSSVTRPVRELKGFCKVVLQPG 717
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ ++FV + K L + ++ AG +I VG
Sbjct: 718 ETRTVEFVVGSDK-LQYIGRDLQPVVEAGRFSIQVGR 753
>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
Length = 855
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 157/443 (35%), Positives = 243/443 (54%), Gaps = 40/443 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R +DLV+R+TL+EK + D + +PRLG+ ++ WWSEALHG +N DDV T
Sbjct: 24 RAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFANN------DDV----T 73
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG-RAG--------LTYWSPNINVARD 174
FP + ASF++ L ++ A S E RA Y+ R G L+ W+PN+N+ RD
Sbjct: 74 VFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLSVWTPNVNIFRD 133
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR ET GEDP++ R V V+GLQ E +++ K+ +C KHYA +
Sbjct: 134 PRWGRGQETYGEDPYLTSRMGVQVVKGLQGPE--------DAKYKKLLACAKHYAVHSGP 185
Query: 235 NWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
W R+ + V+++D+ ET+L F++ V++ + VMC+Y R++ P C +LL
Sbjct: 186 EW---SRHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPCCGSDRLLQ 242
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-- 351
Q +R +W +V+DC +IQ +H +D+ A A+ + AG D++C N+
Sbjct: 243 QILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVH-AAAKAVLAGTDVECQWDKHNYKLL 301
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
AV++G VKE DID+S+K + LG D Y + I ++E+ +LA + A
Sbjct: 302 PEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIVPYAQIPASVINNEEHRQLALKMA 361
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---G 466
RE + LL+N N LPL+ + + +AV+GP+A+ + GNY G P R +S + G + G
Sbjct: 362 RESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNGTPVRTISILDGITSKIG 420
Query: 467 YANVTYKTGCDDVACKSNNSIFA 489
++ Y CD V K S F+
Sbjct: 421 EKSIVYDKACDLVEDKVTQSYFS 443
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 127/296 (42%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
K + I + GL +E E + DR D+ LP Q + + + K ++
Sbjct: 590 KGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGKK---VI 646
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G I T+ AIL A Y GE GG+A+ADV+FG +NP G+LP+T+Y D Q
Sbjct: 647 FVNNSGSAIGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYK-DTTQ 705
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
+ + GRTY+F L+PFG+GLSYT FK Q++ +++
Sbjct: 706 LPDFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYTNFKIG------EAQLDKSEI 751
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
++N T + N G T+G +++ VY +
Sbjct: 752 DTSSSVNIT-------------------------ISISNEGKTEGVEIIQVYVHKQG-LE 785
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVGN 779
IK + GF+RV ++ K + S D A ++ + G + IF GN
Sbjct: 786 EGPIKTLKGFKRVNLKPNEMKNVTINL-PSNSFEFYDKKARSMKVMPGNYEIFYGN 840
>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
17242]
Length = 771
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 208/718 (28%), Positives = 333/718 (46%), Gaps = 108/718 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT+FPT +++N L +++G+ ++
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++ RDPRW R E+ GED ++ R YVRG
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT------- 215
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+ DL S+ S KH+ AY + + E+++ ET+L PFE VK G
Sbjct: 216 GSGDL-SQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG- 270
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM +YN V+GIP A+ ++L +RGEW G++V+D SI+ + + H +E
Sbjct: 271 ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREA 330
Query: 330 AVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
AV Q L+AG+D D G + + A + G V E +ID++++ + + +G F+ +P
Sbjct: 331 AV-QALRAGVDADLKGGAFASLR-EAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYI 387
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
++ + ELA EAAR+ + LL+N TLPL+ +++ VAV+GP+A+ +G
Sbjct: 388 DEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLG 447
Query: 449 NYAGIPCRYMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
+Y + G G V Y GC V + I AA AA+ DA +++ G
Sbjct: 448 DYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSEIAAAVSAARGTDAAVVVIG 506
Query: 506 ----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
D E E DR L L G Q +L+ ++ P+I
Sbjct: 507 GSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRIKATGT-PLI 565
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+V ++ +D+ A + A+L A YPG GG A+A+ + G NP GRLPIT +
Sbjct: 566 VVCIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGHNNPAGRLPITIPRAE- 622
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
+P+ RP + Y LYPFGYGLSY+ F+Y L + Q N
Sbjct: 623 -GQIPVYYNKKRPAN------HDYTDLTAAPLYPFGYGLSYSTFEYGSL---EARQSGDN 672
Query: 663 KLQ-HCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
L+ CR N +D D+ + + SD+V +PP
Sbjct: 673 VLEVSCRIRN--------------TSDREGDEVVQLYI----------SDMVASTVRPP- 707
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+Q+ GF+R+ + G +++ F ++L+++D ++ G+ I VG+
Sbjct: 708 -------RQLGGFRRIRLAPGEQRQVSFTLGD-EALSLIDPQGRRVVEKGDFVIAVGS 757
>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 862
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 157/438 (35%), Positives = 237/438 (54%), Gaps = 37/438 (8%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ + +L R KDLV+R+TL EKV + D + VPRLG+ ++ WWSEALHG +N GP
Sbjct: 24 YQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYANQGP-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
T FP + ASF++ + AVS EARA N R L+ W
Sbjct: 82 --------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRFHDLSVW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+PN+N+ RDPRWGR ET GEDP++ R V+ V+GLQ D R K+ +C
Sbjct: 134 TPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQ------GPADAKYR--KLLACA 185
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KHYA + W R+ + VT +D+ ET+L F+ V++ D VMC+Y R++ P
Sbjct: 186 KHYAVHSGPEWS---RHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLDDEP 242
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
C + +LL Q +R +W +V+DC +I ++H +D+ A A+ + +G D++C
Sbjct: 243 CCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATH-ASAKAVLSGTDVECV 301
Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENI 402
Y + +AV +G +KE DI+ S+ L T LG D + + + S+++
Sbjct: 302 GYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKDELVPWTKIPLSVVNSEDHQ 361
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
+LA + ARE + LL+N+ N LPL S + +AV+GP+AN + + GNY G P R ++ +
Sbjct: 362 KLALDMARETMTLLQNNNNILPL-SKSIGKLAVIGPNANDSQMLSGNYNGTPLRTINILE 420
Query: 463 GFS---GYANVTYKTGCD 477
G G +V Y GCD
Sbjct: 421 GIKTKLGADHVIYDAGCD 438
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 117/266 (43%), Gaps = 54/266 (20%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
E K AD + + G+ +E E + DR D+ LP Q I + + K
Sbjct: 594 EKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKAGKK-- 651
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
+V ++ G IA N AIL A Y GE GG+A+ADV+FG +NP G LP+T+Y
Sbjct: 652 -IVFVNCSGSAIAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVTFYRN- 709
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
VQ LP S GRTY++ L+PFG+GLSYT F TK
Sbjct: 710 -VQQLPDFS-------DYSMKGRTYRYLKSAPLFPFGFGLSYTTFNIGEAKLTK------ 755
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
N++ + + +V N G TDG++++ VY +
Sbjct: 756 -------------------------NNITKGEAIQLRVPVANAGKTDGTELLQVYIRKVD 790
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRI 747
+ K + GF+R+ V AG+ + +
Sbjct: 791 DPDGAS-KTLRGFKRIPVSAGKTEMV 815
>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
12058]
Length = 865
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 167/467 (35%), Positives = 245/467 (52%), Gaps = 52/467 (11%)
Query: 51 SFLFCDSSL-------PY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
SF FC +L PY S R DL+ RMTL+EK+ Q+ + + + RLG+P Y
Sbjct: 8 SFCFCAVALVATAQNEPYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAY 67
Query: 97 EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
WW+EALHGV+ G AT FP I A+F+ + VS EARA Y+
Sbjct: 68 NWWNEALHGVARAGK----------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYH 117
Query: 157 --------LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
G GLT+W+PNIN+ RDPRWGR ET GEDP++ + V+GLQ
Sbjct: 118 DFQRKGERDGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG---- 173
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKE 267
D + K +C KHYA + W +R+ FDA+ ++++D+ ET+L F+ V E
Sbjct: 174 ----DGTGKYDKTHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTE 226
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADS 326
G VMC+YNR G P C++ +LL + +R +W +V+DC +I NH +
Sbjct: 227 GKVKEVMCAYNRYEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPT 286
Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
A A + +G DL+CG Y++ AV++G + E I++S+ L +LG FD +
Sbjct: 287 AAAASADAVVSGTDLECGGSYSSLN-EAVRKGLISEDKINESVFRLLRARFQLGMFDDNT 345
Query: 387 --QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+ + + S E++ A E AR+ +VLL N N LPL S V+ VAV+GP+AN +V
Sbjct: 346 LVSWSEIPYSVVESKEHVAKALEMARKSMVLLTNKNNILPL-SKSVRKVAVLGPNANDSV 404
Query: 445 AMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
+ NY G P + ++ + G V Y+ GCD V ++ S F
Sbjct: 405 MLWANYNGFPTKSVTILEGIRNKLPEGAVYYEKGCDFVNTQTVFSYF 451
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 129/282 (45%), Gaps = 54/282 (19%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K + ++ A AD I + GL S+E E + DR ++ LP Q
Sbjct: 582 DIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQE 641
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+++ + + K PV+ V+ S G +A N+ AI+ A YPG++GG A+ADV+FG +
Sbjct: 642 EMLKALKKTGK-PVVFVLCS--GSTLALPWEAENLDAIIEAWYPGQQGGTAVADVLFGDY 698
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP GRLP+T+Y +S L + RTY+++ G L+PFG+GLSYT F
Sbjct: 699 NPAGRLPLTFY---------ASSSDLPDFEDYDMSNRTYRYFKGRPLFPFGHGLSYTTFD 749
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y K I LR + + +N+G
Sbjct: 750 YGKAKADKKI-------------------------------LRAGEGLTLTIPLKNIGKL 778
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
G +VV VY + P + IK + F+R+ + AG+ + + F
Sbjct: 779 SGDEVVQVYLRNPGDKEGP-IKTLRAFRRISLEAGQAEDVLF 819
>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
CF314]
Length = 875
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 160/459 (34%), Positives = 241/459 (52%), Gaps = 45/459 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q + F + +LP R+++L+ +T+DEK+ + D + VPRL +P Y WW+EALHGV+
Sbjct: 19 QNYKYPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVA 78
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR 159
G AT FP I A+++ K + +S EARA YN GR
Sbjct: 79 RAGT----------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKTGR 128
Query: 160 -AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GLT+W+PNIN+ RDPRWGR ET GEDP++ V V+GLQ + +
Sbjct: 129 YEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGND---------PKY 179
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K +C KH+A + W +R+ ++A V+++D+ ET+L F+ V EG+ VMC+YN
Sbjct: 180 FKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCAYN 236
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLK 336
+G P CA LLN+ +RG+W G +V+DC ++ H D K A A LK
Sbjct: 237 AFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTA-ADALK 295
Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD- 395
DL+CG Y N ++ G + E DID S++ + LG D P+ L Q
Sbjct: 296 HSTDLECGDTYNNLN-KSLAGGLITEKDIDISMRRILKGWFELGMLD--PKSSVLWNQIP 352
Query: 396 ---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ SDE+ + A + A++ IVL+KN+ N LP N +K +AVVGP+A+ + +GNY G
Sbjct: 353 YSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNG 411
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G + Y+ G + S S++
Sbjct: 412 TPSSIVTILEGIKAKFPNTEIIYEKGSEVADPSSRASLY 450
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 136/302 (45%), Gaps = 48/302 (15%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
+ E K AD + GL S+E E + D+ + LP Q +L+ ++ + K
Sbjct: 594 SVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKTGK 653
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PV+ V+ + G + + N +L A Y G+ GG A+ADV+ G +NP GRLP+T+Y
Sbjct: 654 -PVVFVLCT--GSSLGLEQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNPSGRLPVTFY 710
Query: 599 -NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
N + + + + ++ GRTY++ LY FG+GLSY++F Y +K
Sbjct: 711 KNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYGNAKLSK-- 768
Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
N + ++ V N+ DG +VV VY
Sbjct: 769 -----------------------------NSISPNEDIIITVPVTNISDRDGEEVVQVYV 799
Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIF 776
K ++ A +K + F+RV +R+ K I+ + +S D A+ L+ +G++TI
Sbjct: 800 KRNNDVLAP-VKTLRAFERVLIRSKETKNIQLTISK-ESFKFYDEKADDLISKSGDYTIL 857
Query: 777 VG 778
G
Sbjct: 858 YG 859
>gi|322371968|ref|ZP_08046510.1| glycoside hydrolase family 3 domain protein [Haladaptatus
paucihalophilus DX253]
gi|320548390|gb|EFW90062.1| glycoside hydrolase family 3 domain protein [Haladaptatus
paucihalophilus DX253]
Length = 776
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 214/745 (28%), Positives = 338/745 (45%), Gaps = 126/745 (16%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
++ +L DF RLG+P E L G P T+FP ++ +++
Sbjct: 81 KRTNELQDFLGSETRLGIPAIPH-EECLSGYMG-----------PSGTTFPQMLGVASTW 128
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
+ L +I + + A+ G T+ SP +++ARD RWGR+ ET GEDP++V
Sbjct: 129 SPDLVAEITDTIRGQLEAI------GTTHALSPVLDIARDLRWGRVEETFGEDPYLVAAM 182
Query: 195 AVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
A YV GLQ D +G +S+ KH+A + G +R + V +++
Sbjct: 183 ARGYVNGLQGDGDG-------------ISATLKHFAGHGAGE-GGKNRSSVN--VGRREL 226
Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
ET L PFE +K DA SVM +Y+ ++GIP +D LL +RGEW G +V+D S+
Sbjct: 227 RETHLFPFEAVIKTADAESVMNAYHDIDGIPCASDGWLLTDVLRGEWGFDGTVVSDYYSV 286
Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKS 368
+ + H +A SK+ A ++AGLD+ DC Y + NAV+ G V E ++ +
Sbjct: 287 EFLQSEHG-VAASKQAAGVMAVEAGLDVELPYTDC---YGDHLVNAVEDGDVAEATVNTA 342
Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
++ + G D V ++ +L AARE + LLKN+ + LP +
Sbjct: 343 VRRVLRAKAEKGLLDDPTVDVDAAAAPFNTENARDLTTRAARESMTLLKNEDDFLPFDGE 402
Query: 429 KVKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPI---------AGFSGYANV 470
+++TVAVVGP A+ ++G+YA P Y +P+ AGF +V
Sbjct: 403 ELETVAVVGPKADNAQELMGDYA-YPAHYPTEEVDLDATTPLDAIEARGEHAGF----DV 457
Query: 471 TYKTGCDDVACKSNN---------------SIFAASEAAKTADATIILAGL-DLSVEAES 514
Y+ GC + + + A A +D A L + E
Sbjct: 458 RYEQGCTTTGSSTEDFDSAAEAAEAADVAVTFVGARSAVDFSDIDEKQADLPSVPTSGEG 517
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
D DL LPG Q +L+ +V E P+++V++S + + A+L+A PGE
Sbjct: 518 CDVVDLDLPGVQQELVERVHETGT-PLVVVVVSGKPHSVEW--IAEEAPALLYAWLPGER 574
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
GG IA+V+FG+ NPGGRLP++ + +P+ + + L
Sbjct: 575 GGEGIAEVLFGEHNPGGRLPVSIPRS-------VGQLPVYYNRKPNTANEEHVYTESTPL 627
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GLSYT F+Y LS L+ S A R
Sbjct: 628 YPFGHGLSYTDFEYGDLS-----------------LSTDSIAPSGRVSA----------- 659
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
+V N G DG +VV +Y+ + A +++++GF+R+F+ AG +KRI F +A
Sbjct: 660 ---EVTVSNTGDRDGHEVVQLYASAKSPSQARPVQELVGFERIFLAAGESKRIIFEIDAS 716
Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN 779
+ L D N + G + + VG
Sbjct: 717 Q-LAFHDRDMNLAVERGPYELRVGR 740
>gi|435848436|ref|YP_007310686.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
SP4]
gi|433674704|gb|AGB38896.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
SP4]
Length = 771
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 202/693 (29%), Positives = 327/693 (47%), Gaps = 102/693 (14%)
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWG 178
P AT+FP +I ++++ L +++ + + E A+ G T+ SP ++VARD RWG
Sbjct: 113 PEATTFPQMIGMASTWDPELLEEVTETIRGELEAL------GTTHALSPVLDVARDLRWG 166
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R+ ET GEDP +V A YV GLQ + R VS+ KH+ + + G
Sbjct: 167 RVEETFGEDPLLVAAMACGYVSGLQG----------DGRADGVSATLKHFVGHGATDG-G 215
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
+R + V +++ E L P+E ++ DA SVM +Y+ ++GIP + LL +RG
Sbjct: 216 KNRSSLN--VGPRELREVHLFPYEAAIRTADAESVMNAYHDIDGIPCASSEWLLTDLLRG 273
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQ 356
E+ G +V+D S++ +V H A++K +A L+AGLD++ YY AV+
Sbjct: 274 EFGFDGTVVSDYYSVRHLVTEHG-TANTKPEAATAALEAGLDVELPYTDYYGEHLITAVE 332
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
G++ E +D+S++ + R G D +DE L AAR + LL
Sbjct: 333 NGELSEKTLDESVRRVLREKARKGLLDDPSVDAEAAADAFRTDEAAALNRRAARRSMTLL 392
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPIAGFSGY 467
KN+ LPL + +VAV+GP A+A ++G+YA Y +P+A
Sbjct: 393 KNENELLPLTA---DSVAVIGPKADAKKELLGDYA-YAAHYPEEEYASDATTPLAALESR 448
Query: 468 --ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE-------------- 511
V+Y+ GC V+ S + A++ A+ AD + G +V+
Sbjct: 449 DGLEVSYEQGC-TVSGPSTDGFEPAAQVAEDADVALAFVGARSAVDFSDGDASKEEKPSV 507
Query: 512 ---AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
E D DL LPG Q +LI+++ E P+ +VI+S G + ++ A+L+A
Sbjct: 508 PTSGEGCDVTDLGLPGVQEELIDRLQETGT-PLAVVIVS--GRPHSIERITADVPAVLYA 564
Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP--LTSMPLRPVDSLGYPGRTY 626
PG+EGG AI DV+FG+ NP GRLP++ LP + +P+ ++Y
Sbjct: 565 WLPGDEGGSAIVDVLFGEHNPSGRLPVS---------LPKSVGQLPVYYNRKANTANKSY 615
Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
+ +G +YPFG+GLSYT+F+Y LS ++ L +
Sbjct: 616 VYTDGEPVYPFGHGLSYTEFEYGTLSLSEKRVSPLETVVAS------------------- 656
Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
V N G G++VV +Y+ A ++++IGF+RV + AG KR
Sbjct: 657 ------------VPVTNEGDRSGAEVVQLYAHAANPSQARPVQELIGFERVPLEAGETKR 704
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ F + + L D + + G + I VG
Sbjct: 705 VSFELSPTQ-LAFHDESMTLTVEEGPYEIRVGR 736
>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
turnerae T7901]
Length = 888
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 168/484 (34%), Positives = 249/484 (51%), Gaps = 50/484 (10%)
Query: 10 CFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
L++A L+F+ + D N PV S+ + D++L RV DLV
Sbjct: 11 ILGLTLASLLFTGCSPDNNPVPKPV--------SERSTANEQPAYMDTTLDIDTRVDDLV 62
Query: 70 SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
SRM L EK+ Q+ + + + LG+ +Y+WW+EALHGV+ G AT FP I
Sbjct: 63 SRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAG----------KATVFPQAI 112
Query: 130 LTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYWSPNINVARDPRWGRIT 181
A ++ I +AVS EARA ++ GLT+WSPNIN+ RDPRWGR
Sbjct: 113 GMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGRGQ 172
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G A+ Y+ GLQ N + LK ++ KH+A V + R
Sbjct: 173 ETYGEDPYLTGELALPYISGLQGE---------NPKYLKTAAMAKHFA---VHSGPEKSR 220
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
+ + + +D+ ET+L FE V EGD SVMC+YNRVN P+C + LL +T+RG+W
Sbjct: 221 HSDNYIASPKDLNETYLPAFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWG 280
Query: 302 LHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN---AVQQ 357
G++V+DC +I + + A A +++G DL+CG + N A+Q+
Sbjct: 281 FKGHVVSDCGAIADFYAPEAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHFALQR 340
Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVL 415
+ + +ID+S+K L +LG FD Q Y + + S ++ L +AA + VL
Sbjct: 341 EMITQDEIDQSVKRLMKTRFKLGMFDPDDQVPYSKIPMDVVGSQAHLALTQKAAEKSFVL 400
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTY 472
LKN LPL K VA++GP+A ++GNY G P + ++P+ G Y NV Y
Sbjct: 401 LKN-SGILPLK--KSSKVAIIGPNATNPTVLVGNYFGDPIKPVTPLDGIQQYLGEENVFY 457
Query: 473 KTGC 476
G
Sbjct: 458 APGS 461
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 128/281 (45%), Gaps = 48/281 (17%)
Query: 503 LAGLDLSVEAESLD---REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
L G ++SVE E D R D+ LP Q +L+ + ++ K P++LV S G IA N
Sbjct: 634 LEGEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFS--GSAIALNWAN 690
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
N+ AIL YPGE G A+A +++G+ +P GRLPIT+Y L +P
Sbjct: 691 NNVDAILQGFYPGEATGTALARILWGEVSPSGRLPITFYRS-------LDDLP--GFKDY 741
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
RTYK+Y G LYPFGYGLSYTQF Y+ LS T + L T+ S
Sbjct: 742 AMTNRTYKYYQGDVLYPFGYGLSYTQFAYSELSAPAT-------MASGEPLAITAQVS-- 792
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
N G +VV VY + +++ F+R+++
Sbjct: 793 -----------------------NSGKVASDEVVQVYVSMKVPGLSLPQRELKEFKRIYL 829
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
G ++ ++F A K L+ VD G T+ VG G
Sbjct: 830 EPGASQTVEFSI-AGKDLSYVDDQGVRHPYHGPLTLSVGGG 869
>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1286
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 153/425 (36%), Positives = 232/425 (54%), Gaps = 29/425 (6%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ ++S + R DL+SR+TL+EK LG+ +PRLG+ WSEALHG+ G
Sbjct: 32 IYLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEALHGILG---G 88
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
+ I G TSFP + ++++ +L ++ A++ EARA+ G GLTYWSP +
Sbjct: 89 ANQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLTYWSPVVEPI 148
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR E+ GEDPF+ A +VRG+ + T L S P C KHY A
Sbjct: 149 RDPRWGRTGESYGEDPFLAAEIAGGFVRGMV----GNDPTYLKSVP-----CAKHYFA-- 197
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
N DR+ + + +DM E +L P++ +++ + S+M SYN VNG+P+ A L
Sbjct: 198 --NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYL 255
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+ R + L GYI DC +I+ + H ++ + E+A A+ LKAG+D DCG Y +
Sbjct: 256 DTIARRTYGLKGYITGDCAAIEDIYTGHYYV-KTAEEATAKGLKAGVDSDCGSIYQRYAI 314
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
A+++G + DID++L ++ V MR G FD + Y + S N LA E A
Sbjct: 315 AALKKGLITMADIDRALLNIFIVRMRTGEFDPPAKVLYAQFQPNIVNSPANKALAKEIAT 374
Query: 411 EGIVLLKN------DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR--YMSPIA 462
+ VLLKN ++ LPLN A +K +A++GPHA+ +G Y+G P + ++P A
Sbjct: 375 KTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHADK--VELGPYSGRPAQENMITPFA 432
Query: 463 GFSGY 467
G Y
Sbjct: 433 GIKKY 437
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 89/265 (33%), Positives = 125/265 (47%), Gaps = 40/265 (15%)
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
G D E DR L LPG Q +LI VA V I+V+ + G V++ + NI
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVAAVNPN-TIVVMQTLGCVEVEEFKNLQNIPG 677
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPG 623
I+W GY G+ G AIA V+FG+ NPGG+L TWY V+ LP +T LR + G G
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYKS--VKDLPEITDYTLRGGN--GKNG 733
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
RT+ +++ Y FG+G+SYT F+Y+ +K
Sbjct: 734 RTFWYFDKDVSYEFGFGMSYTTFEYSNFRISK---------------------------- 765
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--IKQVIGFQRVFVRA 741
N + D VD +N G +G +V+ VY K P A+ IK++ GF+RV + A
Sbjct: 766 ---NSIIPHDKITVSVDVKNTGKVEGDEVIQVYMKTPDSPASLQRPIKRLKGFKRVTLPA 822
Query: 742 GRNKRIKFVFNACKSLNIVDYAANT 766
G+ K + N C L D NT
Sbjct: 823 GQTKTVNIDIN-CADLWFWDMDKNT 846
>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 871
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 152/419 (36%), Positives = 225/419 (53%), Gaps = 34/419 (8%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + + L ++ RV DLV RMTL+EKV Q+ + + +PRL +P Y+WW+E LHGV+
Sbjct: 22 QTSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
T F T +P I A+F+ ++ + E RA++N
Sbjct: 82 R----TPFK-----VTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEK 132
Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
GLTYW+PNIN+ RDPRWGR ET GEDPF+ G +V GLQ + +
Sbjct: 133 YLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQGND---------PKY 183
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
LK ++C KHYA V + R+ F+A ++ D+ +T+L F+ V + + VMC+YN
Sbjct: 184 LKAAACAKHYA---VHSGPEPLRHVFNADISTYDLWDTYLPAFKKLVVDDKVAGVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
P C L+ +R +W GY+ +DC I NHK A + EDA + G
Sbjct: 241 AFKTQPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHKTHA-TAEDASTDAVLHG 299
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDI 396
D++CG AV++GK+ ET ID S+K L+ + RLG FD S +Y +
Sbjct: 300 TDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFDPSDVVKYAQTPVSVL 359
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
S E+ A + AR+ +VLLKN +TLPL S ++ + V+GP+A+ +A++GNY G P
Sbjct: 360 ESPEHQAHALKMARQSVVLLKNANHTLPL-SKTIRKIVVLGPNADNPIAILGNYNGTPS 417
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 142/335 (42%), Gaps = 70/335 (20%)
Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL---------- 515
G AN+ + G + A + ADA + + G+ +E E +
Sbjct: 577 GKANIRFSAGN-----YKKTDVAALVKRVADADAIVYVGGISPQLEGEEMQVNYPGFNGG 631
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
DR + LP QT L+ + K PV+ V+M+ G +A NI AI+ A Y G+
Sbjct: 632 DRTSIQLPAAQTNLMKTLQATGK-PVVFVMMT--GSALATPWEAENIPAIVNAWYGGQAA 688
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
G A+ADV+FG +NP GRLP+T+Y D T +P RTY+++ G LY
Sbjct: 689 GTAVADVLFGDYNPAGRLPVTFYKSD-------TDLP--DFTDYSMTNRTYRYFKGIPLY 739
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
FGYGLSYTQFKY+ L T+ +
Sbjct: 740 GFGYGLSYTQFKYDKLIVPATV--------------------------------KSGKAI 767
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN--- 752
V N G G +VV +Y K ++ +K + GF RV+++AG + + F+ +
Sbjct: 768 HLSVTVTNSGQIAGDEVVQIYMKHHSQRIKVPLKALKGFARVYLKAGERRTLNFILSPDD 827
Query: 753 -ACKSLN--IVDYAANTLLPAG-----EHTIFVGN 779
A S N +V + AG EH + GN
Sbjct: 828 LAVTSSNGGLVPIKGKITISAGGSQPDEHNVTSGN 862
>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
Length = 746
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 220/777 (28%), Positives = 362/777 (46%), Gaps = 124/777 (15%)
Query: 56 DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
+S LP++ VKDL+SRMT++EK+ QL + G L P+ E+ S++L VG
Sbjct: 25 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 83
Query: 111 -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
P DVI G T FPT + + S++ + ++
Sbjct: 84 VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 143
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ + E+ A AGL + ++P +++ARD RWGR+ E GED ++ A V G Q
Sbjct: 144 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 197
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
N + NS V +C KH+ AY + R + ++E+ + +T+L PF+
Sbjct: 198 -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 245
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
C+ G + M ++N +NGIP+ A P LL +RG+W+ +G++V+D ++++ +V + +
Sbjct: 246 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 302
Query: 324 ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A+ +DA +G+D+D Y + ++ GK+ D+D S+ + + LG F
Sbjct: 303 AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 362
Query: 383 DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
++ + Q I E ++ A + A + VLLKND +TLPL + V+++AVVGP A
Sbjct: 363 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 421
Query: 441 NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
+ ++G++ A R+++ + G N V Y GC D + + A
Sbjct: 422 DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 478
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ A +D I + G + ES R L LPG Q +LI ++ K PV++V+M+ +
Sbjct: 479 KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 537
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT----------WYNGD 601
I + + N+ AIL + G G AIAD++FG +NP GRL I+ +YN
Sbjct: 538 SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYN-- 593
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
Y + MP R N P LYPFGYGLSYT F Y++ T+
Sbjct: 594 YKKSGRPGDMPHSSTT------RHIDVPNAP-LYPFGYGLSYTTFSYSVPQSTQK----- 641
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
YT + V N G DG + V +Y
Sbjct: 642 ---------EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKV 675
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+K++ F+++F++AG +K ++F + +L D A N ++ GE I G
Sbjct: 676 ASVVRPVKELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 731
>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 954
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 226/760 (29%), Positives = 356/760 (46%), Gaps = 111/760 (14%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
+ +S + D +LP RV+ L+S MT ++K++ + G G+P L +P EA+HG
Sbjct: 164 EKTSLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 222
Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
S GAT FP + A++N+ L + + AV E L + W
Sbjct: 223 FSYGS----------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAW 267
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
SP ++VA+D RWGR ET GEDP +V + +++G Q + L + P
Sbjct: 268 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-------SKGLFTTP------- 313
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+ + G D + D ++E++M E L PF ++ D SVM +Y+ G+P
Sbjct: 314 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPV 370
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
+LL+ +R EW G+IV+DC +I + + A K +A Q L AG+ +CG
Sbjct: 371 AKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGD 430
Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
Y + A + G++ ++D+ + + ++ R F+ +P L I SD
Sbjct: 431 TYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNK-PLDWNKIYPGWNSDS 489
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYM 458
+ E+A +AARE IV+L+N N LPL + ++T+AVVGP A+ G+Y +P +
Sbjct: 490 HKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKLLPGQLK 546
Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
S + G V Y+ GC D + I A +AA +D +++ G + E+
Sbjct: 547 SVLTGIKQAVGKQTKVVYEQGC-DFTSSNGTDIPKAVKAASQSDVVVLVLGDCSTSESTT 605
Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
E+ D L LPG Q +L+ V K PVIL++ + G ++ + KAI
Sbjct: 606 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQA--GRPYNLSKASELCKAI 662
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
L PG+EGG A ADV+FG +NP GRLP+T+ +V LPL + GR
Sbjct: 663 LVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRR 713
Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
Y++ + LY FGYGLSYT F+Y+ L K+Q N N A+
Sbjct: 714 YEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAIQAT------ 756
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+NVG G +VV +Y T I ++ F RV ++
Sbjct: 757 -----------------VKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPDE 799
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
+K + F + L++++ + ++ GE I V GGVS
Sbjct: 800 SKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILV--GGVS 836
>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
Length = 873
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 160/424 (37%), Positives = 235/424 (55%), Gaps = 36/424 (8%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
F F + L R+ DLVSRMTL+EK+ QL A + RL +P+Y WW+E+LHGV+ G
Sbjct: 23 QFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWWNESLHGVARAG 82
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR------AGL 162
AT FP I AS++ L +++ A+S EARA ++ L R GL
Sbjct: 83 ----------YATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYLRRDQHDIYQGL 132
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T WSPNIN+ RDPRWGR ET GEDPF+ G YV+GLQ + LKV
Sbjct: 133 TMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGD---------DPEYLKVV 183
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+ KH+A V + R++FDA +E+D+ ET+L F M VK+ SVM +YNR G
Sbjct: 184 ATAKHFA---VHSGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAYNRFRG 240
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+ ++ KLL +R +W GY+V+DC +I + ++HK + A A L+ G DL+
Sbjct: 241 EAASSN-KLLFDILRNKWGFDGYVVSDCGAINDIWEDHK-ITADAASASALALETGTDLN 298
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDE 400
CG Y + A+ G + E I+ +++ L+ ++LG FD Y ++ +
Sbjct: 299 CGATYKSLK-EAIANGLITEEKINIAIERLFRARLKLGMFDTEENLSYATIPFSVNTNAS 357
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ LA +AA+E IVLLKN+ + LPL S +K +AV+GP+A+ ++ GNY G P ++
Sbjct: 358 HTALARKAAQESIVLLKNEAHMLPL-SKDLKQIAVIGPNAHNVQSLWGNYNGTPKNPVTV 416
Query: 461 IAGF 464
+ G
Sbjct: 417 VQGI 420
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 155/313 (49%), Gaps = 59/313 (18%)
Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
+ N + A A+ +D TI++ GL+ +E E + DR L LP Q +L
Sbjct: 582 STPEKNKLERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQREL 641
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + K P++LV+++ + I +A+ + + AIL AGYPG+EGG AIADV+FG +NP
Sbjct: 642 MRALVATGK-PIVLVLLNGSALAINYAQEH--VPAILSAGYPGQEGGNAIADVLFGDYNP 698
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
GRLP+T+Y V LP + GRTY+++ G LYPFGYGLSYTQF Y
Sbjct: 699 AGRLPVTYYKS--VDDLP-------DFEDYSMKGRTYRYFEGEALYPFGYGLSYTQFSY- 748
Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
DA KT L D +V N G DG
Sbjct: 749 -------------------------DAIKTS------GRLAADKVLNVQVTVTNSGDRDG 777
Query: 710 SDVVIVYSKPPAEIAATYIKQV--IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+VV +Y K E+A+T QV +GF+R+ ++ G + ++F +A + ++++ +
Sbjct: 778 DEVVQLYLKD--EVASTTRPQVQLVGFKRIHLQKGETQTVEFRLDA-RQFSMINDQEQLV 834
Query: 768 LPAGEHTIFVGNG 780
+ G T++ G G
Sbjct: 835 VEPGWFTLYAGGG 847
>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
Length = 865
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 168/443 (37%), Positives = 240/443 (54%), Gaps = 49/443 (11%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
L+ D P RV DL+ RMTL+EK Q+ + A +PRLG+P Y++W+EALHGV+ G
Sbjct: 13 LYFDPGQPIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE- 71
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
AT FP I A+++ + GQ V+TE RA YN +A GLT+
Sbjct: 72 ---------ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTF 122
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
WSPNIN+ RDPRWGR ET GEDP++ G AV +V G+Q TD N LK +
Sbjct: 123 WSPNINIFRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQ-------GTDANY--LKAIAT 173
Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A V + R+ F+ + +D+ ET+L F + +G A S+MC+YN V+
Sbjct: 174 PKHFA---VHSGPEQLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKA 230
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
+CA+ LL T+RG W G++ +DC +I + H + E A A +KAG D C
Sbjct: 231 ACANTMLLKDTLRGAWGFKGFVTSDCGAIDDITTGHHNSPTNPEGA-ALAVKAGTDTGC- 288
Query: 345 QYYTNFTGN------AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
+F AV+ G + E D+D +L+ L+T M+LG FD + + + ++ +
Sbjct: 289 ----DFKDEMLDLPRAVKAGYLTEGDMDVALRRLFTARMKLGMFDPAARVPFSTISIAEN 344
Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
S + LA AARE IVLLKND LPL +A + +AVVGP A + +A+ GNY G P
Sbjct: 345 HSPAHRALALRAARESIVLLKND-GVLPL-AAGARRIAVVGPTAASLIALEGNYNGTPVG 402
Query: 457 YMSPIAGFS---GYANVTYKTGC 476
+ P+ G + G + Y G
Sbjct: 403 AVLPVDGMTAAFGADRIVYAQGS 425
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 127/286 (44%), Gaps = 55/286 (19%)
Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
GL+ +E E + DR + LP Q+QL++ + K P+++V+ S G IA
Sbjct: 602 GLNAWLEGEEMPLQVPGFAGGDRTAIALPAAQSQLLDALFATGK-PLVIVLQS--GSAIA 658
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+A+L A YPGE GG+AIA+V+ G NP GRLP+T+Y LP
Sbjct: 659 LGAQEAKARAVLEAWYPGEAGGQAIAEVLSGTVNPSGRLPVTFYAS--TDQLPA------ 710
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
D RTY+++ G YPFG+GLSYT+F Y+ L +
Sbjct: 711 -FDDYRMANRTYRYFAGRVEYPFGHGLSYTRFAYSALR--------------------PA 749
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+S G V+ V +N G G +V +Y P A I+ + G+
Sbjct: 750 TSSVAAGQGTSVS-----------VAVRNTGVLAGDEVAQLYLSVPGREGAP-IRSLKGY 797
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
QRV + AG K + F + L + + A + + I+VG G
Sbjct: 798 QRVHLAAGETKTLTFALEP-RDLALANAAGAMAVTKATYQIWVGGG 842
>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 736
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 214/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT I A+++ L K++GQ ++
Sbjct: 83 RLGIPMF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPELVKEVGQVIAK 130
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 178
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+L+ + +++ KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 179 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 233
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL + +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 234 ALSVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 292
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ +T ID ++ + + +G F+
Sbjct: 293 AAIQSVMAGVDVDLGGDAYTNLC-HAVQSGQMDKTVIDTAVCRVLRMKFEMGLFEHPYVD 351
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 352 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKMINKVAVIGPNADNRYNMLG 410
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+Y GI + +SP + V Y GC + + N I A EAA+
Sbjct: 411 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 461
Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
++ A + G +E E DR L L G Q +L+
Sbjct: 462 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 521
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
+ + K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 522 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 578
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LPI+ V +P+ P + Y + LY FGYG+SYT F+Y+ L
Sbjct: 579 LPISVPRS--VGQIPVYYNQKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 630
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
V+ RC FE +N G DG +V
Sbjct: 631 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 656
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
+Y + +KQ+ F+R ++ G K++ FV + +V+Y ++ +G
Sbjct: 657 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 715
Query: 773 HTIFVG 778
+ +G
Sbjct: 716 FHLMIG 721
>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
CL03T12C18]
gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
CL02T12C04]
Length = 955
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 223/748 (29%), Positives = 353/748 (47%), Gaps = 107/748 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT ++K++ + G G+P L +P EA+HG S
Sbjct: 171 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 228
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ + E + N +A WSP ++V
Sbjct: 229 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDET-VVANTKQA----WSPVLDV 274
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q SR L + KH+ +
Sbjct: 275 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 320
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF V+ D S+M +Y+ GIP +L
Sbjct: 321 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHVVRNYDCQSLMMAYSDYMGIPVAGSTEL 377
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L Q +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y +
Sbjct: 378 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKE 437
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK--QDICSDENIELAAE 407
A + G++ ++D + + + R F+ +P + + K SD + E+A +
Sbjct: 438 VIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNPCKPLDWNKIYPGWNSDRHREMARQ 497
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYMSPIAGFS 465
AARE IV+L+N N LPL S +KT+AV+GP A+ G+Y P + S ++G
Sbjct: 498 AARESIVMLENKDNLLPL-SKTLKTIAVLGPGADDLQP--GDYTPKLQPGQLKSVLSGIK 554
Query: 466 G----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA--------- 512
V Y+ GCD + N I A +AA +D +++ G + EA
Sbjct: 555 AAVGKQTKVLYEQGCDFTTPDATN-IPKAVKAASQSDVVVMVLGDCSTSEATNNVRKTCG 613
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E+ D L LPG Q +L+ V K PV+L++ + D+ A + KAIL PG
Sbjct: 614 ENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDLLKA--SEMCKAILVNWLPG 670
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
+EGG A ADV+FG +NPGGRLP+T+ +V LPL + GR Y++ +
Sbjct: 671 QEGGPATADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDME 721
Query: 633 --TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
LY FGYGLSYT F+Y+ L K+Q N N A+
Sbjct: 722 FYPLYRFGYGLSYTSFEYSDL-----------KIQEKSNGNVMVQAT------------- 757
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+NVG G +V +Y T + ++ F R+ ++ G +K + F
Sbjct: 758 ----------VKNVGGCAGDEVAQLYITDMYASVKTRVMELKDFTRIHLQPGESKNVSFE 807
Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
+++++ + ++ GE + VG
Sbjct: 808 LTPY-DISLLNDRMDRVVEKGEFKVMVG 834
>gi|260642727|ref|ZP_05417108.2| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260620819|gb|EEX43690.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 768
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 211/731 (28%), Positives = 345/731 (47%), Gaps = 104/731 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVP----------RLGLPQYEWWSEALHGVSNVG--- 110
+V+ L+ +MTL+EK+ Q+ + P +G E ++ + +
Sbjct: 53 KVEALLDKMTLEEKLGQMNQLSPWDPNELANKVRNGEIGSILNYMNPEEVNKIQKIAMEE 112
Query: 111 -----PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
P DVI G T FP + A+FN + + + + EA A G+ +
Sbjct: 113 SRLGIPLLVSRDVIHGYKTIFPIPLGQAATFNPQIVENGARVAAIEASA------DGIRW 166
Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
++P I+++RDPRWGRI E+ GEDP++ V ++G Q LNS P +++
Sbjct: 167 TFAPMIDISRDPRWGRIAESCGEDPYLTSVMGVAMIKGFQ-------GDSLNS-PTSMAA 218
Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
C KH+ AY +G Y+ + E+ + +L PF+ V G ++ M S+N +G+
Sbjct: 219 CAKHFVAYGAS--EGGKDYN-STFIPERVLRNVYLPPFKAAVDAG-CATFMTSFNDNDGV 274
Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
PS A+ +L +R EW G +V D S M+ NH F AD KE A +++ AG+D+D
Sbjct: 275 PSTANKFVLKDILRDEWKYDGMVVTDWASAAEMI-NHGFCADGKE-AAEKSVNAGVDMDM 332
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
+ + ++ + KV ID +++ + + R+G F+ Y+ + ++E++
Sbjct: 333 VSETFIKNLKQSLAENKVSIESIDDAVRNILRLKYRMGLFENP--YIVTPQNVKYAEEHL 390
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSP 460
++A EA + ++LLKND TLPL + K++TVAVVGP A+A +G + G +P
Sbjct: 391 KIAKEAVEQSVILLKNDTQTLPLTN-KIRTVAVVGPMADAPYEQMGTWVFDGEKDHTQTP 449
Query: 461 IAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLD 516
+ + NV ++ K+ N I A AA+ AD + G + + E+
Sbjct: 450 LKAIREMYGDQVNVIFEPALGYSRDKNLNGIAKAVNAARHADVVLAFVGEEAILSGEAHS 509
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
+L L G Q+QLI ++ K P++ ++M+ G + A A+L+A +PG GG
Sbjct: 510 LANLNLQGAQSQLIQALSTTGK-PLVTIVMA--GRQLTIASEVEASDAVLYAFHPGTMGG 566
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL-----GYPGRT 625
AIAD++FGK NP + P+T+ +P+ T P P + L G+T
Sbjct: 567 PAIADILFGKVNPSAKTPVTFPR--MTGQVPIYYAHNSTGRPANPKEMLIDEIPVEAGQT 624
Query: 626 ----YKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
FY LYPFGYGLSYT F+Y+ NL TSD
Sbjct: 625 SVGCRSFYLDAGASPLYPFGYGLSYTTFEYS-------------------NLKLTSD--- 662
Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
L + VD +N G DG++VV +Y + +K++ FQRV
Sbjct: 663 ---------KLAINGEISVTVDLKNTGKYDGTEVVQLYIQDKVGSVTRPVKELKAFQRVE 713
Query: 739 VRAGRNKRIKF 749
++AG +K + F
Sbjct: 714 LKAGESKNVSF 724
>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
4_3_47FAA]
gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
Length = 864
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 168/454 (37%), Positives = 233/454 (51%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ DSSL R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ D N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CMDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKE VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R +W G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
CG Y +A ++G + E DID S+K L LG D P V K +CS
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSA 360
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 361 EHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTIT 419
Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
+ G + Y+ GC V S+F+
Sbjct: 420 LLEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 145/326 (44%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y +T +P + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN-------ITQLP--DFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKIIVP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A +K + F+RV + AG+ ++
Sbjct: 772 -------VTNTGNRDGEEVVQVYLK-KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
vulgatus PC510]
Length = 864
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 168/454 (37%), Positives = 233/454 (51%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ DSSL R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ D N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CMDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKE VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R +W G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
CG Y +A ++G + E DID S+K L LG D P V K +CS
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSA 360
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 361 EHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTIT 419
Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
+ G + Y+ GC V S+F+
Sbjct: 420 LLEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 143/326 (43%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A A+V+FG +NP GRLP+T+Y + L + GRTY+++ G
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYR---------NTAQLPDFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKIIVP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A +K + F+RV + AG+ ++
Sbjct: 772 -------VTNTGNRDGEEVVQVYLK-KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848
>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 1049
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 219/769 (28%), Positives = 362/769 (47%), Gaps = 108/769 (14%)
Query: 56 DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
+S LP++ VKDL+SRMT++EK+ QL + G L P+ E+ S++L VG
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 111 -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
P DVI G T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ + E+ A AGL + ++P +++ARD RWGR+ E GED ++ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
N + NS V +C KH+ AY + R + ++E+ + +T+L PF+
Sbjct: 501 -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
C+ G + M ++N +NGIP+ A P LL +RG+W+ +G++V+D ++++ +V + +
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605
Query: 324 ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A+ +DA +G+D+D Y + ++ GK+ D+D S+ + + LG F
Sbjct: 606 AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 383 DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
++ + Q I E ++ A + A + VLLKND +TLPL + V+++AVVGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 441 NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
+ ++G++ A R+++ + G N V Y GC D + + A
Sbjct: 725 DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ A +D I + G + ES R L LPG Q +LI ++ K PV++V+M+ +
Sbjct: 782 KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
I + + N+ AIL + G G AIAD++FG +NP GRL I++ + V +
Sbjct: 841 SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYK 898
Query: 611 MPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
RP D R N P LYPFGYGLSYT F Y++ T+
Sbjct: 899 KSGRPGDMPHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSVPQSTQK------------- 944
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
YT + V N G DG + V +Y +K
Sbjct: 945 -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ F+++F++AG +K ++F + +L D A N ++ GE I G
Sbjct: 987 ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
Length = 793
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 231/801 (28%), Positives = 350/801 (43%), Gaps = 139/801 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA--HGVPRLGLPQYEW----WS------ 100
++ D + R DL+S+MTLDEK Q+ H V + LP W W
Sbjct: 42 IYEDPKQSVNARTADLLSKMTLDEKTCQMATLYGWHRVLKDSLPTDSWKNAIWKDGIANI 101
Query: 101 -EALHGVSNVGPGTHFDDV------------------------IPG-------------- 121
E L+G + G D V IP
Sbjct: 102 DEHLNGFAGWGKTAPIDLVKDMEKHVWAMNETQRFFIEQTRLGIPADFTNEGIRGVEAYE 161
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
AT FPT + ++N+ L + G EARA+ G T ++P ++VARD RWGR+
Sbjct: 162 ATGFPTELNMGMTWNKELVHQEGIITGREARAL------GYTNVYAPIMDVARDQRWGRL 215
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E+ GEDP++V + +G+Q +G KV+S KH+A Y +
Sbjct: 216 EESYGEDPYLVASMGIALAKGIQQ-DG------------KVASTAKHFAVYSANKGAREG 262
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
+ D +V +++E L PF+ +KE VM SYN +GIP L Q +R E
Sbjct: 263 QARTDPQVAPREVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEM 322
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL----DCGQYYTNFTGNAVQ 356
GY+V+D D+++ + H A+ KE AV Q AG+++ + V+
Sbjct: 323 GFTGYVVSDSDALEYLATKHHVAANLKE-AVFQAFMAGMNVRTTFKAPDSIIIYLRQLVK 381
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG--KQDICSDENIELAAEAAREGIV 414
+G++ I+ + + V RLG FD P S ++ + SD + ++A +A+RE +V
Sbjct: 382 EGRIPMDTINHRVADVLRVKFRLGLFD-HPYVESAAETRKVVNSDASQQIALQASRESVV 440
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVT 471
LLKN+ N LPL + + +AVVGP+A +Y + ++ + G G V
Sbjct: 441 LLKNNNNILPLVKS-LDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVL 499
Query: 472 YKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESLDR 517
Y G D V S + +A K A I++ G + ES R
Sbjct: 500 YAKGVDLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSR 559
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
DL LPG+Q +L+ + K PV++V++ + I + + I I++AGYPG +GG
Sbjct: 560 TDLDLPGHQLELVKAIKATGK-PVVVVLLGTQPMTINW--IDKYIDGIVYAGYPGVKGGI 616
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
A+ADV+FG +NPGG+L +TW V +PL + P +P + G K LYPF
Sbjct: 617 AVADVLFGDYNPGGKLTLTWPKS--VGQIPL-NFPSKP-GAQSDEGEHAKIKG--LLYPF 670
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
G+GLSYT F Y L + KT V V
Sbjct: 671 GFGLSYTSFGYTNLKIS---------------------TGKTAADPVAVT---------- 699
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
VD N G G +VV Y + TY K + GF+RV ++AG K I F + L
Sbjct: 700 -VDVTNTGKLAGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREEL 757
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+ + +L GE ++ +G
Sbjct: 758 KLYNREMKFVLEPGEFSVMIG 778
>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
Length = 819
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 234/740 (31%), Positives = 340/740 (45%), Gaps = 119/740 (16%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SLP + R+ L + L++ + QL + A V + LP Y W ++ HGV GT
Sbjct: 71 YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVK----GT 126
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-----LGRA-------- 160
AT +P AS++ L ++G A+ E+RA +N G A
Sbjct: 127 AH------ATVYPMGASLGASWSVDLAWRVGAAIGNESRATHNGLADKSGNACGSTSTGE 180
Query: 161 ------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
G+T ++PN+N+ RDPRWGR E GEDP + AV V GLQ + +
Sbjct: 181 VVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQG-NAEGSTSGP 239
Query: 215 NSRPLKVSSCCKHYAA----YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
PL +CCKH+AA Y ++ DR DA V+ +D+ ET+L + CV A
Sbjct: 240 GGGPLVTGACCKHFAAHFAVYQNEDLP-ADRMVLDANVSSRDLWETYLPVMKACVVRAKA 298
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
+ VNG P+CA P+LLN +R W G++V+D D+ +V HK+++ + E+A
Sbjct: 299 T-------HVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVTTHKYVS-TWEEA 350
Query: 331 VAQTLKAGLDLDC--GQYY-TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
A + AG+D + G Y + +AV+ G V + +S + L V +RLG FD
Sbjct: 351 AAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRRSFERLMRVRLRLGMFDPPAS 410
Query: 388 YVSLGKQDIC-----SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
G+ C + + LA EAAREGIVL KN LPL AK +A+VGP +
Sbjct: 411 TAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGALPL--AKGARIALVGPQVDD 468
Query: 443 TVAMIG--NYAGIPCRYMSPIA---GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
++G NYA ++P+ G ANV+ GCD VAC + + A A A
Sbjct: 469 WRVLLGAVNYAFEDGPDVAPVTIQKGLEAVANVSVAAGCDSVACAALVDVDGAKRLAAAA 528
Query: 498 DATIILAG---------------LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
DAT+++ G D E+ES DR + LPG Q L+ + + V
Sbjct: 529 DATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVALVAALRAASSRLVC 588
Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
+++ A A+ + A+L PG+ GG A+ADV+FG ++P GR PIT Y
Sbjct: 589 VLVHGGAVALGAAAD---DCDAVLDLWVPGQMGGAALADVLFGDYSPAGRSPITMYAA-- 643
Query: 603 VQMLPLTSMPLRPVDSLGYP---GRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQ 658
LP P+ D G TY++Y GP Y FG GLSY F Y + T
Sbjct: 644 TSDLP----PMGVFDEYAGESSNGTTYRYYAGPAPTYAFGDGLSYASFSYAWAAAPPT-- 697
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
T DA +V N GS +VV VY++
Sbjct: 698 --------------TVDACGA---------------IRLRVAVTNTGSVASDEVVQVYAR 728
Query: 719 -PPAEIAATYIKQVIGFQRV 737
P A + A I+ ++ F RV
Sbjct: 729 VPDATVPAPAIR-LVAFDRV 747
>gi|317474379|ref|ZP_07933653.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
gi|316909060|gb|EFV30740.1| glycosyl hydrolase family 3 N terminal domain-containing protein
[Bacteroides eggerthii 1_2_48FAA]
Length = 733
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 216/790 (27%), Positives = 367/790 (46%), Gaps = 122/790 (15%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA------------------- 85
L +Q ++ D+ P IRVKDL+ RMTL EKV QL +
Sbjct: 16 LSVQSQKPIYQDAGQPVEIRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGKEVKNLPA 75
Query: 86 --------HGVPRL-GLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASF 135
H P+L Q + E+ G+ P DVI G T +P + SF
Sbjct: 76 EIGSLIYLHTDPKLRNQIQRKAMEESRLGI----PILFGFDVIHGLRTVYPISLAQACSF 131
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
N L + + E+ +G+ + +SP I+VARDPRWGRI+E GEDP+
Sbjct: 132 NPDLVTLACRVAAKESVL------SGIDWTFSPMIDVARDPRWGRISECYGEDPY----- 180
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
+N V G+ V+G++ + S P +++C KHY Y V G D + D ++ Q +
Sbjct: 181 -LNTVFGIASVKGYQG--EKLSDPYSIAACLKHYVGYGVSE-GGRDYRYTD--ISPQALW 234
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
ET+L P+E VK G A+++M S+N ++GIP+ ++ +L + ++ +W G++V+D ++I+
Sbjct: 235 ETYLPPYEAGVKAG-AATLMSSFNDISGIPATSNHYILTEILKNKWQHDGFVVSDWNAIE 293
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ ++ +A +++A + AG+++D Y + V + K++ + ID ++ +
Sbjct: 294 QLI--YQGVAKDRKEAAYKAFHAGVEMDMRDNVYCEYLEQLVAEKKIQVSQIDDAVARIL 351
Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
+ RLG FD + ++ E+I LA A E +VLLKN N LP +S +K V
Sbjct: 352 RLKFRLGLFDEPYAKELIEQERYLQQEDIALAGRLAEESMVLLKNANNLLPFSSM-IKKV 410
Query: 434 AVVGPHANATVAMIGNYA------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
AV+GP A +V ++G +A + Y F + Y+ GC S+ S
Sbjct: 411 AVIGPIAKDSVNLLGAWAFKGKAEDVETIYEGMQKEFGDKVRLDYEQGC--ALDGSDESG 468
Query: 488 FAAS-EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
F+A+ + A+ +D ++ G E+ R + LP Q +L+ + + K P++LV+
Sbjct: 469 FSAALKTAEASDVVVLCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLVLS 527
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
S G + ++AI+ PG GG +A ++ G+ NP G+L +T+
Sbjct: 528 S--GRPLELIRLEPQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVTF--------- 576
Query: 607 PLTS--MPL--------RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
PL++ +P+ RP D++G Y+ LY FGYGLSYT F Y
Sbjct: 577 PLSTGQIPVYYNMRQSARPFDAMG----DYQDIPTEPLYSFGYGLSYTTFVY-------- 624
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
SDA + +R D +V N G +G + V+ Y
Sbjct: 625 -----------------SDAKLSSL------KIRKDQKITAEVTVTNAGKVEGKETVLWY 661
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
P + +K++ F++ + AG ++ +F + + L+ D L GE +
Sbjct: 662 VSDPFCTISRPMKELKFFEKQSLNAGESRVFRFDIDPMRDLSYTDATGKRFLEPGEFIVS 721
Query: 777 VGNGGVSFPI 786
VG ++F +
Sbjct: 722 VGGRKLTFEV 731
>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
6_1_58FAA_CT1]
Length = 850
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 163/447 (36%), Positives = 245/447 (54%), Gaps = 47/447 (10%)
Query: 42 FSKLGLQMSSFLFC------DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
F L L S LF D P R+ DL+SR+T++EK+ L + G+PRL + +
Sbjct: 9 FVVLALVFSGTLFAQKEVYKDMDAPQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEK 68
Query: 96 YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
Y +EALHG+ V PG T FP I + +N +I +S EARA +
Sbjct: 69 YYHGNEALHGI--VRPGNF--------TVFPQAIGLASMWNPDFLYEISTVISDEARARW 118
Query: 156 NLGRAG----------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
N G LT+WSP +N+ARDPRWGR ET GEDPF+ G+ V +V+GLQ
Sbjct: 119 NELNRGKDQKRLFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQ-- 176
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
G++ R LKV S KH+AA + ++ +R+ + +++E+D+ E +L FE C+
Sbjct: 177 -GND------PRYLKVVSTPKHFAANNEEH----NRFECNPQISERDLREYYLPAFERCI 225
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
+G A S+M +YN +N +P + LL + +R +W +GY+V+DC + ++V +HK++
Sbjct: 226 IDGKAQSIMTAYNAINDVPCTLNTWLLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVK- 284
Query: 326 SKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
+ E A LKAGLDL+CG Y NA +Q V E +ID + + M LG FD
Sbjct: 285 TPEAAATLALKAGLDLECGDNVYIEPLMNAYKQYMVSEAEIDTAAYRILRARMMLGLFDD 344
Query: 385 SPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
+ Y +L + +++ +A EAAR+ +VLLKN+ N LP+N K+K++AVVG NA
Sbjct: 345 PAKNPYNALSPSIVGCEKHKNMALEAARQSLVLLKNENNFLPINPKKIKSIAVVG--INA 402
Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYAN 469
G+Y+G P P++ G N
Sbjct: 403 GNCEFGDYSGKPVNV--PVSVLDGIRN 427
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 131/291 (45%), Gaps = 50/291 (17%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + D TI + G++ S+E E DR+ + LP Q I + ++ P + V++ AG
Sbjct: 594 AKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQELFIEEAYKL--NPKMAVVLVAG 651
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + ++ AIL A YPGE+GG A+A+ +FG +NP GRLP+T+Y L
Sbjct: 652 S-SLAVNWMDEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRS-------LD 703
Query: 610 SMPLRPVDSLG-YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D RTY ++ G LY FGYGLSYT+F Y R
Sbjct: 704 DLP--PFDDYAVQKNRTYMYFTGKPLYAFGYGLSYTKFDY-------------------R 742
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
L+ DA R +N G +G +V VY + P I
Sbjct: 743 KLSVDQDAENVR----------------LSFTIKNSGKYNGDEVAQVYVQFPEIGVKVPI 786
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
KQ+ GF+RV + G+ + K L I + P+G + VG
Sbjct: 787 KQLKGFERVHIAKGKTLPVTITV-PKKELRIWNERKGEFFTPSGNYVFMVG 836
>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 736
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 218/741 (29%), Positives = 340/741 (45%), Gaps = 154/741 (20%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L K++GQ ++
Sbjct: 83 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 130
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 178
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+L+ + +++ KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 179 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 233
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++G P ++ LL Q +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 234 ALSVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 292
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ +T ID ++ + + +G F+
Sbjct: 293 AAIQSVMAGVDVDLGGDAYTNLC-HAVQSGQMDKTVIDTAVCRVLRMKFEMGLFEHPYVD 351
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 352 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 410
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE 492
+Y GI + +SP V Y GC D + +I AA
Sbjct: 411 DYTAPQEDSNVKTVLDGILTK-LSPF-------RVEYVRGCAIRDTTVNEIEQAIKAARR 462
Query: 493 AA------------------KTADATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQV 533
+ K A + G +E E DR L L G Q +L+ +
Sbjct: 463 SEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESL 522
Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
+ K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GRL
Sbjct: 523 QKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGRL 579
Query: 594 PIT----------WYNG------DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
PI+ +YN DYV+M +S P LY F
Sbjct: 580 PISVPRSVGQIPVYYNKKAPRNHDYVEM---SSFP---------------------LYSF 615
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYG+SYT F+Y+ L V+ RC FE
Sbjct: 616 GYGMSYTTFEYSDLQ-------------------------------VVQKSARC---FEV 641
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
+N G DG +V +Y + +KQ+ F+R ++ G K++ FV +
Sbjct: 642 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDF 700
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+V+Y ++ +G + +G
Sbjct: 701 FLVNYTLKKVVESGNFHLMIG 721
>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
paludis DSM 18603]
Length = 823
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 226/822 (27%), Positives = 361/822 (43%), Gaps = 141/822 (17%)
Query: 35 FVCDPGRFSK----LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
F DP + K L ++ DS+ P R+ DL+ +MTL+EK QL +G R
Sbjct: 50 FKADPPIYKKGWIDLNKNGKKDIYEDSTQPIEARLNDLIGQMTLEEKTCQLATL-YGYKR 108
Query: 91 L---GLPQYEWWSEALH-GVSNVG------------------------------------ 110
+ +P EW +E G++N+
Sbjct: 109 ILKDSVPTPEWKNEIWKDGIANIDEHLNGFITWGKTSDLPLVTDVKKHVWAMNQTQRFFI 168
Query: 111 -------PGTHFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
P ++ I G AT+FPT + ++++ L ++G EARA LG
Sbjct: 169 EQTRLGIPVDFTNEGIRGVEAYQATAFPTQLNMGMTWDKPLVNQMGNITGMEARA---LG 225
Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
+ ++P ++VARD RWGR+ E GEDP++V R V +G+Q
Sbjct: 226 YTNV--YAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMAKGMQQNN------------ 271
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
++++ KH+A Y + D +V +++E L PF+ +KE VM SYN
Sbjct: 272 -QIAATAKHFAVYSANKGGREGLARTDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYN 330
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
+GIP L Q +R E+ GY+V+D D+++ + + H AD K DAV Q AG
Sbjct: 331 DYDGIPISGSSYWLIQRLRQEFGFKGYVVSDSDALEYLYNKHHVAADLK-DAVYQAFMAG 389
Query: 339 LDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGK 393
+++ + + V++GK+ I+ ++ + V +LG FD Q
Sbjct: 390 MNVRTTFRTPDSIIIYARQLVKEGKLPIDTINSRVRDVLRVKFKLGLFDHPYVQDAEASA 449
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+ + N +A +A++E IVLLKN LPL +K +T+AV+GP+A +Y +
Sbjct: 450 KLVNCAANQAVALQASKESIVLLKNKGAILPL--SKQQTLAVIGPNALNDDYAHTHYGPL 507
Query: 454 PCRYMSPIAGFS---GYANVTYKTGCD--------------DVACKSNNSIFAASEAAKT 496
+ ++ + G G V Y GC+ D I +A A+
Sbjct: 508 ASKSINILEGIQAKVGAGKVLYALGCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARH 567
Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFA 556
AD +++ G + E+ R L LPGYQ +L+ V K PV++V++ + + I +
Sbjct: 568 ADVAVVVLGGNTQTAGENKSRTSLDLPGYQLRLVKAVKATGK-PVVVVLIGSQPMTINW- 625
Query: 557 ETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV 616
+ +I I++AGYPG +GG A+ADV+FG +NPGG+L +T+ V LP + P +P
Sbjct: 626 -IDQHIDGIIYAGYPGTQGGTAVADVLFGDYNPGGKLTLTFPKS--VGQLPF-NFPTKP- 680
Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
+S G K LYPFG+GLSYT F Y+ L + IQ + N T
Sbjct: 681 NSETDEGELAKIKG--LLYPFGFGLSYTTFAYSDLKISPAIQSDQG--------NVTVSC 730
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
T N G G +VV +Y + TY K + GF R
Sbjct: 731 KVT-----------------------NTGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDR 767
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ ++ G K + F L + + ++ GE + VG
Sbjct: 768 LSLKPGETKEVMFTI-VPDDLKLYNRQMKYVVEPGEFKVMVG 808
>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
CL03T12C04]
Length = 1049
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 219/769 (28%), Positives = 362/769 (47%), Gaps = 108/769 (14%)
Query: 56 DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
+S LP++ VKDL+SRMT++EK+ QL + G L P+ E+ S++L VG
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 111 -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
P DVI G T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ + E+ A AGL + ++P +++ARD RWGR+ E GED ++ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
N + NS V +C KH+ AY + R + ++E+ + +T+L PF+
Sbjct: 501 -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
C+ G + M ++N +NGIP+ A P LL +RG+W+ +G++V+D ++++ +V + +
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605
Query: 324 ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A+ +DA +G+D+D Y + ++ GK+ D+D S+ + + LG F
Sbjct: 606 AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 383 DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
++ + Q I E ++ A + A + VLLKND +TLPL + V+++AVVGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 441 NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
+ ++G++ A R+++ + G N V Y GC D + + A
Sbjct: 725 DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ A +D I + G + ES R L LPG Q +LI ++ K PV++V+M+ +
Sbjct: 782 KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
I + + N+ AIL + G G AIAD++FG +NP GRL I++ + V +
Sbjct: 841 SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYK 898
Query: 611 MPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
RP D R N P LYPFGYGLSYT F Y++ T+
Sbjct: 899 KSGRPGDMPHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSVPQSTQK------------- 944
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
YT + V N G DG + V +Y +K
Sbjct: 945 -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ F+++F++AG +K ++F + +L D A N ++ GE I G
Sbjct: 987 ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|336399403|ref|ZP_08580203.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069139|gb|EGN57773.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 757
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 221/769 (28%), Positives = 348/769 (45%), Gaps = 130/769 (16%)
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH------GVSNVG-------- 110
V+DL+ +MTL EK+ QL + G G PQ S++L + NVG
Sbjct: 46 VRDLIKKMTLTEKIGQLSQYVGGSLLTG-PQSGALSDSLFVRGMVGSILNVGGVESLRKL 104
Query: 111 ------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
P DVI G T FPT + + S++ +G T A
Sbjct: 105 QEKNMQSSRLKIPVLFAFDVIHGYKTIFPTPLAESCSWD------LGLMFETAKAAAIEA 158
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
+G+ + ++P +++ARDPRWGRI E GED ++ + A VRG Q G
Sbjct: 159 SASGIHWTFAPMVDIARDPRWGRIVEGAGEDTYLACKIAETRVRGFQWNLG--------- 209
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
+P V +C KH+ AY G D D ++ + E +L PF+ CV G + M +
Sbjct: 210 KPNSVYACAKHFVAYGAPQ-AGRDYAPVDLSLST--LAEVYLPPFKACVDAG-VHTFMSA 265
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N +NG+P+ + L+ +R +W HG++V+D +++Q + + +A++ DA
Sbjct: 266 FNSLNGVPATGNRWLMTDILRNQWKFHGFVVSDWNAVQELKAHG--VAETDTDAALMAFD 323
Query: 337 AGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ- 394
AG+D+D Y AV +GK+ ID S++ + LG FD +++ + ++
Sbjct: 324 AGVDMDMTDGLYNRCLEKAVCEGKLDMQAIDTSVERILRAKYALGLFDDPYRFLDVKRER 383
Query: 395 -DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
+I S+ +LA +AA +VLLKND TLPL S K +A++GP A+ ++G++
Sbjct: 384 REIRSEAVTKLARKAAASSMVLLKNDHATLPL-SKHTKRIALIGPLADNRSEVMGSWKAR 442
Query: 452 -----------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
GI + S +A VTY GCD + S AA EAAK +D
Sbjct: 443 GEESDVVTVLDGIKKKLGSDVA-------VTYVQGCDFLE-PSTREFPAAFEAAKQSDVV 494
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
I + G + ES R L LPG Q L++ + + + P+++V+M+ G + + +
Sbjct: 495 IAVVGEKALMSGESRSRAVLRLPGQQEALLDTLQKAGR-PLVVVLMN--GRPLCLQKVDR 551
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL---RPVD 617
A+L A +PG + G A+AD++FG P +L ++ PLT +
Sbjct: 552 QADALLEAWFPGTQCGNAVADILFGDAVPSAKLTTSF---------PLTEGQIPNNYNYK 602
Query: 618 SLGYPG-----RTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
G PG T + + P LYPFGYGLSYT F Y
Sbjct: 603 RSGRPGDMSHSSTVRHIDVPNRNLYPFGYGLSYTTFSYG--------------------- 641
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
+ +CP D + VD N G DG ++V +Y +K+
Sbjct: 642 -------EMQCP----KQFNADGTLQVSVDVTNTGGYDGEEIVQLYVADKVASMVRPVKE 690
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+ GFQ+VF+ G+ KRI F NA + L + + ++ G I VG
Sbjct: 691 LKGFQKVFIPKGQTKRIDFTLNA-RDLGFWNNSMQYIVEPGTFEIMVGT 738
>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
CL03T12C32]
Length = 955
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 221/810 (27%), Positives = 361/810 (44%), Gaps = 146/810 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D ++P RV+DL+S+M ++EK Q+ +G R+ LP +W W
Sbjct: 60 VYEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGA 118
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N +L K+G E R + G T ++P ++V RD RWG
Sbjct: 179 YIATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWG 232
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V + +G+Q TD +V++ KHY AY +
Sbjct: 233 RYEEVYGESPYLVAELGIEMAKGMQ--------TDH-----QVAATSKHYIAYSNNKGGR 279
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + P++ +KE VM SYN +G P + L +RG
Sbjct: 280 EGMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRG 339
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E+ GY+V+D D+++ + H AD KE +V Q++ AGL++ C Y
Sbjct: 340 EFGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLREL 398
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
+ +G + + ID ++ + V +G FD P + L + D + EN ++A +A++E
Sbjct: 399 IAEGAIPMSTIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQQVALQASKES 457
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----A 468
+VLLKN LPL+ K+ +AV GP+A+ + +Y + + + G
Sbjct: 458 LVLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGT 517
Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
NV + GCD V + + I A E AK +D T+++ G E+
Sbjct: 518 NVLFTKGCDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGEN 577
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q L+ V K PV+L++++ + I +A+ + AIL A YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLILINGRPLSINWAD--KYVPAILEAWYPGSQ 634
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKF 628
GG AIAD +FG +NPGG+L +T+ V +P + P +P VD + G G +
Sbjct: 635 GGTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV 691
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
NGP LYPFGYGLSYT F+Y+ +S I + +
Sbjct: 692 -NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT----------------------- 726
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
+RC N G G +VV +Y + TY K ++GF R+ + G K +
Sbjct: 727 VRC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELT 778
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F + L +++ + ++ G+ + VG
Sbjct: 779 FTIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
CL09T03C04]
Length = 864
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 168/454 (37%), Positives = 233/454 (51%), Gaps = 42/454 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ DSSL R +DL+ ++TL+EKV + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 24 YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASF I AVS EARA A GLT W
Sbjct: 82 --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ VN V+GLQ D N + K+ +C
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CMDANQKYDKIHACA 186
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKE VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDP 243
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +R +W G +++DC +I HK D+ E A A + +G DL+
Sbjct: 244 CCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
CG Y +A ++G + E DID S+K L LG D P V K +CS
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSA 360
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
E+ L+ + AR+ + LL N N LPL +T+AV+GP+AN +V GNY G P ++
Sbjct: 361 EHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTIT 419
Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
+ G + Y+ GC V S+F+
Sbjct: 420 LLEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 146/326 (44%), Gaps = 62/326 (19%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
FSG A + + D+ K +I K AD I G+ S+E E +
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFR 628
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR D+ LP Q +LI + + K VI V S G IA +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A+A+V+FG +NP GRLP+T+Y +T +P + GRTY+++ G
Sbjct: 686 SGGKAVAEVLFGDYNPAGRLPVTFYRN-------ITQLP--NFEDYNMTGRTYRYFKGDP 736
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFGYGLSYT F Y + +TI+V + +K P
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKIIVP----------- 771
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
N G+ DG +VV VY K E A +K + F+RV + AG+ ++
Sbjct: 772 -------VTNTGNRDGEEVVQVYLK-KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP 823
Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
K L D NT+ AG I VG
Sbjct: 824 -KQLEWWDTQTNTMRTLAGNFDIMVG 848
>gi|409198288|ref|ZP_11226951.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
Length = 747
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 225/766 (29%), Positives = 357/766 (46%), Gaps = 117/766 (15%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPR---------------LGLPQYEWWSE----ALH 104
RV+ L+SRMTL+EK+ Q+ P L + Q E +E AL
Sbjct: 33 RVESLLSRMTLEEKIGQMNQLNGRNPDEKLMSRIRNGEVGSLLNIEQPELINEIQRIALE 92
Query: 105 GVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
P DVI G T FP + ASFN S+ +G AR G
Sbjct: 93 ESRLGIPLLIARDVIHGYKTIFPIPLGQAASFNPSI---VGTGARVAAREATQDG----I 145
Query: 164 YWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
W+ P ++++RDPRWGRI E+ GED ++ + + +RG Q DL + P +
Sbjct: 146 RWTFAPMMDISRDPRWGRIAESFGEDTYLTTKLSSAMIRGFQ-------GNDLKN-PSSM 197
Query: 222 SSCCKHYAAYD-VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
++C KH+ Y V+ K + + R + +L PF+ V+EG +++M S+N
Sbjct: 198 AACAKHFIGYGAVEGGKDYNSTYIPPR----QLRNVYLPPFKAAVEEG-VATIMTSFNSN 252
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
+GIP DP LL +R EW G +V+D S++ M+ H F + KE A+ + + AGLD
Sbjct: 253 DGIPPSGDPWLLTGILRDEWKFDGVVVSDWASVKEMI-AHGFAENGKEAAL-KAVNAGLD 310
Query: 341 LDCGQ--YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC- 397
++ Y+TN + + +GKV E ID +++ + + +RLG FD Y+S +
Sbjct: 311 MEMVSECYFTNIK-DLINEGKVSEKTIDDAVRNILRLKLRLGLFDNP--YISEEDPRVAY 367
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPC 455
S E+++ A AA E +VLLKN+ TLP++S VKT+ VVGP A+A +G + G
Sbjct: 368 SKEHLDAAKMAAEESMVLLKNEDQTLPISSV-VKTICVVGPLADAPHDQMGTWVFDGEKE 426
Query: 456 RYMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+ ++P+ + N+ Y+ K + AA+ +D I G + +
Sbjct: 427 KTITPLKALRQLYGDKVNIIYEPTLKYSRDKDRSKFSKTLAAARKSDVVIAFVGEESILS 486
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI-KAILWAGY 570
E+ DL L G Q +LI+ ++E A P++ V+M+ + I T + K++++A +
Sbjct: 487 GEAHSLADLNLRGAQLELISALSE-AGTPLVTVVMAGRPLTIG---TEVELSKSVIYAWH 542
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRP----VDSLG 620
PG GG AIAD++FGK P G+LP+T+ V +P+ T P R +D +
Sbjct: 543 PGTMGGPAIADILFGKTVPSGKLPVTFPK--MVGQIPVFYNHNSTGRPARGTEVLIDDIP 600
Query: 621 YPGRTYKFYNGP--------TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
R N L+ FGYGLSYT F+Y+ L NL+
Sbjct: 601 LEARQSSLGNTSYYLDAGFDPLFHFGYGLSYTSFEYSDL-----------------NLSN 643
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
+S D V N G G+++V +Y+ + +K++
Sbjct: 644 SS--------------FHPSDTLRVSVQLSNTGDFQGTEIVQLYTADKSASVVRPVKELK 689
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
GFQRV V+ G K + F + + + ++ AGE +I VG
Sbjct: 690 GFQRVLVQPGETKDVVFHLPMSE---LSFWNDGDVVEAGEFSIMVG 732
>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
Length = 854
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 151/445 (33%), Positives = 246/445 (55%), Gaps = 37/445 (8%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
L ++ F + ++ L R DL SR+TL+EK + + + + +PRLG+PQ+EWWSEALH
Sbjct: 16 LPMKAQQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALH 75
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR----- 159
G+ G AT FP + +S++++L +K+ AVS E R +
Sbjct: 76 GIGRNG----------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTI 125
Query: 160 ---AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
GL++W+PNIN+ RDPRWGR ET GEDP++ R + VRGLQ +S
Sbjct: 126 KRYQGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPS--------DS 177
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
+ K+ +C KH+A + W +R+ F+ + E+D+ ET+L F+ V++GD + VMC
Sbjct: 178 KYRKLLACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMC 234
Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQT 334
+Y R++G P C + + L +R EW+ G +V+DC ++ H ++ A A+
Sbjct: 235 AYQRIDGQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKA 294
Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLG 392
+ +G D++CG Y+N AV+ G +KE D+D S++ L LG FD + +
Sbjct: 295 VLSGTDVECGSDYSNLP-EAVRAGIIKEADVDVSVRRLLEARFALGDFDPDELVPWTKIS 353
Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
+ + S + +LA + AR+ +VLL+N+ + LPL + K V VVG +A + M GNY+G
Sbjct: 354 ESVVASKAHKQLALDMARKSMVLLQNN-DILPLKRSGQKIV-VVGANAIDSTMMWGNYSG 411
Query: 453 IPCRYMSPIAGFSGYAN-VTYKTGC 476
P + ++ + G ++ VT+ GC
Sbjct: 412 YPTQTVTILQGLQTKSDQVTFIPGC 436
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 140/300 (46%), Gaps = 68/300 (22%)
Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
AD I + G+ +E E + DR + LP Q ++I ++E + +V +
Sbjct: 599 ADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR---IVFV 655
Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
+ G IA + + AIL A YPGE+GG A+ADV+FG +NP G+LP+T+Y D L
Sbjct: 656 NCSGSAIALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNPSGKLPVTFYKND--AQL 713
Query: 607 PLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
P D L Y GRTY+++ L+PFGYGLSYTQF + +
Sbjct: 714 P---------DFLDYRMAGRTYRYFKETPLFPFGYGLSYTQF-------------TIGQP 751
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
++ N + +V N G DG +VV VY + + A
Sbjct: 752 RYINN--------------------------QVQVSVSNTGKRDGDEVVQVYIR-RTDDA 784
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVGNGGVS 783
A IK + GFQRV ++ G K++ +S D ++NT+ + G + + VG+ ++
Sbjct: 785 AGPIKTLRGFQRVSLKVGETKQVSVSL-PRESFEWWDASSNTMRVIPGNYEVMVGSSSMA 843
>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 791
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 212/735 (28%), Positives = 333/735 (45%), Gaps = 142/735 (19%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G T FPT I A+++ L K++GQ ++
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------ITVFPTGIGMAATWSPELVKEVGQVIAK 185
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G V GL + G+
Sbjct: 186 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGL--INGN- 237
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
SR + KH+ AY V + A V +++ E FL PF+ + G
Sbjct: 238 -----ISRKNSTIATLKHFLAYAVPEG---GQNGNQALVGMRELHENFLPPFKKAIDAG- 288
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP A+ LLNQ +R EW G++V+D SI+ + ++H + A S ED
Sbjct: 289 ALSVMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLYSIEGIYESH-YTASSIED 347
Query: 330 AVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q + AG+D+D G+ YTN AV++ ++ E ID+ + + + +G F+
Sbjct: 348 AAIQAVSAGVDVDLGGEAYTNIY-RAVKEKRLSEAIIDEVVCRVLRLKFEMGLFENPYVD 406
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + + +I A A+ + LLKN + LPL S ++ VAV+GP+A+ M+G
Sbjct: 407 PQIAIERVRNANHIANARRMAQASVTLLKNRHDILPL-SKNIRKVAVIGPNADNCYNMLG 465
Query: 449 NYAGIPCR------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
+Y P + + I + V Y GC + +NN I A EAA AD I
Sbjct: 466 DYTA-PQKDENIKTVLDGIISKLSLSRVEYVRGC-AIRDTTNNEIAKAVEAANRADVVIA 523
Query: 503 LAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQVAEVAKG 539
+ G + + E DR L L G Q +L+ + K
Sbjct: 524 VVGGSSARDFKTTYKETGAAIADKSQISDMECGEGFDRATLSLLGKQLELLESLKSTRK- 582
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT--- 596
P+I+V + ++ +A + + A+L A YPG+EGG AIADV+FG +NP GRLP++
Sbjct: 583 PLIVVYIEGRPLNKNWAAEHAD--ALLTAYYPGQEGGDAIADVLFGDYNPAGRLPVSVPR 640
Query: 597 -------WYNG------DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
+YN DYV+M + LY FGYGLSY
Sbjct: 641 SEGQIPVYYNKKTPKCHDYVEM------------------------SASPLYSFGYGLSY 676
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
+ F+Y+ L T+ + +FE D +N
Sbjct: 677 STFEYSNLKVTQQAPL----------------------------------HFEISFDVEN 702
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G DG +V +Y + ++Q+ F+R F++ G K I F + L+I++
Sbjct: 703 TGKYDGEEVAQLYIRDEYASVVRALRQLKHFKRFFLKQGEKKTIVFTL-VEEDLSIINQK 761
Query: 764 ANTLLPAGEHTIFVG 778
++ G + +G
Sbjct: 762 MERIVEPGSFQLMIG 776
>gi|395802372|ref|ZP_10481625.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
gi|395435613|gb|EJG01554.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
Length = 745
Score = 262 bits (669), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 234/790 (29%), Positives = 354/790 (44%), Gaps = 151/790 (19%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---FAH-GVPRLGLPQYEWWSEAL 103
Q ++ + S + + L+S+MTL+EKV L FA+ GV RLG+P+ + L
Sbjct: 33 QTEEYVGKEISTDHDAEIDKLISQMTLEEKVGMLHGNSMFANAGVKRLGIPELKMADGPL 92
Query: 104 HGV------SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
GV N P +D AT +P A++N + G ++ E RA
Sbjct: 93 -GVREEISRDNWAPAGWTNDF---ATYYPAGGALAATWNAEMAHTFGTSLGEELRA---- 144
Query: 158 GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
R SP IN+ R P GR E EDPF+ + AV + GLQ+ +
Sbjct: 145 -RDKDMLLSPAINMVRTPLGGRTYEYMSEDPFLNKKIAVPLIVGLQEKD----------- 192
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
V +C KHYAA N + +R D ++ E+ + E +L FE VKE A S+M +Y
Sbjct: 193 ---VMACVKHYAA----NNQETNRDFVDVQIDERTLREIYLPAFEASVKEAKAYSIMGAY 245
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N+ G C + +LN+ +R EW G +V+D ++ + A++LK
Sbjct: 246 NKFRGEYLCENDYMLNKILRDEWGFKGVVVSDWAAVH---------------STAKSLKN 290
Query: 338 GLDLDCGQ---YYTNFTGN----AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
GLD++ G + F + AV+ G+V E +ID +K + VL ++ G +
Sbjct: 291 GLDIEMGTPKPFNEFFLADKLIVAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER--- 347
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
K I ++ + + A + A E IVLLKN+ N LPL VK++AV+G +A A+ G
Sbjct: 348 -AKGSIATEAHYQDAYKIAAEAIVLLKNENNALPLQLDGVKSIAVIGNNATKKNALGGFG 406
Query: 451 AGIPC-RYMSPIAGFSGY----ANVTYKTGCDDVACKSNN-------------------- 485
AG+ R ++P+ G + Y G + K N
Sbjct: 407 AGVKTKREVTPLEGLKNRLPSSVKINYAEGYLERYDKKNRGNLGNITANGPVTIDELDPA 466
Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
+ A +AAK +D II AG + E E+ DR DL LP Q +LI +V +A P +V+
Sbjct: 467 KVQEAVDAAKNSDVAIIFAGSNRDYETEASDRRDLHLPFGQEELIKKV--LAVNPKTIVV 524
Query: 546 MSAGG-VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
M AG DI E + A++W+ + G EGG A+ADV+ GK NP G+LP +
Sbjct: 525 MIAGAPFDIN--EVSKKSSALVWSWFNGSEGGNALADVILGKVNPSGKLP-------WTM 575
Query: 605 MLPLTSMPLRPVDSLGYPG--------------RTYKFYNGPTLYPFGYGLSYTQFKYNL 650
+ L P +S +PG R + N LYPFGYGLSYT F
Sbjct: 576 PIALKDSPAHATNS--FPGDKAVNYAEGLLIGYRWFDTKNVAPLYPFGYGLSYTSF---A 630
Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
L KT +K + +N D E VD +N G DG
Sbjct: 631 LDNAKT-----DKTSYAQN-----------------------DVIEVTVDVKNTGKVDGK 662
Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN--TLL 768
+VV +Y+ +++ GF++ V+AG + ++ K L D A+ T+
Sbjct: 663 EVVQLYTSKSDSKITRAAQELKGFKKAEVKAGSSTKVTIKV-PVKELAYYDVASKKWTVE 721
Query: 769 PAGEHTIFVG 778
P G++TI +G
Sbjct: 722 P-GKYTIKLG 730
>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
excentricus CB 48]
Length = 882
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 160/468 (34%), Positives = 241/468 (51%), Gaps = 45/468 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+S P R DLVSRMTL+EK QL + A +PRL + +Y WW+E LHGV+ G
Sbjct: 35 YQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEGLHGVAAAG--- 91
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTY 164
AT FP + A+++E L ++ + +S E RA Y R GLT
Sbjct: 92 -------YATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRFGGSDWFGGLTV 144
Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL--KVS 222
WSPNIN+ RDPRWGR ET GEDP++ R V +VRGLQ P+ +
Sbjct: 145 WSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ-----------GDDPVYYRTV 193
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+ KHYA V + R+ + + D+ +T+L F + EG A S+MC+YN +NG
Sbjct: 194 ATPKHYA---VHSGPEAGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYNAING 250
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDL 341
P+CA+ LL + +R +W GY+V+DCD++ + + E+ V + G DL
Sbjct: 251 QPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAYRPTPEEGVTAAYQVGTDL 310
Query: 342 DCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSD 399
CG + AV+QG + E +D +L L+T +LG FD + + + +D +
Sbjct: 311 ICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQFDPPAKVFPKITAEDYDTP 370
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
N + + + A +VLLKN+ N LPL + + +AV+GP+A++ +++GNY G P ++
Sbjct: 371 ANRDFSQKVAESAMVLLKNENNLLPLK-GEPRQIAVIGPNADSMDSLVGNYNGDPSHPVT 429
Query: 460 PIAGFSGY---ANVTYKTGC---DDVACKSNNSIFAASEAAKTADATI 501
++G A VTY G D V +S F EA T+
Sbjct: 430 VLSGIRARFPKATVTYAPGSGLIDPVMTAVPDSAFCRDEACTQTGVTV 477
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 147/308 (47%), Gaps = 55/308 (17%)
Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQ 532
S+ +A AAK AD + +AGL VE E + DR L LP Q +++ Q
Sbjct: 592 SDTGAQSAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQ 651
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
V+ K PV+LV+++ + I +A+ N + AI+ A YPG +GG A+A ++ G ++P GR
Sbjct: 652 VSAAGK-PVVLVLINGSALGINWADKN--VPAIIEAWYPGGQGGAAVARLIAGDYSPAGR 708
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LP+T+Y LP + GRTY+++ G LYPFGYGLS+T F+Y L+
Sbjct: 709 LPVTFYRS--ADQLPA-------FNDYNMKGRTYRYFKGEALYPFGYGLSFTTFRYAPLT 759
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
+ + D D N GS D +V
Sbjct: 760 LS-------------------------------ARQVAGDGQVSVSADVTNSGSRDSDEV 788
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
V +Y P + A I+ + F+R+ ++AG K ++F + ++L+ V+ + + G+
Sbjct: 789 VQLYVSYPGQKLAP-IRALARFERIHLKAGETKTVRFTLDP-QALSTVNADGSRSVKPGK 846
Query: 773 HTIFVGNG 780
+++G G
Sbjct: 847 VELWLGGG 854
>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
CL02T12C30]
Length = 799
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 213/730 (29%), Positives = 335/730 (45%), Gaps = 125/730 (17%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P ++ +E +HG+++ AT P I +++N L + G
Sbjct: 139 RLGIP-VDFSNEGIHGLNHTK-----------ATPLPAPINIGSTWNRDLVHQAGDIAGK 186
Query: 150 EARAM-YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EA+A+ YN ++P ++VARDPRWGR+ ET GEDP++VG + V+G+Q
Sbjct: 187 EAKALGYN------NVYAPILDVARDPRWGRVLETYGEDPYLVGELGIQMVKGIQ----- 235
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+N V+S KH+A Y + D V +++ E L PF+ V++
Sbjct: 236 QNG---------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKA 286
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
VM SYN +G+P A L Q +R E+ GYIV+D ++++ + H +ADS E
Sbjct: 287 HPKGVMSSYNDWDGVPVTASYYFLTQLLRQEYGFKGYIVSDSEAVEFVQTKH-HVADSYE 345
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTG---------NAVQQGKVKETDIDKSLKYLYTVLMRL 379
+AV Q ++AGL++ TNFT V++GK+ +D+ + + V L
Sbjct: 346 EAVRQVVEAGLNV-----RTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFEL 400
Query: 380 GFFDGSPQYVSLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVV 436
G FD SP YV K + + +D++ + + ++ +VLLKN+ N LPL+ + K V +
Sbjct: 401 GLFD-SP-YVKDPKAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIA 458
Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD--------------D 478
GP A T MI Y ++ G Y V Y GC+
Sbjct: 459 GPLAKETNYMISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTP 518
Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
+ + I A+ AA D I + G D S ES R L LPG Q QL+ + K
Sbjct: 519 LTDEEKKGIAEAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEALHATGK 578
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
PV+LV+++ + I +A + NI +IL A +PG+ GG AIA +FG +NPGGRL +T+
Sbjct: 579 -PVVLVLINGQPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPGGRLSVTFP 635
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP----------TLYPFGYGLSYTQFKY 648
+ + + P +P G +++ GP LYPFGYGLSYT F Y
Sbjct: 636 RS--IGQIEF-NFPFKPGSQDG------QYFEGPNGSGRTRVNGALYPFGYGLSYTTFAY 686
Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
+ NL+ + ++ P + D+ G
Sbjct: 687 S-------------------NLSVKQETPYSQSPVTVTVDVTN------------TGKRA 715
Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
G +VV +Y + Y + GF+R+ ++ G K + FV + L I+D +
Sbjct: 716 GDEVVQLYIRDKVSSVIAYESVLRGFERISLQPGETKTVSFVL-LPEDLQILDRHMEWTV 774
Query: 769 PAGEHTIFVG 778
GE + +G
Sbjct: 775 EPGEFEVRIG 784
>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
Length = 1049
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 219/769 (28%), Positives = 361/769 (46%), Gaps = 108/769 (14%)
Query: 56 DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
+S LP++ VKDL+SRMT++EK+ QL + G L P+ E+ S++L VG
Sbjct: 328 NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386
Query: 111 -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
P DVI G T FPT + + S++ + ++
Sbjct: 387 VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446
Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
+ + E+ A AGL + ++P +++ARD RWGR+ E GED ++ A V G Q
Sbjct: 447 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
N + NS V +C KH+ AY + R + ++E+ + +T+L PF+
Sbjct: 501 -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548
Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
C+ G + M ++N +NGIP+ A P LL +RG+W+ +G++V+D ++++ +V + +
Sbjct: 549 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605
Query: 324 ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
A+ +DA +G+D+D Y + ++ GK+ D+D S+ + + LG F
Sbjct: 606 AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665
Query: 383 DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
++ + Q I E ++ A + A + VLLKND +TLPL + V+++AVVGP A
Sbjct: 666 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724
Query: 441 NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
+ ++G++ A R+++ + G N V Y GC D + + A
Sbjct: 725 DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ A +D I + G + ES R L LPG Q +LI ++ K PV++V+M+ +
Sbjct: 782 KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
I + + N+ AIL + G G AIAD++FG +NP GRL I++ + V +
Sbjct: 841 SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYK 898
Query: 611 MPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
RP D R N P LYPFGYGLSYT F Y+ T+
Sbjct: 899 KSGRPGDMPHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSAPQSTQK------------- 944
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
YT + V N G DG + V +Y +K
Sbjct: 945 -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ F+++F++AG +K ++F + +L D A N ++ GE I G
Sbjct: 987 ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034
>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 874
Score = 261 bits (668), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 160/425 (37%), Positives = 228/425 (53%), Gaps = 42/425 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R+ +L+++MT+ E++ QL D A + RLGLP Y WW+E LHG++ G AT
Sbjct: 38 RIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDG----------YAT 87
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMY------NLGR-AGLTYWSPNINVARDPR 176
FP I A+++ L ++G VSTEARA + N R GLT WSPNIN+ RDPR
Sbjct: 88 VFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPNINIFRDPR 147
Query: 177 WGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDVD 234
WGR ET GEDPF+ +V G+Q + P LK + KH+AA+
Sbjct: 148 WGRGQETYGEDPFLTATLGTQFVEGVQ-----------GNDPFYLKADATPKHFAAHSGP 196
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
+G D F+A V+ D+ +T+L F A+++MCSYN ++G PSCA L
Sbjct: 197 E-EGRDS--FNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGNNLQD 253
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
VR W GY+V+DCD++ + H F D+ A A L AG+DLDCG Y + +
Sbjct: 254 LVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGA-ADALNAGVDLDCGNTYAALS-KS 311
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAARE 411
+ Q E ++++L L +RLG D SP Y +G +++ S + LA AA E
Sbjct: 312 LDQNLTTEAKLNQALHRLLLARVRLGMLDPLSCSP-YRDIGAEELDSPAHHTLALRAAEE 370
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYANV 470
IVLLKND LPL A + V+V+GP A+ + NY G ++P+ GF S + +V
Sbjct: 371 SIVLLKND-GVLPLQ-ASTQKVSVIGPTADMVKVLEANYHGTALHPITPLDGFRSRFHDV 428
Query: 471 TYKTG 475
+Y G
Sbjct: 429 SYAQG 433
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 140/299 (46%), Gaps = 55/299 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
A + A +D + GL +E E+L DR L LP Q L++++ ++ K
Sbjct: 598 AVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLTQLHK- 656
Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
PVI+V+ S GV A + +L A YPGE GG A+A ++ G NP GRLP+T+Y
Sbjct: 657 PVIIVLTSGSGV--ALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSGRLPVTFYR 714
Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
+ +P S+ + RTY++++GP L+PFGYGLSY+ F+Y
Sbjct: 715 S-------VDDLPAFTDYSMAH--RTYRYFDGPVLFPFGYGLSYSHFQYG---------- 755
Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
L ++ KT P V + V N +G++V +Y +P
Sbjct: 756 ---------QLRLSTHMLKTSEPLVAM------------VTVHNESQREGTEVAELYLQP 794
Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
P A + + G QRV +R G + + F A L+ VD + + AGE+ +FVG
Sbjct: 795 PQASGAPRLT-LQGVQRVALRPGETRELTFKL-APGQLSTVDTSGARTVRAGEYKLFVG 851
>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
Length = 285
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 128/277 (46%), Positives = 184/277 (66%), Gaps = 10/277 (3%)
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
GLD S+EAE DR L LPG+Q +L+++VA VA+GPVILV+MS G +D++FA+ + I A
Sbjct: 2 GLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKISA 61
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
ILW GYPG+ GG AIADV+FG NPGGRLP+TWY +Y+ +P+T+M +RP + GYPGR
Sbjct: 62 ILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPGR 121
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
TY+FY GP ++PFG+GLSYT+F ++L K + V LQ N ++ + V
Sbjct: 122 TYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAFTNSTVSTSKA------V 175
Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK-PPAEIAATYIKQVIGFQRVFVRAG 742
V+ CD F VD +N GS DG++ ++V+SK PP + +AT KQ++ F + +V AG
Sbjct: 176 RVSHANCDAMEVGFHVDVKNEGSMDGTNTLLVFSKPPPGKWSAT--KQLVSFHKTYVPAG 233
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
+R+K + CK L++VD +P GEH + +G+
Sbjct: 234 SKQRVKVGVHVCKHLSVVDEFGIRRIPMGEHELQIGD 270
>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 858
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 176/466 (37%), Positives = 242/466 (51%), Gaps = 56/466 (12%)
Query: 47 LQMSSFLFCDS----SLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
L +FLF S LPY +R DL++R+TL EK + + + +PRLG+
Sbjct: 6 LTFIAFLFTVSLVAQQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKA 65
Query: 96 YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
YEWW+EALHGV G AT FP I ASFN L AVS EARA
Sbjct: 66 YEWWNEALHGVGRSGV----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKS 115
Query: 156 N-------LGR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
N L R GLTYW+PN+N+ RDPRWGR ET GEDP++ V V+GLQ +
Sbjct: 116 NKFSEQGGLKRYQGLTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPD- 174
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVK 266
N+ K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V+
Sbjct: 175 -------NAEYDKLHACAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQ 224
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLA 324
+ D VMC+YNR P C +LL Q +R +W G +V+DC +I + H
Sbjct: 225 KADVKEVMCAYNRFEDEPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQP 284
Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
D+ A A + G DL+CG + N AV+ G ++E ID SLK L LG +
Sbjct: 285 DATH-AAANAVLNGTDLECGSDFRNLP-EAVKAGLIEEKRIDVSLKRLLKARFELGEMN- 341
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
S Q + + S+++ LA A E IVLL+N+ N LPL S K+K +AV+GP+AN +V
Sbjct: 342 SDQVWPISYSVVNSEKHQNLALRMAEESIVLLQNNNNILPL-SKKLK-IAVMGPNANDSV 399
Query: 445 AMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGCD---DVACKS 483
GNY G P ++ + F G A + Y+ GCD DVA S
Sbjct: 400 MQWGNYNGFPAHTVTLLEAMRKSFPG-AQLIYEPGCDRTMDVAVSS 444
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/301 (25%), Positives = 133/301 (44%), Gaps = 56/301 (18%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
A+ K AD + G+ S+E E + DR D+ LP Q +L+ + + K
Sbjct: 587 ASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDAGK 646
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
+V ++ G + + +AIL A YPG+ GG A+A+V+ G +NP GRLP+T+Y
Sbjct: 647 K---VVFVNFSGSAMGLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVTFY 703
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
V LP + GRTY++ L+ FGYGLSYT+F +L K
Sbjct: 704 KN--VAQLP-------DFEDYSMKGRTYRYMTEKPLFSFGYGLSYTKF---VLGTAK--- 748
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
LNK + ++ ++ + V N G G++V+ VY +
Sbjct: 749 --LNK-----------------------SSIKANETLKITVPVTNAGKVAGTEVLQVYVR 783
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFV 777
++ K + GF++V + G+ +I + + D+ ++ GE+ ++
Sbjct: 784 KVKDVDGP-AKTLRGFKKVNIEPGKTSQISIDLTSS-AFEFYDWTQRKMMVTPGEYEVYY 841
Query: 778 G 778
G
Sbjct: 842 G 842
>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
propionicigenes WB4]
Length = 1278
Score = 261 bits (668), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 157/427 (36%), Positives = 240/427 (56%), Gaps = 35/427 (8%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
++ +++ + R DLVSRMTL+EK QLG+ +PRLG+ +Y+ W EALHGV VG
Sbjct: 38 IYLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGV--VGRN 95
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
+ + ATSFP + ++++ +L K+ V+ EAR + LTYWSP I A
Sbjct: 96 NNSGMI---ATSFPNSVAVGSTWDPALIKRETSVVADEARGFNHDLIFTLTYWSPVIEPA 152
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDPF+V + +V+GL ++ T L + P C KHY A
Sbjct: 153 RDPRWGRTAETFGEDPFLVSQIGSGFVQGLM----GDDPTYLKTVP-----CGKHYFA-- 201
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
N +R++ A + ++DM E +L P+ +++ S+M +Y+ VNG+P A L+
Sbjct: 202 --NNSEFNRHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSASKFLV 259
Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
+ + + L GY+ DCD++ +V++H++ A SK +A A LK G+D DCG Y
Sbjct: 260 DTIAKRTYGLDGYVTGDCDAVADVVNSHRY-AKSKAEAAAMGLKTGVDSDCGGIYQTSAL 318
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEA 408
A++QG + E D+DK+L +YT+ MRLG FD PQ Y + I + +LA E
Sbjct: 319 EALKQGLISEADMDKALVNIYTIRMRLGEFD--PQNIVPYAGIKPSIINDPSHNDLALEI 376
Query: 409 AREGIVLLKND------QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYMSP 460
A + VLLKN+ + LPLN+ +K +AV+GP A+ +G+Y+G P ++P
Sbjct: 377 ATKSPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQADK--VELGDYSGEADPKYKITP 434
Query: 461 IAGFSGY 467
+ G Y
Sbjct: 435 LEGIKNY 441
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 87/259 (33%), Positives = 126/259 (48%), Gaps = 39/259 (15%)
Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
+ A +AD ++ G D + E DR + LPG Q +LI +A V I+VI G V
Sbjct: 610 DMAASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIAAVNPN-TIVVIQGMGMV 668
Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTS 610
++ + N N+ I++ GY G+ G A+A V+FG NPGG+ +TWY + LP LT
Sbjct: 669 EVEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPGGKTSLTWYKS--INDLPALTD 726
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
LR G GRTY ++N Y FGYGLSYT F Y+ + +KT
Sbjct: 727 YTLR--GGAGKNGRTYMYFNKDVSYEFGYGLSYTTFAYSNFNISKT-------------- 770
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--I 728
+ +D VD +N G+ DG +VV +Y K P A+ I
Sbjct: 771 -----------------SITPNDKVTVTVDVKNTGTVDGDEVVQIYVKTPDSPASLERPI 813
Query: 729 KQVIGFQRVFVRAGRNKRI 747
K++ GF+RV + AG+ K +
Sbjct: 814 KRLKGFKRVAIPAGQTKTV 832
>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 863
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 162/458 (35%), Positives = 242/458 (52%), Gaps = 45/458 (9%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
Q S + + D+ L R DL+ R+TL+EKV + + + +PRLG+ YEWW+EALHGV+
Sbjct: 22 QPSKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81
Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA---MYNLG-----R 159
G AT FP I ASFN+ L ++ AVS EARA +N
Sbjct: 82 RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRY 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
GLT W+PN+N+ RDPRWGR ET GEDP++ GR + VRGLQ E E
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
K+ +C KH+A + W +R+ F+A + +D+ ET+L F+ V++ VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
R G P C +LL Q +R +W G +V DC +I + + H A + DAV
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVL- 299
Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
+G DL+CG + + T +AV++ + E I+ S+K + LG + + + ++
Sbjct: 300 ---SGTDLECGGNFKSIT-DAVKKDLISEEKINTSVKRVLKARFELGEMNSTHPWSNIPF 355
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
I ++ ELA + A E +VLL+N+ N LPLN + VAV+GP+AN +V GNY G
Sbjct: 356 SVIDCPKHKELALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413
Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
P ++ + G A + Y+ C + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
++AD I G+ +E ES+ DR ++ LP Q +++ A + K V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654
Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
++ G +A N AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y +Q
Sbjct: 655 FVNFSGSAMAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--MQ 712
Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
LP + GRTY+F LYPFGYGLSYT+F Y + +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759
Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
T + NVG DG +VV VY P +
Sbjct: 760 TKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKE 794
Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFVGN 779
K + GFQRV + G+ + ++ S D A NT+ P G + I GN
Sbjct: 795 GPQ-KTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYGN 848
>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
CL02T12C04]
Length = 782
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 218/741 (29%), Positives = 340/741 (45%), Gaps = 154/741 (20%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA HG +G AT FPT I A+++ L K++GQ ++
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSLELVKEVGQVIAK 176
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R+ + G + P +++ RDPRW R+ ET GEDP + G + V GL
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 224
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
+L+ + +++ KH+ AY V Y A V +D+ + FL PF + G
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG- 279
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
A SVM SYN ++GIP ++ LL Q +R EW G++V+D SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLYSIEGIHESH-FVALTKEN 338
Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
A Q++ AG+D+D G YTN +AVQ G++ + ID ++ + + +G F+
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397
Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
+ + + E+IELA + A+ I LLKN+ + LPL S + VAV+GP+A+ M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456
Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE 492
+Y GI + +SP V Y GC D + +I AA
Sbjct: 457 DYTAPQEDSNVKTVLDGILTK-LSPF-------RVEYVRGCAIRDTTVNEIEQAIKAARR 508
Query: 493 AA------------------KTADATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQV 533
+ K A + G +E E DR L L G Q +L+ +
Sbjct: 509 SEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESL 568
Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
+ K P+I+V + ++ +A + A+L A YPG+EGG AIADV+FG +NP GRL
Sbjct: 569 QKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGRL 625
Query: 594 PIT----------WYNG------DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
PI+ +YN DYV+M +S P LY F
Sbjct: 626 PISVPRSVGQIPVYYNKKAPRNHDYVEM---SSFP---------------------LYSF 661
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYG+SYT F+Y+ L V+ RC FE
Sbjct: 662 GYGMSYTTFEYSDLQ-------------------------------VVQKSARC---FEV 687
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
+N G DG +V +Y + +KQ+ F+R ++ G K++ FV +
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDF 746
Query: 758 NIVDYAANTLLPAGEHTIFVG 778
+V+Y ++ +G + +G
Sbjct: 747 FLVNYTLKKVVESGNFHLMIG 767
>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
CL02T12C01]
Length = 852
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 163/415 (39%), Positives = 235/415 (56%), Gaps = 39/415 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
LF D + P R+ DL+SR+T++EK+ L + A +PRL + +Y +EALHG+ V PG
Sbjct: 29 LFRDMNAPQHERLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGI--VRPG 86
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY---NLGR---AG----L 162
T FP I A++N L ++ A+S EAR + + G+ AG L
Sbjct: 87 EF--------TVFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLL 138
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDPF+ G +V+GLQ + R LK
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGD---------HPRYLKTV 189
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+AA + ++ +R +AR++E+D+ E +L FE C+ + A S+M +YN VNG
Sbjct: 190 STPKHFAANNEEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNG 245
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + L+ +RG+W +GYIV+DC + + MV HK++ D + A +KAGLDL+
Sbjct: 246 VPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDL-DAAATLAIKAGLDLE 304
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG + YT A + V + DID + + M LG FD Q Y + I
Sbjct: 305 CGDRVYTAPLLKAYNESMVSKADIDSAAYRVLRGRMLLGLFDDPSQNPYNQIEPSVIGCK 364
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
++ ELA E AR+ +VLLKN +N LPLN KVK++AVVG NA G+Y+GIP
Sbjct: 365 KHQELALETARQSMVLLKNQKNFLPLNLKKVKSIAVVG--INAGHCEFGDYSGIP 417
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 95/291 (32%), Positives = 141/291 (48%), Gaps = 49/291 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +AAK D T+ + G++ S+E E DR L LP Q + I ++ +V V++++
Sbjct: 597 AGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVVVLV---A 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + N+ AIL A YPGE+GG AIA+V+FG +NPGGRLP+T+YN L
Sbjct: 654 GSSLAINWIDENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYYNS-------LD 706
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+P D+ RTY+++ G LY FGYGLSYT+F Y K ++
Sbjct: 707 ELP--SFDNYSVQNRTYQYFKGKPLYEFGYGLSYTKFNY--------------KKKNVSI 750
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
N T D + FKV N G DG +V VY + P +K
Sbjct: 751 ANDTIDIT-------------------FKV--SNAGKYDGDEVAQVYVQYPETGTYMPLK 789
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGN 779
Q+ GF RV ++ G++ + K L D + P G++ +G+
Sbjct: 790 QLRGFSRVHIKKGKSADVTISVPK-KELRYWDEKTRQFVTPEGKYVFLIGS 839
>gi|383302737|gb|AFH08276.1| hypothetical protein [uncultured bacterium]
Length = 768
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 211/710 (29%), Positives = 342/710 (48%), Gaps = 123/710 (17%)
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
+A+HG +N P T +PT I +SF+ + KI + + E RAM NL
Sbjct: 134 DAIHGNANA----------PDNTVYPTNIGLASSFDPEMAYKIARQTAAEMRAM-NL--- 179
Query: 161 GLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
+W+ PN++V RDPRWGR+ ET GEDP+++ V G + V+G++ D P
Sbjct: 180 ---HWTFNPNVDVVRDPRWGRVGETFGEDPYLIS------VLGAESVKGYQGTLDT---P 227
Query: 219 LKVSSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
V +C KH+ + + G V+E+ + E L PFE V+ G A S+M S
Sbjct: 228 NDVLACIKHFVGGGFPANGTNGSP-----TDVSERTLREVLLPPFEAGVEAG-AGSLMTS 281
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N VNGIP+ ++ L+ +RGEW G++V+D I+ + D H+ A++ ++A Q++
Sbjct: 282 HNEVNGIPAHSNEWLMRDVLRGEWGFKGFVVSDWMDIEHIYDLHR-TAENLKEAFYQSIM 340
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
AG+D+ G Y+ V++G++ E+ ID+S++ + V RLG F+ + +
Sbjct: 341 AGMDMHMHGIYWNELVCELVREGRIPESRIDESVRRILDVKFRLGIFENPYADEARTMEV 400
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI-- 453
S + A EAAR IVLLKND LPL+++K K V V G +A+ ++G+++
Sbjct: 401 RLSPGHRATALEAARNSIVLLKND-GVLPLDASKYKRVMVTGINADDE-NILGDWSASQR 458
Query: 454 PCRYMSPIAGFSGYANVTYKTGCD---DVACKSNNSIFAASEAAKTADATIILAG----- 505
P + + G A T+ D + S + A+E A+ AD I++AG
Sbjct: 459 PENVTTILEGLREVAPDTHFEFVDQGWNPQTMSPAQVEKAAEHARHADLNIVVAGEYMMR 518
Query: 506 --LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
L E DR D+ L G Q +LI +VA K P IL++++ + + +A N+
Sbjct: 519 HRWALRTGGEDTDRSDIDLVGLQNELIEKVAASGK-PTILILVNGRQLGVEWAA--ENLP 575
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
AI+ A PG GG+A+A++++G NP +LP+T +P R V G
Sbjct: 576 AIVEAWEPGMYGGQAVAEILYGTVNPSAKLPVT---------IP------RSV------G 614
Query: 624 RTYKFYN-GPTLY--------------PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+ +YN P+LY PFG+GLSYT ++Y+
Sbjct: 615 QIQMYYNHKPSLYFHPYAAGKSSSPLWPFGFGLSYTTYEYS------------------- 655
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
+L +SD ++ D + V +N GS DG +++ +Y + +
Sbjct: 656 DLRLSSD------------EIAADGTLDVTVRVKNTGSRDGVEIIQLYIRDLYSSVTRPV 703
Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
K++ F RV ++AG K I F K L +D ++ GE + VG
Sbjct: 704 KELKDFGRVALKAGETKDITFTITPDK-LQFLDKDLRPVVEPGEFVVMVG 752
>gi|410613210|ref|ZP_11324278.1| beta-glucosidase [Glaciecola psychrophila 170]
gi|410167352|dbj|GAC38167.1| beta-glucosidase [Glaciecola psychrophila 170]
Length = 743
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 206/727 (28%), Positives = 344/727 (47%), Gaps = 104/727 (14%)
Query: 59 LPYSIR---VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF 115
L YSIR V +++ + L V +L A RLG+P +G
Sbjct: 53 LAYSIRQGRVGSILNEVRL-HTVNELQRLAVEESRLGIPLL------------IG----- 94
Query: 116 DDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVAR 173
DVI G T FP + AS+ K+ + EA ++ G+ + ++P I+++R
Sbjct: 95 RDVIHGFNTIFPIPLGQAASWCVETVKQCAHISALEAASV------GVNWTFAPMIDISR 148
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGRI E+ GEDP++ V ++G Q E H+N + +++C KH+A Y
Sbjct: 149 DPRWGRIAESLGEDPYLCSVLGVAMLQGFQGDELHKNGS--------IAACAKHFAGYGA 200
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
R + + E ++ +L PF+ G A+ M +++ +NG+P+ + L+
Sbjct: 201 GE---SGRDYSTTNIPENELRNVYLPPFKAAADAGVAT-FMAAFSDLNGVPASGNKWLMT 256
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTG 352
+R EWD G++V+D +S+ + + H F D+K DA + AG+D++ Y
Sbjct: 257 DILREEWDYKGFVVSDWESV-IQLTTHGFSKDNK-DAAYEAANAGIDMEMVSSAYFEHLP 314
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREG 412
+ V +G++ I+ ++K + + +LG FD SL + + S +N++ A +AA +
Sbjct: 315 DLVAEGRIDMRQINNAVKKILHLKWQLGLFDSPYTDASLLPKPLNS-QNLQAAKDAAIKS 373
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAG----FSG 466
VLLKND+N LPL++ + +VA++GP A+ +G + G P + + SG
Sbjct: 374 CVLLKNDKNILPLSAGSLHSVAIIGPLADDPYEQLGTWIFDGDPQHSQTCLTAITQELSG 433
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
AN+ + A ++A TAD I++ G + + E+ R ++ LPG Q
Sbjct: 434 KANIHHVKAMQTSRSHDQADFKQAVKSASTADVAILILGEESILSGEAHCRAEIDLPGCQ 493
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
QLIN +AE P++LVIM+ G + + A+L+A +PG GG AIAD++FGK
Sbjct: 494 EQLINAIAETGT-PIVLVIMA--GRPLTIETVLPKVDAVLFAWHPGTMGGPAIADLLFGK 550
Query: 587 FNPGGRLPITWYNGD------YVQ-----------MLPLTSMPLR-PVDSLGYPGRTYKF 628
P G+LP+T+ Y Q + + ++P+ P SLG
Sbjct: 551 ACPSGKLPVTFPRKVGQVPIYYAQKHSGKPATEQAFIHMDNIPVHSPQTSLGMAATHLDT 610
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
+ P L+PFG+GLSYTQF Y L +L H
Sbjct: 611 HFSP-LFPFGFGLSYTQFSYQNL-----------ELSH--------------------KT 638
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
L+ + +V NVG TDG ++ +Y + +K++ F+RV + AG+N+ +
Sbjct: 639 LKLGETLVVRVLLTNVGDTDGEEIAQLYIRDLVGSVTRPVKELKDFKRVKLTAGKNEWVT 698
Query: 749 FVFNACK 755
F + K
Sbjct: 699 FELSTDK 705
>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
154]
Length = 765
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 199/708 (28%), Positives = 330/708 (46%), Gaps = 109/708 (15%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E V ++ +A RLG+P E HG +G AT FP + +++
Sbjct: 89 EAVNEIQRYAVEHSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 136
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L++++ +AV+ E R+ + G +SP ++V RDPRWGR E GEDP+++G +A
Sbjct: 137 NTELYREMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLIGEFA 191
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDME 254
V GLQ A+ V++ KH+ Y + + H R ++
Sbjct: 192 AASVEGLQGESLDGEAS--------VAATLKHFVGYGSSEGGRNAGPVHMGTR----ELM 239
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E + PF+ V+ G A+S+M +YN ++G+P + +LL+ +R EW G ++ DC +I
Sbjct: 240 EVDMYPFKKAVEAG-AASIMPAYNEIDGVPCTVNEELLDGVLRKEWGFDGMVITDCGAIN 298
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ H D DA + AG+D++ G+ + + AVQ+ ++ + +D++++ +
Sbjct: 299 MLAAGHDTAEDGM-DAAVSAISAGIDMEMSGEMFGMYLERAVQEKRLDVSVLDEAVRRVL 357
Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
T+ +LG F+ + +Q I + E+A + A EGIVLLKN+ +TLPL S + +
Sbjct: 358 TLKFKLGLFENPYADPARAEQVIGCSRHREMARQLAAEGIVLLKNEGSTLPL-SKEDGVI 416
Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSG-----YANVTYKTGCDDVACKSNNS 486
AV+GP+A+ +G+Y P R ++ + G V Y GC + S
Sbjct: 417 AVIGPNADQGYNQLGDYTSPQPPSRVVTVLEGIRAKLGGDKGRVLYAPGC-RINGDSREG 475
Query: 487 IFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLW 521
A A AD +++ G +DL A E +DR L
Sbjct: 476 FELALSCAGQADTVVLVLGGSSARDFGEGTIDLRTGASKVTGNDWSDMDCGEGIDRMTLQ 535
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
L G Q +L ++ ++ K LV++ G IA + + AIL A YPG+EGG A+AD
Sbjct: 536 LSGVQLELAREIHKLGK---RLVVVYINGRPIAEPWIDRHADAILEAWYPGQEGGHAVAD 592
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
++FG NP G+L I+ +V LP+ R G+ Y + YPFGYGL
Sbjct: 593 ILFGDVNPSGKLTISIPK--HVGQLPVYYNGKRS------RGKRYLEEDSQPQYPFGYGL 644
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT+F+Y+ L T +R + V+
Sbjct: 645 SYTEFRYSDLQVTP-------------------------------QTIRTGETAVVTVNV 673
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+N GS G++VV +Y A K++ GF+++++ G +RI+F
Sbjct: 674 ENSGSVAGAEVVQLYINDAASRFTRPAKELKGFRKIYLEPGEKQRIEF 721
>gi|295135338|ref|YP_003586014.1| glycoside hydrolase [Zunongwangia profunda SM-A87]
gi|294983353|gb|ADF53818.1| glycoside hydrolase family protein [Zunongwangia profunda SM-A87]
Length = 764
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 211/734 (28%), Positives = 336/734 (45%), Gaps = 107/734 (14%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
EK++ D+A R+G+P S+ +HG T+FP + T AS+
Sbjct: 90 EKIRVAQDYAVNDTRMGIPLL-IGSDVIHGYK---------------TTFPIPLGTAASW 133
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
+ + KK + + EA A G+ + +SP +++ARDPRWGRI E GEDP++ +
Sbjct: 134 DMEMIKKTAEIAAQEATA------DGINWNFSPMVDIARDPRWGRIAEGAGEDPYLGSQI 187
Query: 195 AVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
A V G Q D EN + + KH+A Y R + ++ M
Sbjct: 188 AKAMVEGYQGDDLAKENT---------MIATVKHFALYGASE---AGRDYNTTDMSRVKM 235
Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
+L P++ + G A SVM S+N V+G+P+ + LL +R W G++ +D S+
Sbjct: 236 FNEYLPPYKAAIDAG-AESVMSSFNDVDGVPATGNKWLLTDLLRDRWGFEGFVTSDYTSL 294
Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
M+ + + D + A LKAGLD+D G+ Y ++ +GKV E +I + + +
Sbjct: 295 NEMIAHG--MGDLQA-VSALALKAGLDMDMVGEGYLKTLKKSLDEGKVTEAEITTAARRI 351
Query: 373 YTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
+LG FD +Y+ S ++DI S+EN + + A VLLK D PL K
Sbjct: 352 LEAKYKLGLFDDPYKYLDESRPEKDILSEENRTFSRKVAAHSFVLLKKDAGVFPLK--KN 409
Query: 431 KTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGY---ANVTYKTGC---DDVACK 482
+A++GP AN M+G +A G P + + G A VTY G DD
Sbjct: 410 AKIALIGPLANNKNNMLGTWAPTGNPQLSVPVLQGVKNVAPKAKVTYAQGANITDDAQLA 469
Query: 483 SNNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
N ++F A + AK +D + + G + E+ R +L +P Q
Sbjct: 470 ENINVFGPRAEISETSPEKMLEEALKVAKKSDVIVAVVGEATEMSGEAASRTNLLIPESQ 529
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
+LI ++A+ K P+ LV+MS ++I+ E+ NI IL +PG E G AIADV+FG
Sbjct: 530 KKLIRELAKTGK-PMALVLMSGRPLNIS-EESEMNID-ILQVWHPGVEAGNAIADVIFGD 586
Query: 587 FNPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT--LYPFGYGLSY 643
+NP G++ +W N V + RP + G+ +F + P LYPFGYGLSY
Sbjct: 587 YNPSGKITASWPRNVGQVPVYYAMKRTGRPGEVEGFQKFKSEFLDTPNSPLYPFGYGLSY 646
Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
T+F+Y SD + ++L+ D N
Sbjct: 647 TEFEY-------------------------SDVKAS------ADELKMDGTLTLSAIITN 675
Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
G DG +VV +Y +KQ+IGF+++ ++ G +K + F +A + L + +
Sbjct: 676 TGDYDGEEVVQLYIHDKVRSITPPMKQLIGFEKIMLKKGESKTVTFEISA-EDLKFYNSS 734
Query: 764 ANTLLPAGEHTIFV 777
+ GE F+
Sbjct: 735 LEYVAEPGEFEFFI 748
>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
CL02T12C29]
Length = 955
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 222/810 (27%), Positives = 358/810 (44%), Gaps = 146/810 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + P RV+DL+S+M ++EK Q+ +G R+ LP +W W
Sbjct: 60 VYEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGA 118
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L K+G E R + G T ++P ++V RD RWG
Sbjct: 179 YIATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWG 232
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V V +G+Q TD +V++ KHY AY +
Sbjct: 233 RYEEVYGESPYLVAELGVEMAKGMQ--------TDY-----QVAATSKHYIAYSNNKGGR 279
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + P++ +KE VM SYN +G P + L +RG
Sbjct: 280 EGMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRG 339
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E+ GY+V+D D+++ + H AD KE +V Q++ AGL++ C Y
Sbjct: 340 EFGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLREL 398
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
+ +G + + ID ++ + V +G FD P + L + D + S EN ++A +A++E
Sbjct: 399 IAEGALPMSTIDDRVRDILRVKFLVGLFD-QPYQIDLKQADKEVNSAENQQVALQASKES 457
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----A 468
+VLLKN LPL+ K+ +AV GP+A+ + +Y + + + G
Sbjct: 458 LVLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGT 517
Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
V + GCD V + + I A E AK +D +++ G E+
Sbjct: 518 EVLFTKGCDLVDANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGEN 577
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q L+ V K PV+LV+++ + I +A+ + AIL A YPG +
Sbjct: 578 KSRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWAD--KYVPAILEAWYPGSQ 634
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKF 628
GG AIAD +FG +NPGG+L +T+ V +P + P +P VD + G G +
Sbjct: 635 GGTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV 691
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
NGP LYPFGYGLSYT F+Y+ +S I + +
Sbjct: 692 -NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT----------------------- 726
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
+RC N G G +VV +Y + TY K ++GF R+ + G K +
Sbjct: 727 VRC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELT 778
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F + L +++ + ++ G+ + VG
Sbjct: 779 FTIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
DSM 18315]
Length = 955
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 222/810 (27%), Positives = 358/810 (44%), Gaps = 146/810 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D + P RV+DL+S+M ++EK Q+ +G R+ LP +W W
Sbjct: 60 VYEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGA 118
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
AT+FPT + ++N L K+G E R + G T ++P ++V RD RWG
Sbjct: 179 YIATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWG 232
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R E GE P++V V +G+Q TD +V++ KHY AY +
Sbjct: 233 RYEEVYGESPYLVAELGVEMAKGMQ--------TDY-----QVAATSKHYIAYSNNKGGR 279
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
D +++ +++E + P++ +KE VM SYN +G P + L +RG
Sbjct: 280 EGMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRG 339
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
E+ GY+V+D D+++ + H AD KE +V Q++ AGL++ C Y
Sbjct: 340 EFGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLREL 398
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
+ +G + + ID ++ + V +G FD P + L + D + S EN ++A +A++E
Sbjct: 399 IAEGALPMSTIDDRVRDILRVKFLVGLFD-QPYQIDLKQADKEVNSAENQQVALQASKES 457
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----A 468
+VLLKN LPL+ K+ +AV GP+A+ + +Y + + + G
Sbjct: 458 LVLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGT 517
Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
V + GCD V + + I A E AK +D +++ G E+
Sbjct: 518 EVLFTKGCDLVDANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGEN 577
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
R L LPG Q L+ V K PV+LV+++ + I +A+ + AIL A YPG +
Sbjct: 578 KSRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWAD--KYVPAILEAWYPGSQ 634
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKF 628
GG AIAD +FG +NPGG+L +T+ V +P + P +P VD + G G +
Sbjct: 635 GGTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV 691
Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
NGP LYPFGYGLSYT F+Y+ +S I + +
Sbjct: 692 -NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT----------------------- 726
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
+RC N G G +VV +Y + TY K ++GF R+ + G K +
Sbjct: 727 VRC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELT 778
Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
F + L +++ + ++ G+ + VG
Sbjct: 779 FTIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
49814]
Length = 897
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 181/531 (34%), Positives = 264/531 (49%), Gaps = 72/531 (13%)
Query: 1 MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
M V S LL +IA L F+T + + + + S+ F F D SL
Sbjct: 1 MKSVKSILLG---TIASLAFATACSSSQTDTETAQTTEEAKSSE-------FRFMDPSLS 50
Query: 61 YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
R DLVS MTL+EK Q+ D A +PRLGL +Y WW+EALHGV+ G
Sbjct: 51 PKERALDLVSHMTLEEKAAQMYDKAAAIPRLGLHEYNWWNEALHGVARAG---------- 100
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GRAGLTYWSPNINVA 172
AT FP I A+++E L ++ +S E RA ++ GLT+WSPNIN+
Sbjct: 101 HATVFPQAIGMAATWDEDLMLEVANVISDEGRAKHHFYANEDVYAMYGGLTFWSPNINIF 160
Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
RDPRWGR ET GEDP++ GR AVN++ GLQ G ++ + K + KHYA
Sbjct: 161 RDPRWGRGQETYGEDPYLTGRMAVNFINGLQ---GDDD------KYFKSVATVKHYA--- 208
Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
V + R+ + T+ D+ ET+L F+ E + +SVMC+YN V G P+C +L+
Sbjct: 209 VHSGPEPSRHRDNYIATDADLYETYLPAFKTAFDETEVASVMCAYNAVWGDPACGSERLM 268
Query: 293 NQTVRGEWDLHGYIVADCDSI-QVMVDNHKFL-----------ADSKEDAVAQTLKAGLD 340
+R E GY+V+DC +I D K D++ A A ++ G D
Sbjct: 269 KDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPYAAHDHVDTRAQAAALSVNMGTD 328
Query: 341 LDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SLGKQ 394
L+CG N AV++G + E ID+S+ LY+ L +LG +D P V ++
Sbjct: 329 LNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYSALFKLGMYD-DPSLVPWSNISID 387
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ S ++E + EAAR +VLLKND LPL VAV+GP+A+ ++ NY G P
Sbjct: 388 TVASPSHLEKSEEAARASLVLLKND-GILPLKPD--TKVAVIGPNADNWWTLVANYYGQP 444
Query: 455 CRYMSPIAGFS---GYANVTYKTGC-------DDVACKSNNSIFAASEAAK 495
++ + G G NV+Y G + +N++F +EA +
Sbjct: 445 TAPVTALKGIKAKIGAENVSYSVGSTIAGDIYSNYKAVPSNTLFHKNEAGE 495
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 99/198 (50%), Gaps = 22/198 (11%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+ G+D ++E E + DR + LP Q +L+ ++ K PV+LV S G
Sbjct: 634 LFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKELHATGK-PVVLVNFS--G 690
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
+A + N+ AI+ A YPGE+ G AIAD+++G+F+P GRLP+T+Y L
Sbjct: 691 SAMALNWEDENLPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKS-------LEG 743
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
MP D RTYK+Y G LYPFG+GLSYT F+Y+ L N N +
Sbjct: 744 MPA--FDDYSMENRTYKYYEGEQLYPFGHGLSYTSFEYSDLKLETAYAANENLQVSVKVT 801
Query: 671 NYTSDASKTRCPGVLVND 688
N AS+ + D
Sbjct: 802 NSGDKASREIVQAYVTRD 819
>gi|305663349|ref|YP_003859637.1| glycoside hydrolase family protein [Ignisphaera aggregans DSM
17230]
gi|304377918|gb|ADM27757.1| glycoside hydrolase family 3 domain protein [Ignisphaera aggregans
DSM 17230]
Length = 757
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 218/790 (27%), Positives = 352/790 (44%), Gaps = 149/790 (18%)
Query: 64 RVKDLVSRMTLDEKVQQL----------------------------------GDFAHGVP 89
RV++L+ RM+++EK+ QL G A P
Sbjct: 6 RVRELIGRMSIEEKIAQLISIPLESVLDGKKFSVEKAREVLKYGVGEILRIGGSSARLSP 65
Query: 90 RLGLPQYEWWSEALHGVSNVG-PGTHFDDVI-----PGATSFPTVILTTASFNESLWKKI 143
R + Y L + +G P ++ I P AT FP + ++++ L ++
Sbjct: 66 REAVEIYNAIQRFLTRETRLGIPAIVHEESIAGLLAPTATVFPIPLALASTWDPDLVYRV 125
Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
A+ + A+ +P +++ R+PRWGR ET GED ++ + YV+G+Q
Sbjct: 126 AVAIRRQIMAI-----GSRHTLAPVLDLCREPRWGRCEETYGEDSYLAASMGIAYVKGIQ 180
Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDMEETFLRPFE 262
D+ V + KH+ + V + + + H R ++ E ++ PFE
Sbjct: 181 -------GDDIR---YGVIATGKHFVGHGVPEGGRNIASIHVGLR----ELLEIYMYPFE 226
Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
VKE + S+M +Y+ ++ +P A+ LL +RG W G V+D + ++ + H+
Sbjct: 227 ATVKEANLLSIMPAYHDIDNVPCHANKWLLTDILRGSWGFKGIAVSDYEGVKQLHTIHRV 286
Query: 323 LADSKEDAVAQTLKAGLDLD--CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLG 380
D E AV + +KAG+D++ G+ + AV++G + E I+++++ + + LG
Sbjct: 287 ARDCMEAAV-KAIKAGVDIEYPSGECFKQLV-EAVRKGLIDEDTINRAVERVLKLKFMLG 344
Query: 381 FFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
F+ + + ++ + ELA E AR+ IVLLKND LPL +KT+AV+GP+A
Sbjct: 345 LFENPFIDETKVPTTLDNEADRELAREVARKAIVLLKND-GILPLKR-DIKTIAVIGPNA 402
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGY------------------------ANVTYKTGC 476
N AM+G+Y Y + I F G V Y GC
Sbjct: 403 NDPWAMLGDY-----HYDAHIGSFDGTYGKISPSVRIVTVLEAIKSRVSPSTEVLYAKGC 457
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAG-------LDLSVEAESLDREDLWLPGYQTQL 529
D + + A E AK AD I + G L + E +DR L LPG Q +L
Sbjct: 458 DTIG-DDRSGFGEAIEIAKRADIIIAVMGDRSGLFNLKMFTSGEGVDRASLKLPGVQEEL 516
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ ++A + K P+ILV+++ G +A + + AI+ A PGEEGG AIAD++FG ++P
Sbjct: 517 LKELASLGK-PIILVLIN--GRPLALSSILPYVNAIVEAWRPGEEGGNAIADILFGDYSP 573
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
GGRLP++ LP L P+ P R Y Y L+PFGYGLSYTQF
Sbjct: 574 GGRLPVS---------LPYDVGQL-PIYYSRKPNCFRDYVEYPAKPLFPFGYGLSYTQFA 623
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y N ++++ R P D VD +NVGS
Sbjct: 624 YE---------------------NLVVESTEVRDP---------DTVIRVSVDVKNVGSM 653
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
G +VV +Y + ++ GF+R+ + G K + F + L D N +
Sbjct: 654 AGDEVVQLYISRDYASVTRPVAELKGFKRITLEPGEKKTVVFEI-PLELLAYYDMDMNYV 712
Query: 768 LPAGEHTIFV 777
+ GE+T +
Sbjct: 713 VEPGEYTFMI 722
>gi|383119099|ref|ZP_09939838.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
gi|251946311|gb|EES86688.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
Length = 859
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 216/768 (28%), Positives = 337/768 (43%), Gaps = 144/768 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGV---------------------PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ PRLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E L G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDPF+V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGTPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSGYANVTYK 473
N LPL K+K++AV+GP NA G+Y G+ + S + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL-LEALKERVSNQLTLNYA 462
Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLPG 524
GC D+ + A + AK +D I++ G + A E D DL L G
Sbjct: 463 KGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTG 521
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 522 VQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADMLL 578
Query: 585 GKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG+
Sbjct: 579 GKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGH 638
Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
GLSYT F+Y LS T + + D C+D E +
Sbjct: 639 GLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVTI 667
Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 668 AIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
CL09T00C40]
Length = 955
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 222/809 (27%), Positives = 360/809 (44%), Gaps = 144/809 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D ++P RV+DL+S+M ++EK Q+ +G R+ LP +W W
Sbjct: 60 VYEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGA 118
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
AT+FPT + ++N +L K+G E R LG + ++P ++V RD RWGR
Sbjct: 179 YIATNFPTQLGLGHTWNRNLVHKVGYITGREGRL---LGYTNV--YAPILDVGRDQRWGR 233
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
E GE P++V V +G+Q TD +V++ KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ--------TDY-----QVAATSKHYIAYSNNKGGRE 280
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
D +++ +++E + P++ +KE VM SYN +G P + L +RGE
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
+ GY+V+D D+++ + H AD KE +V Q++ AGL++ C Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLRELI 399
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
+G + + ID ++ + V +G FD P + L + D + EN +A +A++E +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQLVALQASKESL 458
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----AN 469
VLLKN LPL+ K+ +AV GP+A+ + +Y + + + G +
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518
Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
V + GCD V + + I A E AK +D T+++ G E+
Sbjct: 519 VLFTKGCDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENK 578
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q L+ V K PV+LV+++ + I +A+ + AIL A YPG +G
Sbjct: 579 SRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWAD--KYVPAILEAWYPGSQG 635
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKFY 629
G AIAD +FG +NPGG+L +T+ V +P + P +P VD + G G +
Sbjct: 636 GTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV- 691
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NGP LYPFGYGLSYT F+Y+ +S I + + +
Sbjct: 692 NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT-----------------------V 727
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
RC N G G +VV +Y + TY K ++GF R+ + G K + F
Sbjct: 728 RC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTF 779
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ L +++ + ++ G+ + VG
Sbjct: 780 TIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|160882475|ref|ZP_02063478.1| hypothetical protein BACOVA_00426 [Bacteroides ovatus ATCC 8483]
gi|156112056|gb|EDO13801.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 859
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 220/798 (27%), Positives = 345/798 (43%), Gaps = 142/798 (17%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD--------------------------- 83
SF + + LP +RV DL+ RMTL+EK+ Q+
Sbjct: 24 SFSYKNPLLPTELRVNDLLGRMTLEEKIAQIRHLHSWDVFDGQILNQEKLDKMCGGIGYG 83
Query: 84 FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
F G P RLG+P + +E+LHGV V G
Sbjct: 84 FFEGFPLTAASCRKTFREIQTYMVEKTRLGIPGFPV-AESLHGV-----------VHEGT 131
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
T +P I ++FN L + + ++ E M +P I+V RD RWGR+ E
Sbjct: 132 TIYPQNIAMGSTFNPELAYEKTKHIAGELNTM-----GVKQVLAPCIDVVRDLRWGRVEE 186
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
+ GEDPF+ + AV V+G + H +S KHY + + G++
Sbjct: 187 SFGEDPFLCSKMAVAEVKGYME---H-----------GISPMLKHYGPHG-NPLGGLNLA 231
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
+ V +D+ + +L+PFE + E + +VM SYN N IP+ A +L +R +
Sbjct: 232 SVECGV--RDLFDIYLKPFEAVLAETEIMAVMSSYNSWNRIPNSASRFMLTDILRNRFGF 289
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
GY+ +D + ++ HK AD E A Q L AG+D++ + ++ G+
Sbjct: 290 RGYVYSDWGVVSMLKTFHKTAADDFE-AARQVLTAGMDVEASSSCYAVLADKIRNGEFDI 348
Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
+ ID++++ + LG F+ Q ++ + + S E+++L+ A E VLLKND
Sbjct: 349 SYIDQAVRRVLRAKFELGLFEDPYQEQAVYRLPLRSKESVKLSRRIADESTVLLKNDGQL 408
Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAGFSGY----ANVTYKTGC 476
LPLN +K+VAV+GP NA G+Y + ++P+ G + Y GC
Sbjct: 409 LPLNVRNLKSVAVIGP--NADNVQFGDYTWSKKKEDGVTPLQGIKNLLGDRVKINYAKGC 466
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAG----------LDLSVEAESLDREDLWLPGYQ 526
+A + I A +AA+ +D +I G + S E +D D+ L G Q
Sbjct: 467 -SLASLDTSGIAEAVDAARHSDVALIFVGSSSTAFVRHTQEPSTSGEGIDLSDISLTGAQ 525
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
QLI +V V K PV++++++ G A NI AIL Y GE+ G +IAD++FG
Sbjct: 526 EQLIREVFAVGK-PVVVILVA--GKPFAIPWVKENIPAILAQWYAGEQEGNSIADILFGN 582
Query: 587 FNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
NP G+L ++ Y LP + + PGR Y F N L+ FGYGL
Sbjct: 583 VNPSGKLTFSFPQSTGHLPVYYNYLPTDKGYYKEPGTYEKPGRDYVFSNSSPLWAFGYGL 642
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYTQF+Y L +D + ND C V
Sbjct: 643 SYTQFEY---------------------LKAVTDKELYQA-----NDTVC-----VTVQL 671
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
+N G G +V+ VY + T +KQ+ GF++V + G+ + + + D
Sbjct: 672 KNTGKRTGKEVIQVYMRDVVSSVMTQVKQLKGFRKVDLLPGQTRETTIMI-PVHEFYLTD 730
Query: 762 YAANTLLPAGEHTIFVGN 779
N L +G+ + VG
Sbjct: 731 DLGNRYLESGKFELQVGT 748
>gi|86142030|ref|ZP_01060554.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
gi|85831593|gb|EAQ50049.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
Length = 803
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 216/724 (29%), Positives = 330/724 (45%), Gaps = 113/724 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + EA+HG +G T FP+ I ++FN L KK+G AV+
Sbjct: 135 RLGIPLF-LAEEAMHGHMAIG-----------TTEFPSAIGQASTFNPQLNKKMGAAVAK 182
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E RA + + P +++AR+PRW R+ ET GEDP+++ + + G Q EG E
Sbjct: 183 ELRA-----QGAHIGYGPILDLAREPRWSRVEETFGEDPYLISEMGLGVIEGFQG-EGIE 236
Query: 210 NATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
N P V S KH+AAY V + H R QD ++ PF+ + G
Sbjct: 237 N-------PESVISTLKHFAAYGVSEGGHNGGAVHIGQRELMQD----YMYPFKKAIDAG 285
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSK 327
SVM +Y+ V+GIPS ++ LL +R +W G++V+D SI+ + D+H
Sbjct: 286 -VLSVMTAYSSVDGIPSTSNKALLTGLLREQWGFEGFVVSDLASIEGIKGDHHAAATFED 344
Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
A+A + AG+D D G + + NA + GKV E +D+++KY+ + ++G F+
Sbjct: 345 AAALA--MNAGVDADLGGNGFDDELLNAFKNGKVSEARLDEAVKYVLRLKFKMGLFENPY 402
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
K+ + S +I +A E A EG+ LLKN+ LPL S ++K +AV+GP+A+
Sbjct: 403 VEEKAPKKVVRSAAHIAIAKEMALEGVTLLKNENGLLPL-SKELKKIAVIGPNADMMYNQ 461
Query: 447 IGNYAG--IPCRYMSPIAGFSG---YANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
+G+Y P ++P+ G A +TY G + A + A A +
Sbjct: 462 LGDYTAPQEPEFIVTPLEGIRAKMPKAEITYVKGTAIRDTTQTDIPAAVAAAKSAEVAIV 521
Query: 502 ILAG---LDLSVE----------------------AESLDREDLWLPGYQTQLINQVAEV 536
+L G D E E DR L L G Q +L+ Q E
Sbjct: 522 VLGGSSARDFKTEYLETGAATVSSKEDQVLSDMESGEGYDRSTLDLMGKQLELL-QAVEA 580
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
P ILV+++ G + +I AI+ YPG +GG A+ADV+FG +NP GRLP++
Sbjct: 581 TGTPTILVLIT--GRPLLINWPAKHIPAIIDTWYPGSQGGHALADVLFGDYNPAGRLPVS 638
Query: 597 WYNGDYVQMLPLTSMPLRPV--DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
+P S+ PV + R Y LY FG+GLSYT F Y+ L
Sbjct: 639 ---------IP-KSVGQSPVYYNHWWPKRRDYVEETSAPLYAFGHGLSYTTFDYSDL--- 685
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
K+ N T+ E V+ N G DG +VV
Sbjct: 686 --------KISQSGNATNTT--------------------IEVSVEVTNTGDRDGDEVVQ 717
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y T +KQ+ GF+R+ + G +K + F+ + L + D N + AGE
Sbjct: 718 LYLSDVVSSVVTPVKQLRGFERIHLDKGESKTVTFILTPAE-LALFDAEMNHVAEAGEFE 776
Query: 775 IFVG 778
+ +G
Sbjct: 777 VQLG 780
>gi|325104789|ref|YP_004274443.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
gi|324973637|gb|ADY52621.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
12145]
Length = 802
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 227/829 (27%), Positives = 358/829 (43%), Gaps = 147/829 (17%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-FAHG-VPRLGLPQYEW- 98
F+K G++ +F D S P RV+DL+S+MT+ EK Q + +G V + +P EW
Sbjct: 39 FNKNGIKD---VFEDQSQPIEKRVEDLLSQMTVAEKTNQTATLYGYGRVLKDEMPTSEWK 95
Query: 99 ---WS-------EALHGVSN-------------------------------VGPGTHF-D 116
W EAL+ + N +G F +
Sbjct: 96 KSIWKDGIANMDEALNSLPNNKKAQTEYSFPYSKHATAINTLQKWFIEETRLGIPVDFTN 155
Query: 117 DVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNIN 170
+ I G AT F I +S+N++L +K G+ E +A+ G T ++P ++
Sbjct: 156 EGIHGLCHDRATPFCAPIGIGSSWNKNLVRKAGEIAGREGKAL------GYTNVYAPILD 209
Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
+ARDPRWGR+ E GEDPF+VG N V GLQ +++ KHYA
Sbjct: 210 LARDPRWGRVVECYGEDPFLVGELGKNMVSGLQSN--------------GIAATLKHYAV 255
Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
Y V D VT +++ + L PF+ V+E VM SYN +GIP
Sbjct: 256 YSVPKGGRDGHARTDPHVTPRELHQIHLYPFKKVVQEAKPLGVMSSYNDWDGIPVTGSYY 315
Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QY 346
L + +R ++ +GY+V+D ++++ + H+ D KE +V LKAGL++
Sbjct: 316 FLTELLRKQYGFNGYVVSDSEAVEFIASKHRVAKDFKEASVI-ALKAGLNVWTNFRQPDN 374
Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELA 405
Y N +V G + +++ ++ + +V RLG FD + + + + + E+ + A
Sbjct: 375 YINNLRASVADGSLDMETLNQRVREVLSVKFRLGLFDRPFTENPAASDKKVQTPEDKKFA 434
Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
+ +E IVLLKN + LPL+ K + + V GP A I Y S + G
Sbjct: 435 EQMNKESIVLLKNGNDFLPLDKNKNQKILVTGPLAAEVGYTISRYGPSNNPSTSILDGLK 494
Query: 466 GY----ANVTYKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLD 507
Y N+ Y GC + V K I A AK D I + G +
Sbjct: 495 QYNNGKLNIDYAKGCEIVNEGWPGTEIIDEPVTEKEKAMIADAVAKAKNVDVIIAVVGEN 554
Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
+ ESL R L LPG Q +L+ + K PV++V+++ + I + N + AIL
Sbjct: 555 EKIVGESLSRTSLNLPGRQLELLKALHATGK-PVVMVLVNGRPLTINWE--NHYLTAILE 611
Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG-DYVQMLPLTSMPLRPVDSLGYP---- 622
+ G G+ +A+ +FG +NPGG+L +T+ ++M + P +P P
Sbjct: 612 TWFLGPSAGKVVAETLFGDYNPGGKLSVTFPKSIGQIEM----NFPFKPGSHANQPSSGD 667
Query: 623 -GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
G NG LYPFGYGLSYT+F Y+ L
Sbjct: 668 NGFGKSRVNG-VLYPFGYGLSYTKFSYSDLKL---------------------------- 698
Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
D D +N+G DG +VV +Y + TY Q+ F+R+ ++A
Sbjct: 699 ------DFSKPDSISASFVLKNIGKRDGDEVVQLYFRDLISSVITYDTQLRAFERIHLKA 752
Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
G K++ F A K L I+D N + G+ + +G+ + F
Sbjct: 753 GETKQLNLKF-ARKDLAILDKDMNWAVEPGDFEVLIGSSSEDIRLKEKF 800
>gi|409197254|ref|ZP_11225917.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
Length = 734
Score = 259 bits (662), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 210/761 (27%), Positives = 352/761 (46%), Gaps = 109/761 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPR--------LGLPQYEWWSEALHGVSNVG----- 110
RV+ L+ MTLDEK+ Q+ + G +G E E ++ + +
Sbjct: 23 RVEQLLGEMTLDEKIGQMCQVSGGQGNEESIRQGMIGSILNEVDPENINRLQKIAVEESR 82
Query: 111 ---PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-W 165
P DVI G T FP + A++N L +K + ++EA + G+ + +
Sbjct: 83 LGIPIIVARDVIHGFKTVFPIPLGQAATWNPELVQKGSRIAASEAAS------TGVRWTF 136
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P I+++RD RWGRI E+ GEDP++ V G Q LN +++C
Sbjct: 137 APMIDISRDARWGRIAESLGEDPYLTSVLGAAMVTGFQ-------GDSLNGE-TSIAACA 188
Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
KH+A Y R + + +++ + +L PF+ V G + M +N V+G+P+
Sbjct: 189 KHFAGYGAAEG---GRDYNTTSIPPRELRDIYLPPFKAAVDAG-VRTFMSGFNEVDGVPA 244
Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
A+ LL +R EW G++V+D S M+ NH F AD KE A + +K G+D++
Sbjct: 245 TANKYLLTDVLRNEWQFDGFVVSDWASTWEMI-NHGFAADEKE-AAHRAIKVGVDMEMAT 302
Query: 346 Y-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD-ICSDENIE 403
Y + +++G + DI+++++ + V LG FD Y++ KQ+ E +E
Sbjct: 303 TTYRDNIAALLKEGALNIEDINQAVRNILRVKFELGLFDNP--YIAEEKQNQFARPEYLE 360
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPI 461
A AA + +VLLKN+Q TLP+NS+ +A++GP A+ +G + G ++P+
Sbjct: 361 AANLAATQSMVLLKNEQKTLPINSSS--KIALIGPMADQPYEQLGTWIFDGDTTLTVTPL 418
Query: 462 AGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
F+ G NV + G + A E AK +D + G + + E+ R
Sbjct: 419 QAFNKTFGQENVLFAEGMPISRTRHQKGFRKAIEQAKNSDVIVFCGGEESILSGEAHSRA 478
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
++ LPG Q +LI ++ + K P++LV+M+ G + E + + A+++A +PG GG A
Sbjct: 479 NIDLPGVQNELIKELKKTGK-PLVLVVMA--GRPLTIGEISEHADAVVYAWHPGTMGGAA 535
Query: 579 IADVVFGKFNPGGRLPIT----------WYN----------GDYVQMLPLTSMPLR-PVD 617
+AD+V GK NP G+LP+T +YN + QM +P++ P
Sbjct: 536 LADIVSGKANPSGKLPVTFPKVVGQIPIYYNHKNTGRPANPDSWTQMY---DIPVKAPQT 592
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
SLG P LYPFGYGLSYT F+Y+ LS K + + R
Sbjct: 593 SLGNESHYIDAGFIP-LYPFGYGLSYTSFEYSDLSLDKEV--------YAR--------- 634
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
D+ E + N G G +V VY + +K++ F+R+
Sbjct: 635 --------------DETIEVRFTLSNTGEFAGEEVAQVYVRDLVGNVTRPVKELKAFERI 680
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ G +K + + L + ++ GE ++VG
Sbjct: 681 DLQKGESKTVTLTI-PVQELAFTNIDMKQVVEPGEFQLWVG 720
>gi|409730324|ref|ZP_11271901.1| beta-glucosidase [Halococcus hamelinensis 100A6]
gi|448724096|ref|ZP_21706609.1| beta-glucosidase [Halococcus hamelinensis 100A6]
gi|445786548|gb|EMA37314.1| beta-glucosidase [Halococcus hamelinensis 100A6]
Length = 747
Score = 259 bits (662), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 207/695 (29%), Positives = 338/695 (48%), Gaps = 105/695 (15%)
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWG 178
P T+FP I +S++ L +++ + +E A+ G T+ SP ++VARD RWG
Sbjct: 88 PEGTTFPQSIGMASSWDPDLMRQVMERTRSEMAAI------GTTHALSPVLDVARDLRWG 141
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
R+ ET GEDP++V A YV GLQ + +S+ KH+AA+ + G
Sbjct: 142 RVEETFGEDPYLVAAMASAYVAGLQGPSIEDG----------ISATLKHFAAHSA-SEGG 190
Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
+R + V +++ ET L P+E + A SVM +Y+ ++GIPS ++ LL +RG
Sbjct: 191 KNRASVN--VGPRELRETHLFPYEAAITTAGAESVMNAYHDIDGIPSASNEWLLTDLLRG 248
Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGN 353
E G +V+D S+ + + H ++DS ++ L+AG+D+ DC ++
Sbjct: 249 ELGFDGTVVSDYYSVDFLREEHG-VSDSDRESAVMALEAGIDVELPATDCYEHLP----E 303
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
A++ G++ E +D++++ + + R G D S S+ ++ EL AARE I
Sbjct: 304 AIENGELSEATLDEAVRRVLRMKFRKGLVDDSTVDASVAADAFNTEAATELTERAARESI 363
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPIAGF 464
VLLKN+ LPL+ ++AVVGP A+ M+G+YA P Y +P+
Sbjct: 364 VLLKNENELLPLD--DTDSLAVVGPKADDGQEMMGDYA-YPAHYPEAEVSLDATTPLDAI 420
Query: 465 SGYAN---VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE---------- 511
+A+ + Y+ GC + S + AA EAA AD T+ G +V+
Sbjct: 421 RVHADGTEIAYEEGC-TTSGPSTDGFDAAVEAAAGADVTLAFVGARSAVDFSDPDAEDVT 479
Query: 512 -------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
E D DL LPG QT+L+ +V E P+++V++S I + + A
Sbjct: 480 NPALPTSGEGSDVTDLGLPGVQTELLERVHETGT-PLVVVVVSGKPHSIEW--VAEEVPA 536
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
++ A PGEEGG IADV+FG +NPGG LP++ V LP+ RP + +
Sbjct: 537 VVQAWLPGEEGGTGIADVLFGDYNPGGHLPVSLARS--VGQLPV-HYDRRPNSA----NK 589
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
+ + LY FG+GLSYT+F+Y+ +V+ + L ++ + A+
Sbjct: 590 DHVYTESEPLYSFGHGLSYTEFEYD------DFEVSTDTLGASGSVTASVTAT------- 636
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
NVG GSDVV +Y+ + A +++++GF+RV + AG +
Sbjct: 637 ------------------NVGGRGGSDVVQLYAHAESPDQARPVQELVGFERVSLDAGES 678
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
RI F +A + L D N + G + + VG+
Sbjct: 679 TRISFEIDATQ-LAYHDRDMNLRVHDGSYELRVGH 712
>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
Y412MC10]
Length = 765
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 196/711 (27%), Positives = 328/711 (46%), Gaps = 109/711 (15%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E V + +A RLG+P E HG +G T FP + +++
Sbjct: 89 EAVNHIQRYAVEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L++ + +AV+ E R+ + G +SP ++V RDPRWGR E GEDP+++ YA
Sbjct: 137 NVDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDME 254
V V GLQ L+S P V++ KH+ Y + + H R ++
Sbjct: 192 VASVEGLQ-------GESLDS-PSSVAATLKHFVGYGSSEGGRNAGPVHMGTR----ELM 239
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E + PF+ V+ G A+S+M +YN ++G+P + +LL+ +R EW G ++ DC +I
Sbjct: 240 EVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAID 298
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ H D DA Q ++AG+D++ G+ + AV+ K++ + +D++++ +
Sbjct: 299 MLASGHDTAEDGM-DAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVL 357
Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
T+ +LG F+ + I S++++ LA + A EGIVLLKN+ LPL S + +
Sbjct: 358 TLKFKLGLFENPYVDPQTAENVIGSEQHVGLARQLAAEGIVLLKNEAKALPL-SKEGGVI 416
Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNS 486
AV+GP+A+ +G+Y P + + G V Y GC + S
Sbjct: 417 AVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGC-RIKDDSREG 475
Query: 487 IFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLW 521
A A+ AD +++ G +DL A E +DR L
Sbjct: 476 FEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQ 535
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
L G Q +L+ ++ ++ K +++ I G IA + + AIL A YPG+EGG A+AD
Sbjct: 536 LSGVQLELVQEIHKLGKRMIVVYI---NGRPIAEPWIDEHADAILEAWYPGQEGGHAVAD 592
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
++FG NP G+L ++ +V LP+ R G+ Y + YPFGYGL
Sbjct: 593 ILFGDVNPSGKLTMSIPK--HVGQLPVYYNGKRS------RGKRYLEEDSQPRYPFGYGL 644
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT+F Y+ + T + + D V+
Sbjct: 645 SYTEFSYSDIQMTPEV-------------------------------IGTDGTAVVSVNV 673
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
N G +GS+VV +Y A +++ GFQ++F++ G ++++F
Sbjct: 674 TNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKVEFTIG 724
>gi|393786524|ref|ZP_10374660.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
CL02T12C05]
gi|392660153|gb|EIY53770.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
CL02T12C05]
Length = 841
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 226/821 (27%), Positives = 352/821 (42%), Gaps = 171/821 (20%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTL-------------------------------------- 74
+F D S P RVKDL+S+MT+
Sbjct: 81 IFEDPSQPVEKRVKDLLSQMTIEEKSCQLATLYGFGRVLKDSLPTPAWKEAIWKDGIANI 140
Query: 75 DEKVQQLGDFAHGVP------------------------RLGLPQYEWWSEALHGVSNVG 110
DE++ +G A VP RLG+P ++ +E +HG+++
Sbjct: 141 DEQLNGVGRGAKRVPHLIVPFSNHVKAINETQRWFIEETRLGIP-VDFSNEGIHGLNHTK 199
Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNI 169
AT P I +++N L ++ G+ V EAR + G T ++P +
Sbjct: 200 -----------ATPLPAPIAIGSTWNTELVREAGEIVGKEARVL------GYTNVYAPIL 242
Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
+V RDPRWGR E GEDP+++G V V G+Q +G V++ KH+A
Sbjct: 243 DVVRDPRWGRTLECYGEDPYLIGELGVQMVDGIQS-QG-------------VAATLKHFA 288
Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
Y D VT +++ E +L PF+ +++ VM SYN NG P +
Sbjct: 289 VYSSPKGGRDGNCRTDPHVTPRELHEIYLYPFKHVIQQSHPMGVMSSYNDWNGEPVTSSY 348
Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
L + +R E+ GY+V+D +++ + H+ +A+ ++AV Q L+AGL++ T+
Sbjct: 349 YFLTKLLREEYGFDGYVVSDSQAVEFVHTKHQ-VAEDYDEAVRQVLEAGLNVR-----TH 402
Query: 350 FTGNA---------VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC-SD 399
FT A + + K+ IDK + + V RLG FD + ++ +D
Sbjct: 403 FTPPADFILPIRRLLAENKISMATIDKRVSEVLAVKFRLGLFDAPYRDNPKEADEVAGAD 462
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRY 457
++ E E R+ +VLLKND LPLN ++K V V GP A+ MI Y G+P
Sbjct: 463 KHSEFVKEMQRQSLVLLKNDGQLLPLNKKEIKKVLVTGPLADEDNFMISRYGPNGLPT-- 520
Query: 458 MSPIAGFSGY----ANVTYKTGCDDV-----ACKSNNSIFAASEAA---------KTADA 499
++ + G Y V Y GC+ + A + ++ A E A ++AD
Sbjct: 521 ITVLQGIKDYLKGDVEVVYSKGCNIIDKEWPASEVLPAVLTAEEVADMDKAVSEAQSADV 580
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
I + G D ES R L LPG Q +L+ + K PV+LV+++ + I + +
Sbjct: 581 IIAVMGEDEYRVGESRSRTSLELPGRQRELLQALHATGK-PVVLVLINGQPLTINWE--D 637
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN--GDYVQMLPLTSMPLRPVD 617
N+ AIL A +P +GG+ IA+ +FG +NPGG+L +T+ G P
Sbjct: 638 QNLPAILEAWFPSFQGGKIIAETLFGDYNPGGKLTVTFPKSVGQIELNFPFKKGSHGTQP 697
Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
S G G G LYPFGYGLSYT F Y+ NL T+ A
Sbjct: 698 SSGPNGSGSTRVLG-ALYPFGYGLSYTTFAYS-------------------NLEVTAPAK 737
Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
T+ + D N G G +V +Y + TY ++ GFQRV
Sbjct: 738 GTQGE------------VQISFDITNTGKYAGEEVAQLYVRDLVSSVVTYDSRLRGFQRV 785
Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++ KR+ F L ++D + +G + VG
Sbjct: 786 LLQPNETKRMHFTLKPA-DLELLDRNMEWTVESGTFEVRVG 825
>gi|346226406|ref|ZP_08847548.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 775
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 199/708 (28%), Positives = 329/708 (46%), Gaps = 90/708 (12%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T+FP + S++ L ++ + + EA A +G+ + ++P I++ARD
Sbjct: 122 DVIHGLETTFPIPLAEACSWDLELMEQSARIAAEEATA------SGIAWNFAPMIDIARD 175
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR+ E GEDP++ A VRG Q +E +++ + +N+ + + KH+ Y
Sbjct: 176 PRWGRVMEGAGEDPYLGSLVARARVRGFQGIETYKDFSKINT----MMATSKHFVGYGAV 231
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
G D + D V + + ET+L PF+ V EG ++ M ++N +NG+P + L +
Sbjct: 232 Q-AGRDYHSVDMSV--RTLHETYLPPFKAAVDEG-VTAFMTAFNDLNGVPCTGNKYLFKE 287
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGN 353
+R W G +V D +IQ MV H F D K A + AG+D+D + + +
Sbjct: 288 ILRDRWGFGGMVVTDYTAIQEMV-AHGFARDLKH-ATELAIDAGIDMDMISEGFVTYLKE 345
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAARE 411
V++GKV E ID ++ + + LG FD +Y + +Q + + E+++ A E A+
Sbjct: 346 LVEEGKVSEKQIDVAVSRILEMKFLLGLFDDPFKYCNAERQKEVVMNPEHLKAAREVAQR 405
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------------GIPCRYM 458
IVLL+N N LPL + K VA++GP ++ G +A G+ +Y
Sbjct: 406 SIVLLENKNNVLPLKKNEPKRVALIGPFVKERESLTGEWAIKGDPDKSVTLMEGLEEKYK 465
Query: 459 SPIAGFSGYAN---------VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
FS YA T K V +S S A A+T+D ++ G
Sbjct: 466 DSQVKFS-YAKGTSLPVIDRTTQKVSTTRVPDRSGFS--EAINLARTSDVILVAMGEKFH 522
Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
E+ R D+ LPG Q +L+ ++ + K P+ILV+ + +D+++ N+ AI+ A
Sbjct: 523 WSGEAASRTDITLPGNQRELLKELKKTGK-PIILVLFNGRPLDLSWEA--ENVDAIVEAW 579
Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSLGYPG 623
YPG G A+ADV+ G +NP +L +T+ V +P+ T P + Y
Sbjct: 580 YPGIMAGHAVADVLSGDYNPSAKLVMTFPRN--VGQIPIFYNVKNTGRPFDEDNPADYRS 637
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
N P LYPFGYGLSYT F+Y+ N + K G
Sbjct: 638 SYIDCPNSP-LYPFGYGLSYTSFEYD---------------------NAKISSKKLERGG 675
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
+L VD N G+ DG +VV +Y +K++ GF+++ ++ G
Sbjct: 676 ILT----------VSVDVTNTGTMDGEEVVQLYIHDKVGSVVRPVKELKGFKKIHLKKGE 725
Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
K ++F + + L + + + GE ++ + HL F+
Sbjct: 726 TKTVEFTIDE-ERLKMYNLDMEWVAEPGEFEAWIASSSADESNHLEFS 772
>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
Ellin6076]
gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 765
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 210/698 (30%), Positives = 333/698 (47%), Gaps = 118/698 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + E LHG + +G TSFP I A+F+ L + + +
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------GTSFPQPIGLGATFDPELVESLFAMTAA 151
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
EARA R +P ++VAR+PRWGR+ ET GEDPF+V R + VRG Q
Sbjct: 152 EARA-----RGTHQALTPVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDATFR 206
Query: 210 NATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+ T +V + KH+AA+ ++ + RV + ETFL PF+ + +G
Sbjct: 207 DKT-------RVIATLKHFAAHGQPESGTNCAPVNVSMRV----LRETFLFPFKEALDKG 255
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV---DNH-KFLA 324
A SVM SYN ++G+PS A LL +R EW G++V+D +I + ++H F+A
Sbjct: 256 CAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIYELSYRPESHGHFVA 315
Query: 325 DSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
K +A A ++AG+++ DC + + V +G ++E+ +D+ ++ + ++
Sbjct: 316 KDKREACALAVQAGVNIELPEPDCYLHLVDL----VHKGVLQESQLDELVEPMLRWKFQM 371
Query: 380 GFFDGSPQYVSLGKQDICS--DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
G FD YV + + + D + ELA +AARE I LLKND +PL+ + +KT+AV+G
Sbjct: 372 GLFDDP--YVDPAEAERIAGCDAHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAVIG 429
Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFS----GYANVTYKTGC----------DDVA--- 480
P+AN + ++G Y+G+P ++ + G A V Y GC D+V
Sbjct: 430 PNANRS--LLGGYSGVPKHDVTVLDGIRERVGSRAKVVYAEGCKITIGGSWVQDEVTPSD 487
Query: 481 -CKSNNSIFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQV 533
+ I A + AK AD ++ G + E+ DR L L G Q +L+ +
Sbjct: 488 PAEDRRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELVRAM 547
Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
K PVI + + + I + ++ AI Y G+E GRA+A+V+FG NPGG+L
Sbjct: 548 VATGK-PVIAFLFNGRPISINY--LAQSVPAIFECWYLGQETGRAVAEVLFGDTNPGGKL 604
Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
PIT +P ++ L P P R Y F LY FGYGLSYT F + L
Sbjct: 605 PIT---------IPRSAGHL-PAFYNHKPSARRGYLFDEVGPLYAFGYGLSYTTFAFQNL 654
Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
K +++ R L VD N G+ +G +
Sbjct: 655 RLAKK---KMHRESTARVL----------------------------VDVTNTGAREGRE 683
Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
VV +Y + IK++ GF+++ ++ G+ + ++F
Sbjct: 684 VVQLYIRDLVSSVTRPIKELKGFRKITLQPGQTQTVEF 721
>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
Length = 889
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 153/403 (37%), Positives = 227/403 (56%), Gaps = 43/403 (10%)
Query: 67 DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
DLV++MTLDEK+ QL + A +PRL +P Y WW+E+LHG P T+FP
Sbjct: 36 DLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGALGSLP----------TTNFP 85
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRW 177
I A+F+ SL K + A+STE R ++ L R GL WSPNIN+ RDPRW
Sbjct: 86 EPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWSPNINIFRDPRW 145
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GR ET GEDP++ R V++V G+Q + DL V + KH+A V N
Sbjct: 146 GRGQETYGEDPYLTARMGVSFVEGMQGPD-----PDLPD----VIATPKHFA---VHNGP 193
Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
R+H + V+ D+E+T+L F + EG A SVMC+YNRV+G P+CA +LL + +
Sbjct: 194 ESTRHHANVFVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQELLQEHLV 253
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-------YTNF 350
W GY+V+DCD+++ + DNHK+ D AVA ++ G+D +C + T+
Sbjct: 254 DAWGFQGYVVSDCDAVKDISDNHKYAPDGAA-AVAAAMRMGVDSECHTWTLSDTDGLTDR 312
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL--GKQDICSDENIELAAEA 408
A+++G + +D+D++L L++ +R G G + + D+ + + LA +A
Sbjct: 313 YREALERGLITVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTSSAADVGTPAHGALALKA 372
Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
A E +VLLKND LP +A +K VAV+GP +AT + GNY+
Sbjct: 373 AEESLVLLKND-GILPFQTAGMK-VAVIGPFGDATRVLRGNYS 413
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 136/303 (44%), Gaps = 55/303 (18%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
AA+ AD + + GL +EAE D+ L +P Q +L+ Q K P+
Sbjct: 613 RAAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQELLEQAKATGK-PL 671
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
I+V M+ +++ +A+ N + AIL A YPG+ GG AIA+V+ GK NP G+LP+T+Y
Sbjct: 672 IVVAMNGSPINLHWAKENAD--AILEAWYPGQSGGLAIANVLTGKANPTGKLPLTFYRS- 728
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
V+ LP P D GRTY+++ G +YPFGYGLSYT F Y ++
Sbjct: 729 -VEDLP-------PFDDYDMKGRTYRYFTGKAVYPFGYGLSYTTFGYGPVA--------- 771
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
AS G+ V N G G D V +Y P
Sbjct: 772 -----------VEPASGGAQDGIRVT-----------TQVSNTGQRAGGDAVQLYLDFPD 809
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG 781
I + GFQ+V ++ G +++ F + ++ +L G + + VG+G
Sbjct: 810 APGTPNIA-LRGFQKVSLQPGETRQVTFTLSPRDLSSVTPDGVRKVL-KGHYRVTVGSGQ 867
Query: 782 VSF 784
F
Sbjct: 868 PGF 870
>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
43184]
gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
[Parabacteroides merdae ATCC 43184]
Length = 955
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 221/809 (27%), Positives = 360/809 (44%), Gaps = 144/809 (17%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
++ D ++P RV+DL+S+M ++EK Q+ +G R+ LP +W W
Sbjct: 60 VYEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGA 118
Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
E L+G G P ++ I G
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178
Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
AT+FPT + ++N +L K+G E R LG + ++P ++V RD RWGR
Sbjct: 179 YIATNFPTQLGLGHTWNRNLVHKVGYITGREGRL---LGYTNV--YAPILDVGRDQRWGR 233
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
E GE P++V + +G+Q TD +V++ KHY AY +
Sbjct: 234 YEEVYGESPYLVAELGIEMAKGMQ--------TDH-----QVAATSKHYIAYSNNKGGRE 280
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
D +++ +++E + P++ +KE VM SYN +G P + L +RGE
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
+ GY+V+D D+++ + H AD KE +V Q++ AGL++ C Y +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLRELI 399
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
+G + + ID ++ + V +G FD P + L + D + EN +A +A++E +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQLVALQASKESL 458
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----AN 469
VLLKN LPL+ K+ +AV GP+A+ + +Y + + + G +
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518
Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
V + GCD V + + I A E AK +D T+++ G E+
Sbjct: 519 VLFTKGCDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENK 578
Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
R L LPG Q L+ V K PV+LV+++ + I +A+ + AIL A YPG +G
Sbjct: 579 SRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWAD--KYVPAILEAWYPGSQG 635
Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKFY 629
G AIAD +FG +NPGG+L +T+ V +P + P +P VD + G G +
Sbjct: 636 GTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV- 691
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
NGP LYPFGYGLSYT F+Y+ +S I + + +
Sbjct: 692 NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT-----------------------V 727
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
RC N G G +VV +Y + TY K ++GF R+ + G K + F
Sbjct: 728 RC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTF 779
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ L +++ + ++ G+ + VG
Sbjct: 780 TIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807
>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
11841]
Length = 852
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 160/415 (38%), Positives = 227/415 (54%), Gaps = 39/415 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
LF D P R+ DL+SR+T++EK+ L + A + RLG+ +Y +EALHGV V PG
Sbjct: 28 LFRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A +N L +I A+S EAR + G L
Sbjct: 86 DF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G V +V+GLQ + R LK
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGN---------HPRYLKTV 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+A + ++ +R +A+V+E+D+ E +L FE C+ EG A S+M +YN VN
Sbjct: 189 STPKHFAVNNEEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVND 244
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + L+ +RG+W +GYIV+DC + + M+ H ++ ++E A +KAGLDL+
Sbjct: 245 VPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVK-TREAAATLAVKAGLDLE 303
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Q Y A +Q V E DID + + M LG FD Q Y + +
Sbjct: 304 CGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDDPSQNPYNQIEPSVVGCK 363
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ +LA EAAR+ +VLLKN N LPLN KVK++AVVG +A G+Y+G P
Sbjct: 364 AHQDLALEAARQSMVLLKNKDNFLPLNPQKVKSIAVVG--ISAGHCEFGDYSGTP 416
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 140/290 (48%), Gaps = 49/290 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + A D T+ + G++ S+E E DR L LP Q + I ++ +V V++++
Sbjct: 597 AGKVAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPNTVVVLV---A 653
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + N+ AIL A YPGE+GG A+A+V+FG +NPGGRLP+T+YN L
Sbjct: 654 GSSLAVNWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS-------LD 706
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+P D+ GRTY+++ G LY FGYGLSYT+F+Y
Sbjct: 707 EIP--AFDNYSVKGRTYQYFEGQPLYEFGYGLSYTKFRY--------------------- 743
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
+ GV V D + + N G DG +V VY K P +K
Sbjct: 744 ----------KSKGVSV----ARDTVKVSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLK 789
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
Q+ GF+RV ++ G+ ++ V K L D + P GE+T VG
Sbjct: 790 QLHGFKRVHIKKGKTSKVT-VGVPKKDLRYWDEQERKFVTPKGEYTFMVG 838
>gi|441500080|ref|ZP_20982250.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
gi|441436171|gb|ELR69545.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
Length = 704
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 203/656 (30%), Positives = 335/656 (51%), Gaps = 79/656 (12%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T FP + +S+N L K+ + + EA A +GL + ++P +++ARD
Sbjct: 71 DVIHGHRTIFPLPLAEASSWNLDLIKETARLSAKEAAA------SGLNWTFNPMVDIARD 124
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGRI E GED ++ A V G Q DL S P V +C KH+AAY
Sbjct: 125 PRWGRIAEGSGEDTYLGSLIAKAKVEGYQ-------GDDL-SDPFTVLACVKHFAAYGAS 176
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
G D + D ++++ + ET+L P++ + G A++VM S+N ++G+P+ L+ +
Sbjct: 177 Q-AGRDYHTVD--MSDRVLRETYLPPYKAAIDAG-AATVMTSFNELHGVPASGSRYLMTE 232
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
+R EW G++V D SI MV H +A+ KE A L AG+D+D G Y +
Sbjct: 233 ILREEWRFKGFVVTDYTSINEMVP-HGVVANEKE-AADLALNAGVDMDMQGGVYNDHLAT 290
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAARE 411
V +GKV E +D++++ + + RLG F +Y+ + Q + S E ++ A +ARE
Sbjct: 291 LVNEGKVSEKQVDEAVRRILEMKWRLGLFKDPYRYLDEKRELQVLFSKELMDHALVSARE 350
Query: 412 GIVLLKND----QNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYMSPIAGFS 465
IVLLKN+ + LP+ + VK++A++GP + + M+G + +G + ++ + G
Sbjct: 351 SIVLLKNEPYNNKKLLPI-ANDVKSIALIGPLGDNQIDMLGTWHASGDANKVVTVLQGLK 409
Query: 466 G---YANVTYKTGCDDVACKSNNSIF-AASEAAKTADATIILAGLDLSVEAESLDREDLW 521
A +TY G D + S+ S F A++ A+ AD I+ G + E+ R L
Sbjct: 410 EAFPKAKITYTKGADFMG--SDKSGFEEATKNARAADLVIMAVGENHQQSGEAASRSGLD 467
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
LPG Q +L+ + + K P++ ++M+ + I + + NI AI+ + G G+AIA+
Sbjct: 468 LPGVQQELVEAIYQTGK-PIVALVMAGRPLTIGW--MDENIPAIVNTWHLGTMAGKAIAE 524
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPL-TSMPL--RPVDS-LGYPGRTYKFYNGPTLYPF 637
V+ GK+NP G+L IT+ V +P+ SM RP D+ Y + N P LYPF
Sbjct: 525 VLAGKYNPSGKLTITFPRN--VGQIPIYYSMKNTGRPFDADSKYTSKYLDVSNEP-LYPF 581
Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
GYGLSYT F+Y + L+K++ + N T
Sbjct: 582 GYGLSYTTFEYG--------EPKLSKIEIKEHENLT-----------------------I 610
Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
+V +N G +G +VV +Y + +K++ GF+++ ++ G +K + F N+
Sbjct: 611 EVMVKNTGEYEGQEVVQLYVRDLVGSVTRPVKELKGFEKISLKPGESKVVTFTINS 666
>gi|319901343|ref|YP_004161071.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416374|gb|ADV43485.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 781
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 212/744 (28%), Positives = 340/744 (45%), Gaps = 140/744 (18%)
Query: 81 LGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLW 140
L +A RLG+P + E HG +G AT FPT + ++++ESL
Sbjct: 118 LQKYAVEETRLGIPVL-FAEECPHGHMAIG-----------ATVFPTALSAASTWDESLM 165
Query: 141 KKIGQAVSTEARAM-YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
+++G+A++ EAR N+G + P ++VAR+PRW R+ ET GEDP + V +
Sbjct: 166 QQMGEAIALEARLQGANIG------YGPVLDVAREPRWSRMEETFGEDPVLTSVMGVALM 219
Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF-- 257
+G+Q D+ + + S KH+AAY GV + M + F
Sbjct: 220 KGMQG--------DVQNDGKHLYSTLKHFAAY------GVPESGHNGSRANSGMRQLFSE 265
Query: 258 -LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
L PF+ V+ G A ++M SYN ++G+P ++ LL + +R +W G++ +D SI+ +
Sbjct: 266 YLPPFKKAVEAG-AGTIMTSYNSIDGVPCTSNKFLLTEVLRNQWGFKGFVYSDLISIEGI 324
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
V + D+KE A A+ L+AGLD+D G + A ++G + D+D+++ + +
Sbjct: 325 V-GMRAAKDNKE-AAAKALRAGLDMDLGGDAFGRNLKQAYEEGLITMDDLDRAVSNVLRL 382
Query: 376 LMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
++G F+ + I S E+ ELA AREG+VLLKND LPL+ +K +AV
Sbjct: 383 KFQMGLFENPYVSPEQAGKHIRSREHKELARRVAREGVVLLKND-GVLPLDK-HLKRIAV 440
Query: 436 VGPHANATVAMIGNYAGIPCRYM------SPIAGFSGYANVTYKTGC-------DDV--- 479
+GP+A+ +G+Y R A S V Y GC D+
Sbjct: 441 IGPNADMMYNQLGDYTAPQDRKEIVTVLDGVRAAVSKTTQVVYVKGCAVRDTTESDIPAA 500
Query: 480 -----------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
+ + + + ++ AA ++ +L +D E DR L L
Sbjct: 501 VAAAQRADAVILVVGGSSARDFKTKYISTGAATVSEDIKVLPDMDC---GEGFDRSSLRL 557
Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
G Q +LIN VA K P++++ ++ +++ A +A+L A YPGE+GG IAD+
Sbjct: 558 LGDQEKLINAVAATGK-PLVVIYIAGRAMNMNLAADKA--RALLAAWYPGEQGGAGIADI 614
Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
+FG +NP GRLP++ +P + L S G R Y G LY FGYGLS
Sbjct: 615 LFGDYNPAGRLPVS---------IPRSEGQLPVFYSQGTQ-RDYVEEKGTPLYAFGYGLS 664
Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
YT+F Y+ L K V + C
Sbjct: 665 YTKFVYSALEMRKGTDVETLQTVSC--------------------------------TVT 692
Query: 703 NVGSTDGSDVVIVY--------SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
N G DG +VV +Y S+PP + A F+R+F++ G ++++ F+
Sbjct: 693 NTGDRDGEEVVQLYICDEVASVSQPPILLKA--------FRRIFLKKGESRKVTFLLKK- 743
Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
L I D N ++ G+ + VG
Sbjct: 744 DDLAIYDDEMNYVVEPGDFKVMVG 767
>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
Length = 883
Score = 258 bits (659), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 169/522 (32%), Positives = 263/522 (50%), Gaps = 62/522 (11%)
Query: 28 NGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG 87
N S + VC ++ + S + D++ R DLVSRM+L+EK QL + A
Sbjct: 11 NASVLALLVCLSAPTAQAQNPLESPAYQDTTKTAEQRAADLVSRMSLEEKAAQLINDAPA 70
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
+PRLG+ +Y WW+E LHGV+ G AT FP + A+F+E L ++ +
Sbjct: 71 IPRLGVREYNWWNEGLHGVAAHG----------YATVFPQAVGMAATFDEPLIHRVADTI 120
Query: 148 STEARAMYNLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
S E RA Y R GLT WSPNIN+ RDPRWGR ET GEDP++ R V +
Sbjct: 121 SVEFRAKYVASRHRFGGSDWFRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAF 180
Query: 199 VRGLQDVEGHENATDLNSRPL--KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEET 256
V+GLQ + P+ + + KHYA V + R+ + + D+E+T
Sbjct: 181 VKGLQGED-----------PVYYRTIATPKHYA---VHSGPEASRHRDNINPSRYDLEDT 226
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QV 315
+L F + EG A S+MC+YN ++G P+CA+ LL + +R +W G++V+DCD++ +
Sbjct: 227 YLPAFRATIVEGKAVSIMCAYNAIDGQPACANDDLLVKHLRQDWGFKGFVVSDCDAVGDI 286
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ E+ V +AG DL CG + +AV++G + E+ +D +L L++
Sbjct: 287 YYKTSHHYRPTPEEGVTVAYQAGTDLICGNANEADHVASAVRKGILPESLVDTALVRLFS 346
Query: 375 VLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
+LG FD Q + ++ D + N + + A +VLLKND LPL S + +T+
Sbjct: 347 ARFKLGQFDPPAQVFPAITADDYDTQANRDFSQHVAESAMVLLKND-GLLPLKS-EPRTI 404
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGC-----------DDV 479
AV+GP+A+ +++GNY G P ++ +AG A V Y G DD
Sbjct: 405 AVIGPNADTMDSLVGNYNGDPSHPVTVLAGIKARFPNATVRYAQGSGLIDPVMTAVPDDS 464
Query: 480 ACKSNN--------SIFAASEAAKTADATIILAGLDLSVEAE 513
C+ + S FA+ E + TA + AG+ + + E
Sbjct: 465 FCRDKDCAAKGVTASHFASPEMSGTAQKSAAEAGIHQAWKGE 506
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 93/308 (30%), Positives = 150/308 (48%), Gaps = 55/308 (17%)
Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQ 532
S+ A AAK +D I +AGL VE E + DR L LP Q +++ Q
Sbjct: 593 SDTGAQEAVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQ 652
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
V+ K PV+LV+++ + + +A+ N + AI+ A YPG +GG A+A ++ G F+P GR
Sbjct: 653 VSATGK-PVVLVLINGSALSVNWADKN--VPAIVEAWYPGGQGGAAVARLIAGDFSPAGR 709
Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
LP+T+Y Q+ T ++ GRTY+++ G LYPFGYGLSYT+F Y
Sbjct: 710 LPVTFYR-SADQIPAFTDYTMK--------GRTYRYFKGEALYPFGYGLSYTKFSYAPAK 760
Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
+ A+K G + VD N G+ DG +V
Sbjct: 761 LS---------------------AAKVAGNGEVT----------VSVDVTNSGARDGDEV 789
Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
V +Y P + T I+ + F R+ ++AG K + F ++ ++L+ V+ + + G+
Sbjct: 790 VQLYLSHPGQ-KDTPIRALARFDRIHLKAGETKTVTFTLDS-RALSTVNADGSRSVKPGK 847
Query: 773 HTIFVGNG 780
+++G G
Sbjct: 848 VNLWLGGG 855
>gi|336411808|ref|ZP_08592268.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
gi|335940152|gb|EGN02020.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
Length = 859
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 216/769 (28%), Positives = 338/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E L G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDPF+V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIP-CRYMSPIAGFSGYANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + AG + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTLLEALKERAG--NQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G +A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPLAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y LS T + + D C+D E
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|198277570|ref|ZP_03210101.1| hypothetical protein BACPLE_03792 [Bacteroides plebeius DSM 17135]
gi|198270068|gb|EDY94338.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
plebeius DSM 17135]
Length = 753
Score = 258 bits (659), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 220/764 (28%), Positives = 362/764 (47%), Gaps = 111/764 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLG-----DFAHGVPRLGLPQYEWWS-------EALHGVSNVG- 110
+V L+S+MTL+EK+ Q+ DF R+ + E S E ++ + +
Sbjct: 38 KVDSLLSQMTLEEKLGQMNQLSPWDFEELAARIR--KGEVGSILNVVNPEEINKIQKIAV 95
Query: 111 -------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
P DVI G T FP + A+FN + ++ + + EA A G+
Sbjct: 96 EESRLGIPILVARDVIHGYKTIFPIPLGQAATFNPEIAEQGARVAAIEASA------DGI 149
Query: 163 TY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
+ ++P I+V+RDPRWGRI E+ GEDP++ N V G ++G++ D + P +
Sbjct: 150 RWTFAPMIDVSRDPRWGRIAESCGEDPYL------NAVIGTAMIKGYQG--DSLNDPTAI 201
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
++C KH+ AY R + + E+ + +L PF+ G A+ M S+N +
Sbjct: 202 AACAKHFVAYGAAEG---GRDYNSTFIPERVLRNVYLPPFKAAANAGCAT-FMTSFNDND 257
Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
G+PS A+ +L +R EW G +V D S MV NH F D K DA +++ AG+D+
Sbjct: 258 GVPSTANSFVLKDVLRKEWKYDGMVVTDWASALEMV-NHGFCTDGK-DAAEKSVNAGVDM 315
Query: 342 D-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
+ + + ++ + KV ID +++ + + RLG FD Y+ + +++
Sbjct: 316 EMVSETFIQNLKQSISENKVSMETIDNAVRNILRLKFRLGLFDNP--YIVTPQSVKYAEK 373
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYM 458
+++ A AA + ++LLKN+ +LPL KVKT+A++GP A+A +G + G
Sbjct: 374 HLQAAKTAAEQSVILLKNENQSLPLTD-KVKTLAIIGPMADAPYEQMGTWVFDGEKEHTQ 432
Query: 459 SPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
+P+ + V ++ G KS I A A+ +DA ++ G + + E+
Sbjct: 433 TPLTAIKKMYGDKVKVLFEKGLAYSRDKSTAGIARAISVARQSDAVVVFVGEESILSGEA 492
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
+L L G Q+QLI ++A K PV+ V+M+ G +A A+ A+L++ +PG
Sbjct: 493 HSLVNLNLQGAQSQLIKELAATGK-PVVTVVMA--GRQLAIADEVKVSDAVLYSFHPGTM 549
Query: 575 GGRAIADVVFGKFNPGGRLPITW--YNGD----YVQMLPLTSMPLRPVDSL-----GYPG 623
GG AIAD++FGK NP G+ P+T+ +G Y Q T P P + L G
Sbjct: 550 GGPAIADILFGKVNPSGKTPVTFPRMSGQVPIYYAQH--KTGRPANPTEMLIDEIPVEAG 607
Query: 624 RT----YKFY----NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+T FY N P L+PFGYGLSYT F+Y+ NL+ TSD
Sbjct: 608 QTSVGCRSFYLDAGNSP-LFPFGYGLSYTTFEYS-------------------NLSLTSD 647
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
L D +N G+ DG++VV +Y + +K++ FQ
Sbjct: 648 ------------KLTAQDTLSISFTLKNTGNYDGTEVVQLYIQDKVGSVTRPVKELKRFQ 695
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
RV ++AG + ++ L Y N + G+ ++VG+
Sbjct: 696 RVTLKAGESTQVSLNL-PVSELAFWGYDMNYTVEPGDFRLWVGS 738
>gi|189460899|ref|ZP_03009684.1| hypothetical protein BACCOP_01546 [Bacteroides coprocola DSM 17136]
gi|189432473|gb|EDV01458.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 718
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 215/770 (27%), Positives = 349/770 (45%), Gaps = 92/770 (11%)
Query: 43 SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
S G+ + +F D ++ R+ DL++RMTLDEKV LG+ VPRLG+ Q E
Sbjct: 15 STTGIIHAQNVFNDPAINEEQRLDDLIARMTLDEKVDALGNNTQ-VPRLGI-QASGSVEG 72
Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGR 159
LHG+ GP T+ D T FP +++ L ++ +STE R ++ +
Sbjct: 73 LHGIVLGGP-TYGDRANTPTTGFPQAYGLGETWDTDLLHRVATYISTENRYLFQNAKYRK 131
Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
+GL W+PN+++ RDPRWGR E GED F+ R AV +++G+Q + +
Sbjct: 132 SGLIMWTPNVDLGRDPRWGRTEECYGEDAFLTSRLAVAFIKGIQGD---------HPKYW 182
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+ +S KH+ + N R + +++ E + PF V EG + ++M +YN
Sbjct: 183 RNASLMKHF----LSNSNEYGRTFSSSNYSDKLFREYYAYPFYKGVTEGGSQALMTAYNA 238
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
NG P P L N V EW L+G ++ D + ++++ +HK + + A A +KAG+
Sbjct: 239 YNGTPCIMHPVLRN-IVMKEWGLNGTLLTDGGAFRLLLSDHKRFDNDRAAAAAACIKAGI 297
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDIC 397
+Y + A+ + + DI+K+++ + ++LG D + Y ++G D
Sbjct: 298 TKFLDEY-KDAVYEALHRKLISVEDIEKAIRGNLRISLKLGLLDHAEDNPYAAIGVTDTI 356
Query: 398 SD----ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+ E L EA + IVLLKN + LPL+ K+K +AV+G AT + YAG
Sbjct: 357 APWSKPETKALVREATLKSIVLLKNQDHLLPLDRHKIKKIAVIG--QRATEVLQDWYAGK 414
Query: 454 PCRYMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
P ++ + + + G D +V N + +A A AD I+ G + A
Sbjct: 415 PFYTVNVLDA------IREEAGNDIEVRYVKTNRMDSARTVAAWADVAIVCVGNHPTCNA 468
Query: 513 ------------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
E++DR+ L Q L+ QVA+ + ++I S A N
Sbjct: 469 GWEQAPVISEGKEAVDRQSL--QLDQEDLLLQVAQTNPNTIGVLISS---FPYAINRANQ 523
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
+ A+L +E G A++DV+FG +NP GRL TW +T +P +D
Sbjct: 524 TVPALLHLTQCSQELGHAVSDVIFGHYNPAGRLTQTWVKN-------ITDLP-HMMDYDI 575
Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
GRTY ++ LYPFGYGLSYT+F Y+ + + + L+ C NL
Sbjct: 576 THGRTYMYFKEKPLYPFGYGLSYTRFNYSGTTLNDRVIERGDTLRVCFNL---------- 625
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
+N G DG +VV +Y IKQ+ FQR+ +R
Sbjct: 626 ---------------------KNSGDMDGDEVVQLYVSARKHTDKDPIKQLKAFQRISLR 664
Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
G K+++ + + + +LP E T+ +G + F
Sbjct: 665 KGETKKVELTVPYTELQVWDEKQSRFILPDKEMTLEIGASSSDIRLRTTF 714
>gi|427387354|ref|ZP_18883410.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
12058]
gi|425725515|gb|EKU88386.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
12058]
Length = 786
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 232/816 (28%), Positives = 349/816 (42%), Gaps = 149/816 (18%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
++ D S P RVKDL+S+MT++EK Q+ +G R+ LP +W +E G++N
Sbjct: 41 VYEDPSAPLEARVKDLLSQMTMEEKTCQMATL-YGSGRVLKDSLPTEQWKNEIWKDGIAN 99
Query: 109 VG---------------------------------------PGTHFDDVIPG-----ATS 124
+ P ++ I G AT
Sbjct: 100 IDEQANGLGKFGSSLSYPYVNSVENRQAIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATM 159
Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
FP A++N+ L +I + + EA+A LG + +SP +++A+DPRWGR+ E
Sbjct: 160 FPAQCGQGATWNKELISEIAKVTAEEAKA---LGYTNI--YSPILDIAQDPRWGRVVECY 214
Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
GEDPF+VG ++GLQ A L S P KH+A Y +
Sbjct: 215 GEDPFLVGELGKRMIKGLQ-------AEGLVSTP-------KHFAVYSIPVGGRDAGTRT 260
Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
D V ++M ++ PF E A VM SYN +G P L + +R EW G
Sbjct: 261 DPHVAPREMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKG 320
Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
Y+V+D ++++ + H A++ D AQ + AGL++ TNFT A+
Sbjct: 321 YVVSDSEAVEFLYSKHNVAANAV-DGAAQVINAGLNVR-----TNFTLPENFIRPLRQAI 374
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
+GKV E ID + + V +G FD +P K + + S E+ ++ AA E I
Sbjct: 375 SEGKVSEQTIDSRVADVLRVKFMMGLFD-NPYKGDAKKPEKVVHSKEHQAVSMRAALESI 433
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANV 470
VLLKN+ N LPL S K VAV+GP+A +I Y + G Y A+V
Sbjct: 434 VLLKNENNILPL-SKSTKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDADV 492
Query: 471 TYKTGCD------------DVACKSNNS--IFAASEAAKTADATIILAGLDLSVEAESLD 516
Y G D DV + I A AK +D I++ G + E
Sbjct: 493 RYAKGADIIDKYFPESELYDVPLDKDEQAMIDEAVALAKESDVAIMVLGGNEKTVREEYS 552
Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
R +L L G Q +L+ V K PV+L+++ I +AE I I+ A +PGE G
Sbjct: 553 RTNLDLCGRQEKLLQAVYATGK-PVVLLLVDGRAATINWAE--HYIPGIVHAWFPGEFMG 609
Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLY 635
A+A V+FG +NPGG+L +T+ V +P + P +P DS G+ T TLY
Sbjct: 610 DAVAKVLFGDYNPGGKLAVTFPRS--VGQIPF-AFPFKPGSDSKGFVRVT------GTLY 660
Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
PFGYGLSYT F Y+ L + +
Sbjct: 661 PFGYGLSYTTFAYSDLKIENPV-------------------------------IGVQGSV 689
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
+ +N G G +VV +Y TY+K + GF+RV + G K + FV +
Sbjct: 690 KLSCKVKNTGKVAGDEVVQLYLHDEMSSVTTYVKVLRGFERVHLEPGEEKTVNFVLTP-Q 748
Query: 756 SLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
L + + + ++ G + VG+ + F
Sbjct: 749 ELGLWNKDNHFVVEPGTFAVMVGSSSQDIRLQDKFE 784
>gi|257051950|ref|YP_003129783.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
DSM 12940]
gi|256690713|gb|ACV11050.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
DSM 12940]
Length = 783
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 208/726 (28%), Positives = 323/726 (44%), Gaps = 117/726 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P E E L G PG T FP I ++++ +L + I ++
Sbjct: 103 RLGIPALEH-EECLTGYRG-----------PGGTIFPQSIGLASTWSPALVESITDSIRK 150
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGH 208
A+ + SP ++V+RD RWGR+ ET GEDP +VG YV GLQ D +G
Sbjct: 151 RLAAV-----GAVQALSPVLDVSRDMRWGRVEETYGEDPQLVGALGAAYVSGLQNDGDG- 204
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
+ + KH+AA+ G +R ++ E+++ E L PFE+ ++E
Sbjct: 205 ------------IDATLKHFAAHG-SGEGGKNRSSV--QIGERELREVHLYPFEVAIREA 249
Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
DA +VM +Y+ ++G+P + LL +RGEW G++VAD S+ ++ H +AD++
Sbjct: 250 DARAVMNAYHDIDGVPCASSEWLLTDVLRGEWGFDGHVVADYFSVDLLKTEHG-IADTQR 308
Query: 329 DAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
+A L+AGLD+ DC Y N AV+ G++ E +D +++ + + G FD
Sbjct: 309 EAGVAALEAGLDIELPATDC--YGENLL-KAVEDGELSEATVDTAVRRVLRAKIESGVFD 365
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
+ +DE ELAA AARE + LL+ND + LPL + +VA+VGP A+
Sbjct: 366 DPYVDPEAASEPFDTDEQTELAARAARESMTLLEND-DLLPLAGEDLDSVALVGPQADDG 424
Query: 444 VAMIGNYAG--------------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
A +G+Y + R G + +V Y G + S A
Sbjct: 425 RAQVGDYTHAARFDTEEDGDFECVTPRDALEAKGETAGFDVEYVEGA-TMTGPSTEEFDA 483
Query: 490 ASEAAKTADATIILAGL----------------DLSVEAESLDREDLWLPGYQTQLINQV 533
A E AD + G D+ E+ D DL LPG Q +LI+++
Sbjct: 484 AEETVADADVAVACVGARSDIDFADRENPSELPDVPTSGENCDVTDLELPGVQAELIDRL 543
Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
AE LV++ G A E + A+L A PG+ GG AIADV+FG++NP G L
Sbjct: 544 AETD---TPLVVVQVSGKPHAIPEIAETVPALLHAWLPGQAGGTAIADVLFGEYNPSGHL 600
Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
P++ Q + + P + + +G LY FG+GLSYT F+Y L
Sbjct: 601 PVSIPKSVGQQPVYYSRKP-------NSANEEHVYMDGEPLYSFGHGLSYTDFEYGELEL 653
Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
+ + L V N G G DVV
Sbjct: 654 EEGTVEPMGSLSAS-------------------------------VTVTNAGERAGDDVV 682
Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
+Y A +++++GF+RV + G +KR+ F F+A + L D + + G +
Sbjct: 683 QLYQHAENPSQARPVQELLGFERVHLEPGESKRVTFTFDATQ-LAYYDLNMHLAVEEGPY 741
Query: 774 TIFVGN 779
+ VG
Sbjct: 742 ELRVGE 747
>gi|299140913|ref|ZP_07034051.1| periplasmic beta-glucosidase [Prevotella oris C735]
gi|298577879|gb|EFI49747.1| periplasmic beta-glucosidase [Prevotella oris C735]
Length = 767
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 204/696 (29%), Positives = 320/696 (45%), Gaps = 102/696 (14%)
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
ATSFP +++ +L ++I + EA A+ G T ++P ++V+RDPRWGR+
Sbjct: 119 ATSFPAQCGQGVTWDRALIRQIANVTAQEASAL------GYTNVYAPILDVSRDPRWGRV 172
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E E P++ G V GLQ EN ++ S KH+A Y + +
Sbjct: 173 VECYSESPYLAGELGKQMVLGLQ-----EN---------RIVSTPKHFAVYSLPVGGRDE 218
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
D V ++M+ L PF ++EG A VM SYN +G P P L + +R +W
Sbjct: 219 GTRTDPHVAPKEMKTLLLEPFRKAIQEGGALGVMSSYNDYDGEPITGSPYFLTELLRHQW 278
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT--------- 351
HGY+V+D ++++ + H +A ++E+ A + AGLD+ TNF+
Sbjct: 279 GFHGYVVSDSEAVEFLSSKHH-VAANREEGAAMAINAGLDVR-----TNFSMPETFILPL 332
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAA 409
A+ G V +D +K + V LG FD +P ++ + D + S + +L+ AA
Sbjct: 333 RQALTDGLVSMQILDARVKDVLYVKFWLGLFD-NPYRGNVNEVDQVVHSKAHQQLSLRAA 391
Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-- 467
E IVLLKN+ N LPL S +K +AV+GP+A+AT A + Y S ++G
Sbjct: 392 LESIVLLKNENNLLPL-SKSLKRIAVIGPNADATTAHVCRYGPANAPIKSVLSGIRESMP 450
Query: 468 -ANVTYKTGCD------------DVACKSNNS--IFAASEAAKTADATIILAGLDLSVEA 512
A V Y GC +VA + I A A+ +D +++ G
Sbjct: 451 GAEVRYAKGCSIVDKHFPESELYEVALDTTEQRMIDEAVGVARQSDVAVVVLGGSEETVR 510
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E R DL L G Q QL+ V K PV+LV++ I +A N + AI+ +PG
Sbjct: 511 EEYSRTDLNLMGRQEQLLRAVYATGK-PVVLVLLDGRAATINWA--NQYVPAIVHGWFPG 567
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNG 631
E G A+A V+FG +NPGG+L +T+ V +P + P +P DS G P R +G
Sbjct: 568 EFTGTAVAKVLFGDYNPGGKLAVTFPKS--VGQIPY-AFPFKPGADSKG-PVRV----DG 619
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
LYPFGYGLSYT F Y+ +K + +
Sbjct: 620 -ALYPFGYGLSYTTFAYSDFHISKPV-------------------------------IGI 647
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
E +N G +G ++V +Y + TY K + GF+R+ ++AG ++F+
Sbjct: 648 QGETEVSCKVRNTGQREGDEIVQLYIRDDISSVTTYQKSLRGFERIHLKAGEETTVRFML 707
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
+ L++ + ++ G TI +G +H
Sbjct: 708 TP-RDLSLWNKHEEFVVEPGTFTIMIGRSSEDICLH 742
>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
Length = 864
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 158/441 (35%), Positives = 233/441 (52%), Gaps = 41/441 (9%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
R DLV R+TL+EK + + + +PRLG+ Y+WW+EALHGV G AT
Sbjct: 37 RANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGI----------AT 86
Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
FP I ASF++ L ++ AVS EARA Y R GLT+W+PN+N+ RDP
Sbjct: 87 VFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQGLTFWTPNVNIFRDP 146
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSCCKHYAAYDVD 234
RWGR ET GEDP++ + + VRGLQ G E+A P K+ +C KH+A +
Sbjct: 147 RWGRGQETYGEDPYLTSQMGMAVVRGLQ---GPEDA------PYDKLHACAKHFAVHSGP 197
Query: 235 NWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
W +R+ F+A + +D+ ET++ F+ V++ VMC+YNR+ G P C + +LL
Sbjct: 198 EW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGEPCCGNNRLLT 254
Query: 294 QTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
+R EW G +V+DC +I +H+ D K A A + +G DL+CG Y +
Sbjct: 255 HILRDEWGYQGIVVSDCGAISDFWRKGDHETHPD-KAHASAGAVLSGTDLECGSNYKSLP 313
Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAARE 411
AV+ G + E+ +D S+K L LG D + ++ + + +LA ARE
Sbjct: 314 -EAVKAGLIAESQLDISVKRLLKARFELGEMDKDVCWDTIPYSVVDCQAHKDLALRMARE 372
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---A 468
IVLL+N N LPL K +A+VGP+AN ++ GNY G P + +
Sbjct: 373 SIVLLQNRNNILPLR--KDMKIALVGPNANDSIMHWGNYNGFPSHTETLYEALKKRLPAS 430
Query: 469 NVTYKTGCDDVACKSNNSIFA 489
+ Y+ GCD + + S+FA
Sbjct: 431 QLIYEFGCDRTSPVALESVFA 451
Score = 108 bits (271), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 129/300 (43%), Gaps = 54/300 (18%)
Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
A ++ K AD + G+ ++E E + DR + LP Q QL+ ++ ++ K
Sbjct: 593 ATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKLGK 652
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
P++ + S G + A + ++ A YPG+ GG AIADV+FG +NP G+LP+T+Y
Sbjct: 653 -PIVFINYS--GSAMGLAPESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPVTFY 709
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
+ LP + GRTY++ L+ FG+GLSYT F Y +
Sbjct: 710 RN--TEQLP-------DFEDYAMKGRTYRYMTETPLFRFGHGLSYTTFDYG------KAR 754
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
++ N L T + N G+ DG + V VY +
Sbjct: 755 LSQNTFSKGETLTLT-------------------------IPVSNTGTRDGEETVQVYLR 789
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
P + A + F+RV+V G K IKF + L N L +GE+ + G
Sbjct: 790 RPGDADAPS-HTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLYG 848
>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
329 str. F0087]
gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
11840]
Length = 851
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 159/415 (38%), Positives = 226/415 (54%), Gaps = 39/415 (9%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
LF D P R+ DL+SR+T++EK+ L + A + RLG+ +Y +EALHGV V PG
Sbjct: 28 LFRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG 85
Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
T FP I A +N L +I A+S EAR + G L
Sbjct: 86 DF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLL 137
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
T+WSP +N+ARDPRWGR ET GEDP++ G V +V+GLQ + R LK
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGD---------HPRYLKTV 188
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
S KH+A + ++ +R +A+V+E+D+ E +L FE C+ EG A S+M +YN VN
Sbjct: 189 STPKHFAVNNEEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVND 244
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+P + L+ +RG+W +GYIV+DC + + M+ H ++ ++E A +K GLDL+
Sbjct: 245 VPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYV-KTREAAATLAVKVGLDLE 303
Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
CG Q Y A +Q V E DID + + M LG FD Q Y + +
Sbjct: 304 CGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDAPSQNPYNQIEPSVVGCK 363
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
+ +LA EAAR+ +VLLKN N LPLN KVK++AVVG +A G+Y+G P
Sbjct: 364 AHQDLALEAARQSMVLLKNKDNFLPLNPKKVKSIAVVG--ISAGHCEFGDYSGTP 416
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 139/289 (48%), Gaps = 47/289 (16%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A + A D T+ + G++ S+E E DR L LP Q + I ++ +V V++++
Sbjct: 596 AGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPNTVVVLV---A 652
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
G +A + N+ AIL A YPGE+GG A+A+V+FG +NPGGRLP+T+YN L
Sbjct: 653 GSSMAVNWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS-------LD 705
Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
+P D+ GRTY+++ G LY FGYGLSYT+F+Y K+ VN+ +
Sbjct: 706 EIP--AFDNYSVKGRTYQYFEGQPLYEFGYGLSYTKFRY------KSKGVNVEQ------ 751
Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
D + + N G DG +V VY K P +K
Sbjct: 752 -----------------------DTVKVSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLK 788
Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
Q+ GF+RV ++ G+ ++ + + P GE+T VG
Sbjct: 789 QLHGFKRVHIKKGKTSKVTIGVPRKDLRYWYEQERKFITPKGEYTFMVG 837
>gi|448410571|ref|ZP_21575276.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
gi|445671607|gb|ELZ24194.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
Length = 760
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 206/694 (29%), Positives = 316/694 (45%), Gaps = 104/694 (14%)
Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
P T+FP I ++++ L + + + A +G A SP ++VARD RWGR
Sbjct: 102 PEGTTFPQGIGMASTWDPDLMAAVTDTIGDQLEA---IGTA--HALSPVLDVARDLRWGR 156
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
+ ET GEDP++V A YV GLQ +S +S+ KH+ + V G
Sbjct: 157 VEETYGEDPYLVAEMATAYVDGLQG----------DSPADGISATLKHFVGHAV-GAGGK 205
Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
+R D V+ + + E + PFE ++EG+A SVM +Y+ ++G+P D LL +RGE
Sbjct: 206 NRSSVD--VSRRTLREVHMFPFEAAIQEGNAESVMNAYHDIDGVPCAKDEWLLTDVLRGE 263
Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
W G +V+D S+ + + H A +E AV+ ++AG+D+ DC +Y A
Sbjct: 264 WGFDGTVVSDYFSVDFLKEEHGVAATQQEAAVS-AVEAGVDVELPNTDCYEYLA----EA 318
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
V+ G + E +D+S++ + G F+ V + + LA EAAR+ +V
Sbjct: 319 VRDGDLAEESLDESVRRVLRAKFEKGLFEEYTVDVDAATDPYEDEAAVGLAREAARDSLV 378
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY------------MSPIA 462
+LKN+ + LPL+ A +VAVVGP A+ M+G+YA Y +S I
Sbjct: 379 VLKNESDLLPLDDA--DSVAVVGPKADDKKGMLGDYA-YAAHYPEEEYEFEADTPLSAIE 435
Query: 463 GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE----------- 511
G A+V Y GC S + I A EAA+ AD + G +V+
Sbjct: 436 NRVG-ADVNYAQGC-TATGNSTDKIGRAVEAAENADVALAFVGARSAVDFSDADGVKAEQ 493
Query: 512 ------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
E D DL LPG Q +L+ QV E PV++V++S G A E + A+
Sbjct: 494 PMVPTSGEGCDVTDLGLPGVQNELVAQVEET-DTPVVIVLVS--GKPHAIPEIDAGADAV 550
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
+ A PGEE G AI DVVF + GG LP++ + +P+
Sbjct: 551 VQAWLPGEEAGNAIVDVVFEGHDSGGHLPVSMPKS-------VGQLPVHYSRKPNTYSED 603
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
Y + + +YPFG+GLSY +F+Y+ L +
Sbjct: 604 YVYDDAQPVYPFGHGLSYAEFEYSDLDLSDVDVDPSGT---------------------- 641
Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
F V +N DGSDVV +Y A +++++GF+RV + AG +
Sbjct: 642 ---------FSASVTVENTAERDGSDVVQLYVSAENPDLARPVQELVGFRRVELDAGEST 692
Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
I F A L D AN + AG++ + VG+
Sbjct: 693 EITFDL-AASQLAYHDRNANLAVEAGDYELRVGH 725
>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 863
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 167/460 (36%), Positives = 235/460 (51%), Gaps = 43/460 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R + LV +TL+EK + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASFN + ++ AVS EARA + GLT W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ R V V+GLQ + + K+ +C
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYDKLHACA 184
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR G P
Sbjct: 185 KHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDP 241
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +RGEW G +V+DC +I ++ H D+ E A A + +G DL+
Sbjct: 242 CCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDA-ESASAAAVISGTDLE 300
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
CG Y +V++G + E +D S+K L LG D P+ VS K + S
Sbjct: 301 CGSSYKALI-ESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASA 358
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
+ LA ARE + LL N N LPL + TVAV+GP+AN +V GNY G+P ++
Sbjct: 359 AHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPAHTVT 417
Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAK 495
+ G + Y+ GC V S F+ ++ K
Sbjct: 418 ILDGVRNLLGTDDKLIYEQGCPWVERTLIQSAFSQCKSDK 457
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 144/312 (46%), Gaps = 56/312 (17%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K + I + E K AD I +G+ S+E E + DR D+ LP Q
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+LI+ + K +++++ G I +AIL A YPG++GG+A+A+V+FG +
Sbjct: 641 ELIDALHRAGKK---IILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP G+LP+T+Y V LP + GRTY++ L+PFGYGLSYT F
Sbjct: 698 NPAGKLPVTFYRN--VSQLP-------DFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFG 748
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y KT+ L+K N+L + V N G
Sbjct: 749 YG-----KTV---LDK-----------------------NELTAGQSLKLTVPVTNTGKR 777
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+G +VV VY + + A IK + F+RV + AG+ ++F K L D +NT+
Sbjct: 778 NGEEVVQVYLRKQGD-AEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTV 835
Query: 768 -LPAGEHTIFVG 778
+ G + I VG
Sbjct: 836 RVCPGNYDIMVG 847
>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
[Bacteroides vulgatus ATCC 8482]
Length = 785
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 206/703 (29%), Positives = 326/703 (46%), Gaps = 126/703 (17%)
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAM-YNLGRAGLTYWSPNINVARDPRWGR 179
G T FPT + +++NE L K+G+A++ EAR N+G + P ++VAR+PRW R
Sbjct: 150 GTTVFPTALSAASTWNEGLMLKMGEAIALEARLQGANIG------YGPVLDVAREPRWSR 203
Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
+ ET GEDP + V ++G+Q + + + + KH+AAY V
Sbjct: 204 MEETFGEDPVLTTIMGVAMMKGMQG--------KVQNDGKHLYATLKHFAAYGVP----- 250
Query: 240 DRYHFDARVT--EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
+ H +R + + +L PF VKEG A ++M SYN ++G+P A+ +LL +R
Sbjct: 251 ESGHNGSRANCGMRQLLSEYLPPFRKAVKEG-AGTLMTSYNAIDGVPCTANKELLTDVLR 309
Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQ 356
+W G++ +D SI+ +V + D+KE AV + LKAGLD+D G + A +
Sbjct: 310 NQWGFKGFVYSDLISIEGIV-GMRAAKDNKEAAV-KALKAGLDMDLGGNAFGKNLKKAYE 367
Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
+G + D+D+++ + + ++G F+ L K+ + S E+ ELA + AREG+VLL
Sbjct: 368 EGLITMADLDRAVGNVLRLKFQMGLFENPYVSPELAKKLVHSKEHKELARQVAREGVVLL 427
Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI------AGFSGYANV 470
KN+ LPL S + +AV+GP+A+ +G+Y R A S V
Sbjct: 428 KNE-GVLPL-SKHIGHLAVIGPNADEMYNQLGDYTAPQVREEVATVLDGIRAAVSESTRV 485
Query: 471 TYKTGC---DDVA------------------------CKSNNSIFAASEAAKTADATIIL 503
TY GC D A + + + ++ AA ++ L
Sbjct: 486 TYVKGCAVRDTTATDIPAAVAAAQKADAVVLVVGGSSARDFKTKYISTGAATVSEDAKTL 545
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
+D E DR L L G Q +LI+ VA K P+++V + +++ A +
Sbjct: 546 PDMDC---GEGFDRSSLRLLGDQEKLISAVASTGK-PLVVVYIQGRTMNMNLAAEKA--Q 599
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
A+L A YPGE+GG IAD++FG ++P GRLP++ +P + L S G
Sbjct: 600 ALLTAWYPGEQGGMGIADILFGDYSPAGRLPVS---------VPRSEGQLPVFYSQGTQ- 649
Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
R Y G LY FGYGLSYT+F Y+ L K ++ + C
Sbjct: 650 RDYVESKGTPLYAFGYGLSYTRFTYSGLELQKGTEMETLQTVAC---------------- 693
Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY--------SKPPAEIAATYIKQVIGFQ 735
N G+ DG +VV +Y S+PP + A FQ
Sbjct: 694 ----------------TVTNTGNRDGEEVVQLYIGDKVASVSQPPLLLKA--------FQ 729
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
R+F++ G ++++ F L I D N ++ GE + VG
Sbjct: 730 RIFLKKGESRQVIFHLKK-DDLGIYDSEMNYVVEPGEFKVMVG 771
>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 887
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 151/424 (35%), Positives = 229/424 (54%), Gaps = 34/424 (8%)
Query: 52 FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
F D++L + +RVKDLVSR+TL+EKV Q+ + A +PRLG+P Y+WW+E LHGV+
Sbjct: 40 FPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVAR--- 96
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGR-----AGL 162
T F T +P I A ++ + + + E RA++N LGR GL
Sbjct: 97 -TPFH-----VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNNERYLGL 150
Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
TYW+PNIN+ RDPRWGR ET GEDPF+ +VRGLQ + + LK +
Sbjct: 151 TYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDD---------PKYLKAA 201
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KH+A V + R+ + + D+ +T+L F+ V + VMC+YN +G
Sbjct: 202 ACAKHFA---VHSGPEPSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCAYNAFHG 258
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
P C L+N +R +W GY+ +DC +I HK D+ +V L G D++
Sbjct: 259 QPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLH-GTDVE 317
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
CG + V++G + E +D SL L+T RLG FD +Y + + + E
Sbjct: 318 CGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFDPVSMVKYAQTPESILETAE 377
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
+ + + A++ IVLLKN+ NTLPL S +K +AV+GP+A+ + ++GNY G P ++
Sbjct: 378 HKAHSLKMAQQSIVLLKNEGNTLPL-SKNIKKIAVLGPNADNRIVVLGNYNGQPSEIITA 436
Query: 461 IAGF 464
+ G
Sbjct: 437 LQGI 440
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 147/324 (45%), Gaps = 60/324 (18%)
Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL--------- 515
G ANV + G + KSN S A K ADA + + G+ +E E +
Sbjct: 592 EGKANVHLRAGLLE---KSNLS--AIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNG 646
Query: 516 -DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DR + LP QT+L+ + K P++ V+M+ G IA + NI AI+ A Y G+
Sbjct: 647 GDRTSILLPAVQTELLKMLKGTGK-PLVFVVMT--GSAIALPYEDQNIPAIVNAWYGGQS 703
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
G AIADV+FG +NP GRLP+T+Y D + +P S RTY+++ G L
Sbjct: 704 AGTAIADVLFGDYNPAGRLPVTFYKAD-------SDLP--DFKSYDMNNRTYRYFKGDAL 754
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFG+GLSYT F+Y SK + PG ++
Sbjct: 755 YPFGHGLSYTSFQY----------------------------SKLKTPG----KIKSGAS 782
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
F+ N G DG +VV +Y P I+ + GF R+ ++AG +K + F +
Sbjct: 783 FKVSATLTNTGKKDGDEVVQLYLAYPEVAGKAPIRALKGFNRIRLKAGESKTVSFTLSP- 841
Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
+ +V+ P G+ I +G
Sbjct: 842 EQCQLVNEEGALYQPKGKMEISLG 865
>gi|393782958|ref|ZP_10371138.1| hypothetical protein HMPREF1071_02006 [Bacteroides salyersiae
CL02T12C01]
gi|392671316|gb|EIY64790.1| hypothetical protein HMPREF1071_02006 [Bacteroides salyersiae
CL02T12C01]
Length = 759
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 225/811 (27%), Positives = 356/811 (43%), Gaps = 160/811 (19%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG------------------------- 87
L+ DS P RV+DL+ RMTL EK QL + G
Sbjct: 29 LYKDSLAPIESRVEDLLRRMTLHEKTLQLQNKPVGRIDEIESIFQGQSYGCTHEMGKTAE 88
Query: 88 ---------------VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
RLG+P A+ G+ + + +T FP I
Sbjct: 89 ECAGIYNELQKYMLTKTRLGIPILT----AVEGIQGI--------LQNNSTLFPHSIAQG 136
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAGL-TYWSPNINVARDPRWGRITETPGEDPFVV 191
++FN L +++ A EA AM G+ SP ++AR+ RWGR+ ET GEDPF++
Sbjct: 137 STFNPELIERMTDAAGKEAAAM------GIHQVLSPVFDIARELRWGRVEETYGEDPFLI 190
Query: 192 GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQ 251
+ +V+G Q H+ ++ KH+ A+ G++ E+
Sbjct: 191 SEMGIGFVKGYQK---HQ-----------ITCTPKHFVAHGTPA-GGLNCAFVSG--GER 233
Query: 252 DMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD 311
+ +L PF +KE + +M Y+ +GIP A P + +R E GY+ +D
Sbjct: 234 EFRSIYLYPFARVIKETNPLCIMSCYSAYDGIPVSASPYYMTDVLRDELGFKGYVYSDWG 293
Query: 312 SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKY 371
S+ ++ H + ++E+A +L AG+DLD Y V++GK+ E IDK+++
Sbjct: 294 SVDRVMTFH-YAVPTREEAAKVSLIAGVDLDVDSDYETLE-QQVKEGKIDEAYIDKAVRR 351
Query: 372 LYTVLMRLGFFD----GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
+ V LG FD G P+ V K+ + SD++I LA E A E +LL+N N LPL+
Sbjct: 352 VLYVKFALGLFDRPYYGDPKLV---KKVVRSDKHIALAKEVADESTILLENKNNILPLDL 408
Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKT-------GCDDVA 480
+K K++AVVGP++N TV G+Y+ + + G V K GC+
Sbjct: 409 SKYKSIAVVGPNSNQTV--FGDYSWTTPDTKEGVTLYQGLQQVLGKKKTILQADGCNWWN 466
Query: 481 CKSNNSIFAASEAAKTADATIILAGLD---------LSVEAESLDREDLWLPGYQTQLIN 531
+ I A +A + +D I+ G S E D L LPG Q++L+
Sbjct: 467 RADSKDIEQAVKAVEQSDLAIVAVGTRSTFLGRGPRYSTAGEGFDLSSLELPGNQSELLK 526
Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
V K P+I+V++S + +++A+ N + + W Y GE+ GR++AD++ G NP G
Sbjct: 527 AVKATGK-PMIVVLISGKPLVMSWAKENADAVLVQW--YAGEQQGRSLADILVGNVNPSG 583
Query: 592 RLPIT-------------WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
R+ ++ +Y D VQ RP + P Y F + L+ FG
Sbjct: 584 RVNVSFPRSTGNTPCFYNYYPTDRVQRFD------RP-GTYEEPAGHYIFEHPYALWEFG 636
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSYT F Y+ + +I SD G +V
Sbjct: 637 YGLSYTNFNYSGCTLNDSIY---------------SDQ------GTIVA----------T 665
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
V+ +N G DG +VV +Y + +T IKQ+ F++VF++AG K++ L
Sbjct: 666 VEVENTGKRDGKEVVQLYVRDKISSVSTPIKQLKAFKKVFIKAGEKKKVTLEV-PMSELA 724
Query: 759 IVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
+ D ++ GE I +G+ S IH N
Sbjct: 725 LYDVRMKPVVEPGEFEIQIGSS--SDRIHFN 753
>gi|260593561|ref|ZP_05859019.1| xylosidase/arabinosidase [Prevotella veroralis F0319]
gi|260534549|gb|EEX17166.1| xylosidase/arabinosidase [Prevotella veroralis F0319]
Length = 771
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 206/682 (30%), Positives = 324/682 (47%), Gaps = 94/682 (13%)
Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS--PNINVARDPRWG 178
G T +PT I +SF+ + KI + + E RAM +W+ PN+ VARD RWG
Sbjct: 144 GNTVYPTNIGLASSFDVDMAYKIARQTAEEMRAMN-------MHWNFNPNVEVARDARWG 196
Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY--AAYDVDNW 236
R ET GEDP++V V +G Q +N D V C KH+ +Y ++
Sbjct: 197 RCGETFGEDPYLVTLMGVATNKGYQ--RNLDNVQD-------VLGCVKHFVGGSYSINGT 247
Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
G V+E+ + E F PF+ +++G +VM S+N +NG+P + L+ +
Sbjct: 248 NGAP-----CEVSERTLREVFFPPFKAAIQQGGDWNVMMSHNDLNGVPCHTNSWLMTDVL 302
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAV 355
R EW G+IV+D I+ VD H+ A++KE A Q++ AG+D+ G + V
Sbjct: 303 RKEWGFRGFIVSDWMDIEHCVDQHRTAANNKE-AFYQSIMAGMDMHMHGPEWQTAVVELV 361
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
++G++ E+ ID+S++ + TV RLG F+ + I E+ A EA+R IVL
Sbjct: 362 KEGRIPESRIDESVRRILTVKFRLGLFEHPYSDAKTRDRVITDPEHKRTALEASRNSIVL 421
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC--RYMSPIAGFSGYANVTYK 473
LKN+ + LPL++ K K V V G +AN M G+++ + + + + G + T
Sbjct: 422 LKNENDLLPLDAQKYKKVLVTGINANDQNIM-GDWSELQPEDQVWTVLRGLKSVSPTTDF 480
Query: 474 TGCD---DVACKSNNSIFAASEAAKTADATIILAG-------LDLSVEAESLDREDLWLP 523
D D S + AA AAK D I+ G + E DR++L L
Sbjct: 481 KFVDQGWDPRNMSQAQVNAAVAAAKDCDLNIVCCGEYMMRFRWNERTSGEDTDRDNLDLV 540
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q QLI ++ E K P I+VI+S + + +A ++ AI+ A PG+ GG+AIA+++
Sbjct: 541 GLQNQLIQRLNETGK-PTIVVIISGRPLSLRYAA--EHVPAIINAWEPGQFGGQAIAEII 597
Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY-------NGPTLYP 636
+GK NP +L +T +P ++ + S Y + F+ N P LYP
Sbjct: 598 YGKVNPSAKLAMT---------IPRSAGQI----STWYNHKRSAFFHPAVCTDNKP-LYP 643
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGLSYT F+Y+ L +K I N K Q +
Sbjct: 644 FGYGLSYTSFRYSNLKLSKQIIPNDGKTQIIAS--------------------------- 676
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
V +N G DG ++ +Y + +K++ F RV ++AG + ++F K
Sbjct: 677 --VTIENTGQRDGVEICQLYINDLVSSVSRPVKELKDFLRVELKAGEKRTVEFTITPDK- 733
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L D N ++ AGE + +G
Sbjct: 734 LAFYDLNMNPIVEAGEFEVMIG 755
>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
DSM 2366]
Length = 799
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 224/797 (28%), Positives = 354/797 (44%), Gaps = 143/797 (17%)
Query: 35 FVCDPGRFSKLGLQMSSF----LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
F DP + K + ++ ++ D P + R+ +L+S+MTL+EK Q+ +G R
Sbjct: 25 FKADPPIYRKGWIDLNKNGKKDIYEDPLQPLNARIDNLLSQMTLEEKTCQMATL-YGWKR 83
Query: 91 L---GLPQYEW----WS-------EALHGVSNVG-------------------------- 110
+ LP EW W E L+G G
Sbjct: 84 VLKDSLPTKEWKTAIWKDGIANIDEHLNGFLTWGVTSTSELVTDIKKHVWAMNETQRFFI 143
Query: 111 -------PGTHFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
P ++ I G AT FPT + ++N +L +K+G+ EARA LG
Sbjct: 144 EQTRLGIPVDFTNEGIRGVEAYEATGFPTQLNMGMTWNRNLIRKMGRITGQEARA---LG 200
Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
+ ++P ++VARD RWGR+ E GEDP++V R V G+Q EN
Sbjct: 201 YTNV--YAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMTLGMQ-----ENN------- 246
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
+++S KH+A Y + D +V+ +++E+ L PF+ ++E VM SYN
Sbjct: 247 -QIASTAKHFAVYSANKGAREGLARTDPQVSPREVEDIMLYPFKKVIQEAGIMGVMSSYN 305
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
NGIP L Q +R ++ GY+V+D D+++ + + H A+ KE AV Q AG
Sbjct: 306 DYNGIPITGSEYWLTQRLRKDFGFGGYVVSDSDALEYLYNKHHVAANLKE-AVFQAFMAG 364
Query: 339 LDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SL 391
L++ + V +G++ I+ +K + V +LG FD YV +
Sbjct: 365 LNVRTTFRPPDSIIIYARQLVNEGRIPIETINSRVKDVLRVKFKLGLFDQP--YVKDAAA 422
Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
++ + S + +A +A++E IVLLKN+ LPL S +K +AV+GP+A +Y
Sbjct: 423 SEKLVNSIAHQAVALQASKESIVLLKNNNQILPL-SRSLKKIAVIGPNAADNDYAHTHYG 481
Query: 452 GIPCRYMSPIAGFS---GYANVTYKTGCDDVACK-SNNSIFA-------------ASEAA 494
+ + + + G G V Y GC+ V + IF A A
Sbjct: 482 PLQSKSTNILEGIRNKIGADKVWYAKGCELVDKNWPESEIFPEDPDATAIALIEDAVNTA 541
Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
AD I++ G + E+ R L LPG+Q LI + + K PV+ V++ + I
Sbjct: 542 MKADVAIVVLGGNTKTAGENKSRTTLELPGFQLNLIKAIQKTGK-PVVAVMIGTQPMGIN 600
Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
+ + I I++AGYPG +GG A+ADV+FG +NPGG+L +T+ V LPL + P +
Sbjct: 601 W--IDKYIDGIVYAGYPGVKGGIAVADVLFGDYNPGGKLTLTFPKS--VGQLPL-NFPSK 655
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
P ++ G K LYPFG+GLSYT F Y+ L + Q
Sbjct: 656 P-NAQTDEGELAKIKG--LLYPFGFGLSYTTFAYSNLKISPIEQ---------------- 696
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
D VD N +G ++V +Y + TY K + GF
Sbjct: 697 ---------------EKDGNISISVDITNTAKLEGDEIVQLYIRDVLSTVTTYEKILRGF 741
Query: 735 QRVFVRAGRNKRIKFVF 751
+R+ ++ K +KF
Sbjct: 742 ERISLKPNETKTLKFTL 758
>gi|393779898|ref|ZP_10368130.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 412 str. F0487]
gi|392609318|gb|EIW92128.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 412 str. F0487]
Length = 770
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 209/733 (28%), Positives = 354/733 (48%), Gaps = 99/733 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLG---LPQYEWWSE-----------ALHGVSNV 109
RV ++ MTL+EK+ Q+ F+ G +Y+ + E ++ G+ N+
Sbjct: 46 RVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVAGIRNL 105
Query: 110 G-----------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
P DVI G T FP + + S++ +L +K + + EA A
Sbjct: 106 QKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLTLMRKTAELAAREASA---- 161
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
G+ + ++P +++ RD RWGR E GEDP++ A V+G Q G +N L+S
Sbjct: 162 --DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQMLSS 216
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
P + +C KH+A Y G D + A ++ + +L P+E + S+M S
Sbjct: 217 -PHTLLACGKHFAGYGAAE-SGKD--YNTAELSMHTLRNVYLPPYEATLN-ARVGSIMAS 271
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
N +NG+P+ AD LL + +R EW +G +V+D I +V H D K+ A +
Sbjct: 272 LNEINGVPATADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSAN 329
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGK 393
AG+++D G + + V++GKV E IDK+++++ + LG FD +Y+ + K
Sbjct: 330 AGIEMDMNGATFIKYLSALVKEGKVTEAQIDKAVRHILEIKFLLGLFDDPYRYLDETRAK 389
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
++ +++ +++A +A +VLLKN+ LP+ KT+AV+GP N T + G++
Sbjct: 390 ENTFTEKYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCL 449
Query: 452 GIPCRYMSPIAGFSGYANVT-----YKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
G + +S + G + T Y GC S + A A+ AD ++ G
Sbjct: 450 GDGKQSVSLLTGLTEKYKATNVKLLYAEGCG-FTTISTEQLKEAVAMARKADRVLVAVGE 508
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
S ES R D+ LP Q QL+ + + K P+ ++ S +D+++ N N++AIL
Sbjct: 509 QSSWSGESAVRTDIRLPQAQRQLLEALKTINK-PIAIITFSGRPLDLSWE--NENVQAIL 565
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPV 616
A +PG +GG IADV+ G NP G L +++ V +P+ T P+ V
Sbjct: 566 QAWFPGTQGGYGIADVIAGDVNPSGHLTMSFPRS--VGQIPIYYNYKSTGRPVHTNNEEV 623
Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
D + Y + LYPFGYGLSYT F + V+LNK ++L +D+
Sbjct: 624 DHRPHYNAGYLDSSITPLYPFGYGLSYTTFAIS--------NVHLNK----KSLKRYNDS 671
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
++VN QN G+T+G VV +Y++ + +K++ GFQ+
Sbjct: 672 -------IIVN-----------ASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQK 713
Query: 737 VFVRAGRNKRIKF 749
+ ++AG +K+++F
Sbjct: 714 ISLKAGESKQVRF 726
>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
DSM 14838]
Length = 863
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 167/460 (36%), Positives = 235/460 (51%), Gaps = 43/460 (9%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
+ D+SL R + LV +TL+EK + D + V RLG+ Y WW+EALHGV+ G
Sbjct: 23 YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80
Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
AT FP I ASFN + ++ AVS EARA + GLT W
Sbjct: 81 --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132
Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
+P +N+ RDPRWGR ET GEDP++ R V V+GLQ + + K+ +C
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYDKLHACA 184
Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
KH+A + W +R+ F+A + +D+ ET+L PFE VKEG VMC+YNR G P
Sbjct: 185 KHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDP 241
Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGLDLD 342
C +LL Q +RGEW G +V+DC +I ++ H D+ E A A + +G DL+
Sbjct: 242 CCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDA-ESASAAAVISGTDLE 300
Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
CG Y +V++G + E +D S+K L LG D P+ VS K + S
Sbjct: 301 CGSSYKALI-ESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASA 358
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
+ LA ARE + LL N N LPL + TVAV+GP+AN +V GNY G+P ++
Sbjct: 359 AHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPAHTVT 417
Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAK 495
+ G + Y+ GC V S F+ ++ K
Sbjct: 418 ILDGVRNLLGTDDKLIYEQGCPWVERTLIQSAFSQCKSDK 457
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 144/312 (46%), Gaps = 56/312 (17%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
D+ K + I + E K AD I +G+ S+E E + DR D+ LP Q
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
+LI+ + K +++++ G I +AIL A YPG++GG+A+A+V+FG +
Sbjct: 641 ELIDALHRAGKK---IILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697
Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
NP G+LP+T+Y V LP + GRTY++ L+PFGYGLSYT F
Sbjct: 698 NPAGKLPVTFYRN--VSQLP-------DFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFG 748
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
Y KT+ L+K N+L + V N G
Sbjct: 749 YG-----KTV---LDK-----------------------NELTAGQSLKLTVPVTNTGKR 777
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+G +VV VY + + A IK + F+RV + AG+ ++F K L D +NT+
Sbjct: 778 NGEEVVQVYLRKQGD-AEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTV 835
Query: 768 -LPAGEHTIFVG 778
+ G + I VG
Sbjct: 836 RVCPGNYDIMVG 847
>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 682
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 199/611 (32%), Positives = 298/611 (48%), Gaps = 62/611 (10%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLG---------DFAHGVPRLGLPQYEWWSEALH 104
+CD SL R++DL+S +TLDEKV +G V R+GLP Y W E
Sbjct: 72 YCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPDYYWLVE--- 128
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL------- 157
+N G+ AT F + ASFN S W G TE RA+ N+
Sbjct: 129 --TNTAVGSACIAENKCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALMNVHGERFHT 186
Query: 158 --GRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
GR GLT + PNIN RDPR+GR +E PGEDPF+ G+YA + V+G+Q+ D
Sbjct: 187 HSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQE-------RDA 239
Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
N P KV + KH+ AY + +G D Y+ ++ D+ +T+L +EM + +G A+ VM
Sbjct: 240 NGYP-KVLAYLKHFTAYSREEGRGNDDYN----ISMYDLFDTYLPQYEMGMVQGGATGVM 294
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLH-GYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
CSYN VNGIP+CA+ LLN+ +R W+ ++ DC ++ + A + A A
Sbjct: 295 CSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNL-RGKPIQAADEAQAAAM 353
Query: 334 TLKAGLDLDCGQ--YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYV 389
L G D++ G + N T A+ G E ++++++ Y G FD ++
Sbjct: 354 ALMNGADIEMGSTLFVHNLT-TAITLGYATEEAVNQAIRRSYRPHFIAGRFDDPTLSEWF 412
Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
SLG DI S ++ E+ EAA +G+VLLK++ + LP+ A +AV+GP ++ +
Sbjct: 413 SLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPI--AAGTKLAVLGPLGMTRSGLMSD 470
Query: 450 Y--------AGIPC-RYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
Y G C ++ GF T DV ++ + + + A D
Sbjct: 471 YESDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQLAADRDLI 530
Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
++ G + E E DR+D LPG Q L V + K PV+LV+++ G IA
Sbjct: 531 VLCLGNTKTQEQEGFDRKDTALPGQQYALFEAVLTLRK-PVVLVLVNGG--QIALDGMTG 587
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
AI+ A P GG A+A +FG+ N G+LP T Y +Q S ++
Sbjct: 588 YPSAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYPYSVMQ-----SFDMKDHSMSA 642
Query: 621 YPGRTYKFYNG 631
PGRTY+++ G
Sbjct: 643 PPGRTYRYFTG 653
>gi|256819849|ref|YP_003141128.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
gi|256581432|gb|ACU92567.1| glycoside hydrolase family 3 domain protein [Capnocytophaga
ochracea DSM 7271]
Length = 804
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 195/655 (29%), Positives = 327/655 (49%), Gaps = 74/655 (11%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T FP + + S++ +L +K + + EA A G+ + ++P +++ RD
Sbjct: 158 DVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA------DGINWTFAPMVDITRD 211
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
RWGR E GEDP++ A V+G Q G +N L+S P + +C KH+A Y
Sbjct: 212 ARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQTLSS-PHTLLACGKHFAGYGAA 267
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
G D + A ++ + +L P+E +K G S+M S N +NG+P+ AD LL +
Sbjct: 268 E-SGKD--YNTAELSMHTLRNVYLPPYEATLKAG-VGSIMASLNEINGVPATADKWLLTE 323
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
+R EW +G +V+D I +V H D K+ A + AG+++D G + +
Sbjct: 324 VLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQVA-NLSANAGIEMDMNGATFIKYLSA 381
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAARE 411
V++GKV E IDK+++++ + LG FD +Y+ + K++ ++E +++A +A
Sbjct: 382 LVKEGKVTENQIDKAVRHILEMKFLLGLFDDPYRYLDETRAKENTFTEEYLKVARQAVAS 441
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGY-- 467
+VLLKN+ LP+ KT+AV+GP N T + G++ G + +S + G +
Sbjct: 442 SVVLLKNEAEALPIKKNSDKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYK 501
Query: 468 ---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+ Y GC S + A A+ AD ++ G S ES R D+ LP
Sbjct: 502 GTNVKLLYAEGCGFTTI-STEQLKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLPQ 560
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q QL+ + + K P+ ++ S +D+++ N N++AIL A +PG +GG IADV+
Sbjct: 561 AQRQLLEALKAINK-PIAIITFSGRPLDLSWE--NENVQAILQAWFPGTQGGNGIADVIA 617
Query: 585 GKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPVDSLGYPGRTYKFYNGPTL 634
G NP G L +++ V +P+ T P+ VD + Y + L
Sbjct: 618 GDVNPSGHLTMSFPRS--VGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNAGYLDSSITPL 675
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFGYGLSYT F + V+LNK +++ +D+ ++VN
Sbjct: 676 YPFGYGLSYTTFAIS--------NVHLNK----KSIKRYNDS-------IIVN------- 709
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
QN G T+G VV +Y++ + +K++ GFQ++ ++AG +K+++F
Sbjct: 710 ----ASVQNTGRTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLKAGESKQVRF 760
>gi|153809437|ref|ZP_01962105.1| hypothetical protein BACCAC_03751 [Bacteroides caccae ATCC 43185]
gi|423292726|ref|ZP_17271288.1| hypothetical protein HMPREF1069_06331 [Bacteroides ovatus
CL02T12C04]
gi|149127897|gb|EDM19119.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
gi|392661162|gb|EIY54749.1| hypothetical protein HMPREF1069_06331 [Bacteroides ovatus
CL02T12C04]
Length = 859
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 219/798 (27%), Positives = 344/798 (43%), Gaps = 142/798 (17%)
Query: 51 SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD--------------------------- 83
SF + + LP +RV DL+ RMTL+EK+ Q+
Sbjct: 24 SFSYKNPLLPTELRVNDLLGRMTLEEKIAQIRHLHSWDVFDGQILNQEKLDKMCGGIGYG 83
Query: 84 FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
F G P RLG+P + +E+LHGV V G
Sbjct: 84 FFEGFPLTAASCRKTFREIQTYMVEKTRLGIPGFPV-AESLHGV-----------VHEGT 131
Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
T +P I ++FN L + + ++ E M +P I+V RD RWGR+ E
Sbjct: 132 TIYPQNIAMGSTFNPELAYEKTKHIAGELNTM-----GVKQVLAPCIDVVRDLRWGRVEE 186
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
+ GEDPF+ + AV V+G + H +S KHY + + G++
Sbjct: 187 SFGEDPFLCSKMAVAEVKGYME---H-----------GISPMLKHYGPHG-NPLGGLNLA 231
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
+ V +D+ + +L+PFE + E + +VM SYN N IP+ A +L +R +
Sbjct: 232 SVECGV--RDLFDIYLKPFEAVLAETEIMAVMSSYNSWNRIPNSASRFMLTDILRNRFGF 289
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
GY+ +D + ++ HK D E A Q L AG+D++ + ++ G+
Sbjct: 290 RGYVYSDWGVVSMLKTFHKTAVDDFE-AARQVLTAGMDVEASSSCYAVLADKIRNGEFDI 348
Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
+ ID++++ + LG F+ Q ++ + + S E+++L+ A E VLLKND
Sbjct: 349 SYIDQAVRRVLRAKFELGLFEDPYQEQAVYRLPLRSKESVKLSRRIADESTVLLKNDGQL 408
Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAGFSGY----ANVTYKTGC 476
LPLN +K+VAV+GP NA G+Y + ++P+ G + Y GC
Sbjct: 409 LPLNVRNLKSVAVIGP--NADNVQFGDYTWSKKKEDGVTPLQGIKNLLGDRVKINYAKGC 466
Query: 477 DDVACKSNNSIFAASEAAKTADATIILAG----------LDLSVEAESLDREDLWLPGYQ 526
+A + I A +AA+ +D +I G + S E +D D+ L G Q
Sbjct: 467 -SLASLDTSGIAEAVDAARHSDVALIFVGSSSTAFVRHTQEPSTSGEGIDLSDISLTGAQ 525
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
QLI +V V K PV++++++ G A NI AIL Y GE+ G +IAD++FG
Sbjct: 526 EQLIREVFAVGK-PVVVILVA--GKPFAIPWVKENIPAILAQWYAGEQEGNSIADILFGN 582
Query: 587 FNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
NP G+L ++ Y LP + + PGR Y F N L+ FGYGL
Sbjct: 583 VNPSGKLTFSFPQSTGHLPVYYNYLPTDKGYYKEPGTYEKPGRDYVFSNSSPLWAFGYGL 642
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYTQF+Y L +D + ND C V
Sbjct: 643 SYTQFEY---------------------LKAVTDKELYQA-----NDTVC-----VTVQL 671
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
+N G G +V+ VY + T +KQ+ GF++V + G+ + + + D
Sbjct: 672 KNTGKRTGKEVIQVYMRDVVSSVMTPVKQLKGFRKVDLLPGQTRETTIMI-PVHEFYLTD 730
Query: 762 YAANTLLPAGEHTIFVGN 779
N L +G+ + VG
Sbjct: 731 DLGNRYLESGKFELQVGT 748
>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
6_1_58FAA_CT1]
Length = 868
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 166/456 (36%), Positives = 238/456 (52%), Gaps = 47/456 (10%)
Query: 47 LQMSSFLFCDSSLPYS-------IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWW 99
L S+F F + PY R DL++RMTL EK Q+ + G+ RLG+ Y+WW
Sbjct: 12 LFFSAFSFRAENPPYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWW 71
Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA------ 153
+EALHG++ G AT FP I A+F+++ ++ VS E RA
Sbjct: 72 NEALHGIARAGK----------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQ 121
Query: 154 ---MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
MYN G GLT+W+PNIN+ RDPRWGR ET GEDPF+ + + V+GLQ
Sbjct: 122 RKGMYN-GYKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQ------- 173
Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGD 269
D + K +C KHYA + W +R+ ++A ++ +D+ ET+L F+ V EG
Sbjct: 174 -GDGTQKYDKAHACAKHYAVHSGPEW---NRHSYNAENISIRDLRETYLPAFKALVTEGK 229
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKE 328
VMC+YNR G P C++ LL ++ EW IV+DC +I S
Sbjct: 230 VKEVMCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAA 289
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
DA A + +G DL+CG Y A+++G + ET I++S+ L LG FD
Sbjct: 290 DASADAVISGTDLECGGSYWALD-EALEKGLITETKINESVFRLLRARFELGMFDDDSLV 348
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
+ S+ +C D++ A E AR+ +VLL N NTLPL S +K VAV+GP+AN +V +
Sbjct: 349 SWSSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPL-SKSIKKVAVMGPNANDSVML 407
Query: 447 IGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDDV 479
NY G P R ++ + G +V Y+ GCD V
Sbjct: 408 WANYNGTPDRSVTILEGIKAKLPEGSVIYEKGCDYV 443
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 137/282 (48%), Gaps = 53/282 (18%)
Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
DV A +E K ADA I + G+ S+E E + DR ++ LP Q
Sbjct: 583 DVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQK 642
Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
++ + E K PVI V+ S G +A + + N+ AIL A YPG+EGG A+ADV+FG +
Sbjct: 643 NMMKALKETGK-PVIFVLCS--GSTMALSWEDKNMDAILQAWYPGQEGGTAVADVLFGDY 699
Query: 588 NPGGRLPITWY-NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
NP GRLP+T+Y + D + +M S G GRTY+++ G LYPFG+GLSYT F
Sbjct: 700 NPAGRLPLTFYASSDDLPDFENYNM------SEG-QGRTYRYFKGKPLYPFGHGLSYTGF 752
Query: 647 KYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGS 706
Y+ + LNK + +D ++ +N G
Sbjct: 753 SYS--------KAKLNK-----------------------KSMSVNDSVFLSLNLKNTGL 781
Query: 707 TDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
DG +VV VY + + K + G++RV V+AG+ +K
Sbjct: 782 RDGDEVVQVYIRNLQDPEGPS-KSLRGYKRVSVKAGQTVPVK 822
>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
Length = 769
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 211/725 (29%), Positives = 339/725 (46%), Gaps = 115/725 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + E LHG++ ATS+P I A+FN L +KI A++
Sbjct: 110 RLGIPVI-FHEECLHGLA-----------AKDATSYPVPIGLAATFNPELIEKIFSAIAE 157
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
+AR+ R +P ++V RDPRWGR+ ET GED ++V + + V+GLQ +G
Sbjct: 158 DARS-----RGAHQALTPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQG-DGSL 211
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
N + KV + KH+AA+ G + A +E+ + +TFL PF+ + +
Sbjct: 212 NNNN------KVIATLKHFAAHGQPE-SGTN--CAPANFSERFLRDTFLMPFKEAIDKAG 262
Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF----LAD 325
SVM SYN ++GIPS A+ LL + +R EW+ G++V+D +I + + +A
Sbjct: 263 VISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETVSHGVAA 322
Query: 326 SKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLG 380
+K +A L+AG+++ DC Y N T V+ G E+DID + + LG
Sbjct: 323 NKVEAAKLALEAGVNIEFPNPDC---YPNLT-EMVKGGLADESDIDALVLPMLKYKFELG 378
Query: 381 FFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
FD G+ + +++ ELA +AARE I LLKN+ N LPL K +AV+GP+A
Sbjct: 379 LFDNPYVEAEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLKD--FKKIAVIGPNA 436
Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGC----------DDV----ACK 482
+ T ++G Y G P Y S G V Y GC D+V +
Sbjct: 437 DRT--LLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPDPAE 494
Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEV 536
I A A+ +D +++ G + E+ DR L L G Q +L+ ++ +
Sbjct: 495 DEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEILKT 554
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
K PV++++ + I F N+ AIL Y G+E GRA+ADV+FG +NP G+LP++
Sbjct: 555 GK-PVVVLLFNGRPNSIGF--IKDNVPAILECWYLGQETGRAVADVLFGDYNPSGKLPVS 611
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
+P ++ + P P R Y F + L+ FGYGLSYT+F ++ L +
Sbjct: 612 ---------IPRSAGHI-PAHYSHKPSARRGYLFDDVSPLFAFGYGLSYTKFSFDNLRLS 661
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
K + + D+ ++ +N G+ G +VV
Sbjct: 662 K-------------------------------DTISADEKVSVSIEVKNEGAIAGEEVVQ 690
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
+Y + +K++ GF+++ + G+ + F + L + + GE
Sbjct: 691 LYIRDKVSSVTRPVKELKGFRKITLAPGQTSTVVFEL-LPEHLAFTNVDMKFTVEPGEFE 749
Query: 775 IFVGN 779
I VGN
Sbjct: 750 IMVGN 754
>gi|387789382|ref|YP_006254447.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
gi|379652215|gb|AFD05271.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
3403]
Length = 771
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 219/775 (28%), Positives = 362/775 (46%), Gaps = 121/775 (15%)
Query: 65 VKDLVSRMTLDEKVQQLGDFAHGVPRL----------------------GLPQYEWWSEA 102
V DL+S+MTL+EK+ QL G L G+ E +A
Sbjct: 36 VNDLMSKMTLEEKIGQLNLVTPGGGILTGAVVSQSVEKKIMNGSVGGMFGIIGPEKIRKA 95
Query: 103 LHGVSNVG----PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
N P DVI G T+FP + AS+N L +K Q + EA A
Sbjct: 96 QELAVNKSRLKIPMIFGSDVIHGHKTTFPIPLGLAASWNIELIEKSAQIAAKEATA---- 151
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
GL + +SP ++VARDPRWGRI E GEDP++ A V+G Q + +AT+L
Sbjct: 152 --DGLNWVFSPMVDVARDPRWGRIAEGSGEDPYLGSLIAKAMVKGYQGDNTYSSATNL-- 207
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
+C KH+A Y G D D ++ Q M E +L P++ V+ G SVM S
Sbjct: 208 -----MACVKHFALYGAAE-AGRDYNSVD--MSRQKMYEFYLPPYKAAVEAG-VGSVMSS 258
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N V G+P+ + LL +R +W +G +V+D S+ M+++ + ++ A +K
Sbjct: 259 FNEVEGVPATGNQWLLTDLLRKQWGFNGMVVSDYTSVNEMMEHG---MGNLQEVSALAIK 315
Query: 337 AGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK-- 393
AGLD+D G+ Y + ++Q+GKV ETDI+ + + + +LG F ++++ +
Sbjct: 316 AGLDMDMVGEGYLSTLQKSLQEGKVSETDINLACRRILEAKYKLGLFSDPYKFINEKRAA 375
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
+I + +++ + EAA VLLKN++ LPL K T+A++GP A++ M+G +A +
Sbjct: 376 TEILTTQSLSFSREAATRSFVLLKNEKQVLPLK--KTGTIALIGPLADSKRNMLGTWA-V 432
Query: 454 PCRYMSPIAGFSG-------YANVTYKTGCD------------------DVACKSNNSIF 488
+ + ++ G +A V Y G + D+ +S+ +
Sbjct: 433 SGNWKTSVSVKEGLMNAVGTHAKVLYAKGANISDDSAFARRVNTFGVEIDIDKRSSKELL 492
Query: 489 -AASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
A A+ +D I+ G + E+ R D+ +P Q +L+ + + K PV++V+ +
Sbjct: 493 DEALSIAQQSDVIIVAVGEAADMSGEAASRTDINIPESQKELLKALVQTGK-PVVMVLFN 551
Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDYVQML 606
G + + N ++ AIL PG + G AIADV+FG +NP G++ +T+ N V M
Sbjct: 552 --GRPLTLSWENEHLNAILDVWAPGHQAGNAIADVLFGDYNPSGKITVTFPKNVGQVPMY 609
Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPT---LYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
RP D T K+ + P +YPFGYGLSYT F+Y ++ +
Sbjct: 610 YNHKNTGRPYDDRNR--FTSKYLDMPDNAPMYPFGYGLSYTTFQYGDVTIDQ-------- 659
Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
T PG + KV N G+ DG + V +Y +
Sbjct: 660 --------------DTIKPG---------ETITAKVTITNTGNYDGVETVQLYIQDVIAS 696
Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
A +K + GF+++ ++ G +K ++FV + + L + + AG+ +F+G
Sbjct: 697 VAPPVKTLKGFKQISLKKGESKVVEFVISE-EDLRFYNANLEHVSEAGDFNLFIG 750
>gi|319901526|ref|YP_004161254.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
gi|319416557|gb|ADV43668.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
P 36-108]
Length = 750
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 211/736 (28%), Positives = 338/736 (45%), Gaps = 112/736 (15%)
Query: 64 RVKDLVSRMTLDEKV---QQLGDFAHGVPRLGLPQYEWWSEALHGV-------------- 106
++++L+S MTL+EK+ Q+ + + +GL + L+ V
Sbjct: 34 KIENLLSDMTLEEKLGQMNQISSYGNIEDMIGLIKKGEVGSILNEVDAVRVNALQRVAVE 93
Query: 107 -SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
S +G P DVI G T FP + A+F+ + K + + EA ++ G+
Sbjct: 94 ESRLGIPLLMARDVIHGFKTIFPIPLGQAATFDPEVAKDGARIAAIEASSV------GVR 147
Query: 164 Y-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
+ ++P I+++RDPRWGRI E+ GED ++ V+G Q LNS P ++
Sbjct: 148 WTFAPMIDISRDPRWGRIAESCGEDVYLSSVMGSAMVKGFQ-------GDSLNS-PTSIA 199
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KH+ Y R + ++E+ + + PFE K G A+ M S+N +G
Sbjct: 200 ACAKHFVGYGAAEG---GRDYNSTFISERSLRNVYFPPFEAAAKAGVAT-FMTSFNDNDG 255
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
+PS + +L +RGEW G +V D +S + M+ H F AD K DA + AG+D++
Sbjct: 256 VPSTGNKFILKDVLRGEWGFDGLVVTDWNSAREMI-AHGFAADDK-DAATLAVNAGVDME 313
Query: 343 CGQY--YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
Y + N ++ GKVKE ID+++K + V RLG FD YV + + DE
Sbjct: 314 MVSYAFFKNLP-EQIKSGKVKEEVIDEAVKNILRVKFRLGLFDNP--YVDEKRPSVMYDE 370
Query: 401 -NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRY 457
++ A AA E ++LLKN++ LPL V+TVAVVGP A+A +G + G
Sbjct: 371 SHLAAAKRAAEESVILLKNEREVLPLKET-VRTVAVVGPMADAPYEQLGTWVFDGEKSHT 429
Query: 458 MSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
+P+A + V Y+ G K+ I A AD I G + + E
Sbjct: 430 QTPLAAIRSIYGDKVQVVYEPGLTYSRDKNVAGIAKAVSVTAHADVVIAFVGEEAILSGE 489
Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
+ DL L G Q++LI +A+ K P++ V+M+ G + + A+L++ +PG
Sbjct: 490 AHSLADLNLQGAQSELIAALAKTGK-PLVTVVMA--GRQLTIGKEAEESDAVLYSFHPGT 546
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL----------TSMPLRPVDSLGYPG 623
GG AIAD++FGK P G+ P+T+ V +PL S+ +P++ + P
Sbjct: 547 MGGPAIADLLFGKAVPSGKTPVTFLKA--VGQIPLYYAHNNSGRPASLNYKPLEEI--PV 602
Query: 624 RTYKFYNGPT----------LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
+ G + LYPFGYGLSYT FKY
Sbjct: 603 EAGQTSEGSSSSYMDAGVQPLYPFGYGLSYTTFKYG------------------------ 638
Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
P + +L D D +N G +G++VV +Y + +K++
Sbjct: 639 -------KPKISSRELSSKDVLTVVFDLENTGRYEGTEVVQLYVQDKVASVTRPVKELKR 691
Query: 734 FQRVFVRAGRNKRIKF 749
F RV +++G K + F
Sbjct: 692 FTRVTLKSGEKKTVTF 707
>gi|429745624|ref|ZP_19279029.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
380 str. F0488]
gi|429168470|gb|EKY10301.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
380 str. F0488]
Length = 770
Score = 256 bits (655), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 195/655 (29%), Positives = 327/655 (49%), Gaps = 74/655 (11%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T FP + + S++ +L +K + + EA A G+ + ++P +++ RD
Sbjct: 124 DVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA------DGINWTFAPMVDITRD 177
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
RWGR E GEDP++ A V+G Q G +N L+S P + +C KH+A Y
Sbjct: 178 ARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQMLSS-PHTLLACGKHFAGYGAA 233
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
G D + A ++ + +L P+E + G S+M S N +NG+P+ AD LL +
Sbjct: 234 E-SGKD--YNTAELSMHTLRNVYLPPYEATLNAG-VGSIMASLNEINGVPATADKWLLTE 289
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
+R EW +G +V+D I +V H D K+ A + AG+++D G + +
Sbjct: 290 VLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSANAGIEMDMNGATFIKYLSA 347
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAARE 411
V++GKV E IDK+++++ + LG FD +Y+ + K++ ++E +++A +A
Sbjct: 348 LVKEGKVTEAQIDKAVRHILEMKFLLGLFDDPYRYLDETRAKENTFTEEYLKVARQAVAS 407
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGY-- 467
+VLLKN+ LP+ KT+AV+GP N T + G++ G + +S + G +
Sbjct: 408 SVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYK 467
Query: 468 ---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+ Y GC S + A A+ AD ++ G S ES R D+ LP
Sbjct: 468 GTNVKLLYAEGCG-FTTISTEQLKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLPQ 526
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q QL+ + + K P+ +V S +D+++ N N++AIL A +PG +GG IADV+
Sbjct: 527 AQRQLLEALKAINK-PIAIVTFSGRPLDLSWE--NENVQAILQAWFPGTQGGNGIADVIA 583
Query: 585 GKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPVDSLGYPGRTYKFYNGPTL 634
G NP G L +++ V +P+ T P+ VD + Y + L
Sbjct: 584 GDVNPSGHLTMSFPRS--VGQIPIYYNYKSTGRPVYTNNEEVDHRPHYNAGYLDSSITPL 641
Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
YPFGYGLSYT F + V+LNK +++ +D+ ++VN
Sbjct: 642 YPFGYGLSYTTFAIS--------NVHLNK----KSIKRYNDS-------IIVN------- 675
Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
QN G+T+G VV +Y++ + +K++ GFQ++ ++AG +K+++F
Sbjct: 676 ----ASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKISLKAGESKQVRF 726
>gi|149280000|ref|ZP_01886125.1| putative beta-glucosidase [Pedobacter sp. BAL39]
gi|149229197|gb|EDM34591.1| putative beta-glucosidase [Pedobacter sp. BAL39]
Length = 793
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 212/722 (29%), Positives = 345/722 (47%), Gaps = 109/722 (15%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + E HG +G T FPT I +++++ +L K++ A++
Sbjct: 133 RLGIPML-FSEECPHGHMAIG-----------TTVFPTSIGQSSTWDPALIKEMAAAIAM 180
Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
E R + G + P +++AR+PRW R+ ET GEDP + R V G Q
Sbjct: 181 ETRL-----QGGHIGYGPVLDLAREPRWSRVEETYGEDPVLNSRMGEAMVSGFQ------ 229
Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT--EQDMEETFLRPFEMCVKE 267
T++ S + + S KH+ AY V + H VT +++ +++L PF+ VK
Sbjct: 230 -GTNIGS-GVNILSTLKHFTAYGVP-----EGGHNGGSVTVGNRELFQSYLPPFKAAVKA 282
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G A SVM +YN V+GIP ++ LL +RG+W +G++V+D +SI + NH +A S
Sbjct: 283 G-ALSVMTAYNSVDGIPCSSNRYLLTDILRGQWGFNGFVVSDLNSISGLEGNH-HVASSA 340
Query: 328 EDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
+A A + AGLD D Y Y AV G VK +D +L + + +G F+
Sbjct: 341 TEAAALAMNAGLDADLSGYGYGPALVKAVNGGLVKMATVDTALARVLRLKFNMGLFENPY 400
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
++ + + +++ LA + A+E +VLLKN++N LPL+ A +K +AV+GP+A+
Sbjct: 401 VNPKQAEKQVMNAKHVTLARKVAQESVVLLKNEKNILPLSKA-LKNIAVIGPNADNVYNQ 459
Query: 447 IGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
+G+Y G ++ I A S V Y+ GC S A + A+K+ A
Sbjct: 460 LGDYTAPQADGKVITVLNGIRAKVSKETGVFYQKGCAIRDTASAGIAAAVALASKSDVAI 519
Query: 501 IILAG---LDLSVE---------------------AESLDREDLWLPGYQTQLINQVAEV 536
++L G D E E DR L L G Q +L+ V +
Sbjct: 520 VVLGGSSARDFKTEYQNTGAAEVKASAVAVSDMESGEGFDRSTLDLMGRQMELLRAVVKT 579
Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
PV++V++ + + +A N+ A++ A YPG+EGG AIADV+FG +NP GRL ++
Sbjct: 580 GT-PVVVVLIKGRPLTLNWAA--ENVAAMVDAWYPGQEGGNAIADVLFGDYNPAGRLSVS 636
Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
V LP+ RP+ Y + LY FGYGLSY+ F+Y+ L
Sbjct: 637 VPKS--VGQLPVYYNKKRPLP------HNYVELDEQPLYSFGYGLSYSTFEYSNL----- 683
Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
KT G D+R F D +N GS DG +VV +Y
Sbjct: 684 ---------------------KTNVSG-RGKDVRVQVTF----DLKNTGSRDGDEVVQLY 717
Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
+ T ++Q+ F+R+ +++G+ +++ F +A + L +++ + G+ ++
Sbjct: 718 LRDEQSSVVTPMQQLKQFRRLSLKSGQQQQLSFELSA-EDLQLMNQQMEWQVEPGDFSLM 776
Query: 777 VG 778
VG
Sbjct: 777 VG 778
>gi|300778434|ref|ZP_07088292.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503944|gb|EFK35084.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
Length = 740
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 215/762 (28%), Positives = 351/762 (46%), Gaps = 109/762 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD------- 116
+V +L+S+MTL+EKV Q+ ++ G PQ+ + L + G+ +
Sbjct: 26 KVAELLSKMTLEEKVGQMVQYS-GFEYATGPQHSNSAAVLDEIKKGKVGSMLNVAGSEET 84
Query: 117 --------------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
DVI G T+FP I AS++ + +K + +TEA A Y
Sbjct: 85 RAFQKLAMQSRLKIPLLFGQDVIHGYRTTFPVNIGQAASWDLGMIEKSERIAATEA-AAY 143
Query: 156 NLGRAGLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
+ +W+ P +++ARDPRWGR+ E GED ++ + + ++G Q +
Sbjct: 144 GI------HWTFAPMVDIARDPRWGRVMEGSGEDTYLGTKIGLARIKGFQG----KGLGS 193
Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
L++ V +C KH+AAY G D D + + + ET+L PF+ + G ++
Sbjct: 194 LDA----VMACAKHFAAYGA-AVGGRDYNSVDMSLRQ--LNETYLPPFKAAAEAG-VATF 245
Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
M S+N +NGIP+ A+ + ++G+W+ G++V+D SI M+ H + D+ + A +
Sbjct: 246 MNSFNDINGIPATANQYIQRNLLKGKWNYKGFVVSDWGSIGEMIP-HGYAKDAAQ-AAER 303
Query: 334 TLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
++ G D+D + Y V++GKV +D + + T ++G FD ++ +
Sbjct: 304 AVQGGSDMDMESRVYMAELPKLVKEGKVDAKLVDDAAGRILTKKFQMGLFDDPYRFSNEK 363
Query: 393 KQDICSD--ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+Q +D EN + E + IVLLKN N LPL S KTVA++GP TVA G +
Sbjct: 364 RQKEQTDNQENRKFGREFGSKSIVLLKNHGNILPL-SKNTKTVALIGPFGKETVANHGFW 422
Query: 451 A-GIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
+ ++ F G N + Y GC+ V + A E A+ AD I+
Sbjct: 423 SVAFKDDNQRIVSQFDGIKNQLDKNSTLLYAKGCN-VDDQDKTQFAEAIETARRADVVIM 481
Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
G ++ E+ R ++ G Q L+ ++A+ K P+IL+I + G + F + NI
Sbjct: 482 TLGEGHAMSGEAKSRSNIGFTGVQEDLLQEIAKTGK-PIILMINA--GRPLIFNWASDNI 538
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPV 616
AI++ + G E G +IADV+FGK NPGG+LP+T+ + +P+ T P +
Sbjct: 539 PAIMYTWWLGTEAGNSIADVLFGKVNPGGKLPMTFPRTE--GQIPVYYNHYNTGRPAKNN 596
Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
Y N P YPFGYGLSYT FKY+ + +
Sbjct: 597 TDRNYVSAYIDLDNDPK-YPFGYGLSYTDFKYSDMVLSSA-------------------- 635
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
+L + V N G DG +VV +Y + +K++ GFQ+
Sbjct: 636 -----------NLTGNQTLNISVTVSNTGKYDGEEVVQLYVRDLFGKVVRPVKELKGFQK 684
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
VF++ G +K+I F + L D N GE I +G
Sbjct: 685 VFIKKGESKKIDFKLTP-EDLKFFDDELNFDWEGGEFDIMIG 725
>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 814
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 240/853 (28%), Positives = 360/853 (42%), Gaps = 147/853 (17%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
L+CF + +F A + G F L + + S P RV+
Sbjct: 5 LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62
Query: 68 LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
L+S+MTL+EKV Q+ G+ P+L E+ +L G P T
Sbjct: 63 LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122
Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
H IP G T FPT I +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L +++G+ ++ EA A + + P +++ARDPRW R+ ET GEDP++ G
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGAMG 237
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
VRG Q E D S V + KH+A+Y W A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286
Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
PF V G A SVM SYN ++G P LL ++ W G++V+D ++
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ ++ +A + +A + + AG+D D G Y AV++G V IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
+ ++G FD Q + S E+ LA E AR+ IVLLKN LPL ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462
Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
V+GP+A+ M+G+Y G + I S V Y GC V S
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-AVRDSSRTGFK 521
Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
A E A+ ADA +++ G D S E E DR L L G
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ +++ + K PV+LV++ G + +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFG 638
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NP GRL ++ V LP+ R G R Y G YPFGYGLSYT
Sbjct: 639 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPGTPRYPFGYGLSYTT 691
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F Y + +QV +D R D V QN G
Sbjct: 692 FSYTDMK----VQVTEGS-----------------------DDCRVD----VTVTIQNQG 720
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
+ DG +V +Y + T KQ+ F R+ ++AG ++ + F + KSL +
Sbjct: 721 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 779
Query: 766 TLLPAGEHTIFVG 778
++ G TI VG
Sbjct: 780 WVVEPGRFTIMVG 792
>gi|429756169|ref|ZP_19288778.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
324 str. F0483]
gi|429171889|gb|EKY13478.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
324 str. F0483]
Length = 755
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 210/738 (28%), Positives = 354/738 (47%), Gaps = 109/738 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLG---LPQYEWWSEA------LHGVSNVG---- 110
RV ++ MTL+EK+ Q+ F+ G +Y+ + E + S VG
Sbjct: 31 RVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVVGIRKL 90
Query: 111 ------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
P DVI G T FP + + S++ +L +K + + EA A
Sbjct: 91 QKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREATA---- 146
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
G+ + ++P +++ RD RWGR E GEDP++ A V+G Q G +N L+S
Sbjct: 147 --DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQTLSS 201
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
P + +C KH+A Y G D + A ++ + +L P+E + G S+M S
Sbjct: 202 -PHTLLACGKHFAGYGAAE-SGKD--YNTAELSMHTLRNVYLPPYEATLNAG-VGSIMAS 256
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
N +NG+P+ AD LL + +R EW +G +V+D I +V H D K+ A +
Sbjct: 257 LNEINGVPATADKWLLTEELRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSAN 314
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGK 393
AG+++D G + + V++GK E IDK+++++ + LG FD +Y+ + K
Sbjct: 315 AGIEMDMNGATFIKYLSALVKEGKATEAQIDKAVRHILEMKFLLGLFDDPYRYLDETRAK 374
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
++ ++E +++A +A +VLLKN+ LP+ KT+AV+GP N T + G++
Sbjct: 375 ENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKNSGKTIAVIGPMMNNTSDINGSWTCL 434
Query: 452 GIPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
G + +S ++G + + Y GC S + A A+ AD ++ G
Sbjct: 435 GDGKQSVSLLSGLTQKYKGTNVKLLYAEGCG-FTTISTEQLKEAVAIARKADRVLVAVGE 493
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
S ES R D+ LP Q QL+ + + K P+ ++ S +D+++ N N++AIL
Sbjct: 494 QSSWAGESAVRTDIRLPQAQRQLLEALKAINK-PITIITFSGRPLDLSWE--NENVQAIL 550
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMP-------- 612
A +PG +GG IADV+ G NP G L +++ V +P+ T P
Sbjct: 551 QAWFPGTQGGNGIADVIAGDVNPSGHLTMSFPRS--VGQIPIYYNYKNTGRPVYTNNEEV 608
Query: 613 -LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
LRP + GY + LYPFGYGLSYT F + V+LNK +++
Sbjct: 609 DLRPHYNAGYLDSSIT-----PLYPFGYGLSYTTFAIS--------NVHLNK----KSMK 651
Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
+D+ ++VN QN G+T+G V+ +Y++ + +K++
Sbjct: 652 RYNDS-------IIVN-----------ASVQNTGTTEGEIVLQLYTRQLVASVSRPVKEL 693
Query: 732 IGFQRVFVRAGRNKRIKF 749
GFQ++ ++AG +K+++F
Sbjct: 694 KGFQKISLKAGESKQVRF 711
>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
Length = 814
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 240/853 (28%), Positives = 360/853 (42%), Gaps = 147/853 (17%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
L+CF + +F A + G F L + + S P RV+
Sbjct: 5 LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62
Query: 68 LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
L+S+MTL+EKV Q+ G+ P+L E+ +L G P T
Sbjct: 63 LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122
Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
H IP G T FPT I +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L +++G+ ++ EA A + + P +++ARDPRW R+ ET GEDP++ G
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMG 237
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
VRG Q E D S V + KH+A+Y W A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286
Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
PF V G A SVM SYN ++G P LL ++ W G++V+D ++
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ ++ +A + +A + + AG+D D G Y AV++G V IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
+ ++G FD Q + S E+ LA E AR+ IVLLKN LPL ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462
Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
V+GP+A+ M+G+Y G + I S V Y GC V S
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-TVRDSSRTGFK 521
Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
A E A+ ADA +++ G D S E E DR L L G
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ +++ + K PV+LV++ G + +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFG 638
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NP GRL ++ V LP+ R G R Y G YPFGYGLSYT
Sbjct: 639 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPGTPRYPFGYGLSYTT 691
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F Y + +QV +D R D V QN G
Sbjct: 692 FSYTDMK----VQVTEGS-----------------------DDCRVD----VTVTIQNQG 720
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
+ DG +V +Y + T KQ+ F R+ ++AG ++ + F + KSL +
Sbjct: 721 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 779
Query: 766 TLLPAGEHTIFVG 778
++ G TI VG
Sbjct: 780 WVVEPGRFTIMVG 792
>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
18228]
Length = 787
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 213/741 (28%), Positives = 344/741 (46%), Gaps = 119/741 (16%)
Query: 90 RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
RLG+P + E HG +G AT FPT + +++NESL +++G+ +
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTSMGQASTWNESLIRQMGEVIGL 172
Query: 150 EARAM-YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
EAR N+G + P +++AR+PRW R+ ET GEDP++ G +V+G+Q +
Sbjct: 173 EARLQGANIG------YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKDFK 226
Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD---ARVTEQDMEETFLRPFEMCV 265
+ V S KH AAY GV R + A + + + + +L F+ V
Sbjct: 227 DGR--------HVYSTLKHLAAY------GVPRGGHNGGPADMGLRALLDEYLPGFQRAV 272
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
+ G A++VM SYN ++G+P ++ L++ +R W G++ +D SI + H +A
Sbjct: 273 EVGKAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFDGFVYSDLASIDGIAGAH--VAA 330
Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG- 384
+ EDA Q ++AG D+D G AVQ GKVKE+ I++++ + + R+G F+
Sbjct: 331 NLEDAAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKESAINRAVSNVLRLKFRMGLFEQP 390
Query: 385 --SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
SP+ + + C D + LA + AREG VLLKN+ LPL KVK +AV+GP+A+
Sbjct: 391 YVSPEEAA--RLVNCEDHRM-LARKIAREGTVLLKNN-GILPL--GKVKRIAVIGPNADV 444
Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYAN------VTYKTGCDDVACKSNNSIFAASEAAKT 496
+G+Y P + N + Y GC + + ++I A EAA+
Sbjct: 445 MYNYLGDYTA-PQERSKVVTLLDALRNRMPDVRIDYVKGC-AIRDTTQSNIKEAVEAARK 502
Query: 497 ADATIILAG----LDLSVE----------------------AESLDREDLWLPGYQTQLI 530
AD I+ G D + E DR L L G Q +LI
Sbjct: 503 ADLVILAVGGSSARDFKTKYINTGAATVDSENSGILSDMECGEGFDRATLDLLGDQEKLI 562
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+A K P++ V ++ +++ A ++ A+L A YPGE+GG I DV+ G++NP
Sbjct: 563 RAIAATEK-PLVTVYIAGRPLNMNLASEVSD--ALLTAWYPGEQGGNGIVDVLTGEYNPS 619
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
GRLP++ +V +P+ D + PG+ LY FGYGLSYT F Y+
Sbjct: 620 GRLPMSVPR--HVGQIPVHYSQGTLRDYMDCPGK--------PLYTFGYGLSYTTFAYSN 669
Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
L + T + AS+ ++ + C N G DG
Sbjct: 670 LKLSATAKA----------------ASQPAGDNEVMQTITC--------TVTNTGDRDGD 705
Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
+VV +Y A ++ GFQ++F++ G ++ + F + L+I D N
Sbjct: 706 EVVQLYLNDEVSSVAVPPIRLKGFQKIFLKKGESREVTFQLTR-QDLSIYDRNMNFTAEP 764
Query: 771 GEHTIFVGNGGVSFPIHLNFN 791
G + +G + P+ +F
Sbjct: 765 GRFNVMIGGSSDNLPLKGSFE 785
>gi|268316106|ref|YP_003289825.1| glycoside hydrolase [Rhodothermus marinus DSM 4252]
gi|262333640|gb|ACY47437.1| glycoside hydrolase family 3 domain protein [Rhodothermus marinus
DSM 4252]
Length = 754
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 229/768 (29%), Positives = 360/768 (46%), Gaps = 116/768 (15%)
Query: 65 VKDLVSRMTLDEKVQQL----GDFAHGVP-------------RLGLPQYEWWSEALHGV- 106
++ L++RMTL+EK+ QL G A P R+G + +EA+ +
Sbjct: 33 IEALLARMTLEEKLGQLTLYNGGMAETGPVVREGEPDAIRRGRVGAVMNFFGAEAVCAMQ 92
Query: 107 ------SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
S +G P DVI G T FP + A+F+ +L ++ + + EA A+
Sbjct: 93 RQAVEESRLGIPLLFALDVIHGFRTIFPVPLAEAATFDPALVEQAARVAAGEASAV---- 148
Query: 159 RAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
GL + ++P +++ARD RWGRI E GEDP++ A VRG Q DL
Sbjct: 149 --GLNWTFAPMVDIARDARWGRIVEGSGEDPYLGAVMAAARVRGFQ-------GRDLRD- 198
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
P + + KH+AAY G D D V+E+ + E +L PFE V+ G A S+M ++
Sbjct: 199 PTTILATAKHFAAYGAAE-AGRDYNTVD--VSERTLREVYLPPFEAAVRAG-ALSIMSAF 254
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N + G+P+ AD LL +R EW G +V+D S+ ++ H ADS E + L+A
Sbjct: 255 NEIGGVPATADRWLLTDVLRHEWGFEGLVVSDYTSVWELL-FHGIAADSAEVG-RKALEA 312
Query: 338 GLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQ 394
G+D+D Y V+ G++ E +D++++ + V RLG F+ +Y + +Q
Sbjct: 313 GVDMDMVSGIYVRKLAEEVRAGRLSEAVVDEAVRRVLRVKYRLGLFEDPYRYCRDASREQ 372
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--G 452
+ S + LA E AR+ IVLLKN+ LPL ++ VAV+G AN + +++G +A G
Sbjct: 373 VLLSPAHRRLAREVARKAIVLLKNEGELLPLAD-TLQRVAVIGALANDSASVLGPWAAAG 431
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDV-----------ACKSNNSIFAASEA-AKTA 497
P ++ + G A V Y G +V A + S FA +EA A+ A
Sbjct: 432 RPEDAVTILEGIRAALPGATVRYAPGYAEVPSGSFQEMVAAALSPDTSGFAEAEAVARWA 491
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
+ I++ G + E+ R + LPG Q L ++ + + PV++V+M+ G +A E
Sbjct: 492 EVVILVLGEHRELSGEAASRASVELPGVQLALAWRLLALGR-PVVVVLMN--GRPLAIPE 548
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
+ AI+ A + G E G A+ADV+ GK +PGGRLP+++ + L P
Sbjct: 549 LAASAPAIVEAWFLGTEMGHAVADVLLGKASPGGRLPVSFPRATGQEPLYYNHKP----- 603
Query: 618 SLGYPGR-----TYKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
G P R T K+ + P LYPFGYGL+YT F Y+ L ++
Sbjct: 604 -TGRPPRAEEKYTSKYVDVPWTPLYPFGYGLTYTTFAYDSLRLSRRRLG----------- 651
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
DD E V N G G +VV +Y + +K+
Sbjct: 652 --------------------LDDTLEVVVSVTNTGRRRGEEVVQLYVRDEVASVTRPVKE 691
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ GF RV + G K ++F ++L ++ G T++VG
Sbjct: 692 LKGFARVELAPGETKAVQFRLP-VRALRFWGLEGGWVVEPGWFTLWVG 738
>gi|423271149|ref|ZP_17250120.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
CL05T00C42]
gi|423274973|ref|ZP_17253919.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
CL05T12C13]
gi|392699073|gb|EIY92255.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
CL05T00C42]
gi|392704252|gb|EIY97391.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
CL05T12C13]
Length = 859
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 215/769 (27%), Positives = 337/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E L G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDPF+V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + + G + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y LS T + + D C+D E
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|423214254|ref|ZP_17200782.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693199|gb|EIY86434.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
CL03T12C04]
Length = 735
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 213/772 (27%), Positives = 355/772 (45%), Gaps = 105/772 (13%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
L+ D+ P RV DL+SRMTL+EK+ QL + G VP +G Y
Sbjct: 29 LYKDAKAPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIGEEVKKVPAEIGSLIYY 88
Query: 98 WWSEALHG--------VSNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
+ AL S +G F D I G T +P + S+N L +K
Sbjct: 89 DTNPALRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPELVEKACAVT 148
Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
+ EAR +G+ + +SP I+VARDPRWGR+ E GEDP+ G +A VRG Q
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFAAASVRGYQG-- 200
Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
D S ++++C KHY Y R + ++ Q + +T+L P+EM VK
Sbjct: 201 ------DDMSAEDRIAACLKHYIGYGASE---AGRDYVYTEISAQTLWDTYLLPYEMGVK 251
Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
G A+++M S+N ++G+P A+ + + ++ W G+IV+D +I+ + ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL--KNQGLAAN 308
Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
K++A AGL++D + Y + V++GK+ +D+S++ + V RLG F+
Sbjct: 309 KKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKFRLGLFERP 368
Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
V+ K+ +++++AA+ A E +VLLKN+ LPL K +AVVGP A
Sbjct: 369 YTPVTNEKERFFRPQSMDIAAQLAAESMVLLKNENGILPLTDK--KKIAVVGPMAKNGWD 426
Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
++G++ G + Y F G A + Y GC + A EAA+ +D
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFVGKAELRYALGC-STQGDNRKGFEEALEAARWSDV 485
Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
++ G ++ E+ R + LP Q +L ++ + K P++LV+++ +++ E
Sbjct: 486 VVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKAGK-PIVLVLVNGRPLELNRLEPI 544
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---R 614
++ AIL PG G +A ++ G+ NP G+L +T+ P ++ +P+ R
Sbjct: 545 SD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMTF---------PYSTGQIPIYYNR 593
Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
G+ G YK LY FG+GLSYT+FKY ++ + T KL
Sbjct: 594 RKSGRGHQG-FYKDITSEPLYSFGHGLSYTEFKYGTVTPSVTTVKRGGKLS--------- 643
Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
+V N G DG + V + P +K++ F
Sbjct: 644 ----------------------VEVSVSNTGKRDGLETVHWFISDPYCSITRPVKELKHF 681
Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
++ ++AG K +F + + V+ L GE+ I V + V +
Sbjct: 682 EKQLIKAGETKVFRFDVDLERDFGFVNGNGKRFLEIGEYYIQVKDQKVKIDL 733
>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
Length = 814
Score = 255 bits (652), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 240/853 (28%), Positives = 360/853 (42%), Gaps = 147/853 (17%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
L+CF + +F A + G F L + + S P RV+
Sbjct: 5 LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62
Query: 68 LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
L+S+MTL+EKV Q+ G+ P+L E+ +L G P T
Sbjct: 63 LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122
Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
H IP G T FPT I +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L +++G+ ++ EA A + + P +++ARDPRW R+ ET GEDP++ G
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMG 237
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
VRG Q E D S V + KH+A+Y W A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286
Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
PF V G A SVM SYN ++G P LL ++ W G++V+D ++
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ ++ +A + +A + + AG+D D G Y AV++G V IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
+ ++G FD Q + S E+ LA E AR+ IVLLKN LPL ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462
Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
V+GP+A+ M+G+Y G + I S V Y GC V S
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-AVRDSSRTGFK 521
Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
A E A+ ADA +++ G D S E E DR L L G
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ +++ + K PV+LV++ G + +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFG 638
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NP GRL ++ V LP+ R G R Y G YPFGYGLSYT
Sbjct: 639 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPGTPRYPFGYGLSYTT 691
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F Y + +QV +D R D V QN G
Sbjct: 692 FSYTDMK----VQVTEGS-----------------------DDCRVD----VTVTIQNQG 720
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
+ DG +V +Y + T KQ+ F R+ ++AG ++ + F + KSL +
Sbjct: 721 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 779
Query: 766 TLLPAGEHTIFVG 778
++ G TI VG
Sbjct: 780 WVVEPGRFTIMVG 792
>gi|285808617|gb|ADC36136.1| glycoside hydrolase family 3 protein [uncultured bacterium 253]
Length = 752
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 225/751 (29%), Positives = 345/751 (45%), Gaps = 97/751 (12%)
Query: 64 RVKDLVSRMTLDEKVQQL--------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF 115
++ L+ RMTL EK+ QL G F P L + + G N H
Sbjct: 35 KIDALLKRMTLAEKLGQLQQLDGEGNGSFRPEHPDLIRKGLLGSTLNVRGAKNTNQLQHV 94
Query: 116 D--------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
DVI G T FP + +S++ + ++ + EARA A
Sbjct: 95 AMDESRLKIPVLFGFDVIHGYRTIFPIPLAEASSWDPTSAERSTSIAAREARA------A 148
Query: 161 GLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
G+ + ++P +++ARDPRWGRITE GED F+ +A VRG Q TD S P
Sbjct: 149 GVRWTFAPMLDIARDPRWGRITEGAGEDQFLGAAFARARVRGFQ-------GTDY-SAPD 200
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
K+ +C KH+ AY G D D ++E + E + PF+ V G +VM +N
Sbjct: 201 KMLACAKHWVAYGATE-GGRDYNTTD--MSENTLREIYFPPFKAAVDAG-VGTVMSGFND 256
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NG+P A+ L + +RGEW G++V+D S++ ++ NH LA +DA L AG+
Sbjct: 257 LNGVPVSANHFTLTEVLRGEWKFDGFVVSDYTSVKELI-NHG-LAFGDQDAARLALNAGV 314
Query: 340 DLDCGQYYTNFTG-NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
D++ N G +++GKV ID++++ + + RLG F + + +
Sbjct: 315 DMEMVSRLFNQQGPQLLKEGKVSPATIDEAVRRILRIKFRLGLFANPYADEARETTSLLT 374
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCR 456
EN A A +VLLKN+ TLPL S ++++AV+GP A+ A +G ++G P
Sbjct: 375 SENRAAARALADRSMVLLKNEGGTLPL-SKGIRSIAVIGPLADDHRAPLGWWSGDGKPED 433
Query: 457 YMSPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
++P+ G S V Y GCD V S I A A+ ++ I+ G +
Sbjct: 434 TVTPLMGIRAKVSPATKVNYAKGCD-VQGDSTGDIAEAVAVARESELAIVFVGESAEMVG 492
Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
E+ + L L G Q L+ V K P I+V+++ + + + NT A+L A G
Sbjct: 493 EAASKSSLDLTGCQMDLVKAVQATGK-PTIVVLINGRPLTVGWIFDNT--PAVLEAWMGG 549
Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL---RPVDSLGYPGRTYKFY 629
E G AIADV+FG NPGG+LP+TW V +P+ + RP ++ T K+
Sbjct: 550 TEAGNAIADVLFGDANPGGKLPVTWPR--TVGQVPIYYNHMNTGRPPEANNR--YTSKYL 605
Query: 630 NGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
+ P + FGYGLSYTQFK L + A + G L
Sbjct: 606 DVPWTPQFCFGYGLSYTQFKITNLQLS---------------------APRISATGKLTA 644
Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
V+ +NVG G +VV +Y A +K++ GFQR+ ++ G KR+
Sbjct: 645 S----------VEVENVGKRAGDEVVQLYIHDVAASMTRPVKELKGFQRITLQPGEKKRV 694
Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+FV + + L + GE + VG
Sbjct: 695 EFVLTS-EELGFWNREMRFAAEPGEFKVMVG 724
>gi|294146775|ref|YP_003559441.1| beta-glucosidase [Sphingobium japonicum UT26S]
gi|292677192|dbj|BAI98709.1| beta-glucosidase [Sphingobium japonicum UT26S]
Length = 791
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 220/737 (29%), Positives = 340/737 (46%), Gaps = 109/737 (14%)
Query: 78 VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
V L +A RLG+P + E LHG + VG ATSFP I +S++
Sbjct: 125 VNALQRWATTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 172
Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
L +++ ++ E R+ R SP +++ARDPRWGRI ET GEDP++VG V
Sbjct: 173 DLLREVNAVIAREIRS-----RGVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 227
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
V GLQ G + L P KV + KH + ++ V A V+E+++ E
Sbjct: 228 AVEGLQ---GKGRSRLLP--PGKVFATLKHLTGHGQPESGTNVG----PAPVSERELREN 278
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
F PFE VK +VM SYN ++G+PS A+ LL +RGEW G +V+D ++ +
Sbjct: 279 FFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQL 338
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ H AD E A + L AG+D D G Y G V++GK+ E +D++++++
Sbjct: 339 MSIHHVAAD-LEQAAGRALDAGVDADLPDGLSYATL-GRQVREGKIGEALVDRAVRHMLE 396
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
+ R G F+ +P + + I +D LA +AA+ I+LLKND LPL ++
Sbjct: 397 LKFRAGLFE-NPYADAAASEKITNDARARALALKAAQRSIILLKND-GMLPLKPE--GSI 452
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG----YANVTYKTGCD---------DVA 480
AV+GP +A VA +G Y G P +S + G A + + G D
Sbjct: 453 AVIGP--SAAVARLGGYYGQPPHSVSILEGIRAKVGNRAKIVFAQGVRITENDDWWADKV 510
Query: 481 CKSNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQL 529
+S+ + I A EAA+ D ++ G E DR L L G Q +L
Sbjct: 511 TRSDPAENRRLIAQAVEAARHVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLVGEQQEL 570
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + + K P+ +V+++ G + + + AIL Y GE+GG A+ADV+FG NP
Sbjct: 571 FDALKALGK-PIAVVLIN--GRPASTVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNP 627
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
GG+LP+T +P ++ L P+ P R Y F LYPFG+GLSYT F
Sbjct: 628 GGKLPVT---------IPRSAGQL-PMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFD 677
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
+ P + + VD +N G
Sbjct: 678 LS-------------------------------APRLSAAKISVGGMTRVSVDVRNSGRR 706
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+G +VV +Y + IK++ GFQRV ++ G + + F ++L + + + +
Sbjct: 707 EGDEVVQLYVRDKVGSVTRPIKELKGFQRVTLKPGEVRTVTFTI-GPEALQMWNDHMDRV 765
Query: 768 LPAGEHTIFVGNGGVSF 784
+ G+ I GN V+
Sbjct: 766 VEPGDFEIMTGNSSVAL 782
>gi|110640149|ref|YP_680359.1| b-glucosidase [Cytophaga hutchinsonii ATCC 33406]
gi|110282830|gb|ABG61016.1| candidate b-glucosidase, Glycoside Hydrolase Family 3 protein
[Cytophaga hutchinsonii ATCC 33406]
Length = 745
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 191/659 (28%), Positives = 311/659 (47%), Gaps = 84/659 (12%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDP 175
DVI G T FP + AS++ L +K + E+ + R ++P +++ RD
Sbjct: 103 DVIHGYKTIFPIPLGLAASWDSVLVEKTAMIAAQESYS-----RCINWTFAPMVDICRDA 157
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RWGRI E+PGEDP++ A Y+ G Q G+ A +P ++ +C KH+AAY
Sbjct: 158 RWGRIAESPGEDPYLASVLARAYINGFQ---GNNPA-----QPGRILACSKHFAAYGAAE 209
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
R + ++ + +L+PF V+ G A++ M S+N +NG+P+ + LL
Sbjct: 210 G---GRDYNTVSMSRSTLWNMYLKPFHASVQAG-AATFMTSFNDLNGVPASGNAYLLKDV 265
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNA 354
+R +W G++V+D +S+ M+ H + D K DA + AGLD++ Q Y +
Sbjct: 266 LRNQWKFPGFVVSDWNSVTEMI-THGYCTDEK-DAALKAFSAGLDMEMTSQAYAHHLKTL 323
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
+ + K+ E +D+ +K + + + G F+ +P + K + + LA ++A + V
Sbjct: 324 IAEKKITEQQLDELVKNILRIKLYAGIFE-NPYFKEKEKFTLLDSAALTLAKKSAVKSFV 382
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFS---GYAN 469
LLKN NTLPL A K +AV+GP A A +G + G +P+A G N
Sbjct: 383 LLKNHNNTLPL--AATKKIAVIGPLAEAPKEQLGTWIFDGDKTNSQTPLAALKKMYGAEN 440
Query: 470 VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQL 529
+ Y G +S++ AA +AAK +D + AG + + E+ R D+ LPG Q +L
Sbjct: 441 IKYVQGLTHSRDESHDDFNAAYKAAKKSDVVLFFAGEEAILSGEAHSRADIRLPGAQERL 500
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
I ++ + K P++LVIM+ G I N+ A++ A +PG G A+ADV+ GK N
Sbjct: 501 IRKLHKAGK-PIVLVIMA--GRPITIEHILPNVSAVVMAWHPGTMAGPALADVLSGKENF 557
Query: 590 GGRLPITWYNGDYVQMLPL---TSMPLRPVDSLGYPG---------------RTYKFYNG 631
GRLP+TW V +P+ + RP DS+ + G ++ G
Sbjct: 558 SGRLPVTW--PKTVGQIPIYYNHTNTGRPADSVSFVGIKDIPIEAWQSSLGNNSHYLDAG 615
Query: 632 PT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
T YPFGYGLSYT+F C N + N L
Sbjct: 616 YTPQYPFGYGLSYTKFV-------------------CTN------------SSIEKNTLT 644
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
D + N GS G + + +Y + ++++ F +V ++AG K ++F
Sbjct: 645 VKDSLIVTLSVSNAGSRSGIETIQLYVQDVTASLVRPVRELKAFAQVELKAGETKTVRF 703
>gi|340616356|ref|YP_004734809.1| xylosidase/arabinosidase [Zobellia galactanivorans]
gi|339731153|emb|CAZ94417.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
Length = 738
Score = 255 bits (651), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 221/790 (27%), Positives = 367/790 (46%), Gaps = 130/790 (16%)
Query: 42 FSKLGLQMSSFLFC-DSSLPYSIRVKDLVSRMTLDEKVQQLG-DFAHGVPRLGLPQYEWW 99
F+ L L M + F D + P +++ L+S+M+L+EKV QL + + RLG+P
Sbjct: 11 FTLLALVMFNMGFAQDKARPSDKKIEKLISKMSLEEKVHQLATQYPNANMRLGIPNLSA- 69
Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR 159
+E LHG+ + AT FP I ++++ L +++G V+ E+RA
Sbjct: 70 NECLHGIK-----------MDSATVFPQAIAMASTWDTELIERMGHTVAKESRAF----- 113
Query: 160 AGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
G+ ++P + V RD RWGR E+ GEDP++VG+ +Y+RGLQ + G E + +
Sbjct: 114 -GIHQCYTPMLAVVRDVRWGRTEESYGEDPYLVGKIGSSYIRGLQGM-GAERFDENH--- 168
Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
+ + KH+ A D + G + D ++E ++ L PF M ++E + ++M +++
Sbjct: 169 --IMATAKHFVA-DGEPMAGDNGAAHD--ISEYTLQNVHLYPFRMAIEEAEVGAIMPAHH 223
Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
+NGIP A+ ++ +R EW G +V+D ++ + ++ D E A + L+AG
Sbjct: 224 LLNGIPCHANKHVMQTVLRDEWGWDGLVVSDNGDMRSLKRVFNYVPDY-EHAAKKGLEAG 282
Query: 339 LDLDCGQY--------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD------- 383
+ + + + ++ +AV + V +D ++K++ LG FD
Sbjct: 283 IHQELALFQGWSDHRMFGDYLISAVNKKIVPVALVDDAVKHVLQAKFDLGLFDTDIKNDE 342
Query: 384 ----------GSPQ--------------YVSLGKQD----ICSDENIELAAEAAREGIVL 415
G P YV + K+D + + +LA E A++ IVL
Sbjct: 343 RFDVLKNPDNGEPDKVSQHDAEMFKKALYVGIPKKDWKKTVFDQSHNDLALEVAQKSIVL 402
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-GIPCRYMSPIAGFSGY----ANV 470
LKN+ + LPL K K ++VVGP N +G Y+ P Y++ + G Y V
Sbjct: 403 LKNEGDLLPLKKEKYKKISVVGP--NGKAMRLGGYSPDNPKYYINIVEGIQNYLGSDREV 460
Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
++ GCD +N I A A+++D TI+ G E+ DR+DL LPG Q +L+
Sbjct: 461 AFEEGCDFTDSTAN--IPKAVALAESSDITIVAIGGSEETCRENEDRDDLSLPGPQQKLV 518
Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
+ K P ++V+++ + I + N+ +AI+ Y G+E G+AIA+++FGK NP
Sbjct: 519 EAIHATGK-PYVVVLLNGRPLSIEWIAENS--QAIVEGWYLGQETGKAIANILFGKVNPS 575
Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG--PTLYPFGYGLSYTQFKY 648
G+LPIT+ V +PL L GR + YN L+PFGYGLSYT F
Sbjct: 576 GKLPITFPRN--VGQVPLFYNKLE-------TGRPRQIYNSDPEPLFPFGYGLSYTSF-- 624
Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
+L R N T + N+L + N G+
Sbjct: 625 --------------ELGEPRLSNET----------IAANELTT-----VNIPITNTGTRS 655
Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
G VV +Y K++ F+RV ++ G K+I A + L + T+
Sbjct: 656 GETVVQLYVHDVLSERVRPQKELRNFKRVALKPGETKQISIKIGA-QQLEYWNDGKWTIE 714
Query: 769 PAGEHTIFVG 778
P G+ I VG
Sbjct: 715 P-GQFDIMVG 723
>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
CL07T00C01]
gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
CL07T12C05]
Length = 805
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
+ + S P RV+ L+S+MTL+EKV Q+ G+ P+L E+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99
Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
+L G P T H IP G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FPT I +++N L +++G+ ++ EA A + + P +++ARDPRW R+
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G VRG Q E D S V + KH+A+Y W
Sbjct: 215 ETYGEDPYLNGAMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
A + E+++EE PF V G A SVM SYN ++G P LL ++ W
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
G++V+D ++ + ++ +A + +A + + AG+D D G Y AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
IDK+++ + ++ ++G FD Q + S E+ LA E AR+ IVLLKN
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
LPL ++T+AV+GP+A+ M+G+Y G + I S V Y
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
GC V S A E A+ ADA +++ G D S E
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR L L G Q +L+ +++ + K PV+LV++ G + +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G +GG A+ADV+FG +NP GRL ++ V LP+ R G R Y G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFGYGLSYT F Y + +QV +D R
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
D V QN G+ DG +V +Y + T KQ+ F R+ ++AG ++ + F
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ KSL + ++ G TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|410096731|ref|ZP_11291716.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225348|gb|EKN18267.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
CL02T12C30]
Length = 746
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 216/762 (28%), Positives = 349/762 (45%), Gaps = 108/762 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFA--------HGVPR-------LGLPQYEWWSEALHGV-- 106
RV L+ +MTL EK+ Q+ + G+ R L L E ++A
Sbjct: 32 RVNALLGQMTLQEKIGQMNQLSPFGGLEEMAGLIREGNVGSLLNLTDPELVNKAQRIAVE 91
Query: 107 -SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
S +G P DVI G T FP + A+FN L + + + EA A G+
Sbjct: 92 ESRLGIPLLMSRDVIHGYKTIFPIPLGQAATFNPQLVEDGARVAAVEASA------DGIR 145
Query: 164 Y-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
+ ++P I+++RDPRWGRI E+ GEDP++ V V+G Q D + P V+
Sbjct: 146 WTFAPMIDISRDPRWGRIAESCGEDPYLSSVMGVAMVKGFQG--------DSLNNPTAVA 197
Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
+C KH+ Y R + + E+ + + PFE K G ++ M S+N +G
Sbjct: 198 ACAKHFVGYGASEG---GRDYNSTFIPERQLRNVYFPPFEAAAKAG-CATFMTSFNDNDG 253
Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
IPS + +L +RGEW+ G +V D S M+ +H F D KE A+ +++ AG++++
Sbjct: 254 IPSTGNSFILKDVLRGEWNYDGLVVTDWASSAEMI-SHGFCKDEKEAAM-KSVNAGINME 311
Query: 343 --CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
G + N V++ KV E ID++++ + + RLG FD Y +Q +
Sbjct: 312 MVSGTFIRNLE-ELVKEKKVSEAAIDEAVRNILRLKFRLGLFDNP--YTDTDQQVKYAPT 368
Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYM 458
++ A EAA + ++LLKND+ TLP K++T+AV+GP A+A +G + G
Sbjct: 369 HLAKAKEAAEQSVILLKNDRETLPFTD-KIRTLAVIGPLADAAHDQMGTWVFDGEKAHTQ 427
Query: 459 SPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
+ + + + Y+ G K I A AA ADA ++ AG + + E+
Sbjct: 428 TVLTALKEMYGDKVRIIYEPGLGYSRDKHTAGIAKAVNAAMHADAVLVCAGEESILSGEA 487
Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
DL L G Q++LI +A+ K P++ V+M+ G + + A+L+A +PG
Sbjct: 488 HSLADLHLQGAQSELIAALAKTGK-PLVTVVMA--GRPLTIGQEVEQSDAVLYAFHPGTM 544
Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL-----GYPG 623
GG A+AD++FGK P G+ P+T+ V +P+ T P ++L G
Sbjct: 545 GGPALADLLFGKAVPSGKTPVTFPK--MVGQIPVYYAHNNTGRPASRQETLIDDIPQEAG 602
Query: 624 RT----YKFYNGP---TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
+T FY L+PFGYGLSYT F Y+ L
Sbjct: 603 QTSLGCTSFYMDAGFDPLFPFGYGLSYTTFGYDNLQLA---------------------- 640
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
N L D E D N G +G+++V +Y + A +K++ GF+R
Sbjct: 641 ---------TNQLAVDGTLEISFDLTNTGKYEGTEIVQLYIQDKAGSITRPVKELKGFRR 691
Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ ++ G K + F + L + ++ GE ++VG
Sbjct: 692 IPLKQGETKTVSFSL-PVEELAFWNIDRQRVVEPGEFNLWVG 732
>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 862
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 164/464 (35%), Positives = 240/464 (51%), Gaps = 41/464 (8%)
Query: 45 LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
+G+ + D L + R KDL SR+TL+EK + D + +PRLG+ + WWSEALH
Sbjct: 16 VGVNAQQSPYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALH 75
Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
G +N G DV T FP I ASFN ++ ++ A S EAR YN A
Sbjct: 76 GYANNG------DV----TVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKE 125
Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
L+ W+PN+N+ RDPRWGR ET GEDP++ V V+GLQ E +
Sbjct: 126 DTRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPE--------S 177
Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDASSVM 274
++ K+ +C KH+A + + R+ + A ++ +D+ ET+L F+ V++ VM
Sbjct: 178 TKYRKLYACAKHFAVHSGPEYT---RHTANLADISPRDLWETYLPAFKATVQQAGVREVM 234
Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
C+Y R++ P C + +LL Q +R EW +V+DC +I NH +D+ A T
Sbjct: 235 CAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVHAAAKGT 294
Query: 335 LKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
L AG D++CG Y AV++G V E ++DK + L LG D P+ VS K
Sbjct: 295 L-AGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD-DPKLVSWTK 352
Query: 394 ---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+ + SD + +LA AR+ + LL+N N LPL AK + +AVVGP+A + GNY
Sbjct: 353 ISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPL--AKGEKIAVVGPNAADGPMLWGNY 410
Query: 451 AGIPCRYMSPIAGFSGYA--NVTYKTGCDDVACKSNNSIFAASE 492
G P R + + G A ++ Y GCD V S+ A E
Sbjct: 411 NGTPSRTTTILEGIRAKAGKDIPYLQGCDLVNKNVLTSLLAECE 454
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 129/280 (46%), Gaps = 50/280 (17%)
Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
L G ++ V E DR + LP Q + + K +V ++ G IA
Sbjct: 613 LEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKALKAAGK---TVVFVNCSGSAIALTPEV 669
Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
+ AIL A Y GEEGGRA+ADV++G +NPGG+LP+T+Y ++ L D
Sbjct: 670 ESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVTFYR---------STTQLPAFDDY 720
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
GRTY++++ L+PFGYGLSYT+F S +
Sbjct: 721 SMKGRTYRYFSD-ALFPFGYGLSYTRFAIGKGSLSAPA---------------------- 757
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
++ D V NVG G +VV VY + + A +K + F+RV +
Sbjct: 758 ---------MKADGKVTLTVPVSNVGKRTGDEVVQVYVRDVND-ADGPLKSLKAFRRVSL 807
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
+AG ++++ A ++ ++ D A+NT+ G++ ++ G
Sbjct: 808 KAGESRKVTIPLTA-ETFSLFDSASNTVRTKPGKYVVYYG 846
>gi|53714352|ref|YP_100344.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52217217|dbj|BAD49810.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 859
Score = 254 bits (650), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 214/769 (27%), Positives = 338/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E A G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKELTAQ------GITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDP++V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + + G + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G +A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPLAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y LS T + + D C+D E
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
6_1_58FAA_CT1]
Length = 855
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 167/431 (38%), Positives = 236/431 (54%), Gaps = 41/431 (9%)
Query: 40 GRFSKLGL--QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYE 97
G F L L Q L+ D + P RV DL+SRMT++EKV + A G+PRL + +Y
Sbjct: 16 GLFMALTLHAQNEQPLYKDMNAPIHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYY 75
Query: 98 WWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN- 156
+EALHG+ V PG T FP I AS+N L KI A+S EAR +N
Sbjct: 76 HGNEALHGI--VRPGKF--------TVFPQAIGMAASWNPELIYKISTAISDEARGKWNA 125
Query: 157 --LGRAGL-------TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
LG+ L ++WSP +N+ARDPRWGR ET GEDP + G +V+GLQ G
Sbjct: 126 LGLGKKQLDGSSDLLSFWSPTVNMARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQ---G 182
Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
+ + + LK + KH+AA + ++ +R H +A ++E+D+ E +L FE C+ E
Sbjct: 183 N------HPKYLKAVATPKHFAANNEEH----NRAHCNAVISERDLREYYLPSFEKCIVE 232
Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
G A S+M +YN VNGIP + L+ + +R +W GY+V DC + MV HK++ D +
Sbjct: 233 GKAQSIMTAYNAVNGIPCTVNTYLIKKVLREDWGFQGYVVTDCSAPAWMVTQHKYVKDYE 292
Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
AV KAG D++C YT NA +V + DID +L M LG FD
Sbjct: 293 TAAVLMA-KAGSDMECADNVYTQPLLNAYYNYRVSDADIDSIAYHLLRGRMLLGLFDDPE 351
Query: 387 Q--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
+ Y + + + E+ ELA E AR+ +VLLKN+ N LP+N K+K++AVVG NA
Sbjct: 352 KNPYNKISPEKVGCKEHQELALETARQSLVLLKNENNFLPINPKKIKSIAVVG--INADR 409
Query: 445 AMIGNYAGIPC 455
G+Y+G P
Sbjct: 410 CEFGDYSGTPV 420
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 158/342 (46%), Gaps = 57/342 (16%)
Query: 456 RYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSV 510
RY + F G +A + +K D+ + ++F A +AAK D T+ + G+D S+
Sbjct: 562 RYKIKVEYFDGGGDCFARLYWK--APDLDSRDRINLFGEAGKAAKECDITVAVLGIDKSI 619
Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
E E DR L LP Q + I ++ ++ P +V++ AG IA + NI AI+ A Y
Sbjct: 620 EREGQDRYTLELPADQQEFIREIYKI--NPKTVVVLVAGS-SIAINWIDENIPAIIDAWY 676
Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP-GRTYKFY 629
PGE+GG A+A+ +FGK+NPGGRLP+T+YN + +P P D GRTY+++
Sbjct: 677 PGEQGGTAVAEALFGKYNPGGRLPLTFYNS-------MDELP--PFDDYAVKKGRTYQYF 727
Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
G LY FGYGLSYT+F Y R LN S
Sbjct: 728 TGKPLYEFGYGLSYTKFNY-------------------RKLNIASK-------------- 754
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
D + N G DG +V VY + P IKQ+ GF+RV ++ G+ + +
Sbjct: 755 --QDTINIQFSISNTGKYDGDEVAQVYVQYPETGTYMPIKQLKGFKRVHIKKGQTQNVSI 812
Query: 750 VFNACKSLNIVDYAANTLL-PAGEHTIFVGNGGVSFPIHLNF 790
K L D + P+G + VG+ + F
Sbjct: 813 SIPK-KELRYWDEKTRKFVTPSGNYIFQVGSSSQRINLQKTF 853
>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
CL05T00C42]
gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
CL05T12C13]
Length = 805
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
+ + S P RV+ L+S+MTL+EKV Q+ G+ P+L E+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99
Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
+L G P T H IP G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FPT I +++N L +++G+ ++ EA A + + P +++ARDPRW R+
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G VRG Q E D S V + KH+A+Y W
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
A + E+++EE PF V G A SVM SYN ++G P LL ++ W
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
G++V+D ++ + ++ +A + +A + + AG+D D G Y AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
IDK+++ + ++ ++G FD Q + S E+ LA E AR+ IVLLKN
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
LPL ++T+AV+GP+A+ M+G+Y G + I S V Y
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
GC V S A E A+ ADA +++ G D S E
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR L L G Q +L+ +++ + K PV+LV++ G + +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G +GG A+ADV+FG +NP GRL ++ V LP+ R G R Y G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFGYGLSYT F Y + +QV +D R
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
D V QN G+ DG +V +Y + T KQ+ F R+ ++AG ++ + F
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ KSL + ++ G TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 954
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 223/749 (29%), Positives = 350/749 (46%), Gaps = 109/749 (14%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
+ D+SLP RV+ L++ MT +K++ + G G+P L +P EA+HG S
Sbjct: 170 YMDASLPVDERVESLLAAMTPADKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227
Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
GAT FP + A++N L +++ A+ E + N +A WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRQLTEEVAMAIGDET-VIANTKQA----WSPVLDV 273
Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
A+D RWGR ET GEDP +V + +++G Q + L + P KH+ +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ-------SKGLFTTP-------KHFGGH 319
Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
G D + D ++E++M E L PF ++ D S+M +Y+ GIP +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDYMGIPIAKSTEL 376
Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
L + +R EW +G+IV+DC +I + + A K +A Q L AG+ +CG Y N
Sbjct: 377 LQRILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 436
Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK--QDICSDENIELAAE 407
A + G++ ++D + + + R F+ +P + + K SD + +A
Sbjct: 437 VIQAAKDGRINMENLDNVCRTMLATMFRNELFEKNPCKPLDWNKIYPGWNSDSHKAMAHR 496
Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYMSPIAGF- 464
AA E IV+L+N N LPL S +++T+AV+GP A+ G+Y P + S + G
Sbjct: 497 AACESIVMLENKDNLLPL-SKELRTIAVLGPGADDLQP--GDYTPKLQPGQLKSVLTGIK 553
Query: 465 ---SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE---------- 511
S V Y+ GCD + I A + A AD +++ G D S+
Sbjct: 554 AAVSKQTKVLYEKGCDFTETGMTD-IPKAVKTASQADVVVMVLG-DCSISEATKDVRKTC 611
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ D L LPG Q +L+ V K PVIL++ + D+ A + KAIL P
Sbjct: 612 GENNDLATLVLPGKQQELLEAVCATGK-PVILILQAGRPYDLLKA--SEMCKAILVNWLP 668
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G+EGG A ADV+FG +NPGGRLP+T+ +V LPL + GR Y++ +
Sbjct: 669 GQEGGPATADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719
Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
LY FGYGLSYT F+Y+ L K+Q N N T +A+
Sbjct: 720 EYYPLYRFGYGLSYTSFEYSGL-----------KVQEKPNGNVTVEAT------------ 756
Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+NVG G +V +Y T + ++ F R+ + G +K + F
Sbjct: 757 -----------VKNVGGRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLNPGESKTVSF 805
Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
L++++ + ++ GE I VG
Sbjct: 806 ELTPY-DLSLLNDHMDRVVEKGEFKICVG 833
>gi|119476117|ref|ZP_01616469.1| periplasmic beta-glucosidase [marine gamma proteobacterium
HTCC2143]
gi|119450744|gb|EAW31978.1| periplasmic beta-glucosidase [marine gamma proteobacterium
HTCC2143]
Length = 748
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 216/766 (28%), Positives = 358/766 (46%), Gaps = 111/766 (14%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVP---------RLGLPQY-------------EWWSE 101
RV+ L+++MTL EK+ Q+ AHG L L Q E
Sbjct: 20 RVEILLAKMTLAEKIGQMAQ-AHGSEDGVSDDQRRALELGQLGSVLNIVSIDVICELQRI 78
Query: 102 ALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
AL P DVI G T FP + AS+N L ++ + + EA +
Sbjct: 79 ALEDSRLGIPLLIGRDVIHGYKTIFPIPLGQAASWNPELIEQGARVAALEAATV------ 132
Query: 161 GLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
G+ + ++P I++ RDPRWGRI E+ GEDP++ G VRG Q DL++
Sbjct: 133 GVNWTFAPMIDITRDPRWGRIAESLGEDPYLCGELGAAMVRGFQ-------GKDLSAIG- 184
Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
+++C KH+A Y GVD + A + E ++ +L PF+ + G +S M ++N
Sbjct: 185 SIAACAKHFAGYGAAE-GGVD--YNTAIIAENELRNVYLPPFKAALDSG-VASFMTAFND 240
Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
+NG+P+ + LL Q +R EW G +V+D +SI V + H F A+ KE A + AG+
Sbjct: 241 LNGVPASGNEFLLKQILREEWCYQGMVVSDWESI-VQLTEHGFTANDKEAAF-EAANAGI 298
Query: 340 DLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDIC 397
D++ Y+ + + +G++ +D+ +K + + RLG F+ PQ L +
Sbjct: 299 DMEMVSNTYSQHLESLIIEGRISLAQVDEMVKNILRLKFRLGLFENPYPQPDKLPA--LV 356
Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-----AG 452
+ ++ + A + A E +VLLKN +LPL + + ++A++GP A+ +G + A
Sbjct: 357 NHDHRQAAKKLALESVVLLKNSHQSLPLRLSALSSIALIGPLADDAYEQLGTWIFDGDAD 416
Query: 453 IPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
+ I F+G + V + + I AA+++DA ++ G + +
Sbjct: 417 DSETVLQAINAFAGDSLTVNVDRALETTRSNTFIDIDRTMAAAQSSDAIVLCLGEESILS 476
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E+ R D+ LPG Q QLI+ +A+ AK P+IL++M+ G + ++ AIL+A +P
Sbjct: 477 GEAHSRADISLPGAQEQLIHLLAKTAK-PMILIVMA--GRPLTLEPIIDHVDAILYAWHP 533
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITW---------YNGDYVQMLPLTS---------MPL 613
G G A+ D++FG+ +P G+LPIT+ Y G P ++ P
Sbjct: 534 GTMAGTALTDLLFGEVSPSGKLPITFPRMVGQVPIYYGKKNTGKPPSAESVVHMNDIAPR 593
Query: 614 RPVDSLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
SLG + G T L+PFG+GLSYT F Y NL+
Sbjct: 594 AAQTSLGM--SAFHLDAGFTPLFPFGFGLSYTSFTY-------------------ENLHL 632
Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
+S + + D VD N G +G +VV +Y++ A +K++
Sbjct: 633 SS------------STMNIDGVITVTVDVINCGEREGQEVVQLYTRDLAANVTRPVKELK 680
Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
FQ+V + AG +++KF+ A +L D N ++ G ++ G
Sbjct: 681 QFQKVHLSAGERQQVKFLLKAS-ALAFYDRKMNRIIEPGVFHLWTG 725
>gi|60682370|ref|YP_212514.1| hydrolase [Bacteroides fragilis NCTC 9343]
gi|60493804|emb|CAH08594.1| putative exported hydrolase [Bacteroides fragilis NCTC 9343]
Length = 859
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 215/769 (27%), Positives = 336/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E L G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDPF+V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LA--SVLCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNKN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + + G + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y LS T + + D C+D E
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 864
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 156/440 (35%), Positives = 232/440 (52%), Gaps = 38/440 (8%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
F + D+SL R DL+ R+TL+EK + + + +PRL + Y WW+EALHG++
Sbjct: 25 EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NLGR-AG 161
G AT FP I ASF++SL ++ AVS EARA NL R
Sbjct: 85 GL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
LT W+PN+N+ RDPRWGR ET GEDP++ R V V GLQ + +R K+
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPD--------TARYNKL 186
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+C KHYA + W +R+ F+A ++ +D+ ET+L F+ V+E VMC+YNR
Sbjct: 187 HACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRF 243
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQTLKAGL 339
G P C +LL Q +R EW G +V+DC ++ K A A + G
Sbjct: 244 EGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGT 303
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
D++CG Y + +AV+ G + E ID S+K L LG D + + + + S
Sbjct: 304 DVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDEN-VWTGISSDVVDSP 361
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ +LA + ARE + LL+N+ N LPL +K +A++GP+AN +V GNY G+P ++
Sbjct: 362 KHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLPSHTIT 419
Query: 460 PIAGFSGY---ANVTYKTGC 476
+ G Y +N+ Y+ C
Sbjct: 420 LLEGMQRYLPTSNLIYEPVC 439
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 150/326 (46%), Gaps = 61/326 (18%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
F AN+++ D+ + + E K D I G+ ++E E +
Sbjct: 573 FDKTANLSF-----DMGVNAQIDVKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFR 627
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR ++ LP Q +++ + K +V ++ G IA + N +AIL A YPG+
Sbjct: 628 GGDRTEIELPAVQRRVVEALKTAGKR---IVFVNFSGAAIALEPESLNCEAILQAWYPGQ 684
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A+A+V+FG +NP G+LP+T+Y L +P + GRTY++
Sbjct: 685 AGGQAVAEVLFGDYNPAGKLPLTFYRN-------LAQIP--DFEDYNMTGRTYRYMKETP 735
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFG+GLSYT FKY L ++N +K+ +NLN
Sbjct: 736 LFPFGHGLSYTTFKYGKL------KMNDDKIAAGQNLNLV-------------------- 769
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
+ N GS DG +VV VY K + +K + F+RV + AG+ +KF +
Sbjct: 770 -----IPVTNTGSRDGDEVVQVYLKKMDDTEGP-VKTLRAFKRVRIPAGKTVEVKFSLDD 823
Query: 754 CKSLNIVDYAANTL-LPAGEHTIFVG 778
+ L D +NT+ + G +T+ +G
Sbjct: 824 TQ-LEWWDEQSNTMRVCPGNYTVMIG 848
>gi|163849391|ref|YP_001637435.1| glycoside hydrolase family 3 [Chloroflexus aurantiacus J-10-fl]
gi|222527388|ref|YP_002571859.1| glycoside hydrolase family protein [Chloroflexus sp. Y-400-fl]
gi|163670680|gb|ABY37046.1| glycoside hydrolase family 3 domain protein [Chloroflexus
aurantiacus J-10-fl]
gi|222451267|gb|ACM55533.1| glycoside hydrolase family 3 domain protein [Chloroflexus sp.
Y-400-fl]
Length = 702
Score = 254 bits (650), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 211/732 (28%), Positives = 337/732 (46%), Gaps = 117/732 (15%)
Query: 64 RVKDLVSRMTLDEKVQQLGD-FAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD------ 116
RV L+ +MTL+EK+ QL HG+P L L + ++ + G FD
Sbjct: 7 RVNTLLGQMTLEEKIGQLNQPMIHGLPGLDLLRQGKAGSIINAFGALS-GQGFDHLNSAE 65
Query: 117 ----------------------DVIPGA-TSFPTVILTTASFNESLWKKIGQAVSTEARA 153
D+I G T FP + ASFN SL ++I Q + EA A
Sbjct: 66 QCNALQRAALESRLGIPLLFGRDIIHGQRTVFPIPLAQAASFNPSLVEQINQIAAREASA 125
Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
+ G+ + ++P +++ARD RWGRI E GEDP + R A VRG Q
Sbjct: 126 L------GIRWTFAPMLDIARDARWGRIAEGYGEDPLLTSRMAAAAVRGFQG-------- 171
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
D S+P ++ +C KHY Y R + A ++E + + +L PF V G +
Sbjct: 172 DDVSQPDRLVACAKHYVGYGAAEG---GRDYEQAEISEPTLRDVYLPPFRAAVAAG-VGT 227
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
+M ++ +NG+P+ A+ +LL +R EW G++V+D +S+ +V + +A+ + A A
Sbjct: 228 IMSAFLDLNGMPATANRRLLTDVLRNEWGFDGFVVSDWESVGELVQHG--IAEDRAHAAA 285
Query: 333 QTLKAGLDLD--CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
L+AG+D+D G Y N V+ G+V +ID++++ + + R G F+
Sbjct: 286 LALRAGVDMDMVSGAYLETLAEN-VRCGRVTLAEIDEAVRRILRIKCRAGLFEHPLTDPE 344
Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
DI + + ELA +AARE +VLLKN+++ LPL + + V GP +AT + G +
Sbjct: 345 RAIHDILTPKARELARQAARETMVLLKNERHLLPLRD--FRRILVAGPFVHATGELFGTW 402
Query: 451 AGIPCRYMSPIAGFSGYAN--VTYKTGCDDVACKSNNSIFAAS-----EAAKTADATIIL 503
G A V +A + FAA+ A ADA ++L
Sbjct: 403 T------------MDGRAEDAVPLDQAFQAIAPAGTDLWFAAAPDLALSRAHYADAVVLL 450
Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
G + E+ + DL LP Q + I +A + K PV+LV+ + G +A +
Sbjct: 451 VGEHPARSGENANVSDLGLPPGQLEWITAMAAIGK-PVVLVVFA--GRPLAITRAVAQAQ 507
Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL-RPVDSLGYP 622
A+++A +PG EG A+A+++FG P GRLP++ L P RP+++ G P
Sbjct: 508 AVIYAWHPGLEGAAALAEILFGLATPTGRLPVSMPRTTGQAPLYYAHKPSGRPLEADG-P 566
Query: 623 GRTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
RT ++ + PT L+PFGYGLSYT F Y+ L + H R
Sbjct: 567 FRT-RYVDIPTAPLFPFGYGLSYTSFSYSDLRLSSA---------HMRG----------- 605
Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
E N G GS+VV +Y + ++++ FQR+ ++
Sbjct: 606 -------------TLEISALITNTGERTGSEVVQLYVRDLVGSLTRPVRELKDFQRITLQ 652
Query: 741 AGRNKRIKFVFN 752
G +R+ F+
Sbjct: 653 PGEARRVSFILR 664
>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
[Paenibacillus sp. HGF5]
Length = 765
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 198/711 (27%), Positives = 324/711 (45%), Gaps = 109/711 (15%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E V + +A RLG+P E HG +G T FP + +++
Sbjct: 89 EAVNHIQRYAIEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L++ + +AV+ E R+ + G +SP ++V RDPRWGR E GEDP+++ YA
Sbjct: 137 NLDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDME 254
V V GLQ L+S P V++ KH+ Y + + H R ++
Sbjct: 192 VASVEGLQ-------GESLDS-PSSVAATLKHFVGYGSSEGGRNAGPVHMGTR----ELM 239
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
E + PF+ V+ G A+S+M +YN ++G+P + +LL+ +R EW G ++ DC +I
Sbjct: 240 EVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAID 298
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
++ H D DA Q ++AG+DL+ G+ + AV+ K++ + +D++++ +
Sbjct: 299 MLASGHDTAEDGM-DAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVL 357
Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
T+ +LG F+ + I S ++I LA + A EGIVLLKN+ LPL S + +
Sbjct: 358 TLKFKLGLFENPYVDPQTAENVIGSGQHIGLARQLAAEGIVLLKNEAKALPL-SKEGGVI 416
Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNS 486
AV+GP+A+ +G+Y P + + G V Y GC + S
Sbjct: 417 AVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGC-RIKDDSREG 475
Query: 487 IFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLW 521
A A+ AD +++ G +DL A E +DR L
Sbjct: 476 FEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQ 535
Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
L G Q L ++ ++ K +++ I G IA + + AIL A YPG+EGG AIAD
Sbjct: 536 LSGVQLDLAQEIHKLGKRMIVVYI---NGRPIAEPWIDEHADAILEAWYPGQEGGHAIAD 592
Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
++FG NP G+L ++ +V LP+ R G+ Y + YPFGYGL
Sbjct: 593 ILFGDVNPSGKLTMSIPK--HVGQLPVYYNGKRS------RGKRYLEEDSQPRYPFGYGL 644
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT+F Y+ + T + + D V+
Sbjct: 645 SYTEFSYSDIQMTPEV-------------------------------IGTDGTAVVSVNV 673
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
N G +GS+VV +Y A +++ GFQ++ ++ G ++++F
Sbjct: 674 TNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKVEFTIG 724
>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
DSM 14838]
Length = 864
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 156/440 (35%), Positives = 232/440 (52%), Gaps = 38/440 (8%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
F + D+SL R DL+ R+TL+EK + + + +PRL + Y WW+EALHG++
Sbjct: 25 EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NLGR-AG 161
G AT FP I ASF++SL ++ AVS EARA NL R
Sbjct: 85 GL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
LT W+PN+N+ RDPRWGR ET GEDP++ R V V GLQ + +R K+
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPD--------TARYNKL 186
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+C KHYA + W +R+ F+A ++ +D+ ET+L F+ V+E VMC+YNR
Sbjct: 187 HACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRF 243
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQTLKAGL 339
G P C +LL Q +R EW G +V+DC ++ K A A + G
Sbjct: 244 EGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGT 303
Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
D++CG Y + +AV+ G + E ID S+K L LG D + + + + S
Sbjct: 304 DVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDEN-VWTGISSDVVDSP 361
Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
++ +LA + ARE + LL+N+ N LPL +K +A++GP+AN +V GNY G+P ++
Sbjct: 362 KHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLPSHTIT 419
Query: 460 PIAGFSGY---ANVTYKTGC 476
+ G Y +N+ Y+ C
Sbjct: 420 LLEGMQRYLPTSNLIYEPVC 439
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 150/326 (46%), Gaps = 61/326 (18%)
Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
F AN+++ D+ + + E K D I G+ ++E E +
Sbjct: 573 FDKTANLSF-----DMGVNAQIDVKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFR 627
Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
DR ++ LP Q +++ + K +V ++ G IA + N +AIL A YPG+
Sbjct: 628 GGDRTEIELPAVQRRVVEALKTAGKR---IVFVNFSGAAIALEPESQNCEAILQAWYPGQ 684
Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
GG+A+A+V+FG +NP G+LP+T+Y L +P + GRTY++
Sbjct: 685 AGGQAVAEVLFGDYNPAGKLPLTFYRN-------LAQIP--DFEDYNMTGRTYRYMKETP 735
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
L+PFG+GLSYT FKY L ++N +K+ +NLN
Sbjct: 736 LFPFGHGLSYTTFKYGKL------KMNDDKIAAGQNLN---------------------- 767
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
+ N GS DG +VV VY K + +K + F+RV + AG+ +KF +
Sbjct: 768 ---LAIPVTNTGSRDGDEVVQVYLKKMDDTEGP-VKTLRAFKRVRIPAGKTVEVKFSLDD 823
Query: 754 CKSLNIVDYAANTL-LPAGEHTIFVG 778
+ L D +NT+ + G +T+ +G
Sbjct: 824 TQ-LEWWDEQSNTMRVCPGNYTVMIG 848
>gi|265766195|ref|ZP_06094236.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
gi|263253863|gb|EEZ25328.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
Length = 859
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 214/768 (27%), Positives = 336/768 (43%), Gaps = 144/768 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E A G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKELTAQ------GITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDP++V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSGYANVTYK 473
N LPL K+K++AV+GP NA G+Y G+ + S + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL-LEALKERVSNQLTLNYA 462
Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLPG 524
GC D+ + A + AK +D I++ G + A E D DL L G
Sbjct: 463 KGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTG 521
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 522 VQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADMLL 578
Query: 585 GKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG+
Sbjct: 579 GKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGH 638
Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
GLSYT F+Y LS T + + D C+D E +
Sbjct: 639 GLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVTI 667
Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 668 AIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|423260853|ref|ZP_17241755.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
CL07T00C01]
gi|423266988|ref|ZP_17245970.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
CL07T12C05]
gi|387774614|gb|EIK36724.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
CL07T00C01]
gi|392697691|gb|EIY90874.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
CL07T12C05]
Length = 859
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 212/769 (27%), Positives = 335/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E L G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDP++V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + + G + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y + +K D C+D E
Sbjct: 638 HGLSYTDFEYLSATISK-------------------------------EDYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|404404031|ref|ZP_10995615.1| glycoside hydrolase family protein [Alistipes sp. JC136]
Length = 740
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 203/679 (29%), Positives = 330/679 (48%), Gaps = 80/679 (11%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T P + + S++ + + + EA A AGL + ++P +++ARD
Sbjct: 111 DVIHGYKTISPVPLAESCSWDMETIEASARMAAVEASA------AGLQWTFAPMVDIARD 164
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR+ E GEDP++ A VRG Q DL S P + +C KH+A Y
Sbjct: 165 PRWGRVMEGAGEDPYLGSHIARARVRGFQ-------GDDL-SAPNTILACAKHFAGYGAS 216
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
G D D +++Q + E +L PF+ A++ M S+N ++G+P+ + L+ Q
Sbjct: 217 E-GGRDYNTVD--ISDQRLRELYLPPFKAAADA-GAATFMNSFNELSGVPATGNRFLVKQ 272
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
+R EW G IV+D S+ M+ + +A+ K+ A +K D+D G Y +
Sbjct: 273 ILRNEWGWDGVIVSDWGSVAEMIPHG--IAEDKKQAALLAVKNECDIDMEGNCYPSSLEE 330
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAARE 411
V++GKV E +ID+S++ + + LG FD +Y K+ S + E A + AR+
Sbjct: 331 LVKEGKVSEKEIDRSVRRILRLKYELGLFDDPYRYCDEQREKEVTLSAAHREAARDMARK 390
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYMSPIAGFSGYA- 468
IVLL+N ++ LPL K +++AVVGP A++ V M+G + G P ++ + G A
Sbjct: 391 SIVLLENRKSVLPL--GKPRSIAVVGPLADSPVDMLGEWRAKGDPKEVVTILRGIEKTAG 448
Query: 469 ---NVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
VT+ GCD S+ S FA A AA++AD I G + E R +L LPG
Sbjct: 449 AGTRVTHAKGCD--VTGSDRSGFAEAVRAARSADVVIACLGESADMSGEGYCRSELGLPG 506
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q +L+ ++ + K P++L++ + G + A NI+ I+ + G E G A+ADV+F
Sbjct: 507 VQQELLKELKKTGK-PIVLLL--SNGRPLTLAWEKENIETIVETWFLGTEAGNAVADVLF 563
Query: 585 GKFNPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYPGRTY--KFYNGP--TLYPFGY 639
GK+NP G+L +++ YN + + RP + P + Y + + P LYPFGY
Sbjct: 564 GKYNPSGKLVMSFPYNVGQIPVYYNHKHTGRPFE----PNQRYVMHYIDAPVDALYPFGY 619
Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
GLSYT+F+Y P + + + D V
Sbjct: 620 GLSYTRFEYGE-------------------------------PTLSSDRMAAGDTITATV 648
Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI 759
N G DG +VV +Y + +K++ GF+++F++ G + + F + L
Sbjct: 649 KVTNAGDYDGEEVVQLYIRDLKAQITRPVKELKGFRKIFLKKGESADVTFDITRAE-LEY 707
Query: 760 VDYAANTLLPAGEHTIFVG 778
V + + GE +F+G
Sbjct: 708 VLADGSVVSDPGEFELFIG 726
>gi|254514842|ref|ZP_05126903.1| periplasmic beta-glucosidase [gamma proteobacterium NOR5-3]
gi|219677085|gb|EED33450.1| periplasmic beta-glucosidase [gamma proteobacterium NOR5-3]
Length = 740
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 211/755 (27%), Positives = 343/755 (45%), Gaps = 114/755 (15%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG---VPR-------------- 90
+ + + D L RV +L+ M LDEK+ Q+ G +P
Sbjct: 4 ETAQTIAVDEQLSIDSRVAELLGSMGLDEKIGQMSQLQAGGGWIPDELADSIRRGQVGSV 63
Query: 91 LGLPQYEWWSEALHGV---SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQ 145
L P +E S +G P DVI G T FP + AS+N S+ + G
Sbjct: 64 LNEPDVNIVNELQRLAVEESRLGIPLLIGRDVIHGFKTIFPIPLGQAASWNPSV-VEAGA 122
Query: 146 AVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
VS E RAG+ + ++P I++ RDPRWGRI E+ GEDP++ + VRG Q
Sbjct: 123 RVSAEEAV-----RAGINWTFAPMIDITRDPRWGRIAESLGEDPYLCSKLGAAMVRGFQ- 176
Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
+D S P +++C KH+A Y R + A + E +M +LRPF+
Sbjct: 177 -------SDDLSAPDAIAACAKHFAGYGAAEGG---RDYNTANIPENEMRNVYLRPFKAA 226
Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
+ G ++ M ++ +NG+P+ + L+++ +R EW G +V+D +S+ V + H F
Sbjct: 227 AEAG-VATFMSAFCDLNGVPATGNRWLMDEILRQEWSYQGMVVSDWESV-VEMSVHGFTH 284
Query: 325 DSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
D E A + AG+D++ Y + V + K+ ID+ + + + LG F+
Sbjct: 285 DD-EQAAYEAAMAGIDMEMASSSYRDHLEGLVGENKITLEQIDRMVARVLRLKFELGLFE 343
Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
P ++ + N++ A +AA + VLLKN TLPL AK+ ++A++GP A+
Sbjct: 344 -QPYTDPAQHPELLNKANLKAAKQAATQSCVLLKNAHQTLPLVPAKLDSIALIGPLADDG 402
Query: 444 VAMIGNYA-------GIPCRY-MSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
+G + + CR + + G + + Y+ + S ++ AA AA+
Sbjct: 403 YEQMGTWVFDGDAAHSVTCRQALDELLGRT--VEIHYEKALETTRAASPDNFAAAKNAAQ 460
Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
+DA II+ G + + E+ R ++ LPG+Q LI VA K P+I+VIM+ G +
Sbjct: 461 QSDAAIIVVGEEAFMSGEAHSRANIDLPGHQQALIEAVASAGK-PIIVVIMA--GRPLTI 517
Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------- 608
+ A+L+A +PG GG AIAD++ G +P G+LP+T+ V +P+
Sbjct: 518 EPVLEHADAVLYAWHPGTMGGPAIADLLLGLESPSGKLPVTFPR--VVGQVPIHYAQKNT 575
Query: 609 -------------TSMPLRPVDSLGYPGRTYKFYNG-PTLYPFGYGLSYTQFKYNLLSFT 654
+ P P SLG ++ G L+PFGYGLSY +F+Y
Sbjct: 576 GRPATQESCVDINEAPPRAPQTSLGM--TSFHLDAGFKPLFPFGYGLSYGRFQY------ 627
Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
V + H +R + D N+GS G +VV
Sbjct: 628 ----VKITTSHHS---------------------IRMGQSLDISADVVNMGSHAGEEVVQ 662
Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
+Y + IK++ GF+RV ++ G +RI F
Sbjct: 663 LYIRDLVGSVTRPIKELKGFRRVRLKPGERQRISF 697
>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
Length = 792
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 226/816 (27%), Positives = 355/816 (43%), Gaps = 156/816 (19%)
Query: 42 FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
F+K G++ ++ D S P R+ DL+S+MTL+EK Q+ +G R+ P W
Sbjct: 39 FNKNGIKD---VYEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDACPTAGW 94
Query: 99 WSEALH-GVSNV-----GPGTHFDDV-------------------------IP------- 120
+E G+ N+ G G ++ IP
Sbjct: 95 LAEIWKDGIGNIDEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEG 154
Query: 121 -------GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
AT FP A++N+ L ++I + + EA+A LG + +SP +++A+
Sbjct: 155 IRGLCHDRATMFPAQCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQ 209
Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
DPRWGR+ E+ GEDP++ G + GLQ+ EG + + KH+A Y +
Sbjct: 210 DPRWGRVVESYGEDPYLAGELGKQMILGLQN-EG-------------IVATPKHFAVYSI 255
Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
D V ++M+ +L PF ++E A VM SYN +G P L
Sbjct: 256 PVGGRDGGTRTDPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLT 315
Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-- 351
+ +R +W GY+V+D ++++ + H+ + ++E+ AQ + AGL++ TNFT
Sbjct: 316 EILRQQWGFKGYVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPP 369
Query: 352 -------GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIE 403
A+ +GKV +D+ + + V +G FD P + + +D +
Sbjct: 370 QDFILPLRRAIDEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKA 429
Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
++ +AA E +VLLKN+ LPL S K +AV+GP+A + Y + G
Sbjct: 430 VSMKAALESVVLLKNENQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQG 488
Query: 464 FSGY---ANVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGL 506
Y + V Y GCD + + I A E AK +D I++ G
Sbjct: 489 IKEYLPNSEVRYAKGCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGG 548
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
+ E R +L L G Q QL+ V K PV+LV++ I +A N I AI+
Sbjct: 549 NEKTVREEFSRTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYIPAII 605
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRT 625
A +PGE G AIA V+FG +NPGGRL +T+ V +P + P +P DS G
Sbjct: 606 HAWFPGEFMGDAIAKVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG----- 657
Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCP 682
K LYPFGYGLSYT F Y+ L +K + Q N+
Sbjct: 658 -KVRVDGALYPFGYGLSYTTFGYSDLKISKPVIGPQENIT-------------------- 696
Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
L C +N G G +VV +Y + TY K + GF+R+ ++ G
Sbjct: 697 ------LSC--------TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPG 742
Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ + F + L + D + G ++ VG
Sbjct: 743 EEQTVSFTLTP-QDLGLWDKNNRFTVEPGSFSVMVG 777
>gi|423250669|ref|ZP_17231684.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
CL03T00C08]
gi|423253995|ref|ZP_17234925.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
CL03T12C07]
gi|392651626|gb|EIY45288.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
CL03T00C08]
gi|392654553|gb|EIY48200.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
CL03T12C07]
Length = 859
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 214/769 (27%), Positives = 337/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E L G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDP++V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + + G + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y LS T + + D C+D E
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVIPVQELKGFEKVLIKKGETKQV 715
>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
CL03T12C07]
gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
CL03T00C08]
Length = 805
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
+ + S P RV+ L+S+MTL+EKV Q+ G+ P+L E+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99
Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
+L G P T H IP G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FPT I +++N L +++G+ ++ EA A + + P +++ARDPRW R+
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G VRG Q E D S V + KH+A+Y W
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
A + E+++EE PF V G A SVM SYN ++G P LL ++ W
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
G++V+D ++ + ++ +A + +A + + AG+D D G Y AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
IDK+++ + ++ ++G FD Q + S E+ LA E AR+ IVLLKN
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
LPL ++T+AV+GP+A+ M+G+Y G + I S V Y
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
GC V S A E A+ ADA +++ G D S E
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR L L G Q +L+ +++ + K PV+LV++ G + +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G +GG A+ADV+FG +NP GRL ++ V LP+ R G R Y G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YVEEPG 668
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFGYGLSYT F Y + +QV +D R
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
D V QN G+ DG +V +Y + T KQ+ F R+ ++AG ++ + F
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ KSL + ++ G TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 688
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 190/676 (28%), Positives = 326/676 (48%), Gaps = 83/676 (12%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
D I G T +P + S+N L ++ + EAR +G+ + +SP I+VARD
Sbjct: 70 DAIHGFRTVYPISLAQACSWNPDLVEQACAVSAQEARM------SGVDWTFSPMIDVARD 123
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR+ E GEDP+ G + VRG Q D S +V++C KHY Y
Sbjct: 124 PRWGRVAEGYGEDPYANGVFGAASVRGYQG--------DNMSAENRVAACLKHYVGYGAS 175
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
R + +++Q + +T+L P+EM VK G A+++M S+N ++G+P A+P + +
Sbjct: 176 E---AGRDYVYTEISQQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYTMTE 231
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGN 353
++ W G+IV+D +I+ + ++ LA +K++A AGL++D + Y
Sbjct: 232 ILKNRWRHDGFIVSDWGAIEQL--KNQGLAATKKEAARYAFTAGLEMDMMSHAYDRHLQE 289
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
V++GKV +D++++ + + RLG F+ + K+ +++++AA A E +
Sbjct: 290 LVEEGKVSMAQVDEAVRRVLLLKFRLGLFERPYTPATTEKERFFRPKSMDIAARLAAESM 349
Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG------IPCRYMSPIAGFSGY 467
VLLKN+ N LPL K +AV+GP A ++G++ G + Y A F+G
Sbjct: 350 VLLKNENNVLPLTDK--KKIAVIGPMAKNGWDLLGSWRGHGKDTDVAMLYDGLAAEFAGK 407
Query: 468 ANVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
A + Y GC+ N FA A EAA+ +D ++ G ++ E+ R + LP Q
Sbjct: 408 AELRYALGCNTQG--DNREGFAEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQ 465
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
+L ++ + K PV+LV+++ +++ E ++ AIL PG G +A ++ G+
Sbjct: 466 EELAKELKKAGK-PVVLVLVNGRPLELNRLEPVSD--AILEIWQPGVNGALPMAGILSGR 522
Query: 587 FNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
NP G+L +T P ++ +P+ R G+ G YK LYPFG+GL
Sbjct: 523 INPSGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQG-FYKDITSDPLYPFGHGL 572
Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
SYT+FKY T+ + K++ L+ +V
Sbjct: 573 SYTEFKYG------TVTPSATKVKRGEKLSA-------------------------EVTV 601
Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
N+G+ DG++ V + P +K++ F++ ++AG K +F + + V+
Sbjct: 602 TNIGARDGAETVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVN 661
Query: 762 YAANTLLPAGEHTIFV 777
L GE+ I V
Sbjct: 662 EDGKRFLETGEYNIHV 677
>gi|390167927|ref|ZP_10219905.1| beta-glucosidase, partial [Sphingobium indicum B90A]
gi|389589522|gb|EIM67539.1| beta-glucosidase, partial [Sphingobium indicum B90A]
Length = 771
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 220/737 (29%), Positives = 341/737 (46%), Gaps = 109/737 (14%)
Query: 78 VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
V L +A RLG+P + E LHG + VG ATSFP I +S++
Sbjct: 105 VNALQRWATTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 152
Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
L +++ ++ E R+ R SP +++ARDPRWGRI ET GEDP++VG V
Sbjct: 153 DLLREVNAVIAREIRS-----RGVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 207
Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
V GLQ G + L P KV + KH + ++ V A V+E+++ E
Sbjct: 208 AVEGLQ---GKGRSRLLP--PGKVFATLKHLTGHGQPESGTNVG----PAPVSERELREN 258
Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
F PFE VK +VM SYN ++G+PS A+ LL +RGEW G +V+D ++ +
Sbjct: 259 FFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQL 318
Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
++ H AD E A + L AG+D D G Y G V++GK+ E +D++++++
Sbjct: 319 MNIHHVAAD-LEQAAGRALDAGVDADLPDGLSYATL-GRQVREGKIGEALVDRAVRHMLE 376
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
+ R G F+ +P + + I +D LA +AA+ I+LLKND LPL ++
Sbjct: 377 LKFRAGLFE-NPYADAAASEKITNDGRARALALKAAQRSIILLKND-GMLPLKPE--GSI 432
Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG----YANVTYKTGCD---------DVA 480
AV+GP +A VA +G Y G P +S + G A + + G D
Sbjct: 433 AVIGP--SAAVARLGGYYGQPPHSVSILEGIRAKVGNRAKIVFAQGVRITENDDWWADKV 490
Query: 481 CKSNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQL 529
+S+ + I A EAA+ D ++ G E DR L L G Q +L
Sbjct: 491 TRSDPAENRRLIAQAVEAARHVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLMGEQQEL 550
Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
+ + + K P+ +V+++ G + + + AIL Y GE+GG A+ADV+FG NP
Sbjct: 551 FDALKALGK-PIAVVLIN--GRPASTVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNP 607
Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
GG+LP+T +P ++ L P+ P R Y F LYPFG+GLSYT F
Sbjct: 608 GGKLPVT---------IPRSAGQL-PMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFD 657
Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
+ P + + VD +N G
Sbjct: 658 LS-------------------------------APRLSAAKIGVGGTTRVSVDVRNSGRR 686
Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
+G +VV +Y + IK++ GFQRV ++ G + + F ++L + + + +
Sbjct: 687 EGDEVVQLYVRDKVGSVTRPIKELKGFQRVTLKPGEVRTVTFTV-GPEALQMWNDHMDRV 745
Query: 768 LPAGEHTIFVGNGGVSF 784
+ G+ I GN V+
Sbjct: 746 VEPGDFEIMTGNSSVAL 762
>gi|262405837|ref|ZP_06082387.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294647798|ref|ZP_06725350.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294806192|ref|ZP_06765039.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345510348|ref|ZP_08789916.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
gi|262356712|gb|EEZ05802.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636706|gb|EFF55172.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CC 2a]
gi|294446448|gb|EFG15068.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
xylanisolvens SD CC 1b]
gi|345454537|gb|EEO48843.2| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
Length = 800
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDLTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L ++I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++ G + GLQ EG + + KH+A Y + D
Sbjct: 232 PYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT +A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRHAINEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E +VLLK
Sbjct: 392 KVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N LPL S K +AV+GP+A + Y + G Y + V Y
Sbjct: 452 NKNQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785
>gi|451821117|ref|YP_007457318.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451787096|gb|AGF58064.1| periplasmic beta-glucosidase BglX [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 750
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 214/718 (29%), Positives = 341/718 (47%), Gaps = 94/718 (13%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTAS 134
EK +L A RLG+P L G+ DVI G T FP + S
Sbjct: 95 EKSNELQKIAVEESRLGIP-------ILFGL----------DVIHGYRTIFPIPLAEACS 137
Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGR 193
F+ K+ + + EA A AGL + ++P ++++RDPRWGR+ E GEDP++
Sbjct: 138 FDIEKIKESARIAAKEASA------AGLHWTFAPMVDISRDPRWGRVAEGAGEDPYLGSV 191
Query: 194 YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
A V G Q E +N P + +C KH+A Y + G D D + Q +
Sbjct: 192 IAKARVEGFQG-ESLDN-------PESILACAKHFAGYGAPDG-GRDYNTVDMSL--QTL 240
Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
+ +L PF+ + G + M ++N +NGIP + LL +R ++ +G++V+D +SI
Sbjct: 241 HDVYLPPFKAAAEAG-VGTFMSAFNDLNGIPCTVNKYLLTDVLREKFGFNGFVVSDANSI 299
Query: 314 -QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKY 371
+V+V H + D+K A + L AGLD+D Q Y N V++G + E +D++++
Sbjct: 300 PEVVV--HGYAEDNKA-ASKKALNAGLDMDMSQGTYRNELPELVKEGDILEEVLDEAVRR 356
Query: 372 LYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
+ V LG FD +P K++ + E++E A + +R IVLLKN+ N LPL
Sbjct: 357 VLRVKFLLGLFD-NPYRTDAKKEEKTLLCKEHLEAARDISRRSIVLLKNENNALPLKK-D 414
Query: 430 VKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGF----SGYANVTYKTGCDDVACKS 483
+K +AVVGP A M+G ++ G P ++ I+G S + Y GC + +
Sbjct: 415 LKKIAVVGPLAENAAEMLGTWSHTGNPSDVVTIISGIKAAVSTETEILYAEGC-KITGEE 473
Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
A AK +D I + G + + E+ R D+ LPG Q +L+ ++ ++ K P+I+
Sbjct: 474 CIDFEGAVRVAKESDVIIAVVGENSDMSGEAASRIDINLPGKQEELLKELRKIGK-PLIV 532
Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDY 602
V+++ + I + N+ A++ A G + G AIADV+FG +NP G+L T+ Y+
Sbjct: 533 VLINGRPLTIPW--EAENVDALVEAWQLGTQSGNAIADVLFGDYNPSGKLVATFPYSVGQ 590
Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVN 660
V + M RP + + T K+ +GP LYPFG+GLSYT FKY LS
Sbjct: 591 VPIYYNNPMTGRPAGKIKF---TSKYIDGPAEPLYPFGFGLSYTTFKYENLS-------- 639
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
+L + + D KV N G G +VV +Y
Sbjct: 640 -----------------------ILSAENKIGDTVAVKVYVTNTGEVSGEEVVQLYVSDV 676
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+K++ F++V ++ K I F N K L D N ++ G ++VG
Sbjct: 677 VASRVRPVKELKSFEKVLLQPKECKTIIFKLN-TKDLGFHDENMNYVVEPGLFKVYVG 733
>gi|402494058|ref|ZP_10840805.1| b-glucosidase [Aquimarina agarilytica ZC1]
Length = 708
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 209/746 (28%), Positives = 346/746 (46%), Gaps = 94/746 (12%)
Query: 62 SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS--EALHGVSNVGPGTHFDDVI 119
S R KDL ++ D K ++G F + + + + + + + E+ G+ P DVI
Sbjct: 16 SSRSKDLPEQLKQDVKNGKIGAFLNVMNKAYVDELQRIAIEESPQGI----PLIFARDVI 71
Query: 120 PG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRW 177
G T FP + AS++ K + + EA + G+ + ++P +++A+D RW
Sbjct: 72 HGFKTIFPIPLGLAASWDAETAKSAARVSAIEASSF------GIRWTFAPMLDIAQDSRW 125
Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
GRI E+PGEDP++ A YV G Q+ DL S+P +++C KH+ Y
Sbjct: 126 GRIAESPGEDPYLASILAKAYVEGFQN-------NDL-SQPTSLAACAKHFIGYGA---- 173
Query: 238 GVDRYHFDARVTEQDM-EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
+ ++ + Q + T+L+PFE + G A +VM S+N +NG+P+ + LLN +
Sbjct: 174 AIGGRDYNTAIIHQPLLHNTYLKPFEAALAAG-APTVMTSFNEINGVPASGNKWLLNDIL 232
Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAV 355
RG+ D G++V+D +S M+D H + + K A + AGLD++ + Y N +
Sbjct: 233 RGKLDFKGFVVSDWNSTTGMID-HGYAKNEKHTA-ELSFNAGLDMEMTSKSYENHLKELL 290
Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
++ K+ ET +D + + V +L F +P + ++++LA +A + VL
Sbjct: 291 EEKKITETQLDFLVANILRVKFQLDLFK-NPYRSKTFTGNYYDQKHLDLAKKAVIKSSVL 349
Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSG-YANVTY 472
LKN+ LPLN K VAV+GP ANA + +G + G +P + F+ N +
Sbjct: 350 LKNNA-ILPLN--KNTKVAVIGPLANAPLEQLGTWIFDGDKKHTQTPTSAFTNNKVNFKF 406
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQ 532
G S A E A+ +D + G + + E+ R + LPG Q LI
Sbjct: 407 TEGLSYSRDTSTQGFKKALEIAEASDVILFFGGEEAILSGEAHSRASIDLPGKQEALIKA 466
Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
+A+ K P++LVIM GG ++ ++ A+L A +PG GG AI ++++GK P GR
Sbjct: 467 LAKTGK-PIVLVIM--GGRPLSITNIIDDVDAVLMAWHPGTMGGPAIYEMLWGKSEPQGR 523
Query: 593 LPITWYNGDYVQMLPL------TSMPLRP-----VDSL------GYPGRTYKFYN--GPT 633
LP++W LPL T P P +DS+ G T + +
Sbjct: 524 LPVSW--PKTAGQLPLFYNHKSTGRPFDPKSFVQMDSIPVGAWQSSLGNTTHYLDLGAAP 581
Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
+PFGYGL YT+F Y L +KT + ++
Sbjct: 582 HFPFGYGLGYTRFSYKNLKISKTT-------------------------------ISKNE 610
Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
V N G GSD+V +Y + +K++ F+ +F+ G K ++F
Sbjct: 611 TVSLSVTITNTGKNAGSDIVQLYIQDIVGSLTRPVKELKRFKPIFLEKGETKTVEFTITP 670
Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGN 779
K L V+ +L +G+ +FVGN
Sbjct: 671 -KDLMFVNNTLQPVLESGDFNVFVGN 695
>gi|423303939|ref|ZP_17281938.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
CL03T00C23]
gi|423307339|ref|ZP_17285329.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
CL03T12C37]
gi|392686630|gb|EIY79933.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
CL03T00C23]
gi|392690354|gb|EIY83622.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
CL03T12C37]
Length = 736
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 214/775 (27%), Positives = 356/775 (45%), Gaps = 109/775 (14%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA--------------HGVP-RLGLPQY- 96
++ D+ P RV DLVSRMTL+EKVQQL + +P LG Y
Sbjct: 28 IYKDAKAPIEERVNDLVSRMTLEEKVQQLNQYTLGRNNNENNRGEEVKKIPATLGSLIYF 87
Query: 97 --------EWWSEALHGVSNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQA 146
E +A+ S +G F DVI G T +P + S+N L ++
Sbjct: 88 DEDANLRNEAQRKAMEE-SRLGIPILFGYDVIHGFRTIYPISLGQACSWNPQLVEQACAV 146
Query: 147 VSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
+ EAR +G+ + +SP I+VARD RWGR+ E GEDP+ N V G+ +
Sbjct: 147 AAQEARM------SGVDWTFSPMIDVARDGRWGRVAEGYGEDPYT------NAVFGVASI 194
Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
+G++ +S+ +V++C KHY Y R + ++ Q + +T++ P+E V
Sbjct: 195 KGYQGEDMSDSK--RVAACLKHYIGYGASE---AGRDYVYTEISNQTLWDTYIPPYEAGV 249
Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
K G A+++M S+N ++G P A+ + + ++ W G++V+D ++ ++D AD
Sbjct: 250 KAG-AATLMSSFNDISGTPGSANHYTMTEILKNRWKHDGFVVSDWSAVPQLID-QGHAAD 307
Query: 326 SKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
KE A AGL++D G Y V++GK+ +D ++K + + RLG FD
Sbjct: 308 RKE-AARLAFNAGLEMDMMGHCYDKHMAKLVEEGKISMQLVDDAVKRVLRIKFRLGLFDN 366
Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
S K+ +++ +A + A E IVLLKN+ LPL + T+AV+GP +
Sbjct: 367 PYTPTSTEKERFLLPQSLTIAEKLAEETIVLLKNENKVLPLANGNKPTIAVMGPLVQNSA 426
Query: 445 AMIGNYAG-------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEA-AKT 496
++G++ G +P + + A F+G A + Y GCD ++ S F+ + A A+
Sbjct: 427 ELLGSWYGHGHAEDVLPIK-KALDAEFAGKAELIYTEGCDFDG--NDTSKFSEALAVARK 483
Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFA 556
AD ++ G E+ R + LP Q + I ++ + K P++L + A G + +
Sbjct: 484 ADIILLCMGEKKKWSGENASRSIIELPAIQEKFIAEMKKAGK-PIVLAL--ANGRPLGLS 540
Query: 557 ETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL---TSMPL 613
+ AI+ PG GG+ +A V+ G+ NP G+L IT+ +P+
Sbjct: 541 KVEPLCDAIVEMWQPGVPGGKPLAGVLSGRVNPSGKLSITFPRS--TGQIPIYYNQRKTA 598
Query: 614 RPVDSLGYPGRTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
RP ++ K+ N P+ LY FGYGLSYT F Y +NL K
Sbjct: 599 RP--------QSGKYQNIPSTPLYEFGYGLSYTTFNYG--------NINLPK-------- 634
Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
+R + ++ NVG DG++VV + P K++
Sbjct: 635 ---------------ETIRRGEKLVMEIPVTNVGKRDGAEVVHWFISDPFSTITRPCKEL 679
Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
F++ ++AG +F + + L V+ L GE+ + V + V F +
Sbjct: 680 KHFEKQLIKAGETHIFRFEIDPMRDLAFVNANGEHFLENGEYYVIVKDQKVKFTV 734
>gi|420148909|ref|ZP_14656095.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 335 str. F0486]
gi|394754508|gb|EJF37885.1| glycosyl hydrolase family 3, N-terminal domain protein
[Capnocytophaga sp. oral taxon 335 str. F0486]
Length = 770
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 210/733 (28%), Positives = 352/733 (48%), Gaps = 99/733 (13%)
Query: 64 RVKDLVSRMTLDEKVQQLGDFAHGVPRLG---LPQYEWWSEA------LHGVSNVG---- 110
RV ++ MTL+EK+ Q+ F+ G +Y+ + E + S VG
Sbjct: 46 RVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVVGMRKL 105
Query: 111 ------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
P DVI G T FP + + S++ +L +K + + EA A
Sbjct: 106 QKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREATA---- 161
Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
G+ + ++P +++ RD RWGR E GEDP++ A V+G Q G +N L+S
Sbjct: 162 --DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQTLSS 216
Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
P + +C KH+A Y G D + A ++ + +L P+E + G S+M S
Sbjct: 217 -PHTLLACGKHFAGYGAAE-SGKD--YNTAELSMHTLRNVYLPPYEATLNAG-VGSIMAS 271
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
N +NG+P+ A LL + +R EW +G +V+D I +V H D K+ A +
Sbjct: 272 LNEINGVPATAYKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSAN 329
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGK 393
AG+++D G + + V++GKV E IDK+++++ + LG FD +Y+ + K
Sbjct: 330 AGIEMDMNGATFIKYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFDDPYRYLDETRAK 389
Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
++ ++E +++A +A +VLLKN+ LP+ KT+AV+GP N T + G++
Sbjct: 390 ENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCL 449
Query: 452 GIPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
G + +S + G + + Y GC S + A A+ AD ++ G
Sbjct: 450 GDGKQSVSLLTGLTEKYKGTNVKLLYAEGCG-FTTISTEQLKEAVAIARKADRVLVAVGE 508
Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
S ES R D+ LP Q QL+ + + K P+ ++ S +D+++ N N++AIL
Sbjct: 509 QSSWSGESAVRTDIRLPQAQRQLLEALKAINK-PIAIITFSGRPLDLSWE--NENVQAIL 565
Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPV 616
A +PG +GG IADV+ G NP G+L +++ V +P+ T P+ V
Sbjct: 566 QAWFPGTQGGYGIADVIAGDVNPSGQLTMSFPRS--VGQIPIYYNYKSTGRPVYTNNEEV 623
Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
D + Y + LYPFGYGLSYT F N V+LNK +++ +D+
Sbjct: 624 DHRPHYNAGYLDSSITPLYPFGYGLSYTTFAIN--------NVHLNK----KSIKRYNDS 671
Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
++VN QN G+T+G VV +Y++ + +K++ GFQ+
Sbjct: 672 -------IIVN-----------ASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQK 713
Query: 737 VFVRAGRNKRIKF 749
+ ++AG +K++ F
Sbjct: 714 IPLKAGESKQVHF 726
>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
Length = 802
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 234/814 (28%), Positives = 346/814 (42%), Gaps = 159/814 (19%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE------------ 101
+ + S+P RV+ L+S+MTL+EKV Q+ + LG P YE E
Sbjct: 37 YENPSVPVEERVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEI 90
Query: 102 ------ALHGVSNVGPGT--------------------------HFDDVIP--------- 120
AL G P T H IP
Sbjct: 91 SEYHIGALWGFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPH 150
Query: 121 -----GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDP 175
G T FPT I +++N L +++G+ ++TEA A + + P +++ARDP
Sbjct: 151 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIATEASA-----QGAHIGYGPVLDLARDP 205
Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
RW R+ ET GEDP++ G VRG Q L R V + KH+A+Y
Sbjct: 206 RWSRVEETYGEDPYLNGVMGAALVRGFQ-------GDTLRGRK-SVIATLKHFASY---G 254
Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
W A + E+++EE PF V G A SVM SYN ++G P LL
Sbjct: 255 WTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDI 313
Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNA 354
++ W G++V+D +I + ++ +A S +A + + AG+D D G Y A
Sbjct: 314 LKDRWQFKGFVVSDLYAIGGLREHG--VAGSDYEAAVKAVNAGVDSDLGTNVYAEQLVAA 371
Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
V++G V +DK+++ + + +G FD Q + S E+I LA E AR+ IV
Sbjct: 372 VRKGDVAMETVDKAVRRILFLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIV 431
Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY------- 467
LLKN+ LPL ++T+AV+GP+A+ M+G+Y P S + G
Sbjct: 432 LLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTA-PQADGSVVTVLEGIRQKVSKD 489
Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------ 511
V Y GC V S A EAA++AD +++ G D S E
Sbjct: 490 TRVLYAKGCA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSAN 548
Query: 512 -------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
E DR L L G Q +L+ +V ++ K P++LV++ G + A
Sbjct: 549 RVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIK--GRPLLMEGVIQEADA 605
Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
IL A YPG +GG A+ADV+FG +NP GRL ++ V LP+ R + Y
Sbjct: 606 ILDAWYPGMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTKRKGNRSRYIEE 663
Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
G YPFGYGLSYT F Y + + + N HCR
Sbjct: 664 A-----GTPRYPFGYGLSYTTFSYTGMKVRVSEESN-----HCR---------------- 697
Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
+ V +N G+ DG +VV +Y + T +Q+ F RV ++AG
Sbjct: 698 ----------VDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGET 747
Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ I F + KSL + + G T+ G
Sbjct: 748 REITFTLDK-KSLALYMRDGEWAVEPGRFTVMAG 780
>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
615]
Length = 805
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
+ + S P RV+ L+S+MTL+EKV Q+ G+ P+L E+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99
Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
+L G P T H IP G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FPT I +++N L +++G+ ++ EA A + + P +++ARDPRW R+
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G VRG Q E D S V + KH+A+Y W
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
A + E+++EE PF V G A SVM SYN ++G P LL ++ W
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
G++V+D ++ + ++ +A + +A + + AG+D D G Y AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
IDK+++ + ++ ++G FD Q + S E+ LA E AR+ IVLLKN
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAAQLVASSEHTGLAREVARQSIVLLKNKD 440
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
LPL ++T+AV+GP+A+ M+G+Y G + I S V Y
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
GC V S A E A+ ADA +++ G D S E
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR L L G Q +L+ +++ + K PV+LV++ G + +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G +GG A+ADV+FG +NP GRL ++ V LP+ R G R Y G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFGYGLSYT F Y + +QV +D R
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
D V QN G+ DG +V +Y + T KQ+ F R+ ++AG ++ + F
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ KSL + ++ G TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGLFTIMVG 783
>gi|404449838|ref|ZP_11014826.1| periplasmic beta-glucosidase [Indibacter alkaliphilus LW1]
gi|403764685|gb|EJZ25578.1| periplasmic beta-glucosidase [Indibacter alkaliphilus LW1]
Length = 763
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 205/699 (29%), Positives = 328/699 (46%), Gaps = 98/699 (14%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T F + +++++ L +K + + EA A G+ + +SP ++V+RD
Sbjct: 112 DVIHGYETLFSIPLGLSSTWDMELIEKSARIAAIEASA------DGINWTFSPMVDVSRD 165
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGR++E GEDPF+ + A +RG Q ++ T N+ + +C KH+A Y
Sbjct: 166 PRWGRVSEGNGEDPFLGAKIAQAMIRGYQ----GDDLTAYNT----IMACVKHFALYGAP 217
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
G D D ++ Q M + P++ V+ G SVM ++N V+GIP+ A+ L+
Sbjct: 218 E-AGRDYNTVD--MSRQRMYNEYFLPYQAAVEAG-VGSVMTAFNDVDGIPASANKWLMTD 273
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGN 353
+R +W G++V D +I M + L D ++ A L AG+D+D G+ +
Sbjct: 274 VLREQWGFDGFVVTDYTAINEMTSHG--LGD-LQNVSALALLAGVDMDMVGEGFLTTLEK 330
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAARE 411
++++GK+ E+ ID ++K + +LG FD +Y LG+ ++I + E+ + A E A +
Sbjct: 331 SLEEGKISESHIDTAVKRILVAKYKLGLFDDPYRYSDLGRSEKEIFTQEHRKTAREIAAQ 390
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN-- 469
VLLKN+ + LPL K +A+VGP A+ M G ++ + R+ I+ G N
Sbjct: 391 SFVLLKNEGSILPLK--KSGKIALVGPMADNRENMSGTWS-VAGRFTEAISLKDGLENAL 447
Query: 470 ---VTYKTG-----CDDVACKSNNSIFA----------------ASEAAKTADATIILAG 505
VT T +D + SIF A E A+ +D I G
Sbjct: 448 GNEVTLLTARGANVVEDAEYEERVSIFGKPTYRDERPEETLISEALEIARESDVIIAAMG 507
Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
+ E+ R D+ LP Q +L+ + + K PV+LV+ + G +A ++ I
Sbjct: 508 ESAEMSGEAASRSDIELPANQRRLLEALLDTGK-PVVLVLFT--GRPLAIKWEAEHVSGI 564
Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL 619
L + G E G AIADV+FG NP G+L T+ V +P+ T PL
Sbjct: 565 LNVWFAGSEAGDAIADVLFGDVNPSGKLTATFPQN--VGQIPIFYNHKNTGRPLPEGQWF 622
Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
Y + LYPFGYGLSYT+F Y+ L +
Sbjct: 623 QKFRSNYLDVSNEPLYPFGYGLSYTEFDYSGLQLS------------------------- 657
Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
+L D+ + VD +N GS DGS+VV +Y + +K++ GF++VFV
Sbjct: 658 ------AEELSGDETLQITVDVRNAGSLDGSEVVQLYVRDLVASITRPVKELKGFEKVFV 711
Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+AG + + F + L + A + AGE I VG
Sbjct: 712 KAGETRSVTFELTK-RDLMFYNQDAEFVWEAGEFEIMVG 749
>gi|298482587|ref|ZP_07000772.1| xylosidase [Bacteroides sp. D22]
gi|336405443|ref|ZP_08586122.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
gi|295085727|emb|CBK67250.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
XB1A]
gi|298271294|gb|EFI12870.1| xylosidase [Bacteroides sp. D22]
gi|335938024|gb|EGM99918.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
Length = 800
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 232/862 (26%), Positives = 370/862 (42%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSESISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L ++I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++ G + GLQ EG + + KH+A Y + D
Sbjct: 232 PYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAIDEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E +VLLK
Sbjct: 392 KVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N LPL S K +AV+GP+A + Y + G Y + V Y
Sbjct: 452 NKNQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785
>gi|189462809|ref|ZP_03011594.1| hypothetical protein BACCOP_03507 [Bacteroides coprocola DSM 17136]
gi|189430425|gb|EDU99409.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
coprocola DSM 17136]
Length = 754
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 195/661 (29%), Positives = 312/661 (47%), Gaps = 87/661 (13%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T FP + ASFN L ++ + + EA A G+ + ++P I+V+RD
Sbjct: 110 DVIHGYKTIFPICLGQAASFNPDLVRESARVAAIEASA------DGIRWTFAPMIDVSRD 163
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGRI E+ GEDP++ + G Q D + P +++C KH+ Y
Sbjct: 164 PRWGRIAESCGEDPYLTAVLGKAMIEGFQG--------DSLNDPTSIAACAKHFVGYGAA 215
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
R + + E+ + +L PFE K A++ M S+N +G+PS + +L
Sbjct: 216 E---SGRDYNSTFLPERLLRNVYLPPFEAAAKA-GAATFMTSFNDNDGVPSTGNKFILKN 271
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD--CGQYYTNFTG 352
+R EW G +V D S M+ H F D+ DA ++L AG+D+D G + N
Sbjct: 272 VLREEWKYDGMVVTDWASATEMI-THGFCKDAA-DAAKKSLDAGVDMDMVSGAFSGNLE- 328
Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREG 412
N V++ K+ E ID++++ + + RLG F+ YVS + S E++ A +A +
Sbjct: 329 NLVKENKISEKQIDEAVRNILRLKFRLGLFENP--YVSTPQSVKYSPEHLAKAKQAVEQS 386
Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAG----FSG 466
++LLKN TLPLN+ +V TVAVVGP A+A +G + G +P+A +
Sbjct: 387 VILLKNTNQTLPLNADEVHTVAVVGPLADAPHDQMGTWVFDGEKAHTQTPLAALRAVYGD 446
Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
+ Y+ K + A AAK AD + G + + E+ DL L G Q
Sbjct: 447 KVRIIYEPALAYSRDKQTTGLAKAVNAAKQADVVLAFVGEESILSGEAHSLADLNLQGLQ 506
Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
++LI ++++ K P++ V+M+ G + A+ A+L+A +PG GG A+AD++FGK
Sbjct: 507 SELIEKLSQTGK-PLVTVVMA--GRPLTIAKEVEESDAVLYAFHPGTMGGPALADILFGK 563
Query: 587 FNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL-----------GYPGRTYKFY 629
NP G+ P+T+ V LP+ T P + L R++
Sbjct: 564 VNPSGKTPVTFPK--MVGQLPMYYAHNNTGRPALEKEMLLDEIPMEAGQTSVGCRSFFLD 621
Query: 630 NGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
G T L+PFGYGLSYT F Y L ++
Sbjct: 622 AGSTPLFPFGYGLSYTTFSYGNLK-------------------------------IVSGK 650
Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
L D + V+ +N G +G++VV +Y + +K++ FQRV ++ G +K++
Sbjct: 651 LTVSDTLKVSVELKNTGRYEGTEVVQLYVQDKVGSVTRPVKELKRFQRVNLQPGESKQVM 710
Query: 749 F 749
F
Sbjct: 711 F 711
>gi|333377431|ref|ZP_08469165.1| hypothetical protein HMPREF9456_00760 [Dysgonomonas mossii DSM
22836]
gi|332884165|gb|EGK04433.1| hypothetical protein HMPREF9456_00760 [Dysgonomonas mossii DSM
22836]
Length = 743
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 211/778 (27%), Positives = 359/778 (46%), Gaps = 119/778 (15%)
Query: 53 LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF--------------AHGVPRLGLPQYEW 98
LF ++ Y R++ L+ +MTL+EK+ Q+ H + +
Sbjct: 18 LFAQVNIEY--RIEALLKQMTLEEKIGQMNQLHCEDWNKLKEETEKGHVGSVMSITDPNL 75
Query: 99 WSEALHGV---SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
++E S +G P + DVI G T FP + A+FN + +K Q +TEA A
Sbjct: 76 FNEIQKIAVEESRLGIPLINARDVIHGFKTIFPIPLGQAATFNPEIVEKSSQIAATEASA 135
Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
AG+ + ++P I++ DPRWGRI E GEDP++V +RG Q H
Sbjct: 136 ------AGIRWTFAPMIDITHDPRWGRIAEGFGEDPYLVSEMGKASIRGFQGRSLH---- 185
Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
P + +C KH+ AY R + V+E+ + +LRPFE V+ G A+
Sbjct: 186 ----NPRSILACAKHFVAYGAAEG---GRDYNSTFVSERRLRNLYLRPFEEAVQSGVAT- 237
Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
+M S+N +GIP+ LL +R EW+ +G++++D S+ M H + + KE A+
Sbjct: 238 IMTSFNDNDGIPASGSKFLLTDILRNEWEFNGFVISDWASVIEMA-KHGYCKNGKEAAM- 295
Query: 333 QTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL 391
+ + AGLD++ + Y N +++G+V +DID +++ + + LG F+ Y+
Sbjct: 296 KAVNAGLDMEMVSETYINHLPQLLKEGEVSLSDIDNAVRNILRIKFELGIFEQP--YIQD 353
Query: 392 GKQDIC-SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
+++I ++ ++E A EA + +LLKN+ N LPLN +K + V GP ANA +G +
Sbjct: 354 EREEIYYAESHLEAAQEAVEQSTILLKNENNVLPLNMNNIKRILVTGPMANAPHDQLGTW 413
Query: 451 A--GIPCRYMSPIAGF---SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
G +P+ SG+ + Y+ S + E AK D +
Sbjct: 414 VFDGDKKYTRTPLISLQEQSGHIIEIIYEPALSISRDTSKYNFSKVVELAKKVDVILAFV 473
Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG----GVDIAFAETNT 560
G + + E+ L L G Q+ LI ++A K P++ M+ G ++A ++
Sbjct: 474 GEEAILSGEAHSLTTLNLLGAQSALIEELANTGK-PLVTTFMAGRPLSIGKEVALSD--- 529
Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------------ 608
A+L++ +PG GG A+ ++ GK P G+LP+T+ V +P+
Sbjct: 530 ---AVLYSFHPGTMGGPALVSLLTGKVIPSGKLPVTFPKN--VGQIPIYYNHNNTGRPAD 584
Query: 609 ---TSMPLRPVD----SLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVN 660
T++ P++ SLG ++Y G LYPFGYGLSYT F Y+ L
Sbjct: 585 GNETTLYQIPIEAEQTSLG--NKSYYLDAGKDPLYPFGYGLSYTTFIYSNL--------- 633
Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
+L H N ++ DD E D N G D ++V+ +Y +
Sbjct: 634 --QLSH--------------------NKIKKDDTLEVSFDLSNTGKYDATEVIQIYFRDI 671
Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+K+++ F R+ ++AG+ ++K K L + ++ +G+ +FVG
Sbjct: 672 VANIIRPVKELVHFDRINLQAGKTMKVKVEIPVSK-LAFWNIDMQKVVESGQFELFVG 728
>gi|430736195|gb|AGA60127.1| glycoside hydrolase [Aminobacter sp. Gsoil204]
Length = 772
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 201/662 (30%), Positives = 320/662 (48%), Gaps = 89/662 (13%)
Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
DVI G T FP + AS++ +K + +TEA A G+ + ++P ++VARD
Sbjct: 134 DVIHGHRTIFPISLGEAASWDLKAIEKAARISATEASA------EGIHWTFAPMVDVARD 187
Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
PRWGRI+E GED ++ R A VRG Q DL + V + KH+AAY
Sbjct: 188 PRWGRISEGAGEDVYLGSRIAEARVRGFQ-------GNDLKAVD-TVLATAKHFAAYGAA 239
Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
R + ++E+ + + +L PF+ A++ M S+N V+GIP+ + LL
Sbjct: 240 Q---AGRDYGTVDISERTLRDVYLPPFKAAADA-GAATFMTSFNDVDGIPASGNHHLLTD 295
Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
+R +W G++V D SI MV H + D ++ A Q + AG+D+D G +
Sbjct: 296 VLRDKWGFKGFVVTDYTSINEMV-AHGYSKDLQQ-AGEQAINAGVDMDLQGAVFMEHLAK 353
Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAARE 411
+V +GKV ID ++K + + RLG F+ +Y ++ + + +E A + AR+
Sbjct: 354 SVAEGKVDVARIDAAVKAILEMKYRLGLFEDPYRYSDEAREKATVYRPDFLEAARDVARK 413
Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--- 468
+VLLKN N LPL +A K++AV+GP ++ MIG+++ R P+ G
Sbjct: 414 SMVLLKNANNALPL-AASAKSIAVIGPLGDSKADMIGSWSAAGDRKTRPVTLLEGMQARA 472
Query: 469 ----NVTYKTGCD---DVACKSNNSIFAASEA-AKTADATIILAGLDLSVEAESLDREDL 520
+V Y G + A K++ FA + A A+ +D + G + E+ R L
Sbjct: 473 PKGQSVAYVRGASYAFEDAGKTDG--FAEAIALAQKSDVIVAAMGERWDMTGEAASRTSL 530
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
LPG Q L+ ++ + K P+ILV+MS I +A + N+ AIL A YPG GG AIA
Sbjct: 531 DLPGNQQALLQELKKTGK-PIILVLMSGRPNSIEWA--DANVDAILEAWYPGTMGGHAIA 587
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSLGYPGRTY--KFYNGP 632
DV++G +NP G+LP T+ V +PL T P+ P P Y ++ N P
Sbjct: 588 DVLYGDYNPSGKLPATFPRN--VGQVPLYYDMKNTGRPIDPAK----PDAKYVSRYLNTP 641
Query: 633 T--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
LYPFGYGLSYT F Y+ ++ +K ++
Sbjct: 642 NTPLYPFGYGLSYTSFTYSPVTLSKA-------------------------------RIK 670
Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
+ V N G+ DG +VV +Y + ++++ GF+++ ++ G +K + F
Sbjct: 671 PGEPLTASVTVTNSGARDGEEVVQLYVRDLVGSVTRPVRELKGFRKIPLKKGESKTVSFT 730
Query: 751 FN 752
Sbjct: 731 LT 732
>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
Length = 800
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKKGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L ++I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++VG + GLQ+ EG + + KH+A Y + D
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEAVVHNDAHKAVSMKAALESIVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N+ LPL S +AV+GP+ + Y + G Y + V Y
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNRFTVEPGSFSVMVG 785
>gi|448360576|ref|ZP_21549207.1| beta-glucosidase [Natrialba asiatica DSM 12278]
gi|445653189|gb|ELZ06061.1| beta-glucosidase [Natrialba asiatica DSM 12278]
Length = 777
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 211/744 (28%), Positives = 334/744 (44%), Gaps = 124/744 (16%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
E+ +L + RLG+P E L G P T+FP +I +++
Sbjct: 81 ERTNELQTYLREETRLGIPAIPH-EECLSGYMG-----------PEGTTFPQMIGMASTW 128
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
+ +L + + + + + A +G A SP ++VARD RWGR+ ET GEDP++V A
Sbjct: 129 SPALLETVTETIRDQLEA---IGTA--HALSPVLDVARDLRWGRVEETFGEDPYLVAAMA 183
Query: 196 VNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
YV GLQ D +G +S+ KH+ + G +R + +++
Sbjct: 184 CGYVDGLQGDGDG-------------ISATLKHFVGHAA-GAGGKNRSSVS--IGRRELR 227
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
ET + PFE V+ +A SVM +Y+ ++GIP +D +LL +RGEW G +V+D S++
Sbjct: 228 ETHMFPFEAAVRTANAESVMNAYHDIDGIPCASDERLLTDILRGEWSFDGTVVSDYYSVE 287
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSL 369
+ H AD +E VA ++AG+D+ DC Y + NAV+ G++ E +D++
Sbjct: 288 YLRSEHGVAADEREAGVA-AVEAGIDVELPATDC---YGDHLVNAVEAGELAEETVDEAA 343
Query: 370 KYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
+ + R G D +DE L AARE + LL+ND + LPL +
Sbjct: 344 RRVLRAKARKGLLDDPTVDADAATAPFGTDEARALTERAARESMTLLQNDGDLLPLTGEE 403
Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPIAGFSGYA-----NVTYKTG 475
+VAVVGP A+ ++G+YA P Y +P+ V ++ G
Sbjct: 404 TNSVAVVGPKADDAQELLGDYA-YPAHYPEEEIEFDATTPLDAVRARGEEHGFEVRHERG 462
Query: 476 CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE----------------- 518
C + AA+ AA AD T+ G +V+ DR+
Sbjct: 463 CTTTGPDTEG-FDAAANAAADADVTLAFVGARSAVDFSDSDRDRINKPSVATSGEGCDVV 521
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
DL LPG Q +L+ +V E PV +V++S G A + A++ A PGE GG
Sbjct: 522 DLGLPGVQRELVERVHETGT-PVAVVVVS--GRPHAMERIAATVPAVVQAWLPGERGGEG 578
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLT--SMPLRPVDSLGYPGRTYKFYNGPTLYP 636
IA V+FG+ NP G LP++ +P T +P+ Y + LYP
Sbjct: 579 IAAVLFGEHNPAGHLPVS---------VPRTVGQLPVHYNRKPNTATEEYVYTESDPLYP 629
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP-GVLVNDLRCDDYF 695
FG+GLSYT+F Y LS + + + P G +V
Sbjct: 630 FGHGLSYTEFAYGDLSLS----------------------TDSLSPAGTIVA-------- 659
Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
V +N G T G DV+ +Y+ A +++++GF+RV + G KR+ F +A +
Sbjct: 660 --TVTVENAGDTAGDDVLQLYASAENPDLARPVQELVGFERVSLDPGETKRVSFAVDASQ 717
Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
L D N ++ G + +G+
Sbjct: 718 -LAYYDRDFNLVVEEGPYEFRIGH 740
>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
21150]
Length = 867
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 163/466 (34%), Positives = 236/466 (50%), Gaps = 41/466 (8%)
Query: 40 GRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWW 99
G F+ G+ ++ ++ D+S R DL+ +TL+EKV + D + RLG+ +Y WW
Sbjct: 12 GIFTLAGVGCNTEIWKDNSYSPEERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWW 71
Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-- 157
+EALHGV+ G AT FP + A+F+ + + A S EARA ++
Sbjct: 72 NEALHGVARAGQ----------ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFK 121
Query: 158 -----GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
GR GLT W+PNINV RDPRWGR E GEDPF+ G V+GLQ
Sbjct: 122 ERGERGRYQGLTMWTPNINVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQ-------- 173
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDA 270
D + + K+ +C KHYA + W +R+ F+A + +D+ ET+L F+ V +GD
Sbjct: 174 GDRSGKYDKLHACAKHYAVHSGPEW---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDV 230
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLADSKE 328
VMC+YNR G P C + +LL +R EW G +V+DC +I D H D+K
Sbjct: 231 RMVMCAYNRFEGEPCCGNNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAK- 289
Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
A + AG DL+CG Y + AV+QG + E +D SL+ L LG D
Sbjct: 290 TASTDAVLAGTDLNCGDSYPSLV-EAVEQGLITEEQLDISLRRLLIARFELGEMDPDEEV 348
Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
++ + + S + E+A EAAR+ + LL N LPL + TVAV+GP+AN ++
Sbjct: 349 EWSKIPHSVVSSPTHSEMALEAARKSMTLLMNKNGALPLKKEGL-TVAVMGPNANDSLMQ 407
Query: 447 IGNYAGIPCRYMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
GNY G P + + G V Y+ G V + S+F
Sbjct: 408 WGNYNGTPATTTTILQGIRNALGNDDQVIYEQGTQWVDDRIFKSVF 453
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 141/301 (46%), Gaps = 58/301 (19%)
Query: 491 SEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
S AK ADA +++ +G+ +E E + DR D+ LP Q +++ + + K
Sbjct: 595 SSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKAGK 654
Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
+++++ G I F E AIL A YPG+ GG+A+A+V+FG +NP GRLP+T+Y
Sbjct: 655 E---IILVNCSGSAIGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVTFY 711
Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
V LP RTY+++ G LYPFGYGLSYT F Y+ ++T
Sbjct: 712 KS--VDQLP-------DFQDYNMTNRTYRYFEGEPLYPFGYGLSYTTFSYDQPELSQT-- 760
Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
+++ +AS KV N G DG +VV +Y +
Sbjct: 761 ----------SISTEEEAS-------------------LKVSVANTGDYDGEEVVQLYLQ 791
Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFV 777
P + + + GFQRVF+ G ++F + L + A + P AG++ + V
Sbjct: 792 KPDDTEGPSLT-LRGFQRVFIPKGETVEVEFQLTE-EVLEWWNADAQRMTPLAGDYRLLV 849
Query: 778 G 778
G
Sbjct: 850 G 850
>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
Length = 805
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 229/807 (28%), Positives = 346/807 (42%), Gaps = 145/807 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
+ + S P RV+ L+S+MTL+EKV Q+ G+ P+L E+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99
Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
+L G P T H IP G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FPT I +++N L +++G+ ++ EA A + + P +++ARDPRW R+
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G VRG Q E D S V + KH+A+Y W
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
A + E+++EE PF V G A SVM SYN ++G P LL ++ W
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
G++V+D ++ + ++ +A + +A + + AG+D D G Y AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
IDK+++ + ++ ++G FD Q + S E+ LA E AR+ IVLLKN
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
LPL ++T+AV+GP+A+ M+G+Y G + I S V Y
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
GC V S A E A+ AD +++ G D S E
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR L L G Q +L+ +++ + K PV+LV++ G + +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G +GG A+ADV+FG +NP GRL ++ V LP+ R G R Y G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YVEEPG 668
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFGYGLSYT F Y + V V +
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK-------------------------------VQVTEGSD 697
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
D + + V QN G+ DG +V +Y + T KQ+ F R+ ++AG ++ + F
Sbjct: 698 DCWVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ KSL + ++ G TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|374311316|ref|YP_005057746.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
gi|358753326|gb|AEU36716.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
Length = 773
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 226/802 (28%), Positives = 371/802 (46%), Gaps = 121/802 (15%)
Query: 30 SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG-- 87
SS +F + G + ++ S+LP R+ DL+ RMTL+EKV+QL DF G
Sbjct: 25 SSVTLFPAYSQSIASSGKTKTVLIYEQSNLPLETRLADLLGRMTLEEKVRQL-DFYSGTD 83
Query: 88 ----------VPRLGLPQYEWWSEALHGVSNVG--------------------------- 110
+P P ++AL G G
Sbjct: 84 SLLDRGSKNSLPSKQSPFSTAKADALFGSLGAGAIHDLDPTPEQYNTIQRWVIEHNRLHI 143
Query: 111 PGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
P ++ + G T FP + ++++ S+ +K G A++ EARA G+ +P
Sbjct: 144 PALFIEEGLHGFDTGTVFPAPLNLASTWDPSVAEKTGSAIAAEARAT----GVGMIL-AP 198
Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
+++ARDPRWGRI E GEDP++ G+ + YVRG Q G TD N V + KH
Sbjct: 199 VLDLARDPRWGRIEEDFGEDPYLTGQMGLAYVRGAQ---GESLNTDHN-----VVAEPKH 250
Query: 228 YAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
+AA+ + H + E+++ L+ FE ++G A + M +Y+ ++GIP
Sbjct: 251 FAAHGSPEGGTNTSPVH----IGERELRSVMLKSFEPAFRQGHAMATMAAYHEIDGIPVT 306
Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
ADP LL +R EW G +++D +I+ + H+ +A S + A +K+G+D+ +
Sbjct: 307 ADPYLLKTILRQEWGFQGMVLSDLGAIRRLYQLHQ-VASSPKAASCLAIKSGVDMQFYDF 365
Query: 347 YTNFTGNA----VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
+ A V +G + + D+D++ + + LG FD +L + S ++
Sbjct: 366 DHDVFQKALIDCVHEGSLPQADVDRAASAVLRLKFTLGLFDRPYVDPTLNAKAYRSKPHL 425
Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA----GIPCRYM 458
+++ ++ARE +VLLKN+ LP S ++ +AV+GP NA VA G+Y G+ +
Sbjct: 426 DVSLQSARESLVLLKNENGLLPF-SKSIQRIAVIGP--NADVARYGDYEEEANGLHISIL 482
Query: 459 SPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
+ + +A V + +G D I AA AK+AD I+ G + E+ DR
Sbjct: 483 QGVKAEAPHAQVEFDSGKD---------IAAAVAKAKSADVVILGLGEWRGISGEAFDRT 533
Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG Q +L+ + K PV+LV+ + + I +A+ ++ AI+ A YPGE GG+A
Sbjct: 534 SLDLPGEQEKLLEAITATNK-PVVLVLENGRPLTIGWAK--AHVGAIVEAWYPGEFGGQA 590
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLP--LTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
IA+ +FG NP GRL IT+ V +P + P R DS + Y + L+P
Sbjct: 591 IAETLFGDNNPAGRLTITFPK--TVGQIPDYYNTDPSRAYDSDLTRRKVYVDNDSQPLFP 648
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FGYGLSYT F Y C +L T A+K+ D
Sbjct: 649 FGYGLSYTTFHY------------------C-DLQVTPPAAKS----------NEDVSVT 679
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
F V N G+ G +V VY + T ++ + F R+ ++ ++ + +
Sbjct: 680 FTV--TNTGTKAGDEVSQVYLREQFSSVETPVRSLKAFTRMPLQPQESRTVTLKIPRSE- 736
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + + + G++T++VG
Sbjct: 737 LAVWNADEKWAVEGGKYTVWVG 758
>gi|423214394|ref|ZP_17200922.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692809|gb|EIY86045.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
CL03T12C04]
Length = 800
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L ++I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++ G + GLQ EG + + KH+A Y + D
Sbjct: 232 PYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT +A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRHAINEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E +VLLK
Sbjct: 392 KVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N LPL S K +AV+GP+A + Y + G Y + V Y
Sbjct: 452 NKNQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785
>gi|375359159|ref|YP_005111931.1| putative exported hydrolase [Bacteroides fragilis 638R]
gi|423283738|ref|ZP_17262622.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
615]
gi|301163840|emb|CBW23395.1| putative exported hydrolase [Bacteroides fragilis 638R]
gi|404580776|gb|EKA85484.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
615]
Length = 859
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 214/769 (27%), Positives = 337/769 (43%), Gaps = 146/769 (18%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
++F + ++SLP +RV+DL+SRMTL+EK+ Q+
Sbjct: 22 TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81
Query: 84 -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
F G+ RLG+P + +E+LHG V G
Sbjct: 82 GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
+T FP I ++FN L ++ A++ E A G+T +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKELTAQ------GITQSLTPVIDVCRDLRWGRV 183
Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
E GEDP++V R V+ VRG D + VS KH+ A+ G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228
Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
++++ +L+ FE VKE +VM SYN N P+ + L+ + +R W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286
Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
D GY+ +D +I ++ HK +S E A+ Q L AGLD + V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
ID+++ + T +G F+ + + + ++ LA + A E IVLL+N+
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
N LPL K+K++AV+GP NA G+Y G+ + + G + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461
Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
GC D+ + A + AK +D I++ G + A E D DL L
Sbjct: 462 AKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520
Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
G Q L+ + K PVI+V++S G A + NI I+ YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577
Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
GK NP G+L ++ Y LP R S PG+ Y F + L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
+GLSYT F+Y LS T + + D C+D E
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
+ +N G DG +V VY + ++++ GF++V ++ G K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715
>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
sp. MSX73]
Length = 858
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 162/463 (34%), Positives = 247/463 (53%), Gaps = 43/463 (9%)
Query: 45 LGLQMSS----FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS 100
LGL +S+ +C+ L R +DL+SR+TL+EK + + D + +PRLG+ ++ WWS
Sbjct: 11 LGLSLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWS 70
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR- 159
EALHG +N+G G T FP + ASFN+ L +++ A S E RA YN
Sbjct: 71 EALHGAANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRML 120
Query: 160 --------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
L+ W+PN+N+ RDPRWGR ET GEDP++ VRGLQ E
Sbjct: 121 NGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE----- 175
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDA 270
++ K+ +C KHYA + + R+ + A V+ +D+ ET+L F+ V E
Sbjct: 176 ---TAKYRKLWACAKHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKV 229
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
VMC+Y R++ P C++ +LL Q +R EW + +V+DC ++ + NHK +D+ A
Sbjct: 230 REVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH-A 288
Query: 331 VAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
A+ AG D++CG Y T AV++G + E ++DK + L LG D P+ V
Sbjct: 289 AAKAAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLV 347
Query: 390 SLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
K + S + +LA + AR+ +VLL+N LPL + + +AV+GP+A+ M
Sbjct: 348 EWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPMM 406
Query: 447 IGNYAGIPCRYMSPIAGFSG-YANVTYKTGCDDVACKSNNSIF 488
GNY G P R ++ + G + VTY GCD K+ NS+
Sbjct: 407 WGNYNGTPNRTVTILDGIKARHKRVTYLKGCDLTDTKTVNSLL 449
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 129/291 (44%), Gaps = 63/291 (21%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+ + G+ ++E E + DR ++ LP Q + + E K +V ++ G
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK---TVVFVNCSG 659
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
IA AIL A Y G+EGG A++DV+FG NP G+LP+T+Y LP
Sbjct: 660 SAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYK--RTDQLP--- 714
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
+ GRTY++++ P L+ FGYGLSYT F++
Sbjct: 715 ----DYEDYSMRGRTYRYFSDP-LFAFGYGLSYTTFRFG--------------------- 748
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
++A+ + + V N G+ G +VV VY + A+ +K
Sbjct: 749 RAHAEAA--------------EGGYRLSVPLTNTGTRPGEEVVQVYIRRVADTNGP-LKS 793
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL--LPAGEHTIFVGN 779
+ F+RV ++AG + ++ + KS D + NT+ LP G++ + GN
Sbjct: 794 LRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTMRTLP-GDYELMYGN 842
>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
CL03T12C18]
Length = 800
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 237/862 (27%), Positives = 372/862 (43%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L +I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIGEIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++VG + GLQ+ EG + + KH+A Y + D
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GYIV
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N+ LPL S +AV+GP+ + Y + G Y + V Y
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510
Query: 475 GCDDV-----ACKSNN---------SIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + NN I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PVILV++ I +A N I AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNRFTVEPGSFSVMVG 785
>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
Length = 812
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 238/853 (27%), Positives = 359/853 (42%), Gaps = 149/853 (17%)
Query: 8 LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
L+CF + +F A + G F L + + S P RV+
Sbjct: 5 LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62
Query: 68 LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
L+S+MTL+EKV Q+ G+ P+L E+ +L G P T
Sbjct: 63 LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122
Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
H IP G T FPT I +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
N L +++G+ ++ EA A + + P +++ARDPRW R+ ET GEDP++ G
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMG 237
Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
VRG Q E D S V + KH+A+Y W A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286
Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
PF V G A SVM SYN ++G P LL ++ W G++V+D ++
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345
Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
+ ++ +A + +A + + AG+D D G Y AV++G V IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403
Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
+ ++G FD Q + S E+ LA E AR+ IVLLKN LPL ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462
Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
V+GP+A+ M+G+Y G + I S V Y GC V S
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-AVRDSSRTGFK 521
Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
A E A+ ADA +++ G D S E E DR L L G
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581
Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
Q +L+ +++ + K PV+L+ ++ A E +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLIKGRPLLMEGAIQEA----EAIVDAWYPGMQGGNAVADVLFG 636
Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
+NP GRL ++ V LP+ R G R Y G YPFGYGLSYT
Sbjct: 637 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YVEEPGTPRYPFGYGLSYTT 689
Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
F Y + V V + D + + V QN G
Sbjct: 690 FSYTDMK-------------------------------VQVTEGSDDCWVDVTVTIQNQG 718
Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
+ DG +V +Y + T KQ+ F R+ ++AG ++ + F + KSL +
Sbjct: 719 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 777
Query: 766 TLLPAGEHTIFVG 778
++ G TI VG
Sbjct: 778 WVVEPGRFTIMVG 790
>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus SD CMC 3f]
Length = 800
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L ++I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++VG + GLQ+ EG + + KH+A Y + D
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N+ LPL S +AV+GP+ + Y + G Y + V Y
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNRFTVEPGSFSVMVG 785
>gi|372223664|ref|ZP_09502085.1| glycoside hydrolase family 3 protein [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 768
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 225/812 (27%), Positives = 364/812 (44%), Gaps = 136/812 (16%)
Query: 41 RFSKLGLQMSSFLFCDS--------SLPYSIRVKDLVSRMTLDEKVQQL-----GDFAHG 87
+F LGL + L C+ LPY V +++ MTL+EK+ QL GD G
Sbjct: 4 KFIALGLLVLITLSCNEQKPAQPSQELPYQKEVDSILALMTLEEKIGQLNLPSSGDITTG 63
Query: 88 VPRLGLPQYEWWSEALHGVSNVGPGTHFD--------------------DVIPG-ATSFP 126
+ + + + G+ N+ DVI G ++FP
Sbjct: 64 QAKSSDIASKIAAGKVGGLFNIKTAAKIKEVQRIAVEESRLKIPLLFGMDVIHGYQSTFP 123
Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPG 185
+ AS++ L ++ + + EA A G+ + +SP ++++RDPRWGRI+E G
Sbjct: 124 IPLGLAASWDMDLIQQTARVAAQEASA------DGINWTFSPMVDISRDPRWGRISEGSG 177
Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
EDPF+ G+ A VRG Q + N T L +C KH+A Y G D D
Sbjct: 178 EDPFLGGKIAAAMVRGYQGDDLSANNTLL--------ACVKHFALYGASE-AGRDYNTVD 228
Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
++ M +L P++ + G +SVM S+N V+GIP+ A+ LL +R +W +G+
Sbjct: 229 --MSRVRMYNDYLPPYKAAIDAG-VASVMASFNEVDGIPATANKWLLTDVLREQWGFNGF 285
Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETD 364
+V+D I MV + + + A+ L AGLD+D G+ + ++++G V ET
Sbjct: 286 VVSDYTGINEMVAHG---IGNLQQVSARALNAGLDMDMVGEGFLTTLKKSLEEGLVSETT 342
Query: 365 IDKSLKYLYTVLMRLGFFDGSPQY--VSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
ID ++K + T +LG FD +Y + K ++ + EN + A + + E +VLLKN +
Sbjct: 343 IDTAVKRILTAKYQLGLFDDPYKYCDTTRTKNEVFTKENRDFARKVSAESMVLLKN-EGL 401
Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYMSPIAGFSGYA----NVTYKTGC 476
LPL K ++A++GP AN M G + A + +S + G A + Y G
Sbjct: 402 LPLK--KSGSIALIGPLANTPHNMAGTWSVATQQEKSISVLEGLKEVAGEAVTINYAKGS 459
Query: 477 D---DVACKSNNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDR 517
+ D A + ++F A AK +D + G ES
Sbjct: 460 NVAYDEAYEKRITMFGKEITRDGRTDAQLLAEALAVAKKSDVVVAAIGETAERSGESSSI 519
Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
+L +P Q L++ + K PV++V+ + G +A + AI+ A +PG E G
Sbjct: 520 TNLQIPKAQQDLLDALLATGK-PVVVVLFT--GRPLAITKIQEEAPAIINAWFPGSEAGL 576
Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL--GYPGRTYKFY 629
AIADV+FG NP G+L T+ V +PL T PL P + G+ T +
Sbjct: 577 AIADVLFGAVNPSGKLTATFPRN--VGQVPLFYAHKNTGRPLDPAKTADCGFQKFTSNYL 634
Query: 630 ---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
N P LYPFG+GLSYT F Y+ ++ K
Sbjct: 635 DVCNTP-LYPFGFGLSYTTFSYSDITLDKA------------------------------ 663
Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
+L +D V +N G+ DG +VV +Y + ++++ GF+++F++ +
Sbjct: 664 -ELGPNDSITVSVKVKNTGNFDGKEVVQLYVRDVVRSTTPPVRELKGFKKIFLKKDEEQI 722
Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
++F + L D N + GE +FVG
Sbjct: 723 VQFKLQ-TEDLKFYDTDLNFIAEPGEFQVFVG 753
>gi|86141717|ref|ZP_01060241.1| beta-glucosidase [Leeuwenhoekiella blandensis MED217]
gi|85831280|gb|EAQ49736.1| beta-glucosidase [Leeuwenhoekiella blandensis MED217]
Length = 758
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 213/740 (28%), Positives = 345/740 (46%), Gaps = 117/740 (15%)
Query: 76 EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
+K++Q + A RLG+P S+ +HG T+FP + ++S+
Sbjct: 85 DKIRQAQEIAVKNTRLGIPLL-IGSDIIHGYK---------------TTFPIPLGLSSSW 128
Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
+ L +K Q + EA A G+ + +SP +++ARDPRWGRI+E GEDP++
Sbjct: 129 DMELIEKTAQIAAKEATA------DGINWNFSPMVDIARDPRWGRISEGAGEDPYLGSAI 182
Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
A V G Q E+ T+ N+ + S KH+A Y R + ++ M
Sbjct: 183 AKAMVTGYQ----QEDLTEENT----MISTVKHFALYGAAEG---GRDYNTTDMSRVKMF 231
Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
+L P++ + G A SVM S+N V+G+P+ + LL +R +W G++V+D S+
Sbjct: 232 NEYLPPYKAAIDAG-AESVMSSFNDVDGVPASGNKWLLTHLLREQWGFEGFVVSDYTSVN 290
Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
M+ + L D + A ++ AGLD+D G+ + +V +GKV E I + + +
Sbjct: 291 EMIAHG--LGDLQA-VSALSINAGLDMDMVGEGFLTTLKKSVDEGKVSEATITNACRRIL 347
Query: 374 TVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
+LG FD +Y + +DI + N E+A +AAR+ VLLKN+ TLPL+ K
Sbjct: 348 EAKYKLGLFDDPYKYSDSKRPERDILTAANKEIARDAARKSFVLLKNENKTLPLD--KTA 405
Query: 432 TVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGYANV------TYKTGC---DDVA 480
+A++GP AN M+G +A G P + +PI F G NV +Y G +D A
Sbjct: 406 KIALIGPLANNKNNMLGTWAPTGDP-QLSTPI--FEGLKNVAPNAEISYTKGANISNDTA 462
Query: 481 CKSNNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
++F A + A+TAD + + G + ES R D+ +P
Sbjct: 463 YAKKINVFGPRIEISEATPETLLEEALQNAETADVVVAVVGEATEMSGESSSRTDITIPE 522
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q LI ++ ++ K PV+LV+MS +DI E +IL +PG + G A+ADV+F
Sbjct: 523 SQKTLIQELVKIGK-PVVLVLMSGRPLDI--TEELALPVSILQVWHPGIQAGNAVADVLF 579
Query: 585 GKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
G +NP G+L +W V +P+ T P + L + + N P L FG
Sbjct: 580 GDYNPSGKLTASWPQN--VGQIPVYHSMKTTGRPAPSAEFLKFKSQYLDTPNAPAL-AFG 636
Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
YGLSYT F+Y+ L + +++ D +
Sbjct: 637 YGLSYTTFEYSNLKLS------------SKSIGQNEDVT-------------------VM 665
Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
VD N G+ DG++VV +Y ++ + GFQ+V ++ G K ++ A + L
Sbjct: 666 VDVTNTGAYDGTEVVQLYIHDVVRSITPPMRTLKGFQKVSLKQGETKTVELTLKA-EDLK 724
Query: 759 IVDYAANTLLPAGEHTIFVG 778
+ + + GE +FVG
Sbjct: 725 FYNGSLEFISEPGEFEVFVG 744
>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
ovatus ATCC 8483]
Length = 800
Score = 253 bits (645), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 235/862 (27%), Positives = 370/862 (42%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L +I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIGEIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++VG + GLQ+ EG + + KH+A Y + D
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GYIV
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +D+ + + V +G FD P + + +D + ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N+ LPL S +AV+GP+ + Y + G Y + V Y
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PVILV++ I +A N I AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785
>gi|336412865|ref|ZP_08593218.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
3_8_47FAA]
gi|335942911|gb|EGN04753.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
3_8_47FAA]
Length = 800
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 232/862 (26%), Positives = 372/862 (43%), Gaps = 165/862 (19%)
Query: 5 VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
+ LLC +L ++ + ++ AN +S ++ F+K G++ ++
Sbjct: 1 MKKLLCLALLVSAGSIYSESISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57
Query: 56 DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
D S P R+ DL+S+MTL+EK Q+ +G R+ P W +E G+ N+
Sbjct: 58 DLSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116
Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
G G ++ IP AT FP
Sbjct: 117 QANGLGKFGSEISYSYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
A++N+ L ++I + + EA+A LG + +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++VG + GLQ+ EG + + KH+A Y + D
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V ++M+ +L PF ++E A VM SYN +G P L + +R +W GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
+D ++++ + H+ + ++E+ AQ + AGL++ TNFT A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAIDEG 391
Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
KV +++ + + V +G FD P + + +D + ++ +AA E IVLLK
Sbjct: 392 KVSLHTLNQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451
Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
N+ LPL S K +AV+GP+ + Y + G Y + V Y
Sbjct: 452 NENQMLPL-SKNFKKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510
Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
GCD + + I A E AK +D I++ G + E R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570
Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
L G Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627
Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
V+FG +NPGGRL +T+ V +P + P +P DS G K LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPDSDSKG------KVRVDGVLYPFGY 678
Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
GLSYT F Y+ L +K + Q N+ L C
Sbjct: 679 GLSYTIFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
+N G G +VV +Y + TY K + GF+R+ ++ G + + F +
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QD 763
Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
L + D + G ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785
>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
CL02T12C05]
Length = 864
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 158/453 (34%), Positives = 238/453 (52%), Gaps = 40/453 (8%)
Query: 50 SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
S + D +L R DL+ R+T++EKV + + + G+ RLG+ YEWW+EALHGV+
Sbjct: 26 SQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVARA 85
Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR----AG 161
G AT FP I ASF+++L ++ A+S EARA LG+ G
Sbjct: 86 GL----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRYQG 135
Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
LT W+PNIN+ RDPRWGR ET GEDP++ R V V+GLQ + ++R K+
Sbjct: 136 LTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPD--------SARYNKL 187
Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
+C KH+A + W +R+ F+A + +D+ ET+L F+ V+E D VMC+YNR
Sbjct: 188 HACAKHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRF 244
Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVAQTLKAG 338
G P C +LL Q +R EW G +V+DC +I H D+ A A+ + G
Sbjct: 245 EGDPCCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAH-ASAEAVLNG 303
Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
DL+CG Y T A++ G + E I+ S+K L LG + + +L + S
Sbjct: 304 TDLECGSNYRKLT-EAIKAGIISEKQINVSVKRLLKARFELGEMENIHPW-TLPYSIVDS 361
Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
++ LA + A E + LL+N LPL+ K +A++GP+AN +V GNY G P
Sbjct: 362 PKHRCLALKMAHETMTLLQNKGKVLPLD--KQARIAIIGPNANDSVMQWGNYNGTPSHTS 419
Query: 459 SPIAGFSG---YANVTYKTGCDDVACKSNNSIF 488
+ ++ F +++ Y+ C + NS+F
Sbjct: 420 TLLSAFRKRLPISHLIYEPVCGLTDSITYNSLF 452
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 134/299 (44%), Gaps = 56/299 (18%)
Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
E K D I G+ S+E E + DR D+ P Q +++ + E K V
Sbjct: 596 EKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGK-KV 654
Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
ILV S G +A + AIL A YPGEEGG AI +V+FG +NP GRLPIT+Y
Sbjct: 655 ILVNFS--GSAMALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFYKS- 711
Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
+ LP ++ GRTY++ L+PFGYGLSYT F + ++++
Sbjct: 712 -IDQLP-------DFENYSMKGRTYRYMQEEPLFPFGYGLSYTTFAFG--------KIHI 755
Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
NK N L + + +N+G DG +VV +Y + A
Sbjct: 756 NK-----------------------NSLSAGEKVTLHIPIKNIGDRDGVEVVQIYIQRQA 792
Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
+ +K + F+RV + G+ + +K + D NT+ P GE+ I GN
Sbjct: 793 DKEGP-VKTLRAFKRVEIPKGKTQEVKIELPYV-AFEWFDPTTNTMRPIQGEYNILYGN 849
>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
caccae ATCC 43185]
gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
CL03T12C61]
Length = 786
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 234/858 (27%), Positives = 375/858 (43%), Gaps = 167/858 (19%)
Query: 1 MAKVVSSL-LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSL 59
M K+V L LC S+ +F+ GS+ ++ + F+K G++ ++ D +
Sbjct: 1 MKKLVCGLTLCLSVGN---IFA-------GSTKDIYKKNWIDFNKNGVKD---VYEDPAA 47
Query: 60 PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WSEAL-------HG 105
P RV DL+S+MTL+EK Q+ +G R+ P EW W + + +G
Sbjct: 48 PIEARVADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAEWSKEIWKDGIGNIDEQANG 106
Query: 106 VSNVG-----------------------------PGTHFDDVIPG-----ATSFPTVILT 131
+ G P ++ I G AT FP
Sbjct: 107 LGKFGSELSYPYANSVKNRHEIQRWFVEQTRLGIPVDFTNEGIRGLCHNRATMFPAQCGQ 166
Query: 132 TASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVV 191
A++N+ L ++I + + EA+A LG + ++P +++A+DPRWGR+ E+ GEDP++
Sbjct: 167 GATWNKKLIREIAKVTADEAKA---LGYTNI--YAPILDIAQDPRWGRVVESYGEDPYLA 221
Query: 192 GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQ 251
G + GLQ EG +++ KH+A Y + D V +
Sbjct: 222 GELGKQMILGLQ-AEG-------------LAATPKHFAVYSIPVGGRDGGTRTDPHVAPR 267
Query: 252 DMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD 311
+M+ +L PF ++E A VM SYN +G P L + +R +W GY+V+D +
Sbjct: 268 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 327
Query: 312 SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQGKVKE 362
+++ + H+ + ++E+ AQ + AGL++ TNFT A+ +GK+
Sbjct: 328 AVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAISEGKISL 381
Query: 363 TDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
+D+ + + V LG FD P + + + + E++ +AA E IVLLKN+
Sbjct: 382 HTLDQRVGEILRVKFMLGLFDNPYPGDDRHPETVVHNAAHQEVSMKAALESIVLLKNENQ 441
Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDD 478
LPL S + +AV+GP+A + Y + G Y A V+Y GC+
Sbjct: 442 MLPL-SKSLNKIAVIGPNAEEVKELTCRYGPAHAPIKTVYQGIKEYLPNAEVSYAKGCNI 500
Query: 479 V--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
+ + I A E AK +D I++ G + E R L L G
Sbjct: 501 IDKYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSRTSLDLCG 560
Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
Q QL+ V K PV+LV++ I +A N + AI+ A +PGE G AIA V+F
Sbjct: 561 RQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIVHAWFPGEFMGNAIAKVLF 617
Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
G +NPGGRL +T+ V +P + P +P DS G + LYPFGYGLSY
Sbjct: 618 GDYNPGGRLAVTFPKS--VGQVPF-AFPFKPGSDSKG------RVRVDGVLYPFGYGLSY 668
Query: 644 TQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
T F+Y+ L +K + Q N+ L C
Sbjct: 669 TTFEYSALKISKPVIGPQENMT--------------------------LSC--------I 694
Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
+N G G +VV +Y + TY K + GF+R+ ++ G + I F + L +
Sbjct: 695 VKNTGKRAGDEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGLW 753
Query: 761 DYAANTLLPAGEHTIFVG 778
D + G +I +G
Sbjct: 754 DKNNQFTVEPGSFSIMIG 771
>gi|329963878|ref|ZP_08301220.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
gi|328527131|gb|EGF54137.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
Length = 766
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 200/703 (28%), Positives = 335/703 (47%), Gaps = 109/703 (15%)
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
+A+HG +N PG T +PT I SF+ + +I + + E RAM
Sbjct: 132 DAIHGNANA----------PGNTVYPTNINLACSFDTLMAYRIARETAKEMRAMN----- 176
Query: 161 GLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
+W+ PN+ VARD RWGR+ ET GEDP++V R G+Q V+G++ + D
Sbjct: 177 --MHWTFNPNVEVARDARWGRVGETFGEDPYLVTRM------GVQSVKGYQGSLDSKE-- 226
Query: 219 LKVSSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
V +C KH+ + ++ G A ++E+ + E F PFE VK G A S+M +
Sbjct: 227 -DVLACIKHFVGGSEPINGTNGSP-----ADLSERTLREVFFPPFEAGVKAG-AMSLMTA 279
Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
+N +NG+P ++ L+ +RGEW+ G++V+D I+ D H A++ ++A Q++
Sbjct: 280 HNELNGVPCHSNEWLMADVLRGEWNFPGFVVSDWMDIEHTHDLHA-TAENLKEAFYQSIM 338
Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
+G+D+ G ++ V++G++ E+ ID+S++ + + RLG F+ V +
Sbjct: 339 SGMDMHMHGIHWNEMVVELVKEGRIPESRIDESVRRILDIKFRLGLFEQPYADVEETMKI 398
Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
E+ A EAAR GIVLLKN + LPL+ +K K + V G +A+ ++G+++ P
Sbjct: 399 RLCGEHRATALEAARNGIVLLKN-EGVLPLDPSKYKKIMVTGINADDQ-NILGDWSA-PE 455
Query: 456 RYMSPIAGFSGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDL- 508
+ + G + T D D + A+ AK AD I++AG +
Sbjct: 456 KEENVTTILEGLRMIAPDTQFDFVDQGWDPRNMDPKKVDEAAAHAKNADLNIVVAGEYMM 515
Query: 509 ------SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
+ E DR DL L G Q +LI +VA K P +LV+++ + + +A N +
Sbjct: 516 RFRWNDRTDGEDTDRSDLDLVGLQEELIEKVAASGK-PTVLVLVNGRPLSVRWAAEN--L 572
Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
AI+ A PG +GG+A+A++++GK NP +L IT +P + L+ + Y
Sbjct: 573 PAIVEAWAPGMQGGQAVAEILYGKVNPSAKLAIT---------IPHSVGQLQMI----YN 619
Query: 623 GRTYKFYN-------GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
+ ++++ LYPFGYGLSYT +KY L+ +
Sbjct: 620 HKPSQYFHPYVAGKPSTPLYPFGYGLSYTTYKYEDLNLDR-------------------- 659
Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
++ D V N GS DG ++V +Y + +K++ F
Sbjct: 660 -----------KEIEKDGSVGVSVKVTNTGSRDGVEIVQLYIRDKFSCVTRPVKELKDFA 708
Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
RV ++AG ++ + F K L D ++ GE + VG
Sbjct: 709 RVPLKAGESRVVNFKITPDK-LAFYDIKMKKVVEPGEFIVMVG 750
>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
Length = 858
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 162/463 (34%), Positives = 247/463 (53%), Gaps = 43/463 (9%)
Query: 45 LGLQMSS----FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS 100
LGL +S+ +C+ L R +DL+SR+TL+EK + + D + +PRLG+ ++ WWS
Sbjct: 11 LGLSLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWS 70
Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR- 159
EALHG +N+G G T FP + ASFN+ L +++ A S E RA YN
Sbjct: 71 EALHGAANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRML 120
Query: 160 --------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
L+ W+PN+N+ RDPRWGR ET GEDP++ VRGLQ E
Sbjct: 121 NGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE----- 175
Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDA 270
++ K+ +C KHYA + + R+ + A V+ +D+ ET+L F+ V E
Sbjct: 176 ---TAKYRKLWACAKHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKV 229
Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
VMC+Y R++ P C++ +LL Q +R EW + +V+DC ++ + NHK +D+ A
Sbjct: 230 REVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH-A 288
Query: 331 VAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
A+ AG D++CG Y T AV++G + E ++DK + L LG D P+ V
Sbjct: 289 AAKAAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLV 347
Query: 390 SLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
K + S + +LA + AR+ +VLL+N LPL + + +AV+GP+A+ M
Sbjct: 348 EWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPMM 406
Query: 447 IGNYAGIPCRYMSPIAGFS-GYANVTYKTGCDDVACKSNNSIF 488
GNY G P R ++ + G + VTY GCD K+ NS+
Sbjct: 407 WGNYNGTPNRTVTILNGIKVRHKRVTYLKGCDLTDTKTVNSLL 449
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 127/291 (43%), Gaps = 63/291 (21%)
Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
+ + G+ ++E E + DR ++ LP Q + + E K +V ++ G
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK---TVVFVNCSG 659
Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
IA AIL A Y G+EGG A++DV+FG NP G+LP+T+Y LP
Sbjct: 660 SAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYK--RTDQLP--- 714
Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
+ GRTY++++ P L+ FGYGLSYT F++
Sbjct: 715 ----DYEDYSMRGRTYRYFSDP-LFAFGYGLSYTTFRF---------------------- 747
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
+ R + + V N G+ G +VV VY + A+ +K
Sbjct: 748 ------GRARAEA-------AEGGYRLSVPLTNTGTRPGEEVVQVYIRRVADTNGP-LKS 793
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL--LPAGEHTIFVGN 779
+ F+RV ++AG + ++ + KS D + NT+ LP G++ + GN
Sbjct: 794 LRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTMRTLP-GDYELMYGN 842
>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
cellulosilyticus DSM 14838]
gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
DSM 14838]
Length = 808
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 155/407 (38%), Positives = 233/407 (57%), Gaps = 41/407 (10%)
Query: 73 TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
T++EK+ L + G+ RL +P+Y +EALHGV V PG T FP I
Sbjct: 1 TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGRF--------TVFPQAIGLA 50
Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAG----------LTYWSPNINVARDPRWGRITE 182
A++N L ++ +S EARA +N G LT+WSP +N+ARDPRWGR E
Sbjct: 51 ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 110
Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
T GEDP++ G +V+GLQ G ++ R LK+ S KH+AA + ++ +R+
Sbjct: 111 TYGEDPYLSGIMGTAFVKGLQ---GDDD------RYLKIVSTPKHFAANNEEH----NRF 157
Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
+ +++E+ + E +L FE CVK+G ++S+M +YN +N +P + LL + +R +W
Sbjct: 158 VCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGF 217
Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVK 361
GY+V+DC ++V+ HK++ +KE A A ++KAGLDL+CG Y +A +Q V
Sbjct: 218 KGYVVSDCGGPSLLVNAHKYVK-TKEAAAALSIKAGLDLECGDDVYDQPLLSAYRQYMVT 276
Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
+ DID + + M LG FD Q Y + I S E+ E+A AARE IVLLKN
Sbjct: 277 DADIDSAAYRVLRARMELGLFDSGEQNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQ 336
Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
+ LPLN+ KVK++AVVG NA + G+Y+G+P ++PI+ G
Sbjct: 337 KKMLPLNARKVKSIAVVG--INAGSSEFGDYSGLPV--IAPISVLQG 379
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 100/293 (34%), Positives = 147/293 (50%), Gaps = 54/293 (18%)
Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
A +A + + + + G++ S+E E DR D+ LP Q + + ++ +V P I+V++ AG
Sbjct: 549 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKV--NPNIVVVLVAG 606
Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
+A + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y L
Sbjct: 607 S-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 658
Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
+P P D GRTYK++ G LYPFGYGLSYT FKY+ +QV
Sbjct: 659 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYS------NLQV--------- 701
Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
D E V FQ N G G +V VY K P
Sbjct: 702 ----------------------ADGEEEINVSFQLKNSGKYAGDEVAQVYVKLPERDEIM 739
Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
IK++ GF+RV +++G NK++ L D A + + P+G++TI VG
Sbjct: 740 PIKELKGFERVTLKSGENKKVTLKLRK-DLLRYWDEAKDKFVCPSGDYTIMVG 791
>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
Length = 805
Score = 252 bits (644), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 231/807 (28%), Positives = 347/807 (42%), Gaps = 145/807 (17%)
Query: 54 FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
+ + S P RV+ L+S+MTL+EKV Q+ G+ P+L E+
Sbjct: 40 YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYKRVGEDIRLTPQLEKEIGEYHIG 99
Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
+L G P T H IP G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159
Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
T FPT I +++N L +++G+ ++ EA A + + P +++ARDPRW R+
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214
Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
ET GEDP++ G VRG Q E D S V + KH+A+Y W
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263
Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
A + E+++EE PF V G A SVM SYN ++G P LL ++ W
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322
Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
G++V+D ++ + ++ +A + +A + + AG+D D G Y AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380
Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
IDK+++ + ++ ++G FD Q + S E+ LA E AR+ IVLLKN
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440
Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
LPL ++T+AV+GP+A+ M+G+Y G + I S V Y
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499
Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
GC V S A E A+ ADA +++ G D S E
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558
Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
E DR L L G Q +L+ +++ + K PV+LV++ G + +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615
Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
G +GG A+ADV+FG +NP GRL ++ V LP+ R G R Y G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668
Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
YPFGYGLSYT F Y + +QV +D R
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701
Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
D V QN G+ DG +V +Y + T KQ+ F R+ ++A ++ + F
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESREVTFTL 757
Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
+ KSL + ++ G TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783
>gi|336255157|ref|YP_004598264.1| beta-glucosidase [Halopiger xanaduensis SH-6]
gi|335339146|gb|AEH38385.1| Beta-glucosidase [Halopiger xanaduensis SH-6]
Length = 774
Score = 252 bits (644), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 225/803 (28%), Positives = 369/803 (45%), Gaps = 138/803 (17%)
Query: 48 QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---------------------FAH 86
++S+ + D S RV+DL+ RMT++EK QLG AH
Sbjct: 4 ELSTAAYQDESESVENRVEDLLERMTVEEKAAQLGSVNADRLLDEDGEIDWDAVDEWLAH 63
Query: 87 GV---PRLGLPQYEWWSEA----------LHGVSNVG-PGTHFDDVI-----PGATSFPT 127
G+ RLG SEA L + +G P ++ + P AT+FP
Sbjct: 64 GIGHFTRLGGEGSLAPSEAARVTNELQTYLREETRLGIPAIPHEECLSGYMGPEATTFPQ 123
Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
++ +S+N L + + + + E G + SP ++VARD RWGR+ ET GED
Sbjct: 124 MLGMASSWNPELLQTVTETIRGELE-----GIGTVHALSPVLDVARDLRWGRVEETFGED 178
Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
P++V A YV GLQ + R +S+ KH+ + + G +R +
Sbjct: 179 PYMVAEMARAYVSGLQG----------DGRADGISATLKHFVGHGATD-GGKNRSSLN-- 225
Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
V +++ ET L P+E + E +A SVM +Y+ ++G+P LL + +RGE+ G +V
Sbjct: 226 VGPRELRETHLFPYEAVISEANAESVMNAYHDLDGVPCANSEWLLTEVLRGEFGFDGTVV 285
Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDI 365
+D S++ +V H+ A +K +A Q L+AG+D++ +YY AV++G + E +
Sbjct: 286 SDYYSVRHLVTEHE-TASTKPEAAVQALEAGIDVELPYTEYYGEHLVEAVEEGDLAEETL 344
Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
++S++ + R G FD V +DE E+ EAAR+ + LLKN+ +
Sbjct: 345 NESVRRILREKFRKGVFDDPAVDVDAAADAFHTDEAREVTREAARQSMTLLKNEDDL--- 401
Query: 426 NSAKVKTVAVVGPHANATVAMIGNYA--------GIPCRYMSPIAGFSGY--ANVTYKTG 475
V VAVVGP A+ ++G+YA ++P+ +VTY+ G
Sbjct: 402 LPLDVDDVAVVGPKADNPKELMGDYAYAAHYPEEEYEADAVTPLEALEARDGLDVTYEQG 461
Query: 476 CDDVACKSNNSIFAASEAAKTADATIILAG----LDLS-VEAESLDRED----------- 519
C ++ S + AA++AA AD + G +D S VEAE ++
Sbjct: 462 C-TISGPSTDGFDAAADAAADADVALAFVGARSAVDFSDVEAEKEEKPSVPTSGEGCDVT 520
Query: 520 -LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
L LPG Q +L+ ++ E PV++V++S G A + AIL+A PG+EGG A
Sbjct: 521 HLGLPGVQEELVAELLET-DTPVVVVLVS--GKPHAIEDIAAEAPAILYAWLPGDEGGTA 577
Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLP--LTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
IA+ +FG+ NP G+LP++ LP + +P+ + Y + + +YP
Sbjct: 578 IAETLFGENNPAGKLPVS---------LPKSVGQLPVYYNRKENTANKDYVYTDSEPVYP 628
Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
FG+G SYT+F+Y +S + L F
Sbjct: 629 FGHGESYTEFEYGDVSLSTDSVTPLGS-------------------------------FT 657
Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
V NVG G ++V Y + A +++++GF+RV + G +KR+ F +A +
Sbjct: 658 ASVTVANVGDRAGDEIVQCYGRATNASQARPVQELLGFERVSLEPGESKRVAFDLSATQ- 716
Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
L D + N + G + I +G
Sbjct: 717 LAFHDLSMNLAVEEGPYEIRIGR 739
>gi|345302417|ref|YP_004824319.1| beta-glucosidase [Rhodothermus marinus SG0.5JP17-172]
gi|345111650|gb|AEN72482.1| Beta-glucosidase [Rhodothermus marinus SG0.5JP17-172]
Length = 754
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 229/768 (29%), Positives = 359/768 (46%), Gaps = 116/768 (15%)
Query: 65 VKDLVSRMTLDEKVQQL----GDFAHGVP-------------RLGLPQYEWWSEALHGV- 106
++ L++RMTL+EK+ QL G A P R+G + +EA+ +
Sbjct: 33 IEALLARMTLEEKLGQLTLYNGGMAETGPVVREGEPDAIRRGRVGAVMNFFGAEAVCAMQ 92
Query: 107 ------SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
S +G P DVI G T FP + A+F+ +L ++ + + EA A+
Sbjct: 93 RQAVEESRLGIPLLFALDVIHGFRTIFPVPLAEAATFDPALVEQAARVAAGEASAV---- 148
Query: 159 RAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
GL + ++P +++ARD RWGRI E GEDP++ A VRG Q DL
Sbjct: 149 --GLNWTFAPMVDIARDARWGRIVEGSGEDPYLGAVMAAARVRGFQ-------GRDLRD- 198
Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
P + + KH+AAY G D D V+E+ + E +L PFE V+ G A S+M ++
Sbjct: 199 PTTILATAKHFAAYGAAE-AGRDYNTVD--VSERTLREVYLPPFEAAVRAG-ALSIMSAF 254
Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
N + G+P+ AD LL +R EW G +V+D S+ ++ H ADS E + L+A
Sbjct: 255 NEIGGVPATADRWLLTDVLRHEWGFEGLVVSDYTSVWELL-FHGIAADSAEVG-RKALEA 312
Query: 338 GLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQ 394
G+D+D Y V+ G++ E +D++++ + V RLG F+ +Y + +Q
Sbjct: 313 GVDMDMVSGIYVRKLAEEVRAGRLSEAVVDEAVRRVLRVKYRLGLFEDPYRYCRDASREQ 372
Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--G 452
+ S + LA E AR+ IVLLKN+ LPL ++ VAV+G AN + +++G +A G
Sbjct: 373 VLLSPAHRRLAREVARKAIVLLKNEGELLPLAD-TLQRVAVIGALANDSASVLGPWAAAG 431
Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDV-----------ACKSNNSIFAASEA-AKTA 497
P ++ + G A V Y G +V A + S FA +EA A+ A
Sbjct: 432 RPEDAVTILEGIRAALPGATVRYAPGYAEVPSGSFQEMVAAALSPDTSGFAEAEAVARWA 491
Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
+ I++ G + E+ R + LPG Q L ++ + + PV++V+M+ G +A E
Sbjct: 492 EVVILVLGEHRELSGEAASRASVELPGVQLALARRLLALGR-PVVVVLMN--GRPLAIPE 548
Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
AI+ A + G E G A+ADV+ GK +PGGRLP+++ + L P
Sbjct: 549 LAALAPAIVEAWFLGTEMGHAVADVLLGKASPGGRLPVSFPRATGQEPLYYNHKP----- 603
Query: 618 SLGYPGR-----TYKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
G P R T K+ + P LYPFGYGL+YT F Y+ L ++
Sbjct: 604 -TGRPPRAEEKYTSKYVDVPWTPLYPFGYGLTYTTFAYDSLRLSRRRLG----------- 651
Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
DD E V N G G +VV +Y + +K+
Sbjct: 652 --------------------LDDTLEVVVSVTNTGRRRGEEVVQLYVRDEVASVTRPVKE 691
Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
+ GF RV + G K ++F ++L ++ G T++VG
Sbjct: 692 LKGFARVELAPGETKAVQFRLP-VRALRFWGLEGGWVVEPGWFTLWVG 738
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,641,970,642
Number of Sequences: 23463169
Number of extensions: 547030438
Number of successful extensions: 1193614
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6086
Number of HSP's successfully gapped in prelim test: 1462
Number of HSP's that attempted gapping in prelim test: 1142885
Number of HSP's gapped (non-prelim): 17018
length of query: 792
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 641
effective length of database: 8,816,256,848
effective search space: 5651220639568
effective search space used: 5651220639568
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)